J Shanghai Jiaotong Univ Sci ›› 2023, Vol. 28 ›› Issue (1): 100-113.doi: 10.1007/s12204-023-2573-3
Previous Articles Next Articles
QIN Chao1 (秦 超), WANG Yafei1 (王亚飞), ZHANG Yuchao2 (张宇超), YIN Chengliang1∗ (殷承良)
Received:2022-03-08
Online:2023-01-28
Published:2023-02-10
CLC Number:
QIN Chao1 (秦 超), WANG Yafei1 (王亚飞), ZHANG Yuchao2 (张宇超), YIN Chengliang1∗ (殷承良). Birds-Eye-View Semantic Segmentation and Voxels Semantic Segmentation Based on Frustum Voxels Modeling and Monocular Camera[J]. J Shanghai Jiaotong Univ Sci, 2023, 28(1): 100-113.
| [1] BADRINARAYANAN V, KENDALL A, CIPOLLA R. SegNet: A deep convolutional encoder-decoder architecture for image segmentation [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12): 2481-2495. [2] READING C, HARAKEH A, CHAE J L, et al. Categorical depth distribution network for monocular 3D object detection [C]//2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville: IEEE, 2021: 8551-8560. [3] ABBAS S A, ZISSERMAN A. A geometric approach to obtain a bird’s eye view from an image [C]//2019 IEEE/CVF International Conference on Computer Vision Workshop. Seoul: IEEE, 2019: 4095-4104. [4] LIN C C, WANG M S. A vision based top-view transformation model for a vehicle parking assistant [J]. Sensors, 2012, 12(4): 4431-4446. [5] DENG L Y, YANG M, LI H, et al. Restricted deformable convolution-based road scene semantic segmentation using surround view cameras [J]. IEEE Transactions on Intelligent Transportation Systems, 2020, 21(10): 4350-4362. [6] S?MANN T, AMENDE K, MILZ S, et al. Efficient semantic segmentation for visual bird’s-eye view interpretation [M]//Intelligent autonomous systems 15. Cham: Springer, 2018: 679-688. [7] PAN B W, SUN J K, LEUNG H Y T, et al. Crossview semantic segmentation for sensing surroundings [J]. IEEE Robotics and Automation Letters, 2020, 5(3): 4867-4873. [8] LU C Y, VAN DE MOLENGRAFT M J G, DUBBELMAN G. Monocular semantic occupancy grid mapping with convolutional variational encoder–decoder networks [J]. IEEE Robotics and Automation Letters, 2019, 4(2): 445-452. [9] SCHULTER S, ZHAI M H, JACOBS N, et al. Learning to look around objects for top-view representations of outdoor scenes [M]//Computer vision – ECCV 2018. Cham: Springer, 2018: 815-831. [10] MANI K, DAGA S, GARG S, et al. MonoLayout: Amodal scene layout from a single image [C]//2020 IEEE Winter Conference on Applications of Computer Vision. Snowmass: IEEE, 2020: 1678-1686. [11] RODDICK T, CIPOLLA R. Predicting semantic map representations from images using pyramid occupancy networks [C]//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle: IEEE, 2020: 11135-11144. [12] RONNEBERGER O, FISCHER P, BROX T. U-Net: Convolutional networks for biomedical image segmentation [M]//Medical image computing and computerassisted intervention – MICCAI 2015. Cham: Springer, 2015: 234-241. [13] DING X H, ZHANG X Y, MA N N, et al. RepVGG: making VGG-style ConvNets great again [C]//2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville: IEEE, 2021: 13728-13737. [14] LIN T Y, GOYAL P, GIRSHICK R, et al. Focal loss fordense object detection [C]//2017 IEEE International Conference on Computer Vision. Venice: IEEE, 2017: 2999-3007. [15] CAESAR H, BANKITI V, LANG A H, et al. nuScenes: A multimodal dataset for autonomous driving [C]//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle: IEEE, 2020: 11618-11628. [16] KINGMA D P, BA J. Adam: A method for stochastic optimization[DB/OL]. (2017-01-30). https://arxiv.org/abs/1412.6980. [17] GARCIA-GARCIA A, ORTS-ESCOLANO S, OPREA S, et al. A review on deep learning techniques applied to semantic segmentation [DB/OL]. (2017-04-22). https://arxiv.org/abs/1704.06857. |
| [1] | Wang Yan, Wang Likang, Zhang Jinfeng, Fan Xianghui. Global Dense Two-Branch Cascade Network for Underwater Image Enhancement [J]. J Shanghai Jiaotong Univ Sci, 2026, 31(2): 458-474. |
| [2] | Niu Guochen, Lü Zhihao. Vision-Based Detection for Aerial Intruders in Airport Flight Areas [J]. J Shanghai Jiaotong Univ Sci, 2026, 31(1): 176-186. |
| [3] | Zheng Luzhou, Zhao Changchen, Zhang Chao, Cheng Shichao, Zhang Jianhai. Graph Convolution Network with EEG-EMG Fusion for Upper Limb Motion Intention Recognition [J]. J Shanghai Jiaotong Univ Sci, 2026, 31(1): 12-23. |
| [4] | TAHIR Rizwana, CAI Yunze. Multi-Human Pose Estimation by Deep Learning-Based Sequential Approach for Human Keypoint Position and Human Body Detection [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(6): 1103-1113. |
| [5] | LIU Mengge, LIU Hao, HE Xin, JIN Shaohui, CHEN Pengyun, XU Mingliang. Research Advances on Non-Line-of-Sight Imaging Technology [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(5): 833-854. |
| [6] | YE Jihua, JIANG Lu, XIAO Shunjie, ZONG Yi, JIANG Aiwen. Multi-Label Image Classification Model Based on Multiscale Fusion and Adaptive Label Correlation [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(5): 889-898. |
| [7] | LIN Xiao, LU Meichen, GAO Mufeng, LI Yan. Lightweight Human Pose Estimation Based on Multi-Attention Mechanism [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(5): 899-910. |
| [8] | DING Leqi, WANG Biyun, YAO Lixiu, CAI Yunze. MAGPNet: Multi-Domain Attention-Guided Pyramid Network for Infrared Small Object Detection [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(5): 935-951. |
| [9] | JIANG Wenbo, ZHENG Hangbin, BAO Jinsong. Novel Multi-Step Deep Learning Approach for Detection of Complex Defects in Solar Cells [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(5): 1050-1064. |
| [10] | Fu Zeyu, Fu Zhuang, Guan Yisheng. Vascular Interventional Surgery Path Planning and 3D Visual Navigation [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(3): 472-481. |
| [11] | Wang Baomin, Ding Hewei, Teng Fei, Liu Hongqin. Damage Detection of X-ray Image of Conveyor Belts with Steel Rope Cores Based on Improved FCOS Algorithm [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(2): 309-318. |
| [12] | Wang Gang, Guan Yaonan, Li Dewei. Two-Stream Auto-Encoder Network for Unsupervised Skeleton-Based Action Recognition [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(2): 330-336. |
| [13] | Diao Zijian, Cao Shuai, Li Wenwei, Liang Jianan, Wen Guilin, Huang Weixi, Zhang Shouming. Person Re-Identification Based on Spatial Feature Learning and Multi-Granularity Feature Fusion [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(2): 363-374. |
| [14] | ZHOU Su (周苏), ZHONG Zebin∗ (钟泽滨). Real-Time Ranging of Vehicles and Pedestrians for Mobile Application on Smartphones [J]. J Shanghai Jiaotong Univ Sci, 2024, 29(6): 1081-1090. |
| [15] | YAN Congqiang1,2 (鄢丛强), GUO Zhengyun3,4 (郭正玉), CAI Yunze1,2∗∗ (蔡云泽). Data Augmentation of Ship Wakes in SAR Images Based on Improved CycleGAN [J]. J Shanghai Jiaotong Univ Sci, 2024, 29(4): 702-711. |
| Viewed | ||||||
|
Full text |
|
|||||
|
Abstract |
|
|||||