|
[1]S. Ren, K. He, R. Girshick and J. Sun, "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 39, no. 6, pp. 1137-1149, 1 June 2017, doi: 10.1109/TPAMI.2016.2577031. [2]P. Li, X. Chen and S. Shen, "Stereo R-CNN Based 3D Object Detection for Autonomous Driving," 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 7644-7652, doi: 10.1109/CVPR.2019.00783. [3]J. Sun, L. Chen, Y. Xie, S. Zhang, Q. Jiang, X. Zhou and H. Bao, "Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation," 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, pp. 10548-10557, doi: 10.1109/CVPR42600.2020.01056 [4]Charles R. Qi, Hao Su, Kaichun Mo, Leonidas J. Guibas, "PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation," 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017, pp. 652-660, doi: 10.1109/CVPR.2017.16. [5]Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," Proc. of the IEEE, pp.2278-2324, Nov. 1998 [6]Charles R. Qi, Li Yi, Hao Su, Leonidas J. Guibas, "PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space," Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS'17), 2017, pp. 5105–5114, doi: 10.5555/3295222.3295263. [7]S. Shi, X. Wang and H. Li, "PointRCNN: 3D Object Proposal Generation and Detection From Point Cloud," 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 770-779, doi: 10.1109/CVPR.2019.00086. [8]Y. Zhou and O. Tuzel, "VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection," 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018, pp. 4490-4499, doi: 10.1109/CVPR.2018.00472. [9]Z. Liu, H. Tang, Y. Lin and S. Han, "Point-Voxel CNN for Efficient 3D Deep Learning," Proceedings of the 32nd International Conference on Neural Information Processing Systems (NIPS'19), 2019, pp. 5105–5114, doi: 10.5555/3454287.3454374. [10]S. Shi, C. Guo, L. Jiang, Z. Wang, J. Shi, X. Wang, and H. Li, "PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection," 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 10529-10538, doi: 10.1109/CVPR42600.2020.01054. [11]S. Pang, D. Morris and H. Radha, "CLOCs: Camera-LiDAR Object Candidates Fusion for 3D Object Detection," 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2020, pp. 10386–10393, doi: 10.1109/IROS45743.2020.9341791. [12]C. R. Qi, W. Liu, C. Wu, H. Su and L. J. Guibas, "Frustum PointNets for 3D Object Detection From RGB-D Data," 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018, pp. 918-927, doi: 10.1109/CVPR.2018.00102. [13]Honghui Yang, Zili Liu, Xiaopei Wu, Wenxiao Wang, Wei Qian, Xiaofei He, Deng Cai, “Graph R-CNN: Towards Accurate 3D Object Detection with Semantic-Decorated Local Graph,” 2022 European Conference on Computer Vision (ECCV), 2022, pp 662-679, [14]Xiaopei Wu, Liang Peng, Honghui Yang, Liang Xie, Chenxi Huang, Chengqi Deng, Haifeng Liu, Deng Cai, "Sparse Fuse Dense: Towards High Quality 3D Detection with Depth Completion," 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), NewOrleans , LA , USA,2022 ,pp.5408-5417, doi:10.1109/CV PR52688.2022.00534. [15]A. Mahmoud, J. S. K. Hu and S. L. Waslander, "Dense Voxel Fusion for 3D Object Detection," 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, USA, 2023, pp. 663-672, doi: 10.1109/WACV56688.2023.00073. [16]Yanwei Li, Xiaojuan Qi, Yukang Chen, Liwei Wang, Zeming Li, Jian Sun, Jiaya Jia, "Voxel Field Fusion for 3D Object Detection," 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 2022, pp. 1110-1119, doi: 10.1109/CVPR52688.2022.00119. [17]Yifan Zhang, Qijian Zhang, Junhui Hou, Yixuan Yuan, Guoliang Xing, "Bidirectional Propagation for Cross-Modal 3D Object Detection," arXiv preprint arXiv:2301.09077 (2023). [18]H. Thomas, C. R. Qi, J. -E. Deschaud, B. Marcotegui, F. Goulette and L. Guibas, "KPConv: Flexible and Deformable Convolution for Point Clouds," 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea (South), 2019, pp. 6410-6419, doi: 10.1109/ICCV.2019.00651. [19]Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun, "Deep residual learning for image recognition," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016. [20]S. Woo , J. Park , J.-Y. Lee, I. S. Kweon, "CBAM: Convolutional Block Attention Module," 2018 European Conference on Computer Vision (ECCV), 2018, pp. 3-19, doi: 10.48550/ arXiv.1807.06521 (arXiv version). [21]Chen Chen, Zhe Chen, Jing Zhang, and Dacheng Tao, "Sasa: Semantics-augmented set abstraction for point-based 3d object detection," Proceedings of the AAAI Conference on Artificial Intelligence, volume 1, pp. 221–229, 2022. [22]Zetong Yang, Yanan Sun, Shu Liu, and Jiaya Jia, " 3dssd: Point-based 3d single stage object detector," Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11040–11048, 2020. [23]A. Geiger, P. Lenz and R. Urtasun, "Are we ready for autonomous driving? The KITTI vision benchmark suite," 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012, pp. 3354-3361, doi: 10.1109/CVPR.2012.6248074. [24]D. P. Kingma and J. Ba, “Adam: A Method for Stochastic Optimization,” Proceedings of International Conference on Learning Representations (ICLR Poster), 2015, doi: 10.48550/arXiv.1412.6980 (arXiv version).
|