Journal of Guangdong University of Technology ›› 2023, Vol. 40 ›› Issue (04): 18-23.doi: 10.12052/gdutxb.220145

• Computer Science and Technology • Previous Articles     Next Articles

A Channel-splited Based Dual-branch Block for 3D Point Cloud Processing

Zhong Geng-jun, Li Dong   

  1. School of Automation, Guangdong University of Technology, Guangzhou 510006, China
  • Received:2022-09-19 Online:2023-07-25 Published:2023-08-02

Abstract: In this paper, we propose a channel-split-based dual-branch block (CDBlock) based on the PointMLP method, which splits input features into two groups along the channel dimension and feeds them into different branches of the dual-branch network module. Specifically, our proposed CDBlock consists of a lightweight branch and a heavyweight branch. The lightweight branch, which uses the structure of the residual MLP, is designed to extract coarse semantic features. The heavyweight branch, which uses the bottleneck network as backbone, is responsible for extracting more deeper and distinguishable information. By doing this, our proposed CDBlock effectively improves the network's feature extraction and learning capabilities. Experimental results show that our proposed method outperforms the existing PointMLP by achieving an overall accuracy of 86.2% and a class average accuracy of 84.97% on ScanObjectNN dataset. In particular, our approach achieves better results with using fewer parameters and computational cost than the PointMLP. Additionally, our approach also achieves encouraging performance on the ShapeNetPart dataset.

Key words: 3D point cloud, deep learning, multi-branch, residual network

CLC Number: 

  • TP391.4
[1] QI C R, SU H, MO K, et al. Pointnet: deep learning on point sets for 3D classification and segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Honolulu, HI: IEEE, 2017: 652-660.
[2] QI C R, YI L, SU H, et al. Pointnet++: deep hierarchical feature learning on point sets in a metric space[J]. Advances in Neural Information Processing Systems, 2017, 30: 5099-5108.
[3] THOMAS H, QI C R, DESCHAUD J E, et al. Kpconv: flexible and deformable convolution for point clouds[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. Long Beach: IEEE, 2019: 6411-6420.
[4] WU W, QI Z, FUXIN L. Pointconv: deep convolutional networks on 3D point clouds[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seoul: IEEE, 2019: 9621-9630.
[5] XU M, DING R, ZHAO H, et al. Paconv: position adaptive convolution with dynamic kernel assembling on point clouds[C]//Proceedings of the IEEE/ CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE, 2021: 3173-3182.
[6] RAN H, LIU J, WANG C. Surface representation for point clouds[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. New Orleans: IEEE, 2022: 18942-18952.
[7] LIU Z, HU H, CAO Y, et al. A closer look at local aggregation operators in point cloud analysis[C]// European Conference on Computer Vision. Glasgow: ECCV, 2020: 326-342.
[8] DOSOVITSKIY A, BEYER L, KOLESNIKOV A, et al. An image is worth 16×16 words: transformers for image recognition at scale[C]//International Conference on Learning Representations. Cambridge: ICLR, 2021: 1-6.
[9] ZHAO H, JIANG L, JIA J, et al. Point transformer[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. Montreal: IEEE, 2021: 16259-16268.
[10] GUO M H, CAI J X, LIU Z N, et al. Pct: point cloud transformer[J]. Computational Visual Media, 2021, 7(2): 187-199.
[11] XU MA, CAN Q, HAOXUAN Y, et al. Rethinking network design and local geometry in point cloud: a simple residual MLP framework[C]//International Conference on Learning Representations. Kigali: ICLR, 2022: 1-9.
[12] ZHANG H, WU C, ZHANG Z, et al. Resnest: split-attention networks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. New Orleans: IEEE, 2022: 2736-2746.
[13] SZEGEDY C, LIU W, JIA Y, et al. Going deeper with convolutions[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Boston: IEEE, 2015: 1-9.
[14] HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016: 770-778.
[15] UY M A, PHAM Q H, HUA B S, et al. Revisiting point cloud classification: a new benchmark dataset and classification model on real-world data[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. Seoul: IEEE, 2019: 1588-1597.
[16] YI L, KIM V G, CEYLAN D, et al. A scalable active framework for region annotation in 3D shape collections[J]. ACM Transactions on Graphics(ToG), 2016, 35(6): 1.
[17] LI Y, BU R, SUN M, et al. Pointcnn: convolution on x-transformed points[J]. Advances in Neural Information Processing Systems, 2018, 31: 1-11.
[1] Wen Wen, Liu Ying, Cai Rui-chu, Hao Zhi-feng. Spatial-temporal Deep Regression Model for Multi-granularity Traffic Flow Prediction [J]. Journal of Guangdong University of Technology, 2023, 40(04): 1-8.
[2] Jin Yu-kai, Li Zhi-sheng, Ou Yao-chun, Zhang Hua-gang, Zeng Jiang-yi, Chen Bo-chao. Prediction and Comparative Study of PM2.5 Concentration Based on Multi-stage Clustering [J]. Journal of Guangdong University of Technology, 2023, 40(03): 17-24.
[3] Liu Dong-ning, Wang Zi-qi, Zeng Yan-jiao, Wen Fu-yan, Wang Yang. Prediction Method of Gene Methylation Sites Based on LSTM with Compound Coding Characteristics [J]. Journal of Guangdong University of Technology, 2023, 40(01): 1-9.
[4] Xu Wei-feng, Cai Shu-ting, Xiong Xiao-ming. Visual Inertial Odometry Based on Deep Features [J]. Journal of Guangdong University of Technology, 2023, 40(01): 56-60,76.
[5] Liu Hong-wei, Lin Wei-zhen, Wen Zhan-ming, Chen Yan-jun, Yi Min-qi. A MABM-based Model for Identifying Consumers' Sentiment Polarity―Taking Movie Reviews as an Example [J]. Journal of Guangdong University of Technology, 2022, 39(06): 1-9.
[6] Zhang Yun, Wang Xiao-dong. A Review and Thinking of Deep Learning with a Restricted Number of Samples [J]. Journal of Guangdong University of Technology, 2022, 39(05): 1-8.
[7] Zheng Jia-bi, Yang Zhen-guo, Liu Wen-yin. Marketing-Effect Estimation Based on Fine-grained Confounder Balancing [J]. Journal of Guangdong University of Technology, 2022, 39(02): 55-61.
[8] Yang Ji-sheng, Zhang Yun, Li Dong. A Residual Neural Network with Voting for 3D Object Detection in Point Clouds [J]. Journal of Guangdong University of Technology, 2022, 39(01): 56-62.
[9] Gary Yen, Li Bo, Xie Sheng-li. An Evolutionary Optimization of LSTM for Model Recovery of Geophysical Fluid Dynamics [J]. Journal of Guangdong University of Technology, 2021, 38(06): 1-8.
[10] Lai Jun, Liu Zhen-yu, Liu Sheng-hai. A Small Sample Data Prediction Method Based on Global Data Shuffling [J]. Journal of Guangdong University of Technology, 2021, 38(03): 17-21.
[11] Cen Shi-jie, He Yuan-lie, Chen Xiao-cong. A Monocular Depth Estimation Combined with Attention and Unsupervised Deep Learning [J]. Journal of Guangdong University of Technology, 2020, 37(04): 35-41.
[12] Zeng Bi, Ren Wan-ling, Chen Yun-hua. An Unpaired Face Illumination Normalization Method Based on CycleGAN [J]. Journal of Guangdong University of Technology, 2018, 35(05): 11-19.
[13] Yang Meng-jun, Su Cheng-yue, Chen Jing, Zhang Jie-xin. Loop Closure Detection for Visual SLAM Using Convolutional Neural Networks [J]. Journal of Guangdong University of Technology, 2018, 35(05): 31-37.
[14] Chen Xu, Zhang Jun, Chen Wen-wei, Li Shuo-hao. Convolutional Neural Network Algorithm and Case [J]. Journal of Guangdong University of Technology, 2017, 34(06): 20-26.
[15] Liu Zhen-yu, Li Jia-jun, Wang Kun. A Fingerprint Matching Localization Method Based on Deep Auto Encoder [J]. Journal of Guangdong University of Technology, 2017, 34(05): 15-21.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!