基于特征融合的深度学习目标检测算法研究

doi:10.12052/gdutxb.200147

Abstract

Abstract: Through the study of feature levels in convolutional neural networks, this paper found that high-level feature have stronger semantic information and low resolution, and low-level features have strong resolution and weaker semantic information. Aiming at these problems, a object detection algorithm based on secondary feature fusion is proposed. The algorithm reuses transitional features and performs secondary feature fusion on the basis of Feature Pyramid Networks to supplement the rich low-level feature information to the top. Finally, the average accuracy of AP, AP₅₀, and AP₇₅ on the COCO2014 data set reach 35.3%, 57.5%, and 36.6%, respectively. Compared with the unused feature fusion method and the traditional feature fusion method, the average accuracy is increased by 2.4%, 3.7% and 2.4%, which significantly improves the missed detection and the detection of small targets.

Key words: feature fusion, object detection, convolutional neural network, feature reuse

CLC Number:

TP242.6+2

Huang Jian-hang, Wang Zhen-you. A Research on Deep Learning Object Detection Algorithm Based on Feature Fusion[J].Journal of Guangdong University of Technology, 2021, 38(04): 52-58.

References

[1] HE K, GKIOXARI G, DOLLÁR P, et al. Mask R-CNN [J]. IEEE transactions on pattern analysis & machine intelligence, 2020, 42(2): 386-397.
[2] REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: unified, real-time object detection[C]//IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas: IEEE, 2016: 779-788.
[3] REN S, HE K, GIRSHICK R, et al. Faster R-CNN: towards real-time object detection with region proposal networks [J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2017, 39(6): 1137-1149.
[4] REDMON J, FARHADI A. YOLO9000: better, faster, stronger[C]//IEEE Conference on Computer Vision and Pattern Recognition(CVPR). Honolulu: IEEE, 2017: 6517-6525.
[5] LI Y, CHEN Y, WANG N, et al. Scale-aware trident networks for object detection[C]//IEEE/CVF International Conference on Computer Vision (ICCV). Seoul: IEEE, 2019: 6053-6062.
[6] BHARAT S, MAHYAR N, LARRY S D. SNIPER: efficient multi-scale training[J]. arXiv preprint arXiv: 1805.09300, 2018.
[7] LIU W, ANUELOVG D, ERHAN D, et al. SSD: Single shot multibox detector[C]//European Conference on Computer Vision. Berlin: Springer, Cham, 2016: 21-37.
[8] KONG T, YAO A, CHEN Y, et al. Hypernet: Towards accurate region proposal generation and joint object detection[C]//IEEE Conference on Computer Vision and Pattern Recognition(CVPR). Las Vegas: IEEE, 2016: 845-853.
[9] LIU S T, HUANG D, WANG Y H. Receptive field block net for accurate and fast object detection[J]. arXiv preprint arXiv: 1711.07767, 2017.
[10] LIN TY, DOLLÁR P, GIRSHICK R, et al. Feature pyramid networks for object detection[C]//IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas: IEEE, 2016: 2117-2125.
[11] LI Z, ZHOU F Q. FSSD: feature fusion single shot multibox detector[J]. arXiv preprint arXiv: 1712.00960, 2017.
[12] FU C Y, LIU W, RANGA A, et al. Dssd: deconvolutional single shot detector[J]. arXiv preprint arXiv: 1701.06659. 2017.
[13] 温捷文, 战萌伟, 李楚宏, 等. 一种加强SSD小目标检测能力的Atrous滤波器设计[J]. 计算机应用研究, 2019, 36(3): 861-865, 872.
WEN J W, ZHANM W, LI C H, et al. Design of Atrous filter to strengthen small object detection capability of SSD [J]. Application Research of Computers, 2019, 36(3): 861-865, 872.
[14] 高俊艳, 刘文印, 杨振国. 结合注意力与特征融合的目标跟踪[J]. 广东工业大学学报, 2019, 36(4): 18-23.
GAO J Y, LIU W Y, YANG Z G. Object tracking combined with attention and feature fusion [J]. Journal of Guangdong University of Technology, 2019, 36(4): 18-23.
[15] HE K, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition[C]//IEEE Conference on Computer Vision and Pattern Recognition(CVPR). Las Vegas: IEEE, 2016: 770-778.
[16] REDMON J, FARHADI A. YOLOV3: an incremental improvement[J]. arXiv preprint arXiv: 1804.02767, 2018.
[17] ABADI M, AGARWAL A, BARHAM P, et al. Tensorflow: large-scale machine learning on heterogeneous distributed systems[J]. arXiv preprint arXiv: 1603.04467, 2016.
[18] RUSSAKOVSKY O, DENG J, SU H, et al. ImageNet large scale visual recognition challenge [J]. International journal of computer vision, 2015, 115(3): 211-252.
[19] DAI J F, LI Y, HE K. R-FCN: object detection via region-based fully convolutional networks[J]. arXiv preprint arXiv: 1605.06409, 2016.

Related Articles 14

[1]	Xie Guo-bo, Lin Li, Lin Zhi-yi, He Di-xuan, Wen Gang. An Insulator Burst Defect Detection Method Based on YOLOv4-MP [J]. Journal of Guangdong University of Technology, 2023, 40(02): 15-21.
[2]	Zhang Yun, Wang Xiao-dong. A Review and Thinking of Deep Learning with a Restricted Number of Samples [J]. Journal of Guangdong University of Technology, 2022, 39(05): 1-8.
[3]	Yang Ji-sheng, Zhang Yun, Li Dong. A Residual Neural Network with Voting for 3D Object Detection in Point Clouds [J]. Journal of Guangdong University of Technology, 2022, 39(01): 56-62.
[4]	Zhang Guo-sheng, Feng Guang, Li Dong. Pose-based Oriented Object Detection Network for Aerial Images [J]. Journal of Guangdong University of Technology, 2021, 38(05): 40-47.
[5]	Ma Shao-peng, Liang Lu, Teng Shao-hua. A Lightweight Hyperspectral Remote Sensing Image Classification Method [J]. Journal of Guangdong University of Technology, 2021, 38(03): 29-35.
[6]	Xia Hao, Cai Nian, Wang Ping, Wang Han. Magnetic Resonance Image Super-Resolution via Multi-Resolution Learning [J]. Journal of Guangdong University of Technology, 2020, 37(06): 26-31.
[7]	Zhan Yin-wei, Zhu Bai-wan, Yang Zhuo. Research and Application of Vehicle Color and Model Recognition Algorithm [J]. Journal of Guangdong University of Technology, 2020, 37(04): 9-14.
[8]	Zeng Bi-qing, Han Xu-li, Wang Sheng-yu, Xu Ru-yang, Zhou Wu. Sentiment Classification Based on Double Attention Convolutional Neural Network Model [J]. Journal of Guangdong University of Technology, 2019, 36(04): 10-17.
[9]	Gao Jun-yan, Liu Wen-yin, Yang Zhen-guo. Object Tracking Combined with Attention and Feature Fusion [J]. Journal of Guangdong University of Technology, 2019, 36(04): 18-23.
[10]	Yang Meng-jun, Su Cheng-yue, Chen Jing, Zhang Jie-xin. Loop Closure Detection for Visual SLAM Using Convolutional Neural Networks [J]. Journal of Guangdong University of Technology, 2018, 35(05): 31-37.
[11]	Chen Xu, Zhang Jun, Chen Wen-wei, Li Shuo-hao. Convolutional Neural Network Algorithm and Case [J]. Journal of Guangdong University of Technology, 2017, 34(06): 20-26.
[12]	SHEN Xiao-Min， LI Bao-Jun， SUN Xu， XU Wei-Chao. Large Scale Face Clustering Based on Convolutional Neural Network [J]. Journal of Guangdong University of Technology, 2016, 33(06): 77-84.
[13]	CHEN Shi-Wen1, 2 , Cai-Nian2, Xiao-Ming-Ming3. Detection of Moving Objects Based on the Gaussian Mixture Model and the Canny Operator [J]. Journal of Guangdong University of Technology, 2011, 28(3): 87-91.
[14]	CAO Xiao-jun,PAN Bao-chang,ZHENG Sheng-lin,GAN Yan-fen . Motion Object Detection Method Based on the Characteristic Image [J]. Journal of Guangdong University of Technology, 2007, 24(2): 87-89.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

A Research on Deep Learning Object Detection Algorithm Based on Feature Fusion

HTML

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 14

Metrics

Comments

Recommended 0