| 273 | 8 | 60 |
| 下载次数 | 被引频次 | 阅读次数 |
针对铁塔摄像头视频中挖掘机等工程机械目标极为稀疏、小目标居多、背景复杂、目标遮挡等问题,提出一种改进的YOLOv5s算法,首先将数据增强方式改为mosaic-9,有效增加了小目标样本量;其次,优化了特征提取与特征融合过程,增加了一个检测尺度,提高了小目标的检测精度;最后,引入了SKNet注意力模块,使模型更加关注感兴趣区域,抑制无用信息和复杂背景的干扰,提高算法的检测准确率。实验结果表明,改进后算法AP值达到了79.63%,相对YOLOv5s的AP值提升了6.8%,同时小目标检测精度更高,漏检情况更少。
Abstract:The construction equipment targets such as excavators in the tower-based video are extremely sparse, mostly small and with complex backgrounds, and part of them is occluded. So, we proposed an improved detection algorithm based on YOLOv5s. Firstly, we changed the data augmentation mode to Mosain-9, to increase the number of small target. Then, we optimized the process of feature extraction and feature fusion,and added a detection scale to improve the detection ability of small targets and feature fusion. Finally, we introduced an attention mechanism to pay more attention to the region of interest, suppress useless information and background noise, and improve the accuracy of target detection.The experimental results show that AP value of the improved algorithm reaches 79.63%, which is 6.8% higher than that of YOLOv5s. At the same time, the detection accuracy of small targets is higher, and the missing target is less.
[1]冯笑雨.基于塔基监控图像的建设施工用地识别与空间定位方法研究[D].南京:南京师范大学,2019
[2]王文杰,何小海,卿粼波,等.改进YOLOv5的船舶检测算法及嵌入式实现[J].无线电工程,2022,52(12):2 116-2 123
[3]翟宏亮.基于轻型卷积神经网络的无人机多目标检测系统研究[J].地理空间信息,2022,20(12):81-83
[4]罗茜,赵睿,庄慧珊,等. YOLOv5与Deep-SORT联合优化的无人机多目标跟踪算法[J].信号处理,2022,38(12):2 628-2 638
[5]王一旭,肖小玲,王鹏飞,等.改进YOLOv5s的小目标烟雾火焰检测算法[J].计算机工程与应用,2023,59(1):72-81
[6]程顺生,覃驭楚,吕炎杰.城市环境下基于双目视觉的移动目标检测[J].地理空间信息,2022,20(3):7-11
[7] YoloV5源码部分注释解读(ultralytics版本)(train.py)[EB/OL].(2023-06-19)[2023-07-23]. https://github. com/ultralytics/yolov5
[8]涂海清.基于yolov5s-se和数据增强的夜间车辆目标检测[D].广州:华南理工大学,2021
[9]陈禹蒲,马晓川,李璇.基于YOLOv3锚框优化的侧扫声呐图像目标检测[J].信号处理,2022,38(11):2 359-2 371
[10] YOLOv5解读[EB/OL].(2021-12-09)[2023-10-20]. https://blog. csdn. net/zjl892209143/article/details/121833051
[11] Wang C Y,Liao H Y M,Wu Y H,et al. CSPNet:A New Backbone that Can Enhance Learning Capability of CNN[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops,2020:390-391
[12] He K,Zhang X,Ren S,et al. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,37(9):1 904-1 916
[13] Tan M,Pang R,Le Q V. Efficientdet:Scalable and Efficient Object Detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,2020:10 781-10 790
[14]王战涛,张策,王晓田.基于YOLOV3的改进目标检测识别算法[J].上海航天(中英文),2021,38(6):60-70
[15] Hu J,Shen L,Sun G. Squeeze-and-excitation Networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Washington D. C. USA:IEEE Press,2018(7):132-141
[16]赵梦,于红,李海清,等.融合SKNet与YOLOv5深度学习的养殖鱼群检测[J].大连海洋大学学报,2022,37(2):312-319
基本信息:
中图分类号:TU60;TP183;TP391.41
引用信息:
[1]余添添,吴松,唐芝青,等.利用铁塔视频图像和改进YOLOv5的违规施工监测[J].地理空间信息,2024,22(04):45-48.
2024-04-28
2024-04-28