Algorithm for Coal Mine Rock Foreign Object Detection Based on Enhanced YOLOv8
Downloads
In addressing the challenge of suboptimal detection precision in coal mine rock foreign object detection due to complex environments and variable object scales, this paper proposes a coal mine rock foreign object detection algorithm based on YOLOv8. Initially, the incorporation of the BiFormer attention mechanism is advocated to refine the backbone network, augmenting the model's attention towards pivotal information regions, consequently enhancing localization and feature extraction capabilities. Secondly, a lightweight Content-Aware Recurrent Affine Feature Extraction (CARAFE) operator is utilized within the neck architecture to effectively capture and preserve intricate features at lower hierarchical levels. Finally, Wise-IoU v3 is adopted as the bounding box regression loss for the proposed algorithm, coupled with a prudent gradient allocation approach, thereby enhancing the model's localization capabilities. Empirical findings illustrate that compared to baseline algorithms, the proposed algorithm has fewer parameters, with an average mAP improvement of 2.5%, and a detection speed increase of 2fps/s.
Guofa W, Feng L, Yihui P, Ren. (2019). Coal mine intelligence: Core technical support for high-quality development of the coal industry. Journal of China Coal Society, 44(2), 349-357.
Guofa W. (2022). Latest Technological Advanceme-nts and Issues in Coal Mine Intelligence. Coal Scien-ce & Technology (0253-2336), 50(1).
Xiangang C, Siyin L, Peng W. (2022). Research on Coal Gangue Recognition and Localization System for Coal Gangue Sorting Robots. Coal Science & Technology (0253-2336), 50(1).
Zhang N, Donahue J, Girshick R, et al. Part-Based R-CNNs for Fine-Grained Category Detection[J]. Lect-ure Notes in Computer Science, 2014,8689(1):834-849.
He K, Zhang X, Ren S, et al. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2014,37(9):1904-1916.
Girshick R. Fast R-CNN[C]. IEEE International Conference on Computer Vision. Santiago, Chile, 2015.
Ren S, He K, Girshick R, et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2017,39(6):1137-1149.
Redmon J, Divvala S, Girshick R, et al. You Only Look Once: Unified, Real-Time Object Detection[C]. IEEE/CVF Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA. 2016:779-788.
Redmon J, Farhadi A. YOLO9000: Better, Faster, Stronger[C]. IEEE/CVF Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 2017:6517-6525.
Redmon J, Farhadi A. YOLOv3: An Incremental Improvement[C]. IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 2018.
Bochkovskiy A, Wang C Y, Liao H Y M. YOLOv4: Optimal Speed and Accuracy of Object Detection[C]. IEEE/CVF Conference on Computer Vision and Pattern Recognition, USA, 2020.
LIU W, ANGUELOV D, D E, et al. SSD: Single Shot MultiBox Detector[J]. European Conference on Computer vision, 2016:21-37.
Duan K, Bai S, Xie L, et al. CenterNet: Keypoint Triplets for Object Detection. arXiv,2019.
Han G, Peipei Z, Zheng Y. (Year). Coal Mine Conveyor Belt Foreign Object Detection Based on Feature Enhancement and Transformer. Coal Science & Technology, 1-11.
Xiangang C, Hu L, Peng W. (2024). Coal Foreign O-bject Detection Method Based on Cross-Modal Atte-ntion Fusion. Industrial and Mining Automation (01), 57-65. doi:10.13272/j.issn.1671-251x.2023110035.
Zhengyuan C, Wei J, & Chenghui F. (2023). Intelligent Detection Method for Coal Flow Foreign Objects Based on Dual Attention Generative Adversarial Networks. Industrial and Mining Automation (12), 56-62. doi:10.13272/j.issn.1671-251x.18094.
Jun T, Jingzhao L, Qing S. (2023). Real-time Detecti-on of Foreign Objects on Belt Conveyor Based on F-aster-YOLOv7. Industrial and Mining Automation (11), 46-52+66. doi:10.13272/j.issn.1671-251x.2023020037.
Shuai H, Xu Z, Xu M (2022). Foreign Object Detection on Coal Mine Conveyor Belt Based on CBAM-YOLOv5. Journal of China Coal Society (11), 4147-4156. doi:10.13225/j.cnki.jccs.2021.1644.
Jingyi D, Rui C, Le H. Foreign Object Detection on Coal Mine Belt Conveyor. Industrial and Mining Automation (08), 77-83. doi:10.13272/j.issn.1671-251x.2021040026.
Zhiling R & Yancun Z. (2023). Research on Foreign Object Recognition in Coal Mine Belt Transportatio-n Based on Improved CenterNet Algorithm. Contro-l Engineering (04), 703-711. doi:10.14107/j.cnki.kzgc.20200792.
Zhu, L., Wang, X., Ke, Z., Zhang, W., & Lau, R. W. (2023). Biformer: Vision transformer with bi-level routing attention. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 10323-10333).
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., ... & Polosukhin, I. (2017). Attention is all you need. Advances in neural information processing systems, 30.
Wang, J., Chen, K., Xu, R., Liu, Z., Loy, C. C., & Lin, D. (2019). Carafe: Content-aware reassembly of features. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 3007-3016).
Tong, Z., Chen, Y., Xu, Z., & Yu, R. (2023). Wise-I-oU: bounding box regression loss with dynamic foc-using mechanism. arXiv preprint arXiv:2301.10051.