基于YOLOv11n-GSSE的轻量化石榴成熟度识别方法研究

    A lightweight real-time pomegranate maturity detection method based on YOLOv11n-GSSE

    • 摘要:
      目的 针对复杂果园环境中石榴成熟度检测存在小目标尺度变化大、遮挡严重以及边缘设备计算资源受限等问题,构建一种兼顾检测精度与计算效率的轻量化实时检测方法。
      方法 以YOLOv11n为基础框架,构建轻量化石榴成熟度检测模型YOLOv11n-GSSE。首先,引入GhostHGNetV2作为主干网络,以减少冗余特征并提升特征提取效率;其次,融合无参数SimAM注意力机制以增强对果皮裂纹、颜色变化等细粒度特征的表达能力;最后,在特征融合阶段通过SlimNeck结构强化多尺度特征交互,并设计Efficient_Detect检测头以降低计算复杂度。同时,采用LAMP算法进行50%通道剪枝,并结合教师−助教−学生三阶段知识蒸馏策略对压缩模型进行性能恢复与优化,并在包含14 353幅真实果园图像的数据集上进行试验验证。
      结果 与SSD、Faster R-CNN、RT-DETR及YOLOv8n等模型相比,YOLOv11n-GSSE在检测精度和计算效率方面均表现更优,模型mAP0.5达到93.6%,经剪枝蒸馏后,模型mAP0.5仍保持92.8%,参数量降至3.47 M,计算量降至1.9 G。在Jetson Xavier NX平台上平均端到端推理时间约为0.011 s(约91 FPS)。可视化分析表明,改进模型对目标区域关注更集中、背景抑制能力更强;跨数据集测试中,在水稻病害数据集上的精确率、召回率和mAP0.5分别提升约3.6、3.8和2.4个百分点,表现出较好的泛化能力。
      结论 YOLOv11n-GSSE在保持较高检测精度的同时明显降低模型复杂度,在复杂果园环境中对不同成熟阶段石榴目标具有良好的识别能力,能够满足边缘设备上的实时成熟度监测需求,为果园智能管理与精准采收提供了一种可行的技术方案。

       

      Abstract:
      Objective To address the challenges of large-scale variation of small targets, severe occlusion, and limited computational resources on edge devices in pomegranate maturity detection under complex orchard environments, a lightweight real-time detection method that balanced detection accuracy and computational efficiency was developed.
      Method Based on the YOLOv11n framework, a lightweight pomegranate maturity detection model named YOLOv11n-GSSE, was developed. Firstly, GhostHGNetV2 was introduced as the backbone network to reduce redundant features while improving feature extraction efficiency. Then, the parameter-free SimAM attention mechanism was integrated to enhance the representation of fine-grained features such as peel fissures and color variations. Finally, in the feature fusion stage, the SlimNeck structure was adopted to strengthen multi-scale feature interaction, and an Efficient_Detect head was designed to further reduce computational complexity. Meanwhile, 50% channel pruning based on the LAMP algorithm was performed, and a three-stage knowledge distillation strategy (teacher-assistant-student) was applied to restore and optimize the performance of the compressed model.
      Result Compared with SSD, Faster R-CNN, RT-DETR, YOLOv8n, and other models, YOLOv11n-GSSE achieved superior detection accuracy and computational efficiency, with an mAP0.5 of 93.6%. After pruning and knowledge distillation, the compressed model still maintained an mAP0.5 of 92.8%, while the number of parameters and computational cost were reduced to 3.47 M and 1.9 G, respectively. On the Jetson Xavier NX platform, the average end-to-end inference time was approximately 0.011 s per image (about 91 FPS). Visualization analysis showed that the improved model focused more accurately on target regions and exhibited stronger background suppression capability. In cross-dataset experiments on a rice disease dataset, the precision, recall, and mAP0.5 were improved by approximately 3.6, 3.8, and 2.4 percentage points, respectively, demonstrating good generalization ability.
      Conclusion YOLOv11n-GSSE significantly reduces model complexity while maintaining high detection accuracy. The model demonstrates strong capabilities in recognizing pomegranate targets at different maturity stages under complex orchard conditions and can meet the requirements of real-time maturity monitoring on edge devices, providing a feasible technical solution for intelligent orchard management and precision harvesting.

       

    /

    返回文章
    返回