›› 2026, Vol. 50 ›› Issue (生态统计方法专题): 0-.DOI: 10.17521/cjpe.2025.0334

• •    

基于深度学习目标检测的植物物候自动识别方法对比研究——以神农架粉白杜鹃为例

贾元, 张琳, 宋创业, 赵常明, 郭虓, 祝晓光, 吴冬秀   

  1. 中国科学院植物研究所植被与环境变化重点实验室, 北京 100093 中国
    国家植物园, 北京 100093 中国
    北京天航华创科技有限公司, 北京 100085 中国
    北京航空航天大学, 北京 100191 中国
  • 收稿日期:2025-09-09 修回日期:2025-11-25 出版日期:2026-08-30
  • 基金资助:
    国家重点研发计划课题(2022YFF1300103); 中国科学院战略性先导科技专项(A类)(XDA26020102); 中国科学院野外站基础研究项目课题(KFJ-SW-YW043-4)

Comparison of deep learning-based object detection methods for automatic plant phenology recognition: A case study of Rhododendron hypoglaucum in Shennongjia

Jia Yuan, Zhang Lin, Song Chuangye, Zhao Changming, Guo Xiao, Zhu Xiaoguang, Wu Dongxiu   

  1. Key Laboratory of Vegetation and Environmental Change, Institute of Botany, Chinese Academy of Sciences 100093, China
    , China National Botanical Garden 100093, China
    , Beijing Tianhang Create Technology Co., Ltd. 100085, China
    , Beihang University 100191, China
  • Received:2025-09-09 Revised:2025-11-25 Online:2026-08-30
  • Supported by:
    Supported by the National Key Research and Development Program of China, Grant No.(2022YFF1300103); the Strategic Priority Research Program of Chinese Academy of Sciences,Grant No.(XDA26020102); the basic research project of field stations of Chinese Academy of Sciences(KFJ-SW-YW043-4)

摘要: 植物物候是生态系统响应全球气候变化的关键指示器。尽管自动成像技术能够获取海量的时序物候图像数据,但物候期的自动识别技术仍然是该领域发展的关键技术瓶颈。为解决这一问题,本研究旨在探索基于深度学习的目标检测方法,以实现对植物关键物候期的高频、自动化识别。本研究以神农架常绿阔叶混交林优势种之一的粉白杜鹃(Rhododendron hypoglaucum)为研究对象,基于2022-2025年定点自动采集的4624张物候图像,利用目标检测领域的3种代表性算法Faster R-CNN、YOLOv11和RT-DETR,分别构建物候期自动识别模型,通过对比研究筛选最优模型。结果表明,三种算法均能够识别花芽、叶芽、新叶、花、果实和黄叶等6类物候特定器官特征与数量。其中,YOLOv11 模型表现最佳,其精度为0.785、召回率为0.745,mAP50和mAP50-95分别为0.788和0.501。基于该最优模型的识别结果,我们成功实现了对粉白杜鹃花芽生长、叶芽生长、开花、新叶、果实发育与黄叶持续时间的自动判定,判定结果与人工目视解译结果高度一致。本研究证实了基于深度学习的目标检测方法在长时序植物物候原位观测中的有效性,能够可靠地获取物候关键特征与数量信息,为植物个体物候的高频、精细化、自动识别提供了一种新方法。未来,通过多目标协同监测与模型优化,有望进一步提升该方法的适用性与鲁棒性。

关键词: 物候识别, 长期监测, 深度学习, YOLOv11, 物候相机

Abstract: Plant phenology is a key indicator of how ecosystems respond to global climate change. Although automated imaging technologies can generate vast amounts of time-series phenological data, the accurate automated recognition of discrete phenological periods remains a significant methodological challenge, limiting the scalability of phenological monitoring. In this study, we focused on Rhododendron hypoglaucum, one of dominant species in the evergreen broad-leaved mixed forests of Shennongjia, China. Using 4624 automatically captured time-lapse images collected continuously from 2022 to 2025, we implemented and evaluated 3 representative object detection algorithms—Faster R-CNN, YOLOv11, and RT-DETR to develop automated recognition models for key phenophases. The performance of the models was compared to identify the most effective algorithm for establishing a high-frequency, long-term phenology recognition method. All 3 algorithms successfully detected six phenological traits: floral buds, leaf buds, flowers, new leaves, fruits, and senescent leaves. The YOLOv11 model performed best, achieving a precision of 0.785, recall of 0.745, mAP50 of 0.788, and mAP50-95 of 0.501. Based on the results of the optimal model, the phenological periods duration of R. hypoglaucum—including the duration of floral bud growth, leaf bud growth , flowering, new leaf growth, fruit growth, and yellow leaf—were automatically determined, showing a high level of consistency with manual visual interpretation. This study demonstrates that deep learning-based object detection methods can effectively realize the automated extraction of phenological traits and quantitative information from long-term, in-situ observations. It provides a novel approach for high-frequency, precise, and automated phenological recognition at the individual plant level. In the future, through multi-target collaborative monitoring and model optimization, the applicability and robustness of this method are expected to be further enhanced.

Key words: phenological recognition, long-term monitoring, deep learning, YOLOv11, phenological camera