中国农业气象 ›› 2026, Vol. 47 ›› Issue (4): 558-571.doi: 10.3969/j.issn.1000-6362.2026.04.007

• 农业生物气象栏目 • 上一篇    下一篇

基于机器学习的水稻始穗期预测方法

任义方,朱凤,陈思宁   

  1. 1.江苏省气候中心,南京 210008;2.江苏省绿色食品办公室,南京 210017;3.中国气象科学研究院,北京 100081
  • 收稿日期:2025-03-07 出版日期:2026-04-20 发布日期:2026-04-18
  • 作者简介:任义方, E-mail:renyifang2006@126.com
  • 基金资助:
    “十四五”国家重点研发计划项目“作物干旱高低温灾害预警预测与防控技术研发及集成示范”(2022YFD2300200)

Methods for Prediction Rice Initial Panicle Date Based on Machine Learning Algorithm

REN Yi-fang, ZHU Feng, CHEN Si-ning   

  1. 1.Jiangsu Climate Center, Nanjing 210008, China; 2.Jiangsu Green Food Office, Nanjing 210017; 3. Chinese Academic of Meteorological Sciences, Beijing 100081
  • Received:2025-03-07 Online:2026-04-20 Published:2026-04-18

摘要:

水稻始穗期是稻曲病、穗稻瘟、纹枯病等病害病菌侵入谷粒等器官造成稻米品质降低产量减少的关键期。为提高水稻病害防治针对性,更好满足预防为主,综合防治,绿色控害,减药增效的要求,本文以江苏单季稻为例,利用历史气象资料和水稻生育期观测资料,在分析水稻始穗期特征及其关键影响因子基础上,应用主成分分析法(PCA)、误差反向传播神经网络算法BP)、随机森林算法(RF)研究水稻始穗期的预估方法。设置4组模拟方案分别建立水稻始穗期预测模型,以决定系数、均方根误差作为评判指标,对模型精度及其普适性进行分析评价。结果表明苏北、苏中、苏南地区水稻始穗期的跨度分别在84−31日、89−918和816−920日,各区平均标准差分别为4d6d和5d;江苏各区影响水稻始穗期的关键因子基本一致,水稻始穗前3个生育期日序最为关键,播种分蘖、分蘖拔节、拔节孕穗三个生育阶段的温度类因子重要性明显大于降水和日照类因子;与基于RF算法模型相比,基于BP算法的模型模拟精度更高,且对PCA处理后消除相关性的预测因子具有更好的接纳性,对江苏各区水稻始穗期模拟预测误差均在2d以内,预测提前量在10d左右,可为准确把握水稻病害防治关键期提供技术支撑。

关键词: 水稻, 始穗期, 主成分分析, 随机森林算法, 误差反向传播神经网络算法

Abstract:

The initial panicle date of rice is a critical period during which diseasecausing bacteria, such as rice smut, rice blast disease and sheath blight, invade organs like grains, leading to reduce the rice quality and lower yield In order to improve the ability of rice disease control and better meet the requirements of "prevention first, comprehensive control, green damage control, drug reduction and efficiency", taking Jiangsu as an example, using historical meteorological data and rice growth period observation data, based on the analysis of the characteristics and key influencing factors of the beginning panicle period, four groups of simulation schemes were set up by using principal component analysis, error back propagation neural network algorithm and random forest algorithm to establish the prediction model of the beginning panicle period of rice. In addition, the coefficient of determination and root mean square error were taken as the evaluation indices to analyze and evaluate the accuracy and compatibility of the model. The results showed that: the initial panicle stages of rice in northern Jiangsu, central Jiangsu and southern Jiangsu were concentrated from August 4 to 31, August 9 to September 18 and August 16 to September 20. The average standard deviation of each region was 4d, 6d and 5d respectively. The key factors affecting the initial panicle stage of rice in Jiangsu were basically the same. The days sequence of the three growth stages before the initial panicle stage of rice was the most critical. The temperature factors was significantly more important than the precipitation and sunshine hours factors during three growth stages, from sowing to tillering, from tillering to jointing and from jointing to booting. Compared with the model based on RF algorithm, the model based on BP algorithm had higher simulation accuracy, and had better "acceptability" for the predictors with weakened correlations after PCA treatment. In addition, based on the proposed method, the simulation prediction error of rice initial panicle stage of rice epidemic in all regions of Jiangsu was within 2d, with a prediction advance of about 10d, which can provide a reference basis for accurately capturing the key period of rice disease control. 

Key words: Rice, Initial panicle stage, Principal component analysis, Random forest algorithm, Neural network algorithm