中国农业气象 ›› 2026, Vol. 47 ›› Issue (4): 572-580.doi: 10.3969/j.issn.1000-6362.2026.04.008

• • 上一篇    下一篇

基于模糊逻辑和半监督学习的冰雹概率及大小识别算法

李恒升,刘忠阳,张静怡,张莉,王倩倩,冯丹   

  1. 河南省气象探测数据中心,郑州 450003
  • 收稿日期:2025-04-27 出版日期:2026-04-20 发布日期:2026-04-18
  • 作者简介:李恒升,E-mail:1264777168@qq.com
  • 基金资助:
    河南省农业气象保障与应用技术重点实验室应用技术研究基金项目(KQ202312)

Hail Probability and Size Identification Algorithm Based on Fuzzy Logic and Semi-supervised Learning

LI Heng-sheng, LIU Zhong-yang, ZHANG Jing-yi, ZHANG Li, WANG Qian-qian, FENG Dan   

  1. Henan Meteorological Observation Data Center, Zhengzhou 450003, China
  • Received:2025-04-27 Online:2026-04-20 Published:2026-04-18

摘要:

研发冰雹发生概率和冰雹大小的分类识别产品,可提高冰雹识别准确率。基于2022年河南省92个冰雹实况观测记录以及雷达数据和探空数据,选取组合反射率(CR)、基本反射率55dBZ的高度与0℃层高度差(H0)、基本反射率45dBZ的高度与20℃层高度差(H20)、垂直液态含水量(VIL)、垂直液态含水量密度(VILD)和回波顶高(ET)、差分反射率(ZDR)、差分传播相移率(KDP)和相关系数(CC)共9个特征参量,通过融合模糊逻辑和以KNN为基础弱分类器的半监督学习(FLSTKNN)算法模型对冰雹发生概率和大小分级识别,进一步降低其对建筑、农业生产和人员安全的影响。结果表明:FLSTKNN算法模型在测试集(占数据集的20%)上的准确率达到83%,其精确率80%,召回率83%,表明对多数类样本的识别高度可靠,且F1分数接近80%的优良阈值,说明该算法模型在冰雹发生概率和冰雹大小识别中具有较好效果。

关键词: 双偏振雷达, 冰雹概率, 模糊逻辑, 半监督机器学习

Abstract:

The development of a classification product for hail occurrence probability and hail size can improve the accuracy of hail identification. Based on 92 hail observation records from Henan Province in 2022, together with radar data and sounding data, nine characteristic parameters were selected: composite reflectivity(CR), height difference between 55dBZ base reflectivity and 0°C level (H₀), height difference between 45dBZ base reflectivity and −20°C level (H20), vertical integrated liquid (VIL), vertical integrated liquid density(VILD), echo top height (ET), differential reflectivity (ZDR), specific differential phase(KDP) and correlation coefficient (CC. By integrating fuzzy logic with a Semisupervised Learning algorithm based on a weak Knearest neighbor classifier (referred to as the FLSTKNN model), the probability of hail occurrence and hail size grades were identified, thereby further reducing the impacts of hail on buildings, agricultural production, and human safety. The results show that the FLSTKNN model achieved an accuracy of 83on the test set (20of the dataset). Its precision was 80and its recall was 83%, indicating high reliability in identifying majorityclass samples. Moreover, the F1score approached the excellent threshold of 80%, demonstrating that the proposed model performs well in identifying both hail occurrence probability and hail size.

Key words: Dual?polarization radar, Hail probability, Fuzzy logic, Semi?supervised machine learning