首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于随机森林的热轧带钢质量分析与预测方法
引用本文:纪英俊,勇晓玥,刘英林,刘士新.基于随机森林的热轧带钢质量分析与预测方法[J].东北大学学报(自然科学版),2019,40(1):11-15.
作者姓名:纪英俊  勇晓玥  刘英林  刘士新
作者单位:东北大学 信息科学与工程学院,辽宁 沈阳,110819;上海宝信软件股份有限公司 大数据事业部,上海,201203
基金项目:国家重点研发计划项目(2017YFB0306401); 国家自然科学基金资助项目(61573089).
摘    要:以某钢铁企业的热轧带钢生产实际数据作为分析对象,基于改进的随机森林算法分析工艺参数与产品质量间的隐含关系,进行影响产品质量关键工艺参数的特征提取,建立热轧带钢产品缺陷预测模型.实验结果表明,对非平衡数据集进行平衡处理可以提高样本预测精度;采用CART与C4.5相结合的方法比单一方法可以进一步提升预测精度;同时根据特征的高相关与低相关特性,将互信息作为评价指标应用于特征选择,可以提升随机森林算法的分类效果.在以上三种改进策略下,热轧带钢缺陷的识别率得到明显提高.

关 键 词:热轧带钢  缺陷预测  数据驱动  特征提取  随机森林
收稿时间:2018-04-19
修稿时间:2018-04-19

Random Forest Based Quality Analysis and Prediction Method for Hot-Rolled Strip
JI Ying-jun,YONG Xiao-yue,LIU Ying-lin,LIU Shi-xin.Random Forest Based Quality Analysis and Prediction Method for Hot-Rolled Strip[J].Journal of Northeastern University(Natural Science),2019,40(1):11-15.
Authors:JI Ying-jun  YONG Xiao-yue  LIU Ying-lin  LIU Shi-xin
Institution:1. School of Information Science & Engineering, Northeastern University, Shenyang 110819, China; 2. Big Data Department, Shanghai Baosight Software Co., Ltd., Shanghai 201203, China.
Abstract:The process data of hot-rolled strips from an iron and steel enterprise were analyzed to find out the inherent relationship between process parameters and production quality by using an improved random forests algorithm. After critical features being extracted, a defect prediction model was built. According to the experiment, balancing operation can improve the prediction accuracy of the imbalanced data sets. Meanwhile, the combination of CART and C4.5 can further improve the prediction accuracy than each single method. Furthermore, in consideration of the characteristics whose features have high or low correlations with the response variable, mutual information was introduced as an evaluation criterion for feature selection. Mutual information makes great contribution to classification effect of random forest algorithm, and recognition rate of defects of hot-rolled strips is obviously improved by using three strategies.
Keywords:hot-rolled strip  defect prediction  data driven  feature selection  random forests  
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《东北大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《东北大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号