首页 | 本学科首页   官方微博 | 高级检索  
     

MapReduce框架下基于线性回归的短期负荷预测
引用本文:吴丽珍,孔纯,陈伟. MapReduce框架下基于线性回归的短期负荷预测[J]. 兰州理工大学学报, 2021, 47(1): 97-104
作者姓名:吴丽珍  孔纯  陈伟
作者单位:兰州理工大学电气工程与信息工程学院, 甘肃兰州 730050;兰州理工大学电气工程与信息工程学院, 甘肃兰州 730050;兰州理工大学电气工程与信息工程学院, 甘肃兰州 730050
基金项目:国家自然科学基金(62063016),甘肃省基础研究创新群体项目(18JR3RA133)
摘    要:为解决负荷预测时因数据量大、数据种类繁多带来的计算速度慢、预测精度低等问题,在MapReduce并行编程框架下,提出基于小批量随机梯度下降法的线性回归模型.首先,为清理智能配电终端产生的重复数据和不良数据,提出利用自适应近邻排序算法清除重复记录的数据,并利用K均值聚类的方法剔除异常数据和记录不完整的数据,然后利用F检验...

关 键 词:大数据分析  小批量随机梯度下降  短期负荷预测  分布式并行计算  MapReduce框架
收稿时间:2019-11-11

Short-term load forecasting based on linear regression under MapReduce framework
WU Li-zhen,KONG Chun,CHEN Wei. Short-term load forecasting based on linear regression under MapReduce framework[J]. Journal of Lanzhou University of Technology, 2021, 47(1): 97-104
Authors:WU Li-zhen  KONG Chun  CHEN Wei
Affiliation:College of Electrical and Information Engineering, Lanzhou Univ. of Tech., Lanzhou 730050, China
Abstract:In order to solve problems of slow calculation speed and low prediction accuracy caused by large amount of data and various kinds of data in load forecasting, a linear regression model based on small batch random gradient descent method is proposed in this paper under the framework of MapReduce parallel programming. First of all, in order to clean up repetitive data and bad data generated by the intelligent distribution terminal, the adaptive nearest neighbor sorting algorithm is proposed to remove the repeated data, and accordingly the K-means clustering method is used to eliminate abnormal data and incomplete data. The F-test method is then employed to test whether a data set can represent the load linearly. The T-test method is further adopted to test the significance of linear relationship between the characteristic vector and the load. Finally, any characteristic vector with weak linear relationship with the load is accordingly eliminated as a result. According to the above methods, a short-term load forecasting model is established and applied to the short-term load forecasting of distribution network in Wuwei, Gansu Province. Results from the prediction show that the average absolute percentage error of the proposed short-term load forecasting model is about 2.043%, and the root mean square error takes about 3 112.62. These forecasted errors meet the requirements of load forecasting, and not all improve greatly the speed of load calculation but also shorten the time for load forecasting.
Keywords:big data analyses  mini-batch stochastic gradient descent  short-term load forecasting  distributed parallel computing  MapReduce framework  
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《兰州理工大学学报》浏览原始摘要信息
点击此处可从《兰州理工大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号