首页 | 本学科首页   官方微博 | 高级检索  
     检索      

云环境中大数据挖掘的有效花费研究
引用本文:朱小栋,徐怡,魏紫钰.云环境中大数据挖掘的有效花费研究[J].上海理工大学学报,2020,42(3):247-252.
作者姓名:朱小栋  徐怡  魏紫钰
作者单位:上海理工大学管理学院,上海 200093;上海理工大学管理学院,上海 200093;上海理工大学管理学院,上海 200093
基金项目:国家自然科学基金资助项目(71771152);上海市人民政府发展研究中心“基于互联网+的上海创新发展”研究基地决策咨询研究项目(2019-YJ-L04-A);2020上海市教委高校智库内涵建设项目
摘    要:为平衡云计算资源的租用量与云环境中数据挖掘的计算结果准确率,得到最优的性价比,以监督式学习的卷积神经网络(CNN)为例,探究了CNN迭代次数与准确率的演化规律。选择经典图像数据集MNIST和文本数据集IMDB作为代表展开实验,发现在不同类型的数据集中,当CNN迭代接近最优解时,每提高很小的准确率,耗费的机时陡增,称之为长尾现象。验证在真实云环境中,当大数据挖掘的长尾现象发生且满足企业准确率需求的情况下,选择提前结束取代最高精度时结束,均可以节省大量云资源成本。研究结果对于合理运用云计算资源,降低云服务租用成本,具有实用价值与现实意义。

关 键 词:云计算资源  有效花费  卷积神经网络  长尾现象
收稿时间:2019/3/18 0:00:00

Effective cost of big data mining in cloud environment
ZHU Xiaodong,XU Yi,WEI Ziyu.Effective cost of big data mining in cloud environment[J].Journal of University of Shanghai For Science and Technology,2020,42(3):247-252.
Authors:ZHU Xiaodong  XU Yi  WEI Ziyu
Institution:Business School, University of Shanghai for Science and Technology, Shanghai 200093, China
Abstract:In order to balance the renting quantity of cloud computing resources and the accuracy of data mining in cloud, the optimum cost performance ratio is obtained. Taking the convolution neural network (CNN) as an example, the evolution patterns of the number of iterations and accuracy of CNN was explored. A lot of experiments were performed upon the image dataset MNIST and the text dataset IMDB. The results show that in different types of data sets, the machine time consumed increases sharply with a small increase in accuracy when the optimal solution is approached, which is called the long tail phenomena. Correspondingly, in the real cloud environment, when the long tail phenomenon of big data mining occurs and the accuracy is satisfied, terminating the performance of CNN in cloud in advance rather than at the convergence time can save a lot of cloud resource costs. The results have practical value and practical significance for the rational use of cloud computing resources and the reduction of cloud rental cost.
Keywords:cloud computing resources  effective cost  convolutional neural network  long tail phenomenon
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《上海理工大学学报》浏览原始摘要信息
点击此处可从《上海理工大学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号