首页 | 本学科首页   官方微博 | 高级检索  
     检索      

图文决策融合的多模态电商垃圾评价检测
引用本文:沈学利,赵科林,李世银.图文决策融合的多模态电商垃圾评价检测[J].重庆邮电大学学报(自然科学版),2021,33(6):1038-1046.
作者姓名:沈学利  赵科林  李世银
作者单位:中国矿业大学 信息与控制工程学院,江苏 徐州221116;辽宁工程技术大学 电子与信息工程学院,辽宁 葫芦岛125105;辽宁工程技术大学 电子与信息工程学院,辽宁 葫芦岛125105;中国矿业大学 信息与控制工程学院,江苏 徐州221116
基金项目:国家自然科学基金(61772249)
摘    要:现有的电商垃圾评价检测方法大多基于对评价文本信息进行分析,难以有效检测带有图片的多模态垃圾评价,为充分利用评价的图片和文本内容,提出了基于Transformer双向编码表示(bidirectional encoder representa-tions from transformer,BERT)和宽残差网络(wide residual networks,WRN)的图文融合决策检测方法.该方法利用评价文本对经过预训练的BERT模型进行微调训练,经过表示学习分类得到文本评价类别向量,使用宽残差网络对评价图片进行特征提取和分类并输出图片类别向量,将得到的对应评价图文类别向量共同输入启发式决策融合分类器,对多模态评价整体进行预测分类.使用真实电商评价数据集进行实验表明,相比面向评价文本的分类方法,图文融合决策检测方法对多模态评价分类的精准率提高4.44%,召回率提高2.12%,Micro-F1提高3.67%,结果证实该方法能够对多模态垃圾评价进行有效检测.

关 键 词:多模态数据  垃圾评价  图文融合  预训练模型
收稿时间:2020/4/7 0:00:00
修稿时间:2021/10/22 0:00:00

Multi-modal review spam detection via image-text fusion decision
SHEN Xueli,ZHAO Kelin,LI Shiyin.Multi-modal review spam detection via image-text fusion decision[J].Journal of Chongqing University of Posts and Telecommunications,2021,33(6):1038-1046.
Authors:SHEN Xueli  ZHAO Kelin  LI Shiyin
Institution:School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, P. R. China;School of Electronics and Information Engineering, Liaoning Technical University, Huludao 125105, P. R. China
Abstract:Most of the existing review spam detection methods are based on the analysis of review text information, which is difficult to effectively detect the multi-modal review spam. The image-text fusion detecting method based on bidirectional encoder representations from transformer (BERT) and wide residual network (WRN) is proposed to make full use of the reviews with image and text content. This method uses the review text to fine-tune the pre-trained BERT model, obtains the text review category vector by representation learning classification, uses the wide residual network to extract and classify the evaluation pictures, outputs the picture category vector, and inputs the corresponding evaluation picture category vector into the heuristic decision fusion classifier. Finally, the multimodal evaluation as a whole is predicted and classified. Experiments using real e-commerce evaluation data sets show that compared with the evaluation text oriented classification method, the image text fusion decision detection method improves the accuracy of multimodal evaluation and classification by 4.44%, the recall rate by 2.12% and micro-F1 by 3.67%. The results show that this method can effectively detect multimodal review spam.
Keywords:multi-modal data  review spam  image-text fusion  pre-trained model
本文献已被 万方数据 等数据库收录!
点击此处可从《重庆邮电大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《重庆邮电大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号