首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于短语结构和词语词性相结合的情感分类方法
引用本文:郑亚平,施 佺.基于短语结构和词语词性相结合的情感分类方法[J].南通大学学报(自然科学版),2018,17(3):1-5.
作者姓名:郑亚平  施 佺
作者单位:南通大学 计算机科学与技术学院,江苏 南通 226019
基金项目:江苏省自然科学基金项目(BK20151272);江苏省“六大人才高峰”(2014-WLW-029);江苏省“333工程”(BRA2017475);江苏省“青蓝工程”项目
摘    要:针对传统文本分类方法忽略词语间的语义特征的问题,并为了改善输入文本的表示质量,提出一种基于短语结构和词语词性相结合的情感分类方法.该方法首先通过短语结构优化分词,可以更好地提取文本特征;其次利用Word2vec工具训练词语和词性相结合的文本语料库得到词向量模型,解决了Word2vec无法识别一词多义的问题;最后通过SVM算法对文本进行情感分类.实验结果表明,该算法能够提高文本情感分类的正确性.该方法对舆情监控、股票市场行情预测和了解消费者对产品的偏好等具有较高的实用性.

关 键 词:短语结构  词性  情感分类  Word2vec  SVM

Sentiment Classification Based on Combination of Phrase Structure and Word Parts of Speech
Authors:ZHENG Yaping  SHI Quan
Institution:School of Computer Science and Technology, Nantong University, Nantong 226019, China
Abstract:A sentiment classification based on the combination of phrase structure and word parts of speech is proposed to improve the quality of the input text and the semantic features between words. This method firstly optimizes word segmentation through phrase structure to extract text features better, and then Word2vec is used to train the text corpus combined of words and parts of speech to obtain a word vector model to solve the problem that Word2vec cannot recognize the word polysemy. Finally, the text is subjected to emotional classification through SVM algorithm. The experimental results show that the algorithm can improve the correctness of text sentiment classification. This method has high practicability for monitoring public opinion, forecasting the stock market, and understanding consumers'' product preference.
Keywords:phrase structure  parts of speech  sentiment classification  Word2vec  SVM
本文献已被 CNKI 等数据库收录!
点击此处可从《南通大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《南通大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号