A multiple feature approach for disorder normalization in clinical notes |
| |
Authors: | Chen Lü Bo Chen Chaozhen Lü Likun Qiu Donghong Ji |
| |
Affiliation: | 1.School of Computer,Wuhan University,Hubei,China;2.Department of Chinese Language and Literature,Hubei University of Art and Science,Hubei,China;3.Shandong Key Lab of Language Resource Development and Application,Ludong University,Shandong,China |
| |
Abstract: | In this paper we propose a multiple feature approach for the normalization task which can map each disorder mention in the text to a unique unified medical language system (UMLS) concept unique identifier (CUI). We develop a two-step method to acquire a list of candidate CUIs and their associated preferred names using UMLS API and to choose the closest CUI by calculating the similarity between the input disorder mention and each candidate. The similarity calculation step is formulated as a classification problem and multiple features (string features, ranking features, similarity features, and contextual features) are used to normalize the disorder mentions. The results show that the multiple feature approach improves the accuracy of the normalization task from 32.99% to 67.08% compared with the MetaMap baseline. |
| |
Keywords: | |
本文献已被 CNKI SpringerLink 等数据库收录! |
|