首页 | 本学科首页   官方微博 | 高级检索  
     

PCA-FA: Applying Supervised Learning to Analyze Gene Expression Data
引用本文:翁时锋 张长水 张学工. PCA-FA: Applying Supervised Learning to Analyze Gene Expression Data[J]. 清华大学学报, 2004, 9(4): 428-434
作者姓名:翁时锋 张长水 张学工
作者单位:StateKeyLaboratoryofIntelligentTechnologyandSystems,DepartmentofAutomation,TsinghuaUniversity,Beijing100084,China
摘    要:In previous gene expression data analyses, supervised learning has mainly focused on the classification of attribute data, such as the different experimental conditions, different known classes of the same tumor and sex, However, supervised learning classification is not suitable for interval-scaled attributes such as age and survival outcome of cancer patients, For this problem, this paper proposed a new method by combining two well-known methods: principal component analysis (PCA) and Fisher analysis (FA). The method, PCA-FA, realizes supervised learning with two types of attributes (nominal attributes and intervalscaled attributes). The fuzzy FA was introduced to model the interval-scaled attributes. In this paper, an approximate linear relationship between gene expression data of lung adenocarcinoma patients and survival outcome is successfully revealed by PCA-TA.

关 键 词:监督学习 基因表达数据 主成分分析 微阵列技术 判别式信息提取

PCA-FA:Applying Supervised Learning to Analyze Gene Expression Data
WENG Shifeng,ZHANG Changshui,ZHANG Xuegong State Key Laboratory of Intelligent Technology and Systems. PCA-FA:Applying Supervised Learning to Analyze Gene Expression Data[J]. Tsinghua Science and Technology, 2004, 9(4): 428-434
Authors:WENG Shifeng  ZHANG Changshui  ZHANG Xuegong State Key Laboratory of Intelligent Technology  Systems
Affiliation:WENG Shifeng,ZHANG Changshui,ZHANG Xuegong State Key Laboratory of Intelligent Technology and Systems,Department of Automation,Tsinghua University,Beijing 100084,China
Abstract:In previous gene expression data analyses, supervised learning has mainly focused on the clas-sification of attribute data, such as the different experimental conditions, different known classes of the same tumor and sex. However, supervised learning classification is not suitable for interval-scaled attributes, such as age and survival outcome of cancer patients. For this problem, this paper proposed a new method by combining two well-known methods: principal component analysis (PCA) and Fisher analysis (FA). The method, PCA-FA, realizes supervised learning with two types of attributes (nominal attributes and interval-scaled attributes). The fuzzy FA was introduced to model the interval-scaled attributes. In this paper, an ap-proximate linear relationship between gene expression data of lung adenocarcinoma patients and survival outcome is successfully revealed by PCA-TA.
Keywords:supervised learning  gene expression data  principal component analysis  Fisher analysis
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号