首页 / 院系成果 / 成果详情页

Hierarchical-TGDR: Combining biological hierarchy with a regularization method for multi-class classification of lung cancer samples via high-throughput gene-expression data  期刊论文  

  • 编号:
    20582ff6-8a6a-4e0b-869c-e3b60c7eadc2
  • 作者:
    Tian S., SuárezFari?as M.
  • 地址:

    [1]Division of Clinical Epidemiology, First Hospital of Jilin University, Changchun, Jilin, China.

  • 语种:
    英文
  • 期刊:
    Systems Biomedicine ISSN:2162-8130 2014 年 1 卷 4 期 (278 - 287) ; 2014
  • 收录:
  • 摘要:

    Regularization methods that simultaneously select a small set of the most relevant features and build a classifier usingthe selected features have gained much attention recently in problems of classification of "omics" data. In many multiclassclassification problems, which are of practical importance, the classes are naturally endowed with a hierarchicalstructure. However, such natural hierarchical structure is often ignored. Here, we use an existing regularization algorithm, Threshold Gradient Descent Regularization, in a hierarchical fashion, which takes advantage of natural biological structureto specifically tackle multi-class classification of microarray data. We apply this approach to one of the tasks presentedby the sbv IMPROVER Diagnostic Signature Challenge: the Lung Cancer Sub-Challenge. Gene expression data from nonsmallcell lung carcinoma were used to classify tumors into adenocarcinoma and squamous cell carcinoma subtypes, and their clinical stages (I and II). Genetic and transcriptomic differences between AC and SCC have been reported, indicating a potentially different pathological mechanism of differentiation and invasion. The results from this analysisshow that hierarchical-TGDR outperforms pairwise TGDRs in terms of predictive performance, and is substantially moreparsimonious. In conclusion, the hierarchical-TGDR approach trains classifiers in a top-down fashion by considering thenaturally existing structure within the data, reducing the number of pairwise-TGDRs to be trained. It also highlightsdifferent mechanisms of "invasion" in the two subtypes. This work suggests that incorporating known biologicalinformation into classification algorithms, such as data hierarchies, can improve the discriminative performance andbiological interpretation of this classifier.

  • 推荐引用方式
    GB/T 7714:
    Tian S. Suárez-Fari?as M., et al. Hierarchical-TGDR: Combining biological hierarchy with a regularization method for multi-class classification of lung cancer samples via high-throughput gene-expression data [J].Systems Biomedicine,2014,1(4):278-287.
  • APA:
    Tian S. Suárez-Fari?as M..(2014).Hierarchical-TGDR: Combining biological hierarchy with a regularization method for multi-class classification of lung cancer samples via high-throughput gene-expression data .Systems Biomedicine,1(4):278-287.
  • MLA:
    Tian S. Suárez-Fari?as M., et al. "Hierarchical-TGDR: Combining biological hierarchy with a regularization method for multi-class classification of lung cancer samples via high-throughput gene-expression data" .Systems Biomedicine 1,4(2014):278-287.
  • 入库时间:
    7/18/2019 11:29:22 AM
  • 更新时间:
    7/18/2019 11:29:22 AM
  • 条目包含文件:
    文件类型:PDF,文件大小:
    正在加载全文
浏览次数:8 下载次数:0
浏览次数:8
下载次数:0
打印次数:0
浏览器支持: Google Chrome   火狐   360浏览器极速模式(8.0+极速模式) 
返回顶部