Pierre Massion
Faculty Member
Last active: 1/11/2018

Machine learning models for lung cancer classification using array comparative genomic hybridization.

Aliferis CF, Hardin D, Massion PP
Proc AMIA Symp. 2002: 7-11

PMID: 12463776 · PMCID: PMC2244172

Array CGH is a recently introduced technology that measures changes in the gene copy number of hundreds of genes in a single experiment. The primary goal of this study was to develop machine learning models that classify non-small Lung Cancers according to histopathology types and to compare several machine learning methods in this learning task. DNA from tumors of 37 patients (21 squamous carcinomas, and 16 adenocarcinomas) were extracted and hybridized onto a 452 BAC clone array. The following algorithms were used: KNN, Decision Tree Induction, Support Vector Machines and Feed-Forward Neural Networks. Performance was measured via leave-one-out classification accuracy. The best multi-gene model found had a leave-one-out accuracy of 89.2%. Decision Trees performed poorer than the other methods in this learning task and dataset. We conclude that gene copy numbers as measured by array CGH are, collectively, an excellent indicator of histological subtype. Several interesting research directions are discussed.

MeSH Terms (8)

Algorithms Artificial Intelligence Carcinoma, Non-Small-Cell Lung DNA, Neoplasm Feasibility Studies Humans Lung Neoplasms Nucleic Acid Hybridization

Connections (1)

This publication is referenced by other Labnodes entities:

Links