Note: This content is accessible to all versions of every browser. However, this browser does not seem to support current Web standards, preventing the display of our site's design details.


Learning diagnostic signatures from microarray data using L1-regularized logistic regression


P. Nandy, M. Unger, Kushal K. Dey, C. Zechner, H. Koeppl

Systems Biomedicine, vol. 1, no. 4, pp. 51-57

Making reliable diagnoses and predictions based on high-throughput transcriptional data has attracted immense attention in the past few years. While experimental gene profiling techniques—such as microarray platforms—are advancing rapidly, there is an increasing demand of computational methods being able to efficiently handle such data. In this work we propose a computational workflow for extracting diagnostic gene signatures from high-throughput transcriptional profiling data. In particular, our research was performed within the scope of the first IMPROVER challenge. The goal of that challenge was to extract and verify diagnostic signatures based on microarray gene expression data in four different disease areas: psoriasis, multiple sclerosis, chronic obstructive pulmonary disease and lung cancer. Each of the different disease areas is handled using the same three-stage algorithm. First, the data are normalized based on a multi-array average (RMA) normalization procedure to account for variability among different samples and data sets. Due to the vast dimensionality of the profiling data, we subsequently perform a feature pre-selection using a Wilcoxon’s rank sum statistic. The remaining features are then used to train an L1-regularized logistic regression model which acts as our primary classifier. Using the four different data sets, we analyze the proposed method and demonstrate its use in extracting diagnostic signatures from microarray gene expression data.


Type of Publication:


No Files for download available.
% Autogenerated BibTeX entry
@Article { NanEtal:2013:IFA_4619,
    author={P. Nandy and M. Unger and Kushal K. Dey and C. Zechner and H. Koeppl},
    title={{Learning diagnostic signatures from microarray data using
	  L1-regularized logistic regression}},
    journal={Systems Biomedicine},
Permanent link