COMPARATIVE ANALYSIS OF SHRINKAGE COVARIANCE MATRIX USING MICROARRAYS DATA

Authors

  • Suryaefiza Karjanto Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA Melaka (Kampus Jasin), Melaka, Malaysia
  • Norazan Mohamed Ramli Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA, 40450 Shah Alam, Selangor, Malaysia
  • Nor Azura Md Ghani Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA, 40450 Shah Alam, Selangor, Malaysia

DOI:

https://doi.org/10.11113/jt.v78.8319

Keywords:

Hotelling'sT2, gene set analysis, shrinkage covariance matrix

Abstract

The DNA microarray technologies permit scientists to depict the expression of genes for related samples.  This relationship between genes is analysed using Hotelling’s T2 as a multivariate test statistic but the disadvantage of this test, when used in microarray studies is the number of samples is larger than the number of variables.  This study discovers the potential of the shrinkage approach to estimate the covariance matrix specifically when the high dimensionality problem happened.  Consequently, the sample covariance matrix in Hotelling’s T2 statistic is not positive definite and become singular thus cannot be inverted.  In this research, the Hotelling’s T2 statistic is combined with a shrinkage approach as an alternative estimation to estimate the covariance matrix to detect significant gene sets.  The multivariate test statistic of classical Hotelling's T2 is used to integrate the correlation when assessing changes in activity level across biological conditions.  The performances of the proposed methods were assessed using real data study.  Shrinkage covariance matrix approach indicates a better result for detection of differentially expressed gene sets as compared to other methods.

References

Alwine, J. C., Kemp, D. J., and Stark, G. R. 1992. Method for Detection of Specific RNA’s in Agarose Gels by Transfer to Diaobenzyloxymenthyl-Paper and Hybridization With DNA Probes. Proc Natl Acad Sci. 74. 5350-5353.

Liang, P., and Pardee, A. B. 1992. Differential Display of Eukaryotic Messenger RNA by Means of the Polymerase Chain Reaction. Science. 257. 967-971.

Velculescu, V. L., Zhang, B., Vogelstein., and Kinzler, K. 1995. Serial Analysis of Gene Expression. Science, 270, 484–487

Mary-Huard, T. Daudin, J. J., Robin, S., Bitton, F., Cabannes, E., and Hilson, P. 2004. Spotting Effect in Microarray Experiments. BMC Bioinformatics, 5, 63. Math. Statist. And Prob. 1. 361–379.

Karjanto, S., Ramli, N. M., Aripin, R. and Ghani, N. A. M. 2014. Improved Statistical Test using Shrinkage Covariance Matrix For Identifying Differential Gene Sets. Journal Of Applied Environmental And Biological Sciences. 1. 302-310

Schäfer J. and Strimmer K., A. 2005. Shrinkage Approach to Large-Scale Covariance Matrix Estimation and Implications for Functional Genomics, Statistical Applications In Genetics And Molecular Biology. 4(1). 32.

Ledoit O. and Wolf M. 2003. Improved Estimation of the Covariance Matrix of Stock Returns with An Application to Portfolio Selection, Journal Of Empirical Finance. 10.5. 603-621.

Ledoit O. & Wolf M. 2004. Honey, I Shrunk the Sample Covariance Matrix. The Jurnal Of Portfolio Management. 31(1), 110-119.

Ledoit O. and Wolf M. 2003. A Well-Conditioned Estimator for Large Dimensional Covariance Matrices, Journal Of Multivariate Analysis. 88. 365–411.

Yates P. D. and Reimers M. A. 2009. RCMAT: A Regularized Covariance Matrix Approach to Testing Gene Sets, BMC Bioinformatics. 10. 300.

Mootha V. K., Lindgren C. M., Eriksson K.F., Subramanian A., Sihag S., Lehar L., Puigserver P., Carlsson E., Ridderstraale M., Laurila E. et al. 2003. Pgc-1 Α-Responsive Genes Involved in Oxidative Phosphorylation Are Coordinately Downregulated in Human Diabetes. Nature Genetics 34(3). 267–273.

Kong S. W., Pu W. T. and Park P.J. 2006. A Multivariate Approach for Integrating Genome-Wide Expression Data and Biological Knowledge. Bioinformatics. 22(19), 2373-2380.

Gallagher, I. J., Scheele, C., Keller, P., Nielsen, A. R., Remenyi, J., Fischer, C. P., et al. 2010. Integration of Microrna Changes in Vivo Identifies Novel Molecular Features of Muscle Insulin Resistance in Type 2 Diabetes. Genome Med. 2(2). 9-9

Downloads

Published

2016-04-18

How to Cite

COMPARATIVE ANALYSIS OF SHRINKAGE COVARIANCE MATRIX USING MICROARRAYS DATA. (2016). Jurnal Teknologi (Sciences & Engineering), 78(4-4). https://doi.org/10.11113/jt.v78.8319