Genome Feature Optimization and Coronary Artery Disease Prediction using Cuckoo Search
Keywords:Gene expression cad genes, Coronary Artery Disease, Computer Intelligence, Predictive Analysis, Classification
CVD (Cardiovascular Diseases) is among the major health ailment issue leading to millions of deaths every year. CVDs are resulting as an outcome of implications in terms of environmental and the genetic factors that result in the CVD for individuals. Phenomenal advancements that has taken place in the diagnosis solutions like usage of genomic tools are contributing towards predicting and diagnosis of heart diseases more accurately. In recent past, analyzing gene expression data, particularly using machine learning strategies to predict and classify the given unlabeled gene expression record is a generous research issue. Concerning this, a substantial requirement is feature optimization, which is since the overall genes observed in human body are closely 25000 and among them 636 are cardio vascular related genes. Hence, it complexes the process of training the machine learning models using these entire cardio vascular gene features. Hence, this manuscript is using bidirectional pooled variance strategy of ANOVA standard to select optimal features. Along the side to surpass the constraint observed in traditional classifiers, which is unstable accuracy at k-fold cross validation, this manuscript proposed a classification strategy that build upon the swarm intelligence technique called cuckoo search. The experimental study indicating that the number of optimal features those selected by proposed model is substantially low that compared to the other contemporary model that selects features using Forward Feature Selection and classifies using SVM classifier (FFS&SVM). The experimental study evinced that the proposed model, which selects feature by bidirectional pooled variance estimation and classifies using proposed classification strategy that build on cuckoo search (BPVE&CS) outperformed the selected contemporary model (FFS&SVM).
Thom T, Haase N, Rosamond W, Howard VJ, Rumsfeld J, Manolio T, Zheng ZJ, Flegal K, O'Donnell C, Kittner
S, Lloyd-Jones D. Heart disease and stroke statistics--2006 update: a report from the American Heart
Association Statistics Committee and Stroke Statistics Subcommittee. Circulation. 2006 Feb 14; 113(6):e85.
Thygesen K, Alpert JS, White HD, Jaffe AS, Apple FS, Galvani M, Katus HA, Newby LK, Ravkilde J,
Chaitman B, Clemmensen PM. Universal definition of myocardial infarctionKristian Thygesen, Joseph S. Alpert
and Harvey D. White on behalf of the Joint ESC/ACCF/AHA/WHF Task Force for the Redefinition of
Myocardial Infarction. European heart journal. 2007 Oct 1; 28(20):2525-38.
Eggers KM, Lind L, Venge P, Lindahl B. Will the universal definition of myocardial infarction criteria result in
an overdiagnosis of myocardial infarction? The American journal of cardiology. 2009 Mar 1; 103(5):588-91.
Wang Z, Luo X, Lu Y, Yang B. miRNAs at the heart of the matter. Journal of Molecular Medicine. 2008 Jul 1;
De Planell-Saguer M, Rodicio MC. Detection methods for microRNAs in clinic practice. Clinical biochemistry.
Jul 31; 46(10):869-78.
Melander O, Newton-Cheh C, Almgren P, Hedblad B, Berglund G, Engström G, Persson M, Smith JG,
Magnusson M, Christensson A, Struck J. Novel and conventional biomarkers for prediction of incident
cardiovascular events in the community. Jama. 2009 Jul 1; 302 (1):49-57.
Shah T, Casas JP, Cooper JA, Tzoulaki I, Sofat R, McCormack V, Smeeth L, Deanfield JE, Lowe GD, Rumley
A, Fowkes FG. Critical appraisal of CRP measurement for the prediction of coronary heart disease events: new
data and systematic review of 31 prospective cohorts. International journal of epidemiology. 2009 Feb 1;
Wilson PW, Pencina M, Jacques P, Selhub J, D'Agostino R, O'Donnell CJ. C - reactive protein and
Reclassification of Cardiovascular Risk in the Framingham Heart StudyCLINICAL PERSPECTIVE.
Circulation: Cardiovascular Quality and Outcomes. 2008 Nov 1; 1(2):92-7.
Pedrotty DM, Morley MP, Cappola TP. Transcriptomic biomarkers of cardiovascular disease. Progress in
cardiovascular diseases. 2012 Aug 31; 55(1):64-9.
Randi AM, Biguzzi E, Falciani F, Merlini P, Blakemore S, Bramucci E, Lucreziotti S, Lennon M, Faioni EM,
Ardissino D, Mannucci PM. Identification of differentially expressed genes in coronary atherosclerotic plaques
from patients with stable or unstable angina by cDNA array analysis. Journal of Thrombosis and Haemostasis.
Apr 1; 1(4):829-35.
Archacki SR, Angheloiu G, Tian XL, Tan FL, DiPaola N, Shen GQ, Moravec C, Ellis S, Topol EJ, Wang Q.
Identification of new genes differentially expressed in coronary artery disease by expression profiling.
Physiological genomics. 2003 Sep 29; 15(1):65-74.
Elashoff MR, Wingrove JA, Beineke P, Daniels SE, Tingley WG, Rosenberg S, Voros S, Kraus WE, Ginsburg
GS, Schwartz RS, Ellis SG. Development of a blood-based gene expression algorithm for assessment of
obstructive coronary artery disease in non-diabetic patients. BMC medical genomics. 2011 Mar 28; 4(1):26.
Kittleson MM, Shui QY, Irizarry RA, Minhas KM, Edness G, Conte JV, Parmigiani G, Miller LW, Chen Y,
Hall JL, Garcia JG. Identification of a gene expression profile that differentiates between ischemic and no
ischemic cardiomyopathy. Circulation. 2004 Nov 30; 110(22):3444-51.
Kittleson MM, Minhas KM, Irizarry RA, Shui QY, Edness G, Breton E, Conte JV, Tomaselli G, Garcia JG,
Hare JM. Gene expression analysis of ischemic and nonischemic cardiomyopathy: shared and distinct genes in
the development of heart failure. Physiological genomics. 2005 May 11; 21(3):299-307.
Min KD, Asakura M, Liao Y, Nakamaru K, Okazaki H, Takahashi T, et al. Identification of genes related to
heart failure using global gene expression profiling of human failing myocardium. Bioch and Biophy Res
Comm. 2010; 393:55–60.
Suresh R, Li X, Chiriac A, Goel K, Terzic A, Perez-Terzic C, Nelson TJ. Transcript me from circulating cells
suggests dysregulated pathways associated with long-term recurrent events following first-time myocardial
infarction. Journal of molecular and cellular cardiology. 2014 Sep 30; 74:13-21.
Liew CC, Ma J, Tang HC, Zheng R, Dempsey AA. The peripheral blood transcript me dynamically reflects
system wide biology: a potential diagnostic tool. Journal of Laboratory and Clinical Medicine. 2006 Mar 31; 147
Kazmi N, Gaunt TR. Diagnosis of coronary heart diseases using gene expression profiling; stable coronary
artery disease, cardiac ischemia with and without myocardial necrosis. PloS one. 2016 Mar 1; 11 (3):e0149475.
Yu H, Ni J, Zhao J. ACOSampling: An ant colony optimization-based under sampling method for classifying
imbalanced DNA microarray data. Neurocomputing. 2013 Feb 4; 101:309 -18.
Uzer MS, Yilmaz N, Inan O. Feature selection method based on artificial bee colony algorithm and support
vector machines for medical datasets classification. The Scientific World Journal. 2013 Jul 28; 2013.
Arafat H, Elawady RM, Barakat S, Elrashidy NM. Using rough set and ant colony optimization in feature
selection. International Journal of Emerging Trends & Technology in Computer Science (IJETTCS). 2013 Jan; 2
Taha AM, Tang AY. Bat algorithm for rough set attribute reduction. Journal of Theoretical and Applied
Information Technology. 2013 May 10; 51(1):1-8.
Kumar PG, Vijay SA, Devaraj D. A hybrid colony fuzzy system for analyzing diabetes microarray data.
InComputational Intelligence in Bioinformatics and Computational Biology (CIBCB), 2013 IEEE Symposium
on 2013 Apr 16 (pp. 104-111). IEEE.
Sathishkumar K, Thiagarasu V, Ramalingam M. An efficient artificial bee colony and fuzzy c means based
clustering gene expression data. International Journal of Innovative Research in Computer and Communication
Engineering. 2013 Jul; 1(5).
Nakamura RY, Pereira LA, Rodrigues D, Costa KA, Papa JP, Yang XS. Swarm Intelligence and Bio-Inspired
Computation: 9. Binary Bat Algorithm for Feature Selection. Elsevier Inc. Chapters; 2013 May 16.
Alshamlan H, Badr G, Alohali Y. MRMR-ABC: a hybrid gene selection algorithm for cancer classification
using microarray gene expression profiling. BioMed research international. 2015 Apr 15; 2015.
Chen KH, Wang KJ, Tsai ML, Wang KM, Adrian AM, Cheng WC, Yang TS, Teng NC, Tan KP, Chang KS.
Gene selection for cancer identification: a decision tree model empowered by particle swarm optimization
algorithm. BMC bioinformatics. 2014 Feb 20; 15(1):49.
Shilaskar S, Ghatol A. Feature selection for medical diagnosis: Evaluation for cardiovascular diseases. Expert
Systems with Applications. 2013 Aug 31; 40(10):4146-53.
Liu H, Liu W, Liao Y, Cheng L, Liu Q, Ren X, Shi L, Tu X, Wang QK, Guo AY. CADgene: a comprehensive
database for coronary artery disease genes. Nucleic acids research. 2011 Jan 1; 39(suppl 1):D991-6.
Budak H, Ta?abat SE. A MODIFIED T-SCORE FOR FEATURE SELECTION.
Kummer O, Savoy J, Argand RE. Feature selection in sentiment analysis.
Sahoo PR, Theorems TM. Functional Equations. World Scientific; 1998.
Yang XS, Deb S. Cuckoo search via Lévy flights. InNature & Biologically Inspired Computing, 2009. NaBIC
World Congress on 2009 Dec 9 (pp. 210-214). IEEE.
Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, Marshall KA, Phillippy KH,
Sherman PM, Holko M, Yefanov A. NCBI GEO: archive for functional genomics data sets—update. Nucleic
acids research. 2013 Jan 1; 41(D1):D991-5.
Powers DM. Evaluation: from precision, recall and F-measure to ROC, unforcedness, nakedness and correlation.
Zarlis, M., Roslina, R., & Elviwani. (2020). Implementation of Information Extraction Algorithm for Indonesian
Research Report. APTIKOM Journal on Computer Science and Information Technologies, 5(3), 154-159.
Menakadevi, N., Nagarjuna, T., & Nehru, K. (2020). Pattern Recognition for Visually Impaired People Using
Radon Signature. APTIKOM Journal on Computer Science and Information Technologies, 5(3), 110-115.
Retrieved from https://aptikom-