Oncogenes Classification Measured by Microarray using Data Mining Computational Methods

Authors

  • Fabrício Alves Rodrigues Universidade Federal de Goiás. Jataí (GO), Brasil.
  • Laurence Rodrigues do Amaral Faculdade de Computação (FACOM). Universidade Federal de Uberlândia (UFU). Patos de Minas (MG), Brasil.

DOI:

https://doi.org/10.32635/2176-9745.RBC.2012v58n2.625

Keywords:

Computational Biology, Gene Expression, Data Mining, Medical Oncology, Databases as Topic

Abstract

Introduction: In recent decades, cancer has been given a great dimension, becoming an evident world public health problem. The World Health Organization estimates that, in 2030, 27 million cancer cases and 17 million cancer deaths can be expected. Faced with this alarming scenario, Data Mining brings methods and tools to help building more meaningful knowledge about cancer. Objective: This paper aims to apply five traditional Data Mining methods on the NCI60 dataset. The database was created with data from microarray experiments, with levels of expression of 1,000 genes grouped into nine cancer classes. Method: The methods used on this paper are: J48, Random Forest, PART , IBK and Naive Bayes, which belong to the Weka environment, very traditional in Data Mining. Due to the low number of records for some cancer classes, the validation of the results obtained by the classifiers used the 3-fold cross validation. Results: The classifier which obtained the highest accuracy was IBK, while J48 and PART classifiers drastically reduced the set of genes, building high level knowledge as trees or rules. Conclusion: The results of this study can be used as tools aiming at assisting cancer cure research and may be used in the classification of new cases or to further improve understanding of gene/gene and gene/cancer relations.

 

Downloads

Download data is not yet available.

Published

2012-06-29

How to Cite

1.
Rodrigues FA, Amaral LR do. Oncogenes Classification Measured by Microarray using Data Mining Computational Methods. Rev. Bras. Cancerol. [Internet]. 2012 Jun. 29 [cited 2024 Jul. 22];58(2):241-9. Available from: https://rbc.inca.gov.br/index.php/revista/article/view/625

Issue

Section

ORIGINAL ARTICLE