Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
A Prostate Cancer Model Build by a Novel SVM-ID3 Hybrid Feature Selection Method Using Both Genotyping and Phenotype Data from dbGaP
Download
index.pdf
Date
2014-03-20
Author
Yucebas, Sait Can
Aydın Son, Yeşim
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
214
views
91
downloads
Cite This
Through Genome Wide Association Studies (GWAS) many Single Nucleotide Polymorphism (SNP)-complex disease relations can be investigated. The output of GWAS can be high in amount and high dimensional, also relations between SNPs, phenotypes and diseases are most likely to be nonlinear. In order to handle high volume-high dimensional data and to be able to find the nonlinear relations we have utilized data mining approaches and a hybrid feature selection model of support vector machine and decision tree has been designed. The designed model is tested on prostate cancer data and for the first time combined genotype and phenotype information is used to increase the diagnostic performance. We were able to select phenotypic features such as ethnicity and body mass index, and SNPs those map to specific genes such as CRR9, TERT. The performance results of the proposed hybrid model, on prostate cancer dataset, with 90.92% of sensitivity and 0.91 of area under ROC curve, shows the potential of the approach for prediction and early detection of the prostate cancer.
Subject Keywords
Genome-wide association
,
Support vector machines;
,
Body-mass index
,
Risk
,
Classification
,
Metaanalysis
,
Diagnosis
,
Genetics
,
Density
,
Antigen
URI
https://hdl.handle.net/11511/30805
Journal
PLOS ONE
DOI
https://doi.org/10.1371/journal.pone.0091404
Collections
Graduate School of Informatics, Article
Suggestions
OpenMETU
Core
A novel SVM-ID3 Hybrid Feature Selection Method to Build a Disease Model for Melanoma using Integrated Genotyping and Phenotype Data from dbGaP
Aydın Son, Yeşim (2014-09-03)
The relations between Single Nucleotide Polymorphism (SNP) and complex diseases are likely to be non-linear and require analysis of the high dimensional data. Previous studies in the field mostly focus on genotyping and effects of various phenotypes are not considered. To fill this gap a hybrid feature selection model of support vector machine and decision tree has been designed. The designed method is tested on melanoma. We were able to select phenotypic features such as moles and dysplastic nevi, and SNPs...
A Hybrid feature selection model for genome wide association studies
Yücebaş, Sait Can; Baykal, Nazife; Aydın Son, Yeşim; Department of Health Informatics (2013)
Through Genome Wide Association Studies (GWAS) many SNP-complex disease relations have been investigated so far. GWAS presents high amount – high dimensional data and relations between SNPs, phenotypes and diseases are most likely to be nonlinear. In order to handle high volume-high dimensional data and to be able to find the nonlinear relations, data mining approaches are needed. A hybrid feature selection model of support vector machine and decision tree has been designed. This model also combines the gen...
An integrative approach to structured snp prioritization and representative snp selection for genome-wide association studies
Üstünkar, Gürkan; Aydın Son, Yeşim; Weber, Gerhard Wilhelm; Department of Information Systems (2011)
Single Nucleotide Polymorphisms (SNPs) are the most frequent genomic variations and the main basis for genetic differences among individuals and many diseases. As genotyping millions of SNPs at once is now possible with the microarrays and advanced sequencing technologies, SNPs are becoming more popular as genomic biomarkers. Like other high-throughput research techniques, genome wide association studies (GWAS) of SNPs usually hit a bottleneck after statistical analysis of significantly associated SNPs, as ...
A multi-layered graphical model of the relation among SNPS, GENES, and pathways based on subgraph search
Ersoy, Gökhan; Aydın Son, Yeşim; Can, Tolga; Department of Bioinformatics (2015)
The analysis of Single Nucleotide Polymorphisms (SNPs) through Genome Wide Association Studies (GWAS) presents great potential for describing disease loci and gaining insight into the underlying etiology of diseases. Recently described combined p-value approach allows identification of associations at gene and pathway level. The integrated programs like METU-SNP produce simple lists of either SNP id/gene id/pathway title and their p-values and significance status or SNP id/disease id/pathway information. In...
Identification and analysis of genomic regions with large between-population differentiation in humans
Myles, S.; Tang, K.; Somel, Mehmet; Green, R. E.; Kelso, J.; Stoneking, M. (Wiley, 2008-01-01)
The primary aim of genetic association and linkage studies is to identify genetic variants that contribute to phenotypic variation within human populations. Since the overwhelming majority of human genetic variation is found within populations, these methods are expected to be effective and can likely be extrapolated from one human population to another. However, they may lack power in detecting the genetic variants that contribute to phenotypes that differ greatly between human populations. Phenotypes that...
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
S. C. Yucebas and Y. Aydın Son, “A Prostate Cancer Model Build by a Novel SVM-ID3 Hybrid Feature Selection Method Using Both Genotyping and Phenotype Data from dbGaP,”
PLOS ONE
, pp. 0–0, 2014, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/30805.