Predicting the disease of Alzheimer (AD) with SNP biomarkers and clinical data based decision support system using data mining classification approaches

Download
2012
Erdoğan, Onur
Single Nucleotide Polymorphisms (SNPs) are the most common DNA sequence variations where only a single nucleotide (A, T, C, G) in the human genome differs between individuals. Besides being the main genetic reason behind individual phenotypic differences, SNP variations have the potential to exploit the molecular basis of many complex diseases. Association of SNPs subset with diseases and analysis of the genotyping data with clinical findings will provide practical and affordable methodologies for the prediction of diseases in clinical settings. So, there is a need to determine the SNP subsets and patients’ clinical data which is informative for the prediction or the diagnosis of the particular diseases. So far, there is no established approach for selecting the representative SNP subset and patients’ clinical data, and data mining methodology that is based on finding hidden and key patterns over huge databases. This approach have the highest potential for extracting the knowledge from genomic datasets and to select the number of SNPs and most effective clinical features for diseases that are informative and relevant for clinical diagnosis. In this study we have applied one of the widely used data mining classification methodology: “decision tree” for associating the SNP Biomarkers and clinical data with the Alzheimer’s disease (AD), which is the most common form of “dementia”. Different tree construction parameters have been compared for the optimization, and the most efficient and accurate tree for predicting the AD is presented.

Suggestions

Predicting the Disease of Alzheimer With SNP Biomarkers and Clinical Data Using Data Mining Classification Approach: Decision Tree
Erdogan, Onur; Aydın Son, Yeşim (2014-09-03)
Single Nucleotide Polymorphisms (SNPs) are the most common genomic variations where only a single nucleotide differs between individuals. Individual SNPs and SNP profiles associated with diseases can be utilized as biological markers. But there is a need to determine the SNP subsets and patients' clinical data which is informative for the diagnosis. Data mining approaches have the highest potential for extracting the knowledge from genomic datasets and selecting the representative SNPs as well as most effec...
Integration of METU-SNP databases via RDF for PI_SNP web service
Gedikoğlu, Ceyhun; Aydın Son, Yeşim; Çarkacıoğlu, Levent; Department of Bioinformatics (2014)
Single Nucleotide Polymorphism (SNP) is a variation which occurs after a nucleotide mutates between members of a species or paired chromosomes in DNA sequence. SNP data is especially important for identifying genetic variations underlying complex diseases. The need for collection and service of this data under a standard format and globally normalized and structured metadata that houses the structured SNP data is becoming more important while recent advances in high-throughput genotyping technologies are re...
Evaluating Oxygen Tensions Related to Bone Marrow and Matrix for MSC Differentiation in 2D and 3D Biomimetic Lamellar Scaffolds
Sayin, Esen; Baran, Erkan Turker; Elsheikh, Ahmed; Mudera, Vivek; Cheema, Umber; Hasırcı, Vasıf Nejat (2021-04-01)
The physiological O-2 microenvironment of mesenchymal stem cells (MSCs) and osteoblasts and the dimensionality of a substrate are known to be important in regulating cell phenotype and function. By providing the physiologically normoxic environments of bone marrow (5%) and matrix (12%), we assessed their potential to maintain stemness, induce osteogenic differentiation, and enhance the material properties in the micropatterned collagen/silk fibroin scaffolds that were produced in 2D or 3D. Expression of ost...
Stability of a recombinant plasmid carrying alpha-amylase gene in Bacillus subtilis
Gültekin, Güzin Candan; Bayındırlı, Alev (1998-03-01)
Plasmid pSB6 is a streptococcal recombinant plasmid carrying the alpha-amylase gene of Bacillus amyloliquefaciens and the chloramphenicol resistance gene. The segregational and structural instabilities of this plasmid were examined under non-selective conditions in Bacillus subtilis. These instabilities were modelled according to a kinetic expression derived from the difference in the growth between plasmid-bearing and plasmid-free cells. This plasmid showed slight segregational instability and much higher ...
Prediction of protein subcellular localization based on primary sequence data
Ozarar, M; Atalay, Mehmet Volkan; Atalay, Rengül (2003-01-01)
This paper describes a system called prediction of protein subcellular localization (P2SL) that predicts the subcellular localization of proteins in eukaryotic organisms based on the amino acid content of primary sequences using amino acid order. Our approach for prediction is to find the most frequent motifs for each protein (class) based on clustering and then to use these most frequent motifs as features for classification. This approach allows a classification independent of the length of the sequence. ...
Citation Formats
O. Erdoğan, “Predicting the disease of Alzheimer (AD) with SNP biomarkers and clinical data based decision support system using data mining classification approaches,” M.S. - Master of Science, Middle East Technical University, 2012.