Mixed effects models for time series gene expression data

Download

index.pdf

Date

2011

Author

Erkan, İbrahim

Metadata

Show full item record

Item Usage Stats

151
views

65
downloads

The experimental factors such as the cell type and the treatment may have different impact on expression levels of individual genes which are quantitative measurements from microarrays. The measurements can be collected at a few unevenly spaced time points with replicates. The aim of this study is to consider cell type, treatment and short time series attributes and to infer about their effects on individual genes. A mixed effects model (LME) was proposed to model the gene expression data and the performance of the model was validated by a simulation study. Realistic data sets were generated preserving the structure of the sample real life data studied by Nymark et al. (2007). Predictive performance of the model was evaluated by performance measures, such as accuracy, sensitivity and specificity, as well as compared to the competing method by Smyth (2004), namely Limma. Both methods were also compared on real life data. Simulation results showed that the predictive performance of LME is as high as 99%, and it produces False Discovery Rate (FDR) as low as 0.4% whereas Limma has an FDR value of at least 32%. Moreover, LME has almost 99% predictive capability on the continuous time parameter where Limma has only about 67% and even it cannot handle continuous independent variables.

Subject Keywords

Gene expression, icroarray Data., Modified maximum likelihood (MML)

URI

http://etd.lib.metu.edu.tr/upload/12613913/index.pdf
https://hdl.handle.net/11511/20813

Collections

Graduate School of Natural and Applied Sciences, Thesis

Suggestions

OpenMETU
Core

Partially Observable Gene Regulatory Network Control Without a Boundary on Horizon Erdogdu, Utku; Polat, Faruk; Alhajj, Reda (2012-11-09) Gene regulatory networks (GRNs) govern the protein transcription process in the cell and interactions among genes play a vital role in determining the biosynthesis rate of proteins. By using intervention techniques discovered by biological research it is possible to control a GRN, thus promoting or demoting the expression rate of a certain gene. In this work, this control task is studied in a partially observable setting where interventions lack perfect knowledge of the expression level of all genes. Moreov...
Comparing Clustering Techniques for Real Microarray Data Purutçuoğlu Gazi, Vilda (2012-08-29) The clustering of genes detected as significant or differentially expressed provides useful information to biologists about functions and functional relationship of genes. There are variant types of clustering methods that can be applied in genomic data. These are mainly divided into the two groups, namely, hierarchical and partitional methods. In this paper, as the novelty, we perform a detailed clustering analysis for the recently collected boron microarray dataset to investigate biologically more interes...
Mathematical Modeling and Approximation of Gene Expression Patterns Yılmaz, Fatih; Öktem, Hüseyin Avni (2004-09-03) This study concerns modeling, approximation and inference of gene regulatory dynamics on the basis of gene expression patterns. The dynamical behavior of gene expressions is represented by a system of ordinary differential equations. We introduce a gene-interaction matrix with some nonlinear entries, in particular, quadratic polynomials of the expression levels to keep the system solvable. The model parameters are determined by using optimization. Then, we provide the time-discrete approximation of our time...
Clustering of short time-course gene expression data with dissimilar replicates Cinar, Ozan; İlk Dağ, Özlem; İyigün, Cem (2018-04-01) Microarrays are used in genetics and medicine to examine large numbers of genes simultaneously through their expression levels under any condition such as a disease of interest. The information from these experiments can be enriched by following the expression levels through time and biological replicates. The purpose of this study is to propose an algorithm which clusters the genes with respect to the similarities between their behaviors through time. The algorithm is also aimed at highlighting the genes w...
Integer linear programming based solutions for construction of biological networks Eren Özsoy, Öykü; Can, Tolga; Department of Health Informatics (2014) Inference of gene regulatory or signaling networks from perturbation experiments and gene expression assays is one of the challenging problems in bioinformatics. Recently, the inference problem has been formulated as a reference network editing problem and it has been show that finding the minimum number of edit operations on a reference network in order to comply with perturbation experiments is an NP-complete problem. In this dissertation, we propose linear programming based solutions for reconstruction o...

Citation Formats

İ. Erkan, “Mixed effects models for time series gene expression data,” Ph.D. - Doctoral Program, Middle East Technical University, 2011.