A simulation study on the comparison of methods for the analysis of longitudinal count data

İnan, Gül
The longitudinal feature of measurements and counting process of responses motivate the regression models for longitudinal count data (LCD) to take into account the phenomenons such as within-subject association and overdispersion. One common problem in longitudinal studies is the missing data problem, which adds additional difficulties into the analysis. The missingness can be handled with missing data techniques. However, the amount of missingness in the data and the missingness mechanism that the data have affect the performance of missing data techniques. In this thesis, among the regression models for LCD, the Log-Log-Gamma marginalized multilevel model (Log-Log-Gamma MMM) and the random-intercept model are focused on. The performance of the models is compared via a simulation study under three missing data mechanisms (missing completely at random, missing at random conditional on observed data, and missing not random), two types of missingness percentage (10% and 20%), and four missing data techniques (complete case analysis, subject, occasion and conditional mean imputation). The simulation study shows that while the mean absolute error and mean square error values of Log-Log-Gamma MMM are larger in amount compared to the random-intercept model, both regression models yield parallel results. The simulation study results justify that the amount of missingness in the data and that the missingness mechanism that the data have, strictly influence the performance of missing data techniques under both regression models. Furthermore, while generally occasion mean imputation displays the worst performance, conditional mean imputation shows a superior performance over occasion and subject mean imputation and gives parallel results with complete case analysis.


A marginalized multilevel model for bivariate longitudinal binary data
Inan, Gul; İlk Dağ, Özlem (Springer Science and Business Media LLC, 2019-06-01)
This study considers analysis of bivariate longitudinal binary data. We propose a model based on marginalized multilevel model framework. The proposed model consists of two levels such that the first level associates the marginal mean of responses with covariates through a logistic regression model and the second level includes subject/time specific random intercepts within a probit regression model. The covariance matrix of multiple correlated time-specific random intercepts for each subject is assumed to ...
A simulation study on marginalized transition random effects models for multivariate longitudinal binary data
Yalçınöz, Zerrin; İlk Dağ, Özlem; Department of Statistics (2008)
In this thesis, a simulation study is held and a statistical model is fitted to the simulated data. This data is assumed to be the satisfaction of the customers who withdraw their salary from a particular bank. It is a longitudinal data which has bivariate and binary response. It is assumed to be collected from 200 individuals at four different time points. In such data sets, two types of dependence -the dependence within subject measurements and the dependence between responses- are important and these are...
A comparative study of autoregressive neural network hybrids
Taşkaya Temizel, Tuğba (2005-06-01)
Many researchers have argued that combining many models for forecasting gives better estimates than single time series models. For example, a hybrid architecture comprising an autoregressive integrated moving average model (ARIMA) and a neural network is a well-known technique that has recently been shown to give better forecasts by taking advantage of each model's capabilities. However, this assumption carries the danger of underestimating the relationship between the model's linear and non-linear componen...
Vardar Acar, Ceren (null; 2018-04-30)
In this study, we mainly propose an algorithm to generate correlated random walk converging to fractional Brownian motion, with Hurst parameter, H∈ [1/2,1]. The increments of this random walk are simulated from Bernoulli distribution with proportion p, whose density is constructed using the link between correlation of multivariate Gaussian random variables and correlation of their dichotomized binary variables. We prove that the normalized sum of trajectories of this proposed random walk yields a Gaussian p...
A simplified procedure for estimating the inelastic drift demands on frame structures
Ay, Bekir Özer (null; 2008-10-17)
Citation Formats
G. İnan, “A simulation study on the comparison of methods for the analysis of longitudinal count data,” M.S. - Master of Science, Middle East Technical University, 2009.