Sample size determination for logistic regression

Motrenko, Anastasiya
Strijov, Vadim
Weber, Gerhard Wilhelm
The problem of sample size estimation is important in medical applications, especially in cases of expensive measurements of immune biomarkers. This paper describes the problem of logistic regression analysis with the sample size determination algorithms, namely the methods of univariate statistics, logistics regression, cross-validation and Bayesian inference. The authors, treating the regression model parameters as a multivariate variable, propose to estimate the sample size using the distance between parameter distribution functions on cross-validated data sets. Herewith, the authors give a new contribution to data mining and statistical learning, supported by applied mathematics.


Estimating parameters of a multiple autoregressive model by the modified maximum likelihood method
TÜRKER BAYRAK, ÖZLEM; Akkaya, Ayşen (Elsevier BV, 2010-02-15)
We consider a multiple autoregressive model with non-normal error distributions, the latter being more prevalent in practice than the usually assumed normal distribution. Since the maximum likelihood equations have convergence problems (Puthenpura and Sinha, 1986) [11], we work Out modified maximum likelihood equations by expressing the maximum likelihood equations in terms of ordered residuals and linearizing intractable nonlinear functions (Tiku and Suresh, 1992) [8]. The solutions, called modified maximu...
A powerful approach for effective finding of significantly differentially expressed genes
Abul, Osman; Alhajj, Reda; Polat, Faruk (Institute of Electrical and Electronics Engineers (IEEE), 2006-07-01)
The problem of identifying significantly differentially expressed genes for replicated microarray experiments is accepted as significant and has been tackled by several researchers. Patterns from Gene Expression (PaGE) and q-values are two of the well-known approaches developed to handle this problem. This paper proposes a powerful approach to handle this problem. We first propose a method for estimating the prior probabilities used in the first version of the PaGE algorithm. This way, the problem definitio...
An algorithm to analyze stability of gene-expression patterns
Gebert, J; Latsch, M; Pickl, SW; Weber, Gerhard Wilhelm; Wunschiers, R (Elsevier BV, 2006-05-01)
Many problems in the field of computational biology consist of the analysis of so-called gene-expression data. The successful application of approximation and optimization techniques, dynamical systems, algorithms and the utilization of the underlying combinatorial structures lead to a better understanding in that field. For the concrete example of gene-expression data we extend an algorithm, which exploits discrete information. This is lying in extremal points of polyhedra, which grow step by step, up to a...
Analysis of variance and linear contrasts in experimental design with generalized secant hyperbolic distribution
Yilmaz, Yidiz E.; Akkaya, Ayşen (Elsevier BV, 2008-07-01)
We consider one-way classification model in experimental design when the errors have generalized secant hyperbolic distribution. We obtain efficient and robust estimators for block effects by using the modified maximum likelihood estimation (MML) methodology. A test statistic analogous to the normal-theory F statistic is defined to test block effects. We also define a test statistic for testing linear contrasts. It is shown that test statistics based on MML estimators are efficient and robust. The methodolo...
Estimation and hypothesis testing in BIB design and robustness
Tiku, Moti L.; ŞENOĞLU, BİRDAL (Elsevier BV, 2009-07-01)
Modified maximum likelihood estimators of the unknown parameters in a BIB design under non-normality of error distributions are obtained. They are shown to be more efficient and robust than the traditional least squares estimators. A test statistic for testing a linear contrast among treatment effects is developed. A real life example is given.
Citation Formats
A. Motrenko, V. Strijov, and G. W. Weber, “Sample size determination for logistic regression,” JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, pp. 743–752, 2014, Accessed: 00, 2020. [Online]. Available: