Probabilistic distance clustering adjusted for cluster size

İyigün, Cem
The probabilistic distance clustering method of [1] works well if the cluster sizes are approximately equal. We modify that method to deal with clusters of arbitrary size and for problems where the cluster sizes are themselves unknowns that need to be estimated. In the latter case, our method is a viable alternative to the expectation-maximization (EM) method.


A binomial noised model for cluster validation
Toledano-Kitai, Dvora; Avros, Renata; Volkovich, Zeev; Weber, Gerhard Wilhelm; Yahalom, Orly (IOS Press, 2013-01-01)
Cluster validation is the task of estimating the quality of a given partition of a data set into clusters of similar objects. Normally, a clustering algorithm requires a desired number of clusters as a parameter. We consider the cluster validation problem of determining the optimal ("true") number of clusters. We adopt the stability testing approach, according to which, repeated applications of a given clustering algorithm provide similar results when the specified number of clusters is correct. To implemen...
Linear contrasts in experimental design with non-identical error distributions
Senoglu, B; Tiku, ML (Wiley, 2002-01-01)
Estimation of linear contrasts in experimental design, and testing their assumed values, is considered when the error distributions from block to block are not necessarily identical. The normal-theory solutions are shown to have low efficiencies as compared to the solutions presented here.
Autoregressive models with short-tailed symmetric distributions
Akkaya, Ayşen (Informa UK Limited, 2008-01-01)
Symmetric short-tailed distributions do indeed occur in practice but have not received much attention particularly in the context of autoregression. We consider a family of such distributions and derive the modified maximum likelihood estimators of the parameters. We show that the estimators are efficient and robust. We develop hypothesis-testing procedures.
Multiple linear regression model with stochastic design variables
İslam, Muhammed Qamarul (Informa UK Limited, 2010-01-01)
In a simple multiple linear regression model, the design variables have traditionally been assumed to be non-stochastic. In numerous real-life situations, however, they are stochastic and non-normal. Estimators of parameters applicable to such situations are developed. It is shown that these estimators are efficient and robust. A real-life example is given.
Minimum variance quadratic unbiased estimation for the variance components in simple linear regression with onefold nested error
Gueven, Ilgehan (Informa UK Limited, 2006-01-01)
The explicit forms of the minimum variance quadratic unbiased estimators (MIVQUEs) of the variance components are given for simple linear regression with onefold nested error. The resulting estimators are more efficient as the ratio of the initial variance components estimates increases and are asymptotically efficient as the ratio tends to infinity.
Citation Formats
A. BEN-ISRAEL and C. İyigün, “Probabilistic distance clustering adjusted for cluster size,” PROBABILITY IN THE ENGINEERING AND INFORMATIONAL SCIENCES, pp. 603–621, 2008, Accessed: 00, 2020. [Online]. Available: