A computational approach to nonparametric regression: bootstrapping CMARS method

Download
2015-10-01
Yazici, Ceyda
Yerlikaya-Ozkurt, Fatma
Batmaz, İnci
Bootstrapping is a computer-intensive statistical method which treats the data set as a population and draws samples from it with replacement. This resampling method has wide application areas especially in mathematically intractable problems. In this study, it is used to obtain the empirical distributions of the parameters to determine whether they are statistically significant or not in a special case of nonparametric regression, conic multivariate adaptive regression splines (CMARS), a statistical machine learning algorithm. CMARS is the modified version of the well-known nonparametric regression model, multivariate adaptive regression splines (MARS), which uses conic quadratic optimization. CMARS is at least as complex as MARS even though it performs better with respect to several criteria. To achieve a better performance of CMARS with a less complex model, three different bootstrapping regression methods, namely, random-X, fixed-X and wild bootstrap are applied on four data sets with different size and scale. Then, the performances of the models are compared using various criteria including accuracy, precision, complexity, stability, robustness and computational efficiency. The results imply that bootstrap methods give more precise parameter estimates although they are computationally inefficient and that among all, random-X resampling produces better models, particularly for medium size and scale data sets.
MACHINE LEARNING

Suggestions

A computational approach to nonparametric regression: bootstrapping cmars method
Yazıcı, Ceyda; Batmaz, İnci; Department of Statistics (2011)
Bootstrapping is a resampling technique which treats the original data set as a population and draws samples from it with replacement. This technique is widely used, especially, in mathematically intractable problems. In this study, it is used to obtain the empirical distributions of the parameters to determine whether they are statistically significant or not in a special case of nonparametric regression, Conic Multivariate Adaptive Regression Splines (CMARS). Here, the CMARS method, which uses conic quadr...
CMARS: a new contribution to nonparametric regression with multivariate adaptive regression splines supported by continuous optimization
Weber, Gerhard-Wilhelm; Batmaz, İnci; Köksal, Gülser; Taylan, Pakize; Yerlikaya-Ozkurt, Fatma (2012-01-01)
Regression analysis is a widely used statistical method for modelling relationships between variables. Multivariate adaptive regression splines (MARS) especially is very useful for high-dimensional problems and fitting nonlinear multivariate functions. A special advantage of MARS lies in its ability to estimate contributions of some basis functions so that both additive and interactive effects of the predictors are allowed to determine the response variable. The MARS method consists of two parts: forward an...
An fMRI segmentation method under markov random fields for brain decoding
Aksan, Emre; Yarman Vural, Fatoş Tunay; Department of Computer Engineering (2015)
In this study, a specially tailored segmentation method for partitioning the fMRI data into a set of "homogenous" regions with respect to a predefined cost function is proposed. The proposed method, referred as f-MRF, employs univariate and multivariate fMRI data analysis techniques under Markov Random Fields to estimate the segments by resolving a mixture density. The univariate approach helps identifying activation pattern of a voxel independently from other voxels. In order to capture local interactions ...
EVALUATING THE CMARS PERFORMANCE FOR MODELING NONLINEARITIES
Batmaz, İnci; Kartal-Koc, Elcin; Köksal, Gülser (2010-02-04)
Multivariate Adaptive Regression Splines (MARS) is a very popular nonparametric regression method particularly useful for modeling nonlinear relationships that may exist among the variables. Recently, we developed CMARS method as an alternative to backward stepwise part of the MARS algorithm. Comparative studies have indicated that CMARS performs better than MARS for modeling nonlinear relationships. In those studies, however, only main and two-factor interaction effects were sufficient to model the nonline...
An Approach for determining process economy parameters of multivariate loss functions
Özkan, Gökçe; Köksal, Gülser; Department of Industrial Engineering (2016)
The aim of this study is to provide an effective method for determining parameters of multivariate loss functions, which are related with process economics. The loss functions are widely used in product and process design and other quality engineering applications. Although there are several studies about different types of loss functions, there is a lack of studies on determining cost matrix parameters of these functions. For this purpose, we propose a method based on multi-objective decision making tools....
Citation Formats
C. Yazici, F. Yerlikaya-Ozkurt, and İ. Batmaz, “A computational approach to nonparametric regression: bootstrapping CMARS method,” MACHINE LEARNING, pp. 211–230, 2015, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/40591.