Parameter estimation in generalized partial linear models with conic quadratic programming

Download
2010
Çelik, Gül
In statistics, regression analysis is a technique, used to understand and model the relationship between a dependent variable and one or more independent variables. Multiple Adaptive Regression Spline (MARS) is a form of regression analysis. It is a non-parametric regression technique and can be seen as an extension of linear models that automatically models non-linearities and interactions. MARS is very important in both classification and regression, with an increasing number of applications in many areas of science, economy and technology. In our study, we analyzed Generalized Partial Linear Models (GPLMs), which are particular semiparametric models. GPLMs separate input variables into two parts and additively integrates classical linear models with nonlinear model part. In order to smooth this nonparametric part, we use Conic Multiple Adaptive Regression Spline (CMARS), which is a modified form of MARS. MARS is very benefical for high dimensional problems and does not require any particular class of relationship between the regressor variables and outcome variable of interest. This technique offers a great advantage for fitting nonlinear multivariate functions. Also, the contribution of the basis functions can be estimated by MARS, so that both the additive and interaction effects of the regressors are allowed to determine the dependent variable. There are two steps in the MARS algorithm: the forward and backward stepwise algorithms. In the first step, the model is constructed by adding basis functions until a maximum level of complexity is reached. Conversely, in the second step, the backward stepwise algorithm reduces the complexity by throwing the least significant basis functions from the model. In this thesis, we suggest not using backward stepwise algorithm, instead, we employ a Penalized Residual Sum of Squares (PRSS). We construct PRSS for MARS as a Tikhonov Regularization Problem. We treat this problem using continuous optimization techniques which we consider to become an important complementary technology and alternative to the concept of the backward stepwise algorithm. Especially, we apply the elegant framework of Conic Quadratic Programming (CQP) an area of convex optimization that is very well-structured, hereby, resembling linear programming and, therefore, permitting the use of interior point methods. At the end of this study, we compare CQP with Tikhonov Regularization problem for two different data sets, which are with and without interaction effects. Moreover, by using two another data sets, we make a comparison between CMARS and two other classification methods which are Infinite Kernel Learning (IKL) and Tikhonov Regularization whose results are obtained from the thesis, which is on progress.

Suggestions

CMARS: a new contribution to nonparametric regression with multivariate adaptive regression splines supported by continuous optimization
Weber, Gerhard-Wilhelm; Batmaz, İnci; Köksal, Gülser; Taylan, Pakize; Yerlikaya-Ozkurt, Fatma (2012-01-01)
Regression analysis is a widely used statistical method for modelling relationships between variables. Multivariate adaptive regression splines (MARS) especially is very useful for high-dimensional problems and fitting nonlinear multivariate functions. A special advantage of MARS lies in its ability to estimate contributions of some basis functions so that both additive and interactive effects of the predictors are allowed to determine the response variable. The MARS method consists of two parts: forward an...
ROBUST CONIC GENERALIZED PARTIAL LINEAR MODELS USING RCMARS METHOD - A ROBUSTIFICATION OF CGPLM
Ozmen, Ayse; Weber, Gerhard Wilhelm (2012-08-08)
GPLM is a combination of two different regression models each of which is used to apply on different parts of the data set. It is also adequate to high dimensional, non-normal and nonlinear data sets having the flexibility to reflect all anomalies effectively. In our previous study, Conic GPLM (CGPLM) was introduced using CMARS and Logistic Regression. According to a comparison with CMARS, CGPLM gives better results. In this study, we include the existence of uncertainty in the future scenarios into CMARS a...
Parameter estimation in generalized partial linear models with Tikhanov regularization
Kayhan, Belgin; Karasözen, Bülent; Department of Scientific Computing (2010)
Regression analysis refers to techniques for modeling and analyzing several variables in statistical learning. There are various types of regression models. In our study, we analyzed Generalized Partial Linear Models (GPLMs), which decomposes input variables into two sets, and additively combines classical linear models with nonlinear model part. By separating linear models from nonlinear ones, an inverse problem method Tikhonov regularization was applied for the nonlinear submodels separately, within the e...
Robust conic quadratic programming applied to quality improvement -a robustification of CMARS
Özmen, Ayşe; Weber, Gerhard Wilhelm; Batmaz, İnci; Department of Scientific Computing (2010)
In this thesis, we study and use Conic Quadratic Programming (CQP) for purposes of operational research, especially, for quality improvement in manufacturing. In previous works, the importance and benefit of CQP in this area became already demonstrated. There, the complexity of the regression method Multivariate Adaptive Regression Spline (MARS), which especially means sensitivity with respect to noise in the data, became penalized in the form of so-called Tikhonov regularization, which became expressed and...
Analyses of Two Different Regression Models and Bootstrapping
Gökalp Yavuz, Fulya (Springer, Berlin, Heidelberg, 2011-09-02)
Regression methods are used to explain the relationship between a single response variable and one or more explanatory variables. Graphical methods are generally the first step and are used to identify models that can be explored to describe the relationship. Although linear models are frequently used and they are user friendly, many important associations are not linear and require considerably more analytical effort. This study is focused on such nonlinear models. To perform statistical inference in this ...
Citation Formats
G. Çelik, “Parameter estimation in generalized partial linear models with conic quadratic programming,” M.S. - Master of Science, Middle East Technical University, 2010.