Examining parallelization in kernel regression

Date

2023-11-04

Author

Oltulu, Orçun
Gökalp Yavuz, Fulya

Metadata

Show full item record

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Item Usage Stats

88
views

0
downloads

For a few decades, parallelization in statistical computing has been an increasing trend, and researchers have put significant effort into converting or adjusting known statistical methods and algorithms in parallel. The main reasons for the transition to parallel processes are the rapid growth in the size and the volume of data and the accelerated hardware developments. Divide and (re)combine (DnR) is one of the parallelization methods that allows the existing data or method to be implemented by dividing it into smaller pieces. It is possible to use the DnR method in most regression methods to reveal the relationship between the data. Although several libraries have been created in existing programming languages for many regression methods, such an approach is not yet used for kernel regression. However, it should be kept in mind that the kernel regression calculation method takes a relatively long time. For this reason, parallelization would be a handy strategy to decrease the calculation time in kernel regression. In this study, we aim to demonstrate how time efficiency is achieved using DnR methods for kernel regression with the help of several parallelization strategies in R. The results indicate that the computation time can be reduced proportionally with a trade-off between time and accuracy.

URI

https://hdl.handle.net/11511/108648

Journal

SOFT COMPUTING

DOI

https://doi.org/10.1007/s00500-023-09285-4

Collections

Department of Statistics, Article

Citation Formats

O. Oltulu and F. Gökalp Yavuz, “Examining parallelization in kernel regression,” SOFT COMPUTING, pp. 0–0, 2023, Accessed: 00, 2024. [Online]. Available: https://hdl.handle.net/11511/108648.