StreamMARS: A Streaming Multivariate Adaptive Regression Splines Algorithm

2019-12-14
Computers and internet have become inevitable parts of our life in the 1990s, and afterwards, bulk of data are started being recorded in digital platforms automatically. To extract meaningful patterns from such data computational methods are developed in data mining and machine learning domains. Multivariate adaptive regression splines (MARS) is one such method successfully applied to off-line static data for prediction. In about last ten years, we face with the big data problem due to the steady increase in the size of the data. Streaming data is a kind of big data collected from sensor networks, production processes, twitter messages etc. Algorithms processing this type of data should consider both memory and time limitations as well as its changing nature with time. We develop a streaming version of a powerful predictive method MARS for estimating model parameters on-line in a temporarily adaptive manner using forgetting factors. Performance of the algorithm developed is tested on simulated data with different dimensions in static, abrupt and smoothly changing environments; as well as on real-life datasets, and also, compared with those of some benchmarking methods such as sliding windows. Results show that StreamMARS is a promising algorithm for predicting streaming big data.
The 13th International Conference on Computational and Financial Econometrics (CFE 2019) (14-16 December 2019)

Suggestions

Opportunities and barriers of architect led design build projects
Deniz, Ayça; Elias Özkan, Soofia Tahira; Department of Building Science in Architecture (2012)
From past to today, technological developments have resulted in new systems in parallel with digital age. Innovations have been started to be replaced with the traditional solutions. Standardizations have also started to be renewed in accordance with the high technology and complexity of the projects. Under these circumstances, design and construction activities have been separated in the construction industry. As a result, alternative project delivery systems have been developed and selecting the right del...
FSOLAP: A Fuzzy Logic-based Spatial OLAP Framework for Spatial-Temporal Analytics and Querying
Keskin, Sinan; Yazıcı, Adnan; Department of Computer Engineering (2023-1-3)
Nowadays, with the rise in sensor technology, the amount of spatial and temporal data increases day by day. Fast, effective, and accurate analysis and prediction of collected data have become more essential than ever. Spatial Online Analytical Processing (SOLAP) emerged to perform data mining on spatial and temporal data that naturally contains the hierarchical structure used in many complex applications. In addition, uncertainty and fuzziness are inherently essential elements of data in many complex data a...
FSOLAP: A fuzzy logic-based spatial OLAP framework for effective predictive analytics
Keskin, Sinan; Yazıcı, Adnan (2023-03-01)
Nowadays, with the rise in sensor technology, the amount of spatial and temporal data increases day by day. Fast, effective, and accurate analysis and prediction of collected data have become more essential than ever. Spatial Online Analytical Processing (SOLAP) emerged to perform data mining on spatial and temporal data that naturally contains the hierarchical structure used in many complex applications. In addition, uncertainty and fuzziness are inherently essential elements of data in many complex data a...
Workplace cysberslacking: an investigation based on the theory of planned behavior
Koç, Yasemin Doğa; Toker, Yonca; The Department of Psychology (2020)
The use of computers and mobiles at workspaces has increased dramatically in the last decade. Employees’ access to the Internet is inevitable and mostly required within working hours. Cyberslacking is a phenomenon that describes the non-work-related behavior conducted in the workplace by using the Internet. The effects of Cyberslacking behaviors in the workplace are still controversial. Literature suggests that Cyberslacking can be a facilitator of both positive and negative workplace behaviors. Thus, it is...
Template based image watermarking in the fractional fourier domain
Gökozan, Tolga; Akar, Gözde; Department of Electrical and Electronics Engineering (2005)
One of the main features of digital technology is that the digital media can be duplicated and reproduced easily. However, this allows unauthorized and illegal use of information, i.e. data piracy. To protect digital media against illegal attempts a signal, called watermark, is embedded into the multimedia data in a robust and invisible manner. A watermark is a short sequence of information, which contains owner2s identity. It is used for evidence of ownership and copyright purposes. In this thesis, we use ...
Citation Formats
İ. Batmaz, “StreamMARS: A Streaming Multivariate Adaptive Regression Splines Algorithm,” London, UK, 2019, p. 71, Accessed: 00, 2021. [Online]. Available: https://hdl.handle.net/11511/72160.