Predicting user activity in video games
DOI:
https://doi.org/10.34185/1562-9945-5-161-2025-09Keywords:
data analysis, forecasting, Gradient Boosting, LinearSVR, ARIMA, SARIMA, ARIMAX, SARIMAXAbstract
This paper describes the results of the implementation of a full cycle of solving the problem of predicting the number of online players in video games in a month based on his-torical and current data.
Within the framework of data mining, several forecasting models were tested, including Gradient Boosting (XGBoost, LightGBM), LinearSVR, and the SARIMA and SARIMAX time series models. According to the results of preliminary testing, it was found that the SARIMA and SARIMAX models provide the lowest average error rates (about 20–22%), while other methods had a significantly higher error (about 40%), which does not allow them to be effec-tively used to solve the problem.
The final testing of the SARIMA and SARIMAX models on the example of three games with different levels of popularity allowed us to conclude that, despite the slight lag of SARIMAX in terms of the average MAPE value, this model provides more stable forecasting results, better adapts to the characteristics of different types of games, and more accurately approaches real online values. Thus, the most optimal approach to predicting the future of online video games was the SARIMAX model, which demonstrates high accuracy, stability and flexibility.
The results obtained may be useful for further application in the gaming industry for the purpose of planning server loads, marketing campaigns and general monitoring of the dy-namics of interest in the product.
References
General information for games on the Steam platform.
URL: https://www.kaggle.com/datasets/nikdavis/steam-store-games (date of access: 28.04.2025).
Reviews for games on the Steam platform.
URL: https://www.kaggle.com/datasets/najzeko/steam-reviews-2021/data (date of access: 28.04.2025).
Online and pricing history for games on the Steam platform.
URL: https://data.mendeley.com/datasets/ycy3sy3vj2/1 (date of access: 28.04.2025). 12. PostgreSQL. URL: https://www.postgresql.org/ (date of access: 28.04.2025).
Gradient Boosting. URL: https://en.wikipedia.org/wiki/Gradient_boosting (date of access: 28.04.2025).
XGBoost. URL: https://xgboost.readthedocs.io/en/release_3.0.0/ (date of access: 28.04.2025).
LightGBM. URL: https://lightgbm.readthedocs.io/en/stable/ (date of access: 28.04.2025).
LinearSVR. URL: https://habr.com/ru/articles/802185/ (date of access: 28.04.2025). (S)ARIMA(X).
URL: https://habr.com/ru/articles/477206/ (date of access: 28.04.2025).
Downloads
Published
Issue
Section
License
Copyright (c) 2026 System technologies

This work is licensed under a Creative Commons Attribution 4.0 International License.









