Machine Learning Regression Model: Exploring Regression Algorithms for Mercedes-Benz Price Prediction

Authors

  • Ridho Sholehurrohman Universitas Lampung
  • Muhaqiqin Universitas Lampung
  • Igit Sabda Ilman Universitas Lampung
  • Agung Pambudi Universitas Lampung
  • Wartariyus Universitas Lampung
  • Joko Triloka Institut Informatika dan Bisnis Darmajaya
  • Handoyo Widi Nugroho Universitas Lampung

DOI:

https://doi.org/10.35194/mji.v18i1.6476

Keywords:

Ensemble Methods, Machine Learning, Mercedes-Benz Price, Price Prediction, Regression

Abstract

Predicting luxury car prices, such as Mercedes-Benz, remains challenging due to multiple interacting variables, including model, ratings, and market conditions. This study compares six regression algorithms, Linear Regression, Random Forest, Gradient Boosting, XGBoost, K-Nearest Neighbors, and AdaBoost, to identify the most effective model for Mercedes-Benz price prediction. A Kaggle dataset of 10,432 records was preprocessed through cleaning, removal of missing values (resulting in 10,307 records), One-Hot Encoding for categorical variables, and standardization of numerical features using StandardScaler, then split into 80% training and 20% testing data. Model performance was evaluated using MSE, RMSE, and R². Random Forest achieved the best performance (R² = 0.97; RMSE: $3,917), followed closely by Gradient Boosting (R² = 0.96; RMSE: $4,359) and XGBoost (R² = 0.96; RMSE: $4,305). Linear Regression achieved a similar R² (0.96) but higher errors (RMSE: $4,767), while AdaBoost (R² = 0.95; RMSE: $4,897) and KNN (R² = 0.90; RMSE: $5,657) showed lower performance. These findings confirm that ensemble methods, particularly Random Forest, significantly outperform traditional and distance-based approaches for luxury car price prediction. This study provides a comprehensive comparative framework for automotive pricing analytics, with future research directions including additional features, hyperparameter tuning, and integration of external market factors to further enhance prediction accuracy.

References

[1] Vaneesha K H, Srinivas V, Abhishek V, and Sujay Srinivas, "Comparative Analysis of Machine Learning Algorithms for Used Car Price Prediction," International Journal of Current Science Research and Review, vol. 7, no. 9, pp. 7220-7228, Sep. 2024, doi: 10.47191/ijcsrr/V7-i9-39.

[2] H. Chen, "Car Price Prediction Based on Multiple Machine Learning Models," in Proceedings of the 2nd International Conference on Data Analysis and Machine Learning - DAML, SciTePress, 2025, pp. 92-95, doi: 10.5220/0013509000004619.

[3] R. Chen, "Car Price Prediction Using Machine Learning," in Proceedings of the 1st International Conference on E-commerce and Artificial Intelligence (ECAI 2024), 2024, pp. 536-541, doi: 10.5220/0013270100004568.

[4] Z. Cheng, J. Liu, and H. Zhang, "Predicting car prices using Gradient Boosting machine and decision trees," Journal of Computer Science, vol. 29, no. 3, pp. 112-120, 2018, doi: 10.1016/j.jocs.2018.08.012.

[5] F. R. Amik, A. Lanard, A. Ismat, and S. Momen, "Application of Machine Learning Techniques to Predict the Price of Pre-Owned Cars in Bangladesh," Information, vol. 12, no. 12, p. 514, 2021, doi: 10.3390/info12120514.

[6] J. Yang, J. Kim, H. Ryu, J. Lee, and C. Park, “Predicting Car Rental Prices: A Comparative Analysis of Machine Learning Models,” Electronics, vol. 13, no. 12, p. 2345, Jun. 2024, doi: 10.3390/electronics13122345.

[7] S. Mirasçı and A. Aksoy, “Data-Driven Purchasing Strategies: Price Prediction Models and Strategy Development,” Expert Systems with Applications, vol. 266, p. 125986, 2025, doi: 10.1016/j.eswa.2025.125986

[8] S. Yılmaz and İ. H. Selvi, "Price Prediction Using Web Scraping and Machine Learning Algorithms in the Used Car Market," Sakarya University Journal of Computer & Information Sciences, vol. 6, no. 2, pp. 140-148, Aug. 2023, https://doi.org/10.35377/saucis...1309103.

[9] G. P. Raj, G. George, et al., "Enhanced Used Car Price Prediction Using Machine Learning: A Comparative Study of Regression Models," in *2025 International Conference on Advances in Modern Age Technologies for Health and Engineering Science (AMATHE)*, Shivamogga, India, 2025, https://doi.org/10.1109/AMATHE65477.2025.11081288.

[10] T. Qian, "Used Car Price Prediction by Using XGBoost," BCP Business & Management, vol. 44, pp. 62-68, Apr. 2023, doi: 10.54691/bcpbm.v44i.4794.

[11] R. Gayathri, S. U. Rani, L. Čepová, M. Rajesh, and K. Kalita, "A Comparative Analysis of Machine Learning Models in Prediction of Mortar Compressive Strength," Processes, vol. 10, no. 7, p. 1387, Jul. 2022, doi: 10.3390/pr10071387.

[12] I. Fayyaz, G. G. Md. N. Ali, and S. S. Khairunnesa, "Advanced Feature Engineering and Machine Learning Techniques for High Accurate Price Prediction of Heterogeneous Pre-Own Cars," Vehicles, vol. 7, no. 3, p. 94, 2025, doi: 10.3390/vehicles7030094.

[13] J. He, "Predicting Vehicle Prices Using Machine Learning: A Case Study with Linear Regression," in Proceedings of the 5th International Conference on Signal Processing and Machine Learning, 2024, pp. 35-42, doi: 10.54254/2755-2721/99/20251746.

[14] L. M. Soegianto, A. T. Hinandra, P. A. Suri, and M. Fajar, "Comparison of Model Performance on Housing Business Using Linear Regression, Random Forest Regressor, SVR, and Neural Network," in Procedia Computer Science, Elsevier B.V., 2024, pp. 1139-1145, doi: 10.1016/j.procs.2024.10.343.

[15] D. Chicco, M. J. Warrens, and G. Jurman, "The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation," PeerJ Computer Science, vol. 7, p. e623, 2021, doi: 10.7717/peerj-cs.623.

[16] R. J. Hyndman and G. Athanasopoulos, Forecasting: Principles and Practice, 3rd ed. OTexts, 2021.

[17] S. Bird, E. Klein, and E. Loper, Natural Language Processing with Python: Analyzing Text with the Natural Language Toolkit. O'Reilly Media, 2009.

[18] A. Géron, Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems, 3rd ed. O'Reilly Media, 2022, ISBN: 9781098125974.

[19] D. A. Shofiana, M. Caniadi, R. Sholehurohman, and Aristoteles, "Decision Tree Algorithms in Water Quality Classification: A Comparative Study of Random Forest, XGBoost, and C5.0," Science and Technology Indonesia, vol. 10, no. 4, pp. 999-1011, 2025, https://doi.org/10.26554/sti.2025.10.4.999-1011.

Downloads

Published

2026-06-29

How to Cite

Sholehurrohman, R., Muhaqiqin, Sabda Ilman, I., Pambudi, A., Wartariyus, Triloka, J., & Widi Nugroho, H. (2026). Machine Learning Regression Model: Exploring Regression Algorithms for Mercedes-Benz Price Prediction. Media Jurnal Informatika, 18(1), 142–152. https://doi.org/10.35194/mji.v18i1.6476

Similar Articles

<< < 1 2 3 > >> 

You may also start an advanced similarity search for this article.