Random Forest Analysis for Predicting the Probability of Earthquake in Indonesia
Abstract
This research focuses on identifying risk zones by applying the Random Forest algorithm to predict the probability of earthquakes in Indonesia. The selection of this algorithm is based on its capacity to process voluminous, intricate, and non-linear data, which is frequently encountered in the context of seismic studies. In this study, a predictive model is constructed using historical earthquake data and geographic coordinates. The primary objective is to evaluate the effectiveness of the Random Forest algorithm in predicting earthquake probabilities across different regions of Indonesia. The analysis results indicate that the highest likelihood of earthquakes occurs in Maluku at 24.77%, followed by Nusa Tenggara at 18.34% and Sulawesi at 18.68%. The Random Forest algorithm achieved an accuracy rate of 90.78% in the prediction model, demonstrating its effectiveness in forecasting earthquake probabilities. These findings are expected to provide valuable insights for the government and stakeholders to develop improved disaster mitigation strategies in Indonesia. Furthermore, the methods used in this study can be applied to predict the probabilities of various types of natural disasters across different regions. on using larger datasets and examining the specific regions from which the data is collected.
References
[2] Tupan, N. R. Widuri, and R. Rachmawati, “Analisis Bibliometrik Publikasi Ilmiah Tentang Prediksi Gempa Bumi Berbasis Data Scopus Periode Tahun 2015-2020,” Libraria, vol. 8, 2020.
[3] M. Nur Shodiq, D. Hidayat Kusuma, M. Ghulam Rifqi, A. Ridho Barakbah, and T. Harsono, “Adaptive Neural Fuzzy Inference System and Automatic Clustering for Earthquake Prediction in Indonesia,” 2019.
[4] R. Jena et al., “Integrated model for earthquake risk assessment using neural network and analytic hierarchy process: Aceh province, Indonesia,” Geoscience Frontiers, vol. 11, no. 2, pp. 613–634, Mar. 2020, doi: 10.1016/j.gsf.2019.07.006.
[5] K. J. Kim and S. H. Yoon, “Assessment of building damage risk by natural disasters in South Korea using decision tree analysis,” Sustainability (Switzerland), vol. 10, no. 4, Apr. 2018, doi: 10.3390/su10041072.
[6] P. Debnath et al., “Analysis of earthquake forecasting in India using supervised machine learning classifiers,” Sustainability (Switzerland), vol. 13, no. 2, pp. 1–13, Jan. 2021,
doi: 10.3390/su13020971.
[7] J. Han, J. Kim, S. Park, S. Son, and M. Ryu, “Seismic vulnerability assessment and mapping of Gyeongju, South Korea using frequency ratio, decision tree, and random forest,” Sustainability (Switzerland), vol. 12, no. 18, Sep. 2020, doi: 10.3390/SU12187787.
[8] H. Jia, J. Lin, and J. Liu, “An earthquake fatalities assessment method based on feature importance with deep learning and random forest models,” Sustainability (Switzerland), vol. 11, no. 10, May 2019, doi: 10.3390/su11102727.
[9] BMKG, “Earthquakes in Indonesia,” 2024, Kaggle.
doi: 10.34740/KAGGLE/DSV/9223206.
[10] C. Saranya and G. Manikandan, “A Study on Normalization Techniques for Privacy Preserving Data Mining.”
[11] Z. Gao, L. Ding, Q. Xiong, Z. Gong, and C. Xiong, “Image Compressive Sensing Reconstruction Based on z-Score Standardized Group Sparse Representation,” IEEE Access, vol. 7, pp. 90640–90651, 2019,
doi: 10.1109/ACCESS.2019.2927009.
[12] A. Apicella, F. Isgrò, A. Pollastro, and R. Prevete, “On the effects of data normalization for domain adaptation on EEG data,” Eng Appl Artif Intell, vol. 123, Aug. 2023,
doi: 10.1016/j.engappai.2023.106205.
[13] K. A. Wahid et al., “Intensity standardization methods in magnetic resonance imaging of head and neck cancer,” Phys Imaging Radiat Oncol, vol. 20, pp. 88–93, Oct. 2021,
doi: 10.1016/j.phro.2021.11.001.
[14] H. Soo and C. Author, “머신러닝을 이용한 공공시설 호우피해 예측함수 개발 Development of Heavy Rain Damage Prediction Function for Public Facility Using Machine Learning,” J. Korean Soc. Hazard Mitig, vol. 17, no. 6, pp. 443–450, 2017,
doi: 10.9798/KOSHAM.2017.17.6.443.
[15] N. Kedam, D. K. Tiwari, V. Kumar, K. M. Khedher, and M. A. Salem, “River stream flow prediction through advanced machine learning models for enhanced accuracy,” Results in Engineering, vol. 22, Jun. 2024, doi: 10.1016/j.rineng.2024.102215.
[16] Y. Bammou et al., “Spatial Mapping for Multi-Hazard Land Management in Sparsely Vegetated Watersheds Using Machine Learning Algorithms,” Environ Earth Sci, vol. 83, no. 15, Jul. 2024,
doi: 10.1007/s12665-024-11741-9.
[17] X. Sui, M. Hu, H. Wang, and L. Zhao, “Measurement of Coastal Marine Disaster Resilience and Key Factors with a Random Forest Model: The Perspective of China’s Global Maritime Capital,” Water (Switzerland), vol. 14, no. 20, Oct. 2022, doi: 10.3390/w14203265.
[18] S. Pietro Garofalo et al., “Predicting carob tree physiological parameters under different irrigation systems using Random Forest and Planet satellite images,” Front Plant Sci, vol. 15, 2024,
doi: 10.3389/fpls.2024.1302435.
[19] V. Shkuratskyy, A. B. Usman, M. O’Dea, M. U. Rehman, and S. R. Sabuj, “A Machine Learning Approach for Earthquake Prediction in Various Zones Based on Solar Activity,” 2024. [Online]. Available: https://www.researchgate.net/publication/382326570
[20] J. Tjen, R. J. Iskandar, T. Willay, and T. Darmanto, “Electric Power Consumption Prediction from Scarce Dataset with Entropy-Based Subset Selection Regression Tree (e-ss RT),” IEEE, 2023.
[21] A. Fauzan and D. Ahmad, “ANALISIS HASIL PREDIKSI MAGNITUDO GEMPA DI WILAYAH KOTA PADANG MENGGUNAKAN TEKNIK RANDOM FOREST,” Jurnal Lebesgue : Jurnal Ilmiah Pendidikan Matematika, Matematika dan Statistika, vol. 4, no. 3, pp. 1569–1576, Dec. 2023,
doi: 10.46306/lb.v4i3.450.
[22] H. Tantyoko, D. Kartika Sari, and A. R. Wijaya, “PREDIKSI POTENSIAL GEMPA BUMI INDONESIA MENGGUNAKAN METODE RANDOM FOREST DAN FEATURE SELECTION,” 2023. [Online]. Available: http://jom.fti.budiluhur.ac.id/index.php/IDEALIS/indexHenriTantyoko|http://jom.fti.budiluhur.ac.id/index.php/IDEALIS/index