OPTIMASI DETEKSI SPAM EMAIL DENGAN RANDOM FOREST DAN RANDOM SEARCH
DOI:
https://doi.org/10.32897/sobat.2025.7.1.5288Keywords:
Hyperparameter-Tuning, Random Forest, Randomized Search Classifier, Deteksi Spam, EmailAbstract
Deteksi spam pada email merupakan komponen penting dalam sistem keamanan dan penyaringan informasi digital. Untuk meningkatkan akurasi dalam klasifikasi spam, digunakan algoritma Random Forest Classifier yang dikenal karena stabilitas dan keandalannya dalam memproses data dengan banyak fitur. Penelitian ini menerapkan teknik hyperparameter tuning menggunakan metode Random Search Cross-Validation untuk mengoptimalkan kinerja model dengan menemukan kombinasi parameter terbaik. Random Search Cross-Validation memungkinkan proses pencarian parameter yang efisien tanpa harus menguji seluruh kemungkinan seperti pada Grid Search Cross-Validation. Hasil evaluasi menunjukkan bahwa gabungan antara model Random Forest dan Hyperparameter-Tuning menggunakan metode Random Search Cross-Validation memiliki performa yang lebih tinggi dibandingkan model default, terutama dalam hal akurasi dan presisi deteksi spam. Sistem ini dapat membantu pengguna maupun organisasi dalam mengelola email secara lebih efektif dan aman.
References
S. T. Ibrahim, O. B. Adjunct Lecturer, and O. H. Part Time Lecturer, “Spam Email Detection Scheme Based On Random Forest Algorithm,” LAUJCI, 2023. (Online). Available: Www.Laujci.Lautech.Edu.Ng
M. A. Bouke, A. Abdullah, M. T. Abdullah, S. A. Zaid, H. El Atigh, And S. H. Alshatebi, “A Lightweight Machine Learning-Based Email Spam Detection Model Using Word Frequency Pattern,” Journal Of Information Technology And Computing, Vol. 4, No. 1, Pp. 15–28, Jun. 2023, Doi: 10.48185/Jitc.V4i1.653.
C. Beaman Craigbeaman, “Anomaly Detection In Emails Using Machine Learning And Header Information.”
P. Probst, M. Wright, And A.-L. Boulesteix, “Hyperparameters And Tuning Strategies For Random Forest,” Feb. 2019, Doi: 10.1002/Widm.1301.
R. Ageng, R. Faisal, And S. Ihsan, “Random Forest Machine Learning For Spam Email Classification,” Journal Of Dinda Data Science, Information Technology, And Data Analytics, Vol. 4, No. 1, Pp. 8–13, 2024, (Online). Available: Http://Journal.Ittelkom-Pwt.Ac.Id/Index.Php/Dinda
T. O. Omotehinwa And D. O. Oyewola, “Hyperparameter Optimization Of Ensemble Models For Spam Email Detection,” Applied Sciences (Switzerland), Vol. 13, No. 3, Feb. 2023, Doi: 10.3390/App13031971.
M. Sahami, S. Dumais, D. Heckerman, And E. Horvitz, “A Bayesian Approach To Filtering Junk E-Mail,” 1998. (Online). Available: Www.Aaai.Org
B. Klimt And Y. Yang, “The Enron Corpus: A New Dataset For Email Classification Research.” (Online). Available: Http://Www-2.Cs.Cmu.Edu/~Enron/.
N. Al-Shanableh Mazen, S. Alzyoud, And E. Nashnush, “ENHANCING EMAIL SPAM Detection Through Ensemble Enhancing Email Spam Detection Through Ensemble Machine Learning: A Comprehensive Evaluation Of Machine Learning: A Comprehensive Evaluation Of Model Integration And Performance Model Integration And Performance Part Of The Management Information Systems Commons.” (Online). Available: Https://Scholarworks.Lib.Csusb.Edu/Ciima
A. Kosmopoulos, G. Paliouras, and I. Androutsopoulos, “Adaptive Spam Filtering Using Only Naive Bayes Text Classifiers.” (Online). Available: http://www.aueb.gr/users/ion/
M. A. Ghani and A. Subekti, “Email Spam Filtering Dengan Algoritma Random Forest,” IJCIT (Indonesian Journal on Computer and Information Technology, vol. 3, no. 2, pp. 216–221, 2018.
J. Bergstra, J. B. Ca, and Y. B. Ca, “Random Search for Hyper-Parameter Optimization Yoshua Bengio,” 2012. (Online). Available: http://scikit-learn.sourceforge.net.
G. M. Sai, K. 1#, K. Eswar, T. 2#, D. Harshavardhan, and R. 3#, “Study of SPAM Email Detection.”