Downloads
Download


This work is licensed under a Creative Commons Attribution 4.0 International License.
Research on Optimization of Naive Bayes Algorithm for Spam Filtering
With the rapid development of the Internet, email as an important communication tool faces increasingly severe spam problems. This paper proposes a series of optimization and improvement solutions targeting several key issues in the application of traditional Naive Bayes algorithm for spam filtering. The research first systematically analyzes the theoretical foundation of Naive Bayes algorithm in text classification and its applicability in spam identification, thoroughly investigating main factors affecting algorithm performance such as data imbalance, feature selection, and high-dimensional sparsity. Building on this foundation, the paper presents comprehensive improvement strategies from four dimensions: data preprocessing, feature selection, model structure, and parameter optimization, including innovative solutions like improved information gain feature selection methods, dynamic weight adjustment mechanisms, and semi-Naive Bayes models. Through comparative experimental validation, the optimized algorithm achieves significant performance enhancement in classification, effectively reducing misjudgment rates while improving identification capability for new types of spam. This research not only provides more effective technical solutions for spam filtering but also offers valuable references for extending Naive Bayes algorithm to other text classification tasks.
References
- Han X. Application Research of Bayesian Optimization in Spam Filtering. Journal of Xuzhou Institute of Technology (Natural Sciences Edition) 2023; 38(2): 77–83.
- Peng G. A Review of Research on Spam Filtering Based on Naive Bayes Algorithm. Computer Knowledge and Technology 2020; 16(14): 244–245+247.
- Long J, Shen XH, Ding XJ, et al. Quantum Genetic Algorithm Optimized Weighted Naive Bayes Composite Language Text Classification. Journal of University of Jinan (Natural Science Edition) 2022; 36(2): 136–141.
- Ma YM, Wu DF. Research on Audit Data Sharing Mechanism Based on Blockchain and Naive Bayes Algorithm. Chinese Certified Public Accountant 2025; (05): 57–64.
- Ma YL. Intelligent Classification Method of Power Grid Construction Resources Based on Naive Bayes Algorithm. Popular Utilization of Electricity 2024; 39(09): 66–67.
- Zeng JQ. Research on Improved Instance Weighted Naive Bayes Algorithm. Master’s Thesis. East China University of Technology, Chongqing, China, 2024.
- Zhang K. Design of Data Filtering Engine for Networked Audit System Based on Naive Bayes Algorithm. Electronic Technology 2024; 53(05): 208–209.
Supporting Agencies
- Funding: This research received no external funding.