Downloads

Xu, T. . (2024). Credit Risk Assessment Using a Combined Approach of Supervised and Unsupervised Learning. Journal of Computational Methods in Engineering Applications, 4(1), 1–12. https://doi.org/10.62836/jcmea.v4i1.040105

Credit Risk Assessment Using a Combined Approach of Supervised and Unsupervised Learning

In the financial industry, credit scoring is a crucial tool for assessing credit risk. The study aims to enhance the accuracy and reliability of credit scoring by combining supervised and unsupervised learning methods. We propose an integrated model that combines Kohonen's Self-Organizing Maps (SOM) with the Random Forest algorithm to provide a more comprehensive analysis of credit card user data. Key features for model training were identified through feature selection and extraction. Experimental results show that the integrated model improved the AUC from 0.82 to 0.89, increased user satisfaction from a score of 3.8 to 4.35, and boosted usage rates by 12.5%. Additionally, the integrated model significantly enhanced the discrimination and prediction accuracy of user credit risk. These findings indicate that the combination of unsupervised learning with Kohonen's Self-Organizing Maps and supervised learning with Random Forest can effectively improve the accuracy of credit scoring, providing financial institutions with a more reliable tool for credit risk assessment.

credit risk assessment; integrated model; supervised and unsupervised learning; predictive performance

References

  1. Thudumu S, Branch P, Jin J, Singh J. A Comprehensive Survey of Anomaly Detection Techniques for High Dimensional Big Data. Journal of Big Data 2020; 7: 1–30. DOI: https://doi.org/10.1186/s40537-020-00320-x
  2. Golbayani P, Florescu I, Chatterjee R. A Comparative Study of Forecasting Corporate Credit Ratings Using Neural Networks, Support Vector Machines, and Decision Trees. The North American Journal of Economics and Finance 2020; 54: 101251. DOI: https://doi.org/10.1016/j.najef.2020.101251
  3. Wang G, Ma J, Huang L, Xu K. Two Credit Scoring Models Based on Dual Strategy Ensemble Trees. Knowledge-Based Systems 2012; 26: 61–68. DOI: https://doi.org/10.1016/j.knosys.2011.06.020
  4. Liu L. A Self-Learning BP Neural Network Assessment Algorithm for Credit Risk of Commercial Bank. Wireless Communications & Mobile Computing 2022; 2022(2): 1–10. DOI: https://doi.org/10.1155/2022/9650934
  5. Angelini E, Di Tollo G, Roli A. A Neural Network Approach for Credit Risk Evaluation. The Quarterly Review of Economics and Finance 2008; 48(4): 733–755. DOI: https://doi.org/10.1016/j.qref.2007.04.001
  6. Bao W, Lianju N, Yue K. Integration of Unsupervised and Supervised Machine Learning Algorithms for Credit Risk Assessment. Expert Systems with Applications 2019; 128: 301–315. DOI: https://doi.org/10.1016/j.eswa.2019.02.033
  7. Yao Y. Application of Artificial Intelligence in Smart Cities: Current Status, Challenges and Future Trends. International Journal of Computer Science and Information Technology 2024; 2(2): 324–333.
  8. Yao Y. Digital Government Information Platform Construction: Technology, Challenges and Prospects. International Journal of Social Sciences and Public Administration 2024; 2(3): 48–56. DOI: https://doi.org/10.62051/ijsspa.v2n3.06
  9. Demirgüç-Kunt A, Klapper LF. Measuring Financial Inclusion: The Global Findex Database. World Bank Policy Research Working Paper No. 6025 2012. DOI: 10.1596/1813-9450-6025. DOI: https://doi.org/10.1596/1813-9450-6025
  10. Xia Y, Liu S, Yu Q, Deng L, Zhang Y, Su H, Zheng K. Parameterized Decision-Making with Multi-Modal Perception for Autonomous Driving. 2023. arXiv:2312.11935.
  11. Zhang Y, Yang K, Wang Y, Yang P, Liu X. Speculative ECC and LCIM Enabled NUMA Device Core. In Proceedings of the 2023 3rd International Symposium on Computer Technology and Information Science (ISCTIS), Chengdu, China, 7–9 July 2023. DOI: https://doi.org/10.1109/ISCTIS58954.2023.10213102
  12. Qiu L, Liu M. Innovative Design of Cultural Souvenirs Based on Deep Learning and CAD. Computer-Aided Design and Applications 2024. DOI:10.14733/cadaps.2024.S14.237-251 DOI: https://doi.org/10.14733/cadaps.2024.S14.237-251
  13. Liu M, Li Y. Numerical Analysis and Calculation of Urban Landscape Spatial Pattern. In Proceedings of the 2nd International Conference on Intelligent Design and Innovative Technology (ICIDIT 2023), Chengdu, China, 4–6 August 2023.
  14. Jamshidi EJ, Yusup Y, Kayode JS, Kamaruddin MA. Detecting Outliers in a Univariate Time Series Dataset Using Unsupervised Combined Statistical Methods: A Case Study on Surface Water Temperature. Ecological Informatics 2022; 69: 101672. DOI: https://doi.org/10.1016/j.ecoinf.2022.101672
  15. Óskarsdóttir M, Bravo C, Sarraute C, Vanthienen J, Baesens B. The Value of Big Data for Credit Scoring: Enhancing Financial Inclusion Using Mobile Phone Data and Social Network Analytics. Applied Soft Computing 2019; 74: 26–39. DOI: https://doi.org/10.1016/j.asoc.2018.10.004
  16. Demir S, Şahin EK. Liquefaction Prediction with Robust Machine Learning Algorithms (SVM, RF, and XGBoost) Supported by Genetic Algorithm-Based Feature Selection and Parameter Optimization from the Perspective of Data Processing. Environmental Earth Sciences 2022; 81(18): 459. DOI: https://doi.org/10.1007/s12665-022-10578-4
  17. Shaikhina T, Lowe D, Daga S, Briggs D, Higgins R, Khovanova N. Decision Tree and Random Forest Models for Outcome Prediction in Antibody Incompatible Kidney Transplantation. Biomedical Signal Processing and Control 2019; 52: 456–462. DOI: https://doi.org/10.1016/j.bspc.2017.01.012
  18. Huijben IA, Nijdam AA, Overeem S, Van Gilst MM, Van Sloun R. Som-Cpc: Unsupervised Contrastive Learning with Self-Organizing Maps for Structured Representations of High-Rate Time Series. In Proceedings of the International Conference on Machine Learning 2023, 23–29 July 2023, Hawaii, USA.
  19. Norton M, Uryasev S. Maximization of Auc and Buffered Auc in Binary Classification. Mathematical Programming 2019; 174: 575–612. DOI: https://doi.org/10.1007/s10107-018-1312-2