ML-based Approach for Credit Risk Assessment Using Parallel Calculations

Hentosh, L.; Tsikalo, Y.; Kustra, N.; Kutucu, H.

ML-based Approach for Credit Risk Assessment Using Parallel Calculations

dc.contributor.author	Hentosh, L.
dc.contributor.author	Tsikalo, Y.
dc.contributor.author	Kustra, N.
dc.contributor.author	Kutucu, H.
dc.date.accessioned	2024-09-29T16:22:40Z
dc.date.available	2024-09-29T16:22:40Z
dc.date.issued	2022
dc.department	Karabük Üniversitesi	en_US
dc.description	3rd International Workshop on Computational and Information Technologies for Risk-Informed Systems, CITRisk 2022 -- 12 January 2023 -- Virtual, Online -- 189474	en_US
dc.description.abstract	In banks and other credit organizations, the task of credit scoring often arises when making decisions on granting loans. The last one consists of making a reasoned decision based on information about the applicant, whether she should be granted a loan, and, if so, on what terms. This paper proposes the application of parallel calculations of the Random forest algorithm when solving the credit scoring task. This approach made it possible to reduce the time of model training and dataset processing significantly. Expectedly, when applying less data, the resulting acceleration and efficiency worsen. Using only 2500 entries, the execution time of the sequential algorithm is less than the parallel algorithm. The developed software was tested on three different processors: 4-core, 8-core, and 12-core, to evaluate the parallelization quality of data pre-processing. The classification algorithm is computationally complex and time-consuming, so we obtained practically the same acceleration for processing 5000 and 10000 records. With this amount of data, the 12-core processor gave the biggest gain in time when working with 12 threads. As a result, it is possible to have an acceleration of more than 6. This efficiency indicator of the proposed parallel algorithm can be significantly improved by varying the number of threads and considering the current trends in developing the multi-core architecture of computing systems. Also, using data without pre-processing, the following evaluation metrics were obtained: AUC=0.9 and Precision=0.845, and using data after pre-processing, these metrics were: AUC=0.86, Precision=0.89. © 2022 Copyright for this paper by its authors.	en_US
dc.identifier.endpage	173	en_US
dc.identifier.issn	1613-0073
dc.identifier.scopus	2-s2.0-85163841243	en_US
dc.identifier.scopusquality	N/A	en_US
dc.identifier.startpage	161	en_US
dc.identifier.uri	https://hdl.handle.net/20.500.14619/10212
dc.identifier.volume	3422	en_US
dc.indekslendigikaynak	Scopus	en_US
dc.language.iso	en	en_US
dc.publisher	CEUR-WS	en_US
dc.relation.ispartof	CEUR Workshop Proceedings	en_US
dc.relation.publicationcategory	Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı	en_US
dc.rights	info:eu-repo/semantics/closedAccess	en_US
dc.subject	acceleration	en_US
dc.subject	classification task	en_US
dc.subject	Credit scoring	en_US
dc.subject	parallel algorithm	en_US
dc.subject	Random forest	en_US
dc.title	ML-based Approach for Credit Risk Assessment Using Parallel Calculations	en_US
dc.type	Conference Object	en_US

Koleksiyon

Scopus İndeksli Yayınlar Koleksiyonu

ML-based Approach for Credit Risk Assessment Using Parallel Calculations

Dosyalar

Koleksiyon