Hyperparameters tuning of random forest with harmony search in credit scoring

Correct identification of defaulters and non-defaulters in the lending industry is a crucial task for financial institutions. Credit scoring is a tool utilized for credit granting decisions. Recently, Random Forest (RF) is actively researched in credit scoring due to two main benefits, i.e. non-para...

Full description

Saved in:
Bibliographic Details
Main Authors: Goh, Rui Ying, Lee, Lai Soon, Adam, Mohd. Bakri
Format: Article
Language:English
Published: Academy of Sciences Malaysia 2019
Online Access:http://psasir.upm.edu.my/id/eprint/80123/1/Hyperparameters%20tuning%20of%20random%20forest%20with%20harmony%20search%20in%20credit%20scoring.pdf
http://psasir.upm.edu.my/id/eprint/80123/
https://www.akademisains.gov.my/asmsj/article/hyperparameters-tuning-of-random-forest-with-harmony-search-in-credit-scoring/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Correct identification of defaulters and non-defaulters in the lending industry is a crucial task for financial institutions. Credit scoring is a tool utilized for credit granting decisions. Recently, Random Forest (RF) is actively researched in credit scoring due to two main benefits, i.e. non-parametric flexibility to account for various data patterns with good classification ability and the computed features importance that can explain the attributes. Hyperparameters tuning is a necessary procedure to ensure good performance of a RF. This paper proposes the use of a metaheuristic, Harmony Search (HS), to form a hybrid HS-RF to conduct hyperparameters tuning. A Modified HS (MHS) is also proposed, forming MHS-RF, for effective yet efficient search of the RF hyperparameters. Along with parallel computing, MHS-RF effectively reduces the computational efforts of the hyperparameters tuning procedure. The proposed hybrid models are benchmarked with standard statistical models on the Lending Club peer-to-peer lending dataset. The computational results show that a well-tuned RF have better performance than statistical models, with MHS-RF reported the best performance yet being the most efficient in hyperparameters tuning of RF.