Alternative Credit Scoring and Classification Employing Machine Learning Techniques on a Big Data Platform

dc.contributor.author Dağ, Hasan
dc.contributor.author Kiyakoğlu, Burhan Yasin
dc.contributor.author Rezaeinazhad, Arash Mohammadian
dc.contributor.author Korkmaz, Halil Ergun
dc.contributor.author Dağ, Hasan
dc.contributor.other Management Information Systems
dc.date.accessioned 2021-02-19T18:52:27Z
dc.date.available 2021-02-19T18:52:27Z
dc.date.issued 2019
dc.description.abstract With the bloom of financial technology and innovations aiming to deliver a high standard of financial services, banks and credit service companies, along with other financial institutions, use the most recent technologies available in a variety of ways from addressing the information asymmetry, matching the needs of borrowers and lenders, to facilitating transactions using payment services. In the long list of FinTechs, one of the most attractive platforms is the Peer-to-Peer (P2P) lending which aims to bring the investors and borrowers hand in hand, leaving out the traditional intermediaries like banks. The main purpose of a financial institution as an intermediary is of controlling risk and P2P lending platforms innovate and use new ways of risk assessment. In the era of Big Data, using a diverse source of information from spending behaviors of customers, social media behavior, and geographic information along with traditional methods for credit scoring prove to have new insights for the proper and more accurate credit scoring. In this study, we investigate the machine learning techniques on big data platforms, analyzing the credit scoring methods. It has been concluded that on a HDFS (Hadoop Distributed File System) environment, Logistic Regression performs better than Decision Tree and Random Forest for credit scoring and classification considering performance metrics such as accuracy, precision and recall, and the overall run time of algorithms. Logistic Regression also performs better in time in a single node HDFS configuration compared to a non-HDFS configuration. en_US
dc.identifier.citationcount 3
dc.identifier.doi 10.1109/UBMK.2019.8907113 en_US
dc.identifier.endpage 734 en_US
dc.identifier.isbn 978-172813964-7
dc.identifier.scopus 2-s2.0-85076215629 en_US
dc.identifier.scopusquality N/A
dc.identifier.startpage 731 en_US
dc.identifier.uri https://hdl.handle.net/20.500.12469/3960
dc.identifier.wos WOS:000609879900138 en_US
dc.identifier.wosquality N/A
dc.institutionauthor Hindistan, Yavuz Selim en_US
dc.institutionauthor Kiyakoğlu, Burhan Yasin en_US
dc.institutionauthor Rezaeinazhad, Arash Mohammadian en_US
dc.institutionauthor Korkmaz, Halil Ergun en_US
dc.institutionauthor Daǧ, Hasan en_US
dc.language.iso en en_US
dc.publisher Institute of Electrical and Electronics Engineers Inc. en_US
dc.relation.journal UBMK 2019 - Proceedings, 4th International Conference on Computer Science and Engineering en_US
dc.relation.publicationcategory Kitap Bölümü - Uluslararası en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.scopus.citedbyCount 5
dc.subject Big data en_US
dc.subject Credit Risk Scoring en_US
dc.subject Crowd-funding en_US
dc.subject Hadoop en_US
dc.subject Machine Learning en_US
dc.subject P2P en_US
dc.subject Peer-to-Peer lending en_US
dc.title Alternative Credit Scoring and Classification Employing Machine Learning Techniques on a Big Data Platform en_US
dc.type Book Part en_US
dc.wos.citedbyCount 4
dspace.entity.type Publication
relation.isAuthorOfPublication e02bc683-b72e-4da4-a5db-ddebeb21e8e7
relation.isAuthorOfPublication.latestForDiscovery e02bc683-b72e-4da4-a5db-ddebeb21e8e7
relation.isOrgUnitOfPublication ff62e329-217b-4857-88f0-1dae00646b8c
relation.isOrgUnitOfPublication.latestForDiscovery ff62e329-217b-4857-88f0-1dae00646b8c

Files