Random forest implementation for classification analysis: default predictions applied to Italian companies

DSpace/Manakin Repository

Show simple item record

dc.contributor.advisor Casarin, Roberto it_IT
dc.contributor.author Tramontin, Davide <1992> it_IT
dc.date.accessioned 2020-07-12 it_IT
dc.date.accessioned 2020-09-24T12:05:13Z
dc.date.available 2020-09-24T12:05:13Z
dc.date.issued 2020-07-30 it_IT
dc.identifier.uri http://hdl.handle.net/10579/17720
dc.description.abstract The growing importance of big data and the increased environment complexity have led to an increase in the implementation machine learning algorithms, given their ability to efficiently deal with entangled situations. This study contributes to the framework regarding the application of random forests and other machine learning algorithms. Specifically, the topic of research is company failure and probability of default. The major impact that the firm’s default has on businesses, markets, and societies, underlines the importance of developing models which predict the probability of default. This research attempts to address this topic with two purposes: create an accurate binary model to classify companies in Defaulted and Non-Defaulted; identify the most important predictors in order to understand the links between the financial ratios considered and the companies’ status. Random forests’ ability to deal with big data sets and with various and diverse predictors have led to choosing this algorithm to analyze the topic of research. Building on a literature review of decision trees, random forests, company failure, and the models which predict the probability of default, this study’s analysis is constructed through several experiments which permit to tune the model appropriately and construct the final model which provide the highest accuracy. Through its cross-sectional analysis, this research confirms random forests’ strong stability and its consistent performance. The final model generated performs well, and identifies in the coverage of fixed assets, gross profit, net working capital, cost of debt, debt to equity ratio, leverage, solvency ratio, and return on assets, the most important default predictors. Finally, the results and methods applied have been jointly used to extend the purpose of this research. In order to permit further development of this study and of research on random forest and machine learning, an R programming code which permits to reproduce the computations carried out is provided. Importantly, the designed function is applicable to any data set to permit the analysis of different topics as well and provides a visual representation of the results through a Shiny App, permitting an easier interpretation of results. it_IT
dc.language.iso en it_IT
dc.publisher Università Ca' Foscari Venezia it_IT
dc.rights © Davide Tramontin, 2020 it_IT
dc.title Random forest implementation for classification analysis: default predictions applied to Italian companies it_IT
dc.title.alternative Random Forest Implementation for Classification Analysis: Default Predictions Applied to Italian Companies it_IT
dc.type Master's Degree Thesis it_IT
dc.degree.name Global development and entrepreneurship it_IT
dc.degree.level Laurea magistrale it_IT
dc.degree.grantor Dipartimento di Economia it_IT
dc.description.academicyear 2019/2020 - Sessione Estiva it_IT
dc.rights.accessrights openAccess it_IT
dc.thesis.matricno 860126 it_IT
dc.subject.miur SECS-P/07 ECONOMIA AZIENDALE it_IT
dc.description.note it_IT
dc.degree.discipline it_IT
dc.contributor.co-advisor it_IT
dc.date.embargoend it_IT
dc.provenance.upload Davide Tramontin (860126@stud.unive.it), 2020-07-12 it_IT
dc.provenance.plagiarycheck Roberto Casarin (r.casarin@unive.it), 2020-07-27 it_IT


Files in this item

This item appears in the following Collection(s)

Show simple item record