Training Convolutional Neural Networks with Imbalanced Datasets: an Empirical Evaluation on the WildCat Dataset

DSpace/Manakin Repository

Show simple item record

dc.contributor.advisor Vascon, Sebastiano it_IT
dc.contributor.author Mahammadli, Rafail <1996> it_IT
dc.date.accessioned 2023-02-27 it_IT
dc.date.accessioned 2023-05-23T12:50:43Z
dc.date.available 2023-05-23T12:50:43Z
dc.date.issued 2023-03-16 it_IT
dc.identifier.uri http://hdl.handle.net/10579/22812
dc.description.abstract This thesis aims to exploit deep learning and computer vision techniques in wildlife monitoring. In zoology, wildlife monitoring is a crucial task for keeping track of species movement patterns and demographics. One technique to track wild animals consists of capturing images with spatio-temporal tags to later recover species movements in space and time. Such images are usually acquired through camera traps in the wild or from veterinarians or zoologists when wild animals get injured or found dead. This thesis focuses on the latter case, and uses convolutional neural networks (CNN) to automatically classify three categories of cats: European Wildcat, Domestic Cats and Hybrid Cats. The architectures employed here are Deep Residual Learning Architecture (Resnet 18/50) and Very Deep Convolutional Networks Architecture (VGG16). The aforementioned CNNs models are trained with a dataset of images, named wildcat dataset, collected from various researchers around the globe and made available to me during the internship at Fuorisentiero. Despite image classification being a task in which we saw significant advances thanks to deep convolutional architecture, classes imbalance is still a problem that considerably affects their performances. The wildcat dataset belongs to the category of imbalanced sets, with a class distribution strongly biased toward the wildcat class. In this work, we explored different methodologies to deal with datasets' classes imbalances in the context of CNN. We considered, analyzed and evaluated the following methods: Over/Under-sampling and Cost Sensitive Loss. We further report other methods in the literature, such as the Synthetic Minority Oversampling Technique(SMOTE). The experiments showed that such methods are of primary importance for CNN's performance, being able to considerably improve their accuracy. it_IT
dc.language.iso it_IT
dc.publisher Università Ca' Foscari Venezia it_IT
dc.rights © Rafail Mahammadli, 2023 it_IT
dc.title Training Convolutional Neural Networks with Imbalanced Datasets: an Empirical Evaluation on the WildCat Dataset it_IT
dc.title.alternative it_IT
dc.type Master's Degree Thesis it_IT
dc.degree.name Informatica - computer science it_IT
dc.degree.level Laurea magistrale it_IT
dc.degree.grantor Dipartimento di Scienze Ambientali, Informatica e Statistica it_IT
dc.description.academicyear 2021/2022 - appello sessione straordinaria it_IT
dc.rights.accessrights closedAccess it_IT
dc.thesis.matricno 877254 it_IT
dc.subject.miur it_IT
dc.description.note it_IT
dc.degree.discipline it_IT
dc.contributor.co-advisor it_IT
dc.date.embargoend it_IT
dc.provenance.upload Rafail Mahammadli (877254@stud.unive.it), 2023-02-27 it_IT
dc.provenance.plagiarycheck Sebastiano Vascon (sebastiano.vascon@unive.it), 2023-03-06 it_IT


Files in this item

This item appears in the following Collection(s)

Show simple item record