Information retrieval and extraction from forums, complaints and technical reviews

DSpace/Manakin Repository

Show simple item record

dc.contributor.advisor Orlando, Salvatore it_IT
dc.contributor.author Quintavalle, Bruno <1966> it_IT
dc.date.accessioned 2019-05-31 it_IT
dc.date.accessioned 2020-02-24T06:38:48Z
dc.date.available 2020-02-24T06:38:48Z
dc.date.issued 2019-07-18 it_IT
dc.identifier.uri http://hdl.handle.net/10579/15586
dc.description.abstract Complaints and technical reviews often describe complex problems, most of the times in very articulated ways. Over that kind of corpora, we are considering here three classical tasks: Information Retrieval, Text Classification and Information Extraction. In this context however, these tasks should take into special consideration the structure of the sentence, with special attention to verbal phrases, as complaints are usually descriptions of actions that have been performed whilst they shouldn’t (or the other way around). We want to leverage results from traditional NLP tasks like Semantic Role Labeling and Dependency Parsing, but also to employ the most recent advances in the field of Word and Sentence Embedding. Moreover, Semantic Web technologies should be employed when background knowledge is required. In order to deal with these three heterogeneous approaches, a particular implementation of the SPARQL query language has been developed. It provides a language for template extraction that seamlessly mixes the state of the art of the above-mentioned tasks. Its main difference from SPARQL is the ability to deal with similarity and uncertainty. However, its syntax is strictly the same, simplifying the integration with OWL ontologies and allowing its use as an endpoint for other engines in a federated query context. The case studies illustrated here focuses mainly on problems related to telecommunication companies, using publicly available corpora and forums threads extracted from the web. However, the language has been designed to be used in any context that requires extracting information from user generated corpora of complex technical descriptions. it_IT
dc.language.iso en it_IT
dc.publisher Università Ca' Foscari Venezia it_IT
dc.rights © Bruno Quintavalle, 2019 it_IT
dc.title Information retrieval and extraction from forums, complaints and technical reviews it_IT
dc.title.alternative it_IT
dc.type Doctoral Thesis it_IT
dc.degree.name Informatica it_IT
dc.degree.level Dottorato di ricerca it_IT
dc.degree.grantor Dipartimento di Scienze Ambientali, Informatica e Statistica it_IT
dc.description.academicyear Dottorati, 2018/2019, sessione ESTIVA_01-07-19 it_IT
dc.description.cycle 31
dc.degree.coordinator Focardi, Riccardo
dc.location.shelfmark D001964
dc.location Venezia, Archivio Università Ca' Foscari, Tesi Dottorato
dc.rights.accessrights openAccess it_IT
dc.thesis.matricno 761617 it_IT
dc.format.pagenumber XI, 98 p.
dc.subject.miur INF/01 INFORMATICA it_IT
dc.description.note it_IT
dc.degree.discipline it_IT
dc.contributor.co-advisor it_IT
dc.date.embargoend it_IT
dc.provenance.upload Bruno Quintavalle (761617@stud.unive.it), 2019-05-31 it_IT
dc.provenance.plagiarycheck Salvatore Orlando (orlando@unive.it), 2019-07-01 it_IT


Files in this item

This item appears in the following Collection(s)

Show simple item record