Information retrieval and extraction from forums, complaints and technical reviews

dc.contributor.advisor	Orlando, Salvatore	it_IT
dc.contributor.author	Quintavalle, Bruno <1966>	it_IT
dc.date.accessioned	2019-05-31	it_IT
dc.date.accessioned	2020-02-24T06:38:48Z
dc.date.available	2020-02-24T06:38:48Z
dc.date.issued	2019-07-18	it_IT
dc.identifier.uri	http://hdl.handle.net/10579/15586
dc.description.abstract	Complaints and technical reviews often describe complex problems, most of the times in very articulated ways. Over that kind of corpora, we are considering here three classical tasks: Information Retrieval, Text Classification and Information Extraction. In this context however, these tasks should take into special consideration the structure of the sentence, with special attention to verbal phrases, as complaints are usually descriptions of actions that have been performed whilst they shouldn’t (or the other way around). We want to leverage results from traditional NLP tasks like Semantic Role Labeling and Dependency Parsing, but also to employ the most recent advances in the field of Word and Sentence Embedding. Moreover, Semantic Web technologies should be employed when background knowledge is required. In order to deal with these three heterogeneous approaches, a particular implementation of the SPARQL query language has been developed. It provides a language for template extraction that seamlessly mixes the state of the art of the above-mentioned tasks. Its main difference from SPARQL is the ability to deal with similarity and uncertainty. However, its syntax is strictly the same, simplifying the integration with OWL ontologies and allowing its use as an endpoint for other engines in a federated query context. The case studies illustrated here focuses mainly on problems related to telecommunication companies, using publicly available corpora and forums threads extracted from the web. However, the language has been designed to be used in any context that requires extracting information from user generated corpora of complex technical descriptions.	it_IT
dc.language.iso	en	it_IT
dc.publisher	Università Ca' Foscari Venezia	it_IT
dc.rights	© Bruno Quintavalle, 2019	it_IT
dc.title	Information retrieval and extraction from forums, complaints and technical reviews	it_IT
dc.title.alternative		it_IT
dc.type	Doctoral Thesis	it_IT
dc.degree.name	Informatica	it_IT
dc.degree.level	Dottorato di ricerca	it_IT
dc.degree.grantor	Dipartimento di Scienze Ambientali, Informatica e Statistica	it_IT
dc.description.academicyear	Dottorati, 2018/2019, sessione ESTIVA_01-07-19	it_IT
dc.description.cycle	31
dc.degree.coordinator	Focardi, Riccardo
dc.location.shelfmark	D001964
dc.location	Venezia, Archivio Università Ca' Foscari, Tesi Dottorato
dc.rights.accessrights	openAccess	it_IT
dc.thesis.matricno	761617	it_IT
dc.format.pagenumber	XI, 98 p.
dc.subject.miur	INF/01 INFORMATICA	it_IT
dc.description.note		it_IT
dc.degree.discipline		it_IT
dc.contributor.co-advisor		it_IT
dc.date.embargoend		it_IT
dc.provenance.upload	Bruno Quintavalle (761617@stud.unive.it), 2019-05-31	it_IT
dc.provenance.plagiarycheck	Salvatore Orlando (orlando@unive.it), 2019-07-01	it_IT