Query log based techniques to improve the performance of a web search engine

DSpace/Manakin Repository

Show simple item record

dc.contributor.advisor Orlando, Salvatore
dc.contributor.author Broccolo, Daniele <1984> it_IT
dc.date.accessioned 2014-06-30T07:25:34Z
dc.date.available 2014-06-30T07:25:34Z
dc.date.issued 2014-03-18
dc.identifier.uri http://hdl.handle.net/10579/4635
dc.description.abstract Every user leaves traces of her/his behaviour when she/he surfs the Web. All the usage data generated by users is stored in logs of several web applications, and such logs can be used to extract useful knowledge for enhancing and improving performance of online services. Also Search Engines (SEs) store usage information in so-called query logs, which can be used in different ways to improve the SE user experience. In this thesis we focus on improving the performance of a SE, in particular its effectiveness and efficiency, through query log mining. We propose to enhance the performance of SEs by discussing a novel Query Recommender System. We prove that is possible to decrease the length of a user's query session by unloading the SE of part of the queries that the user submits in order to refine his initial search. This approach helps the user find what she/he is searching in a shorter period of time, while at the same time decreasing the number of queries that the SE must process, and thus decreasing the overall server load. We also discuss how to enhance the SE efficiency by optimizing the use of its computational resources. The knowledge extracted from a query log is used to dynamically adjust the query processing method by adapting the pruning strategy to the SE load. In particular query logs permit to build a regressive model used to predict the response time for any query, when different pruning strategies are applied during query processing. The prediction is used to ensure a minimum quality of service when the system is heavily loaded, by trying to process the various enqueued queries by a given deadline. Our study also addresses the problem of the effectiveness of query results by comparing their quality when dynamic pruning is adopted to reduce the query processing times. Finally, we also study how response times and results vary when, in presence of high loads, processing is either interrupted after a fixed time threshold elapses or dropped completely. Moreover, we introduce a novel query dropping strategy based on the same query performance predictors discussed above. it_IT
dc.language.iso eng it_IT
dc.publisher Università Ca' Foscari Venezia it
dc.rights © Daniele Broccolo, 2014 it_IT
dc.subject Web search engine it_IT
dc.subject Information retrieval it_IT
dc.subject Query log mining it_IT
dc.subject Web mining it_IT
dc.subject Efficiency it_IT
dc.subject Query recommender system it_IT
dc.subject Query difficulty prediction it_IT
dc.title Query log based techniques to improve the performance of a web search engine it_IT
dc.type Doctoral Thesis en
dc.degree.name Informatica it_IT
dc.degree.level Dottorato di ricerca it
dc.degree.grantor Scuola di dottorato in Scienze e tecnologie (SDST) it_IT
dc.description.academicyear 2014 it_IT
dc.description.cycle 25 it_IT
dc.degree.coordinator Focardi, Riccardo
dc.location.shelfmark D001355 it
dc.location Venezia, Archivio Università Ca' Foscari, Tesi Dottorato it
dc.rights.accessrights openAccess it_IT
dc.thesis.matricno 955758 it_IT
dc.format.pagenumber XVI, 115 p. it_IT
dc.subject.miur INF/01 INFORMATICA it_IT
dc.description.tableofcontent 0) Introduction 1) Web Search 2) Distributed Architectures 3) On Query Logs 4) Search Shortcut 5) Load-Sensitive Selective Pruning 6) Highly Loaded Search Engine it_IT
dc.identifier.bibliographiccitation Broccolo, Daniele. "Query log based techniques to improve the performance of a web search engine", Università Ca’ Foscari Venezia, tesi di dottorato, 25. ciclo, 2014 it_IT


Files in this item

This item appears in the following Collection(s)

Show simple item record