Tese

Generation and Ranking of Candidate Networks of Relations for Keyword Search over Relational Databases

Sem resumo.

Autor principal: Oliveira, Péricles Silva de
Outros Autores: http://lattes.cnpq.br/9559422309114150
Grau: Tese
Idioma: por
Publicado em: Universidade Federal do Amazonas 2017
Assuntos:
Acesso em linha: http://tede.ufam.edu.br/handle/tede/5806
id oai:https:--tede.ufam.edu.br-handle-:tede-5806
recordtype dspace
spelling oai:https:--tede.ufam.edu.br-handle-:tede-58062017-08-23T05:04:04Z Generation and Ranking of Candidate Networks of Relations for Keyword Search over Relational Databases Oliveira, Péricles Silva de Silva, Altigran Soares da http://lattes.cnpq.br/9559422309114150 http://lattes.cnpq.br/3405503472010994 Keyword-search Match graph Relational database Ranking Candidate networks CIÊNCIAS EXATAS E DA TERRA: CIÊNCIA DA COMPUTAÇÃO Sem resumo. Several systems proposed for processing keyword queries over relational databases rely on the generation and evaluation of Candidate Networks (CNs), i.e., networks of joined database relations that, when processed as SQL queries, provide a relevant answer to the input keyword query. Although the evaluation of CNs has been extensively addressed in the literature, problems related to efficiently generating meaningful CNs have received much less attention. To generate useful CNs is necessary to automatically locating, given a handful of keywords, relations in the database that may contain relevant pieces of information, and determining suitable ways of joining these relations to satisfy the implicit information need expressed by a user when formulating her query. In this thesis, we present two main contributions related to the processing of Candidate Networks. As our first contribution, we present a novel approach for generating CNs, in which possible matchings of the query in database are efficiently enumerated at first. These query matches are then used to guide the CN generation process, avoiding the exhaustive search procedure used by current state-of-art approaches. We show that our approach allows the generation of a compact set of CNs that leads to superior quality answers, and that demands less resources in terms of processing time and memory. As our second contribution, we initially argue that the number of possible Candidate Networks that can be generated by any algorithm is usually very high, but that, in fact, only very few of them produce answers relevant to the user and are indeed worth processing. Thus, there is no point in wasting resources processing useless CNs. Then, based on such an argument, we present an algorithm for ranking CNs, based on their probability of producing relevant answers to the user. This relevance is estimated based on the current state of the underlying database using a probabilistic Bayesian model we have developed. By doing so we are able do discard a large number of CNs, ultimately leading to better results in terms of quality and performance. Our claims and proposals are supported by a comprehensive set of experiments we carried out using several query sets and datasets used in previous related work and whose results we report and analyse here. 2017-08-22T19:40:44Z 2017-04-28 Tese OLIVEIRA, Péricles Silva de. Generation and Ranking of Candidate Networks of Relations for Keyword Search over Relational Databases. 2017. 78 f. Tese (Doutorado em Informática) - Universidade Federal do Amazonas, Manaus, 2017. http://tede.ufam.edu.br/handle/tede/5806 por Acesso Aberto http://creativecommons.org/licenses/by-nc-nd/4.0/ application/pdf Universidade Federal do Amazonas Instituto de Computação Brasil UFAM Programa de Pós-graduação em Informática
institution TEDE - Universidade Federal do Amazonas
collection TEDE-UFAM
language por
topic Keyword-search
Match graph
Relational database
Ranking Candidate networks
CIÊNCIAS EXATAS E DA TERRA: CIÊNCIA DA COMPUTAÇÃO
spellingShingle Keyword-search
Match graph
Relational database
Ranking Candidate networks
CIÊNCIAS EXATAS E DA TERRA: CIÊNCIA DA COMPUTAÇÃO
Oliveira, Péricles Silva de
Generation and Ranking of Candidate Networks of Relations for Keyword Search over Relational Databases
topic_facet Keyword-search
Match graph
Relational database
Ranking Candidate networks
CIÊNCIAS EXATAS E DA TERRA: CIÊNCIA DA COMPUTAÇÃO
description Sem resumo.
author_additional Silva, Altigran Soares da
author_additionalStr Silva, Altigran Soares da
format Tese
author Oliveira, Péricles Silva de
author2 http://lattes.cnpq.br/9559422309114150
author2Str http://lattes.cnpq.br/9559422309114150
title Generation and Ranking of Candidate Networks of Relations for Keyword Search over Relational Databases
title_short Generation and Ranking of Candidate Networks of Relations for Keyword Search over Relational Databases
title_full Generation and Ranking of Candidate Networks of Relations for Keyword Search over Relational Databases
title_fullStr Generation and Ranking of Candidate Networks of Relations for Keyword Search over Relational Databases
title_full_unstemmed Generation and Ranking of Candidate Networks of Relations for Keyword Search over Relational Databases
title_sort generation and ranking of candidate networks of relations for keyword search over relational databases
publisher Universidade Federal do Amazonas
publishDate 2017
url http://tede.ufam.edu.br/handle/tede/5806
_version_ 1831969521646174208
score 11.753735