Exploring the Data Wilderness through Examples

Davide Mottin, Matteo Lissandrini, Themis Palpanas, Yannis Velegrakis


Exploration is one of the primordial ways to accrue knowledge about the world and its nature. As we accumulate, mostly automatically, data at unprecedented volumes and speed, our datasets have become complex and hard to understand. In this context exploratory search provides a handy tool for progressively gather the necessary knowledge by starting from a tentative query that hopefully leads to answers at least partially relevant and that can provide cues about the next queries to issue. An exploratory query should be simple enough to avoid complicate declarative languages (such as SQL) and mechanisms, and at the same time retain the flexibility and expressiveness of such languages. Recently, we have witnessed a rediscovery of the so called example-based methods, in which the user, or the analyst circumvent query languages by using examples as input. This shift in semantics has led to a number of methods receiving as query a set of example members of the answer set. The search system then infers the entire answer set based on the given examples and any additional information provided by the underlying database. In this tutorial, we present an excursus over the main example-based methods for exploratory analysis. We show how different data types require different techniques, and present algorithms that are specifically designed for relational, textual, and graph data. We conclude by providing a unifying view of this query-paradigm and identify new exciting research directions.

The tutorial has be presented on Sunday, June 30th, at SIGMOD'19 See the official program.

The content of this tutorial has been expanded in a book:
Find out more!

You can also visit the official website: data-exploration.ml


Exploring the Data Wilderness through Examples.”
Proceedings of the 42Nd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2019 (pp. 1411-1412).

 author = {Mottin, Davide and Lissandrini, Matteo and Velegrakis, Yannis and Palpanas, Themis},
 title = {Exploring the Data Wilderness Through Examples},
 booktitle = {Proceedings of the 2019 International Conference on Management of Data},
 series = {SIGMOD '19},
 year = {2019},
 isbn = {978-1-4503-5643-5},
 location = {Amsterdam, Netherlands},
 pages = {2031--2035},
 numpages = {5},
 url = {http://doi.acm.org/10.1145/3299869.3314031},
 doi = {10.1145/3299869.3314031},
 acmid = {3314031},
 publisher = {ACM},
 address = {New York, NY, USA},
 keywords = {data exploration, database usability, example-based methods, exploratory search, graph exploration},