EXTENDED PRELIMINARY PROGRAM

All times are in Eastern European Summer Time (EEST) time zone.

24 August, 2021

  Room 1019 Room 1018
8:30-9:00 Registration
9:00-11:00
Doctoral Consortium
Nicolas Ringuet Challenges in lifelong pathways recommendation

Xu Teng and Goce Trajcevski Semantically diverse constrained queries

Dániel Varga, János Márk Szalai-Gindl and Sándor Laki The descriptiveness of feature descriptors with reduced dimensionality

Vladimir Ivković and Ivan Luković An Approach to Validation of Business-Oriented Smart Contracts Based on Process Mining
CAoNS
Johannes Kastner and Peter M. Fischer Scalable and Explainable User Role Detection in Social Media

Georgios Stoupas and Antonis Sidiropoulos Multi-dimensional Ranking via Majorization

Jennifer Neumann and Peter M. Fischer Inferring Missing Retweets in Twitter Information Cascades
11:00-11:30 Coffee Break
11:30-12:30
SIMPDA
Robert Wrembel Data Warehouse and Data Lake Technologies: Selected Challenges still to be Researched

Maurice Van Keulen Process innovations for breast cancer care

Gabriel Marques Tavares and Sylvio Barbon Junior Process Mining Encoding via Meta-Learning for an Enhanced Anomaly Detection

Anahita Farhang Ghahfarokhi, Gyunam Park, Alessandro Berti and Wil M.P. van der Aalst OCEL: A Standard for Object-Centric Event Logs
CAoNS
Keynote talk by Ginestra Bianconi
12:30-13:30 Lunch
13:30-15:30
DOING*
Genoveva Vargas-Solar, José-Luis Zechinelli-Martini, Javier A. Espinosa-Oviedo, and Luis M. Vilches-Blázquez LACLICHEV: Exploring the History of Climate Change in Latin America within Newspapers Digital Collections

Ana Sodré, Dimmy Magalhães, Luis Floriano, Aurora Pozo, Carmem Hara and Sidnei Machado COVID-19 Portal: Machine Learning Techniques Applied to the Analysis of Judicial Processes Related to the Pandemic

Ciro Medeiros, Umberto Costa, and Martin A. Musicante Standard Matching-Choice Expressions for Defining Path Queries in Graph Databases
MADEISD
Sidra Aslam, Michael Mrissa A RESTful Privacy-aware and Mutable Decentralized Ledger

Tomasz Dziubich, Jan Cychnerski Segmentation quality refinement in large-scale medical image dataset with crowd-sourced annotations

Jan Cychnerski and Tomasz Dziubich Process of medical dataset construction for machine learning – multifield study and guidelines

William Steingartner, Valerie Novitzka Natural Semantics for Domain-Specific Language
15:30-16:00 Coffee Break
16:00-17:30
DOING
Ciro Medeiros, Martin Musicante and Mirian Halfeld-Ferrari The Formal-Language-Constrained Graph Minimization Problem

Tatiane Lautert, Nádia P. Kozievitch, Ismael Villanueva-Miranda and Monika Akbar Public Health Units – Exploratory Analysis for Decision Support

Rufat Babayev and Lena Wiese Interpreting Decision-Making Process for Multivariate Time Series Classification
MegaData
Sadi Alawadi, Victor R. Kebande, Yuji Dong, Joseph Bugeja, Jan A. Persson and Carl Magnus Olsson A Federated Interactive Learning IoT-based Health Monitoring Platform

Iver Toft Tomter and Weihai Yu Augmenting SQLite for Local-First Software

Keynote talk by Dr Srijith Rajamohan Machine Learning Pipelines
18:00- Reception

* – starts at 14:00

25 August, 2021

  Room 1019
8:30-9:00 Registration
9:00-9:15 Opening
9:15-10:15 Keynote 1: Divesh Srivastava
Topic: Towards High-Quality Big Data: A Focus on Time
(Session chair: Ladjel Bellatreche)
10:15-10:45 Coffee break
10:45-12:30
Session 1 Patterns and Events (Session chair: Boris Novikov)
Witold Andrzejewski and Pawel Boinski Maximal Mixed-Drove Co-occurrence Patterns

Dickson Odhiambo Owuor and Anne Laurent Efficiently mining large gradual patterns using chunked storage layout

Yihong Zhang, Masumi Shirakawa, and Takahiro Hara A General Method for Event Detection on Social Media

Siraj Mohammed, Fekade Getahun and Richard Chbeir 5W1H Aware Framework for Representing and Detecting Real Events from Multimedia Digital EcoSystem
12:30-13:30 Lunch
13:30-14:50
Session 2 Social Media and Text Mining (Session chair: Yannis Manolopoulos)
Room 1018
Short paper session 1 Database Internals and Processes
(Session chair: Ahmed Awad)
Abderrazek Azri, Cecile Favre, Nouria Harbi, Jerome Darmont, and Camille Nous MONITOR: A Multimodal Fusion Framework to Assess Message Veracity in Social Networks

Pegdwende N. Sawadogo, Jerome Darmont, and Camille Nous Joint Management and Analysis of Textual Documents and Tabular Data within the AUDAL Data Lake

Markus Endres, Lena Rudenko, and Dominik Groninger Aggregation and Summarization of Thematically Similar Twitter Microblog Messages
Daniel Lindner, Alexander Löser, and Jan Kossmann Learned What-If Cost Models for Autonomous Clustering

Martin Kappel, Stefan Jablonski, and Stefan Schonig Cost-sensitive Predictive Business Process Monitoring

Gajendra Doniparthi, Timo Mühlhaus, and Stefan Deßloch A Hybrid Data Model and Flexible Indexing for Interactive Exploration of Large-scale Bio-Science Data

Alexis I. Aspauza Lescano and Robson L. F. Cordeiro Relational Conditional Set Operations
14:50-15:20 Coffee break
15:20-16:30 Keynote 2: Sanjay Chawla
Topic: A perspective on prescriptive and reinforcement learning
(Session chair: Panagiotis Karras )
16:30-18:00 Steering Committee meeting (Chair: Yannis Manolopoulos)
18:00-19:00 Excursion in University of Tartu Art Museum
19:00-22:00 Conference dinner

26 August, 2021

  Room 1019
8:30-9:00 Registration
9:00-10:15 Keynote 3: Dirk Draheim
Topic: Data exchange for Digital Government: Where are we heading?
(Session chair: Marlon Dumas)
10:15-10:45 Coffee break
10:45-12:30
Session 3 Indexes, Queries, and Constraints
(Session chair: Divesh Srivastava)
Kevin Wellenzohn, Luka Popovic, Michael Böhlen, and Sven Helmer Inserting Keys into the Robust Content-and-Structure (RCAS) Index

Chiara Forresi, Matteo Francia, Enrico Gallinucci, and Matteo Golfarelli Optimizing execution plans in a multistore

Stefan Brass and Mario Wenzel Integrity Constraints for Microcontroller Programming in Datalog

Maksim Goman Chance Constraint as a Basis for Probabilistic Query Model
12:30-13:30 Lunch
13:30-14:50
Session 4 High-dimensional Data and Data Streams
(Session chair: Dirk Draheim)
Room 1018
Short paper session 2 Complex Data (Session chair: Raimundas Matulevičius)
Arnab Chakrabarti, Abhijeet Das, Michael Cochez, Christoph Quix Unsupervised Feature Selection for Efficient Exploration of High Dimensional Data

Annabelle Gillet, Eric Leclercq, and Nadine Cullot MuLOT: Multi-level optimization of the canonical polyadic tensor decomposition at large-scale

Vasile-Marian Scuturici, Benjamin Chazelle, Pierre-Loic Maisonneuve, Ammar Mechouche, Jean-Marc Petit From Large Time Series to Patterns Movies: Application to Airbus Helicopters Flight Data
Wenqin Dong, Eric W Lee, Vicki Stover Hertzberg, Roy L Simpson, and Joyce C Ho GASP: Graph-based Approximate Sequential Pattern Mining for Electronic Health Records

Andrius Barauskas, Agnė Brilingaitė, Linas Bukauskas, Vaida Čeikutė, Alminas Čivilis, and Simonas Šaltenis Semi-Synthetic Data and Testbed for Long-Distance E-Vehicle Routing

Raj Ratn Pranesh, Mehrdad Farokhnejad, Ambesh Shekhar, and Genoveva Vargas-Solar Looking for COVID-19 misinformation in multilingual social media texts

Ilia Triapitcin, Ajantha Dahanayake, and Bernhard Thalheim Semantic Discovery from Sensors and Image Data for Real-Time Spatio-Temporal Emergency Monitoring
14:50-15:20 Coffee break
15:20-17:00
Session 5 Data Integration (Session chair: Riccardo Tommasini)
Laıs Soares Caldeira, Guilherme Dal Bianco, and Anderson A. Ferreira Experimental Evaluation among Reblocking Techniques Applied to the Entity Resolution

Tobias Zeimetz and Ralf Schenkel FiLiPo: A Sample Driven Approach for Finding Linkage Points between RDF Data and APIs

Jing Zhang, Bonggun Shin, Jinho D. Choi, and Joyce C Ho SMAT: An attention-based deep learning solution to the automation of schema matching

Souad Ghazouani, Anis Tissaoui, and Richard Chbeir Towards a Cloud-WSDL Metamodel: A new extension of WSDL for Cloud Service Description
17:00-17:15 Closing