icafe – informal team seminar

Upcoming Talks

Title:  TBA

Speaker: TBA

Place and Time: TBA

Abstract: TBA

Former Talks

Title:  Web Bias Monitoring

Speaker: Théo JAMMES-BEUVE, Thomas LE FLOCH and Olivier MEYER

Place and Time: Rennes – Lipari (F202) and 30 June 2020

Title:  The Thing’in platform

Speaker: Maria Massri

Place and Time: Rennes -Aurigny(D165) and 13 March 2020


The Thing’in platform (www.thinginthefuture.com) is an open platform initiated by Orange, which proposes to operators, object manufacturers, object owners and service developers to cooperate together in the birth of the web of things. This platform allows to understand the context in which each object evolves thanks to the repository of connected objects coming from different universes. The evolution of an object results in the evolution of its relationships and interactions with other objects over time which can be naturally handled by a temporal and graph-oriented data storage.
Although graph databases have extensively found applications in the relationship centered era, a time-version support is seldom provided. For instance, current systems capture the most recently updated snapshot of the underlying graph, whilst the analysis and prediction of temporal behaviors imply the persistence of every graph element’s history. Since physical deletions are forbidden in such a scenario, the outgrowing data volume is a crippling restriction steering the interest in this area towards the optimization of the persistent storage. In this PhD thesis, we are aiming to deliver a storage and querying system that is capable of optimizing both space and query’s computation time costs.
Tomorrow, I will be presenting the rationale behind this work, the anterior academic work posited in this area with its limitations and possible solutions.

Title:  Public transportation

Speaker:  Gauthier Lyan

Place and Time: Rennes -Aurigny(D165) and 03 March 2020


“Nowadays, climate change has become an actual issue to address for both scientists and politicians. If the former can prove to the later that there actually are solutions to reduce the impact of human activities on the climate, the later cannot easily act without appropriate tools that facilitate choices on what to act on.
Public transportation systems are wide and complex, involving many stakeholders and heterogeneous factors that have an impact on their efficiency, hence global impact. We will propose a software approach that offer the possibility to study public transportation systems both in temporal and spatial dimensions, offering predictions of commercial speed in known and unknown environment, based on historical data and available exogenous data. The purpose of this research is to enable decision-makers to make better decisions about public transportation.”

We will discuss the data sources we already/should have, and if our assumptions about them make sense or not.

We will discuss the framework we are being imagining.

Title:  Privacy and ethical issues of AI in legal systems

Speaker:  Louis Béziaud

Place and Time: Rennes -Aurigny(D165) and 18 February 2020

Title:  From databases to artificial intelligence

Speaker:  Zoltan Miklos

Place and Time: Rennes -Aurigny(D165) and 11 February 2020


Repetition for the HDR presentation.

Title:  Modeling uncertainty and inaccuracy on data from crowdsourcing platforms: MONITOR

Speaker:  Constance Thierry

Place and Time: Rennes -Aurigny(D165) and 21 January 2020 (visio from Lannion)


Repetition for the EGC2020 presentation.

Links: Paper on HAL / EGC2020

Title:  Building metro map of scientific topics using hierarchy alignments

Speaker:  Ian Jeantet

Place and Time: Rennes -Aurigny(D165) and 14 January 2020


Presentation of the my joint work done with the Griffith University during my mobility in Australia. I’ll explain how we ended up to build metro maps of scientific topics to study the evolution of science through time.

Title:  Feedback from the Shonan Meeting on Crowdsourcing/Future of Work

Speaker:  David Gross-Amblard

Place and Time: Rennes -Aurigny(D165) and 17 December 2019

Title:  Crowdsourcing the database course with HEADWORK

Speaker:  Adrien Wacquet  (2019 Summer Internship)

Place and Time: Rennes -Aurigny(D165) and 03 December 2019

Title:  Web crawler & and the DIFFIX attack

Speaker:  Antonin Voyez

Place and Time: Rennes -Aurigny(D165) and 19 November 2019


Presentation of a web crawler made for the PROFILE project and a short presentation of the current work done for my upcoming thesis with ENEDIS : linear reconstruction applied to the DIFFIX system.

Title:  The anonymization of personal data: myth, limits, and successes

Speaker:  Tristan Allard,  Joris Duguépéroux, Tompoariniaina Andriamilanto

Place and Time: Rennes -Oleron(A008) and 12 March 2019

Link: Privacy Games @ Festival des Libertés Numériques 

Title: Overlapping hierarchical clustering

Speaker:  Ian Jeantet

Place and Time: Rennes -Oleron(A008) and 12 February 2019


Agglomerative clustering methods have been widely used by many research communities to explore hierarchical structures in their data. The produced cluster hierarchies contribute to understanding the hierarchical structures that are present in complex data. However the agglomerative methods necessarily result in a tree structure, where one has to make a split decision too early in the construction process, that can affect the conclusions one can make about the obtained hierarchical structure. In various settings, one needs a richer hierarchical structure to describe the clusters of the data. Moreover, clusters might also overlap. In this paper, We propose a framework that enables to compute hierarchical structures represented as directed acyclic graphs rather than trees. Our bottom-up method creates clusters with density-based merging criteria, such that the various clusters can overlap.

Title: Integrating uncertain data using user feedback in crowdsourcing applications

Speaker:  Marion Tommasi

Place and Time: Rennes -Oleron(A008) and  22 January 2019


Crowdsourcing applications are used in many domains to perform tasks which are difficult for computers or to gather knowledge using a crowd of people. To execute a task in a crowdsourcing application, human workers by performing some micro-tasks and the resulting data is integrated into the system to proceed with the completion of the global task. However, the data provided by workers is uncertain as human workers can make mistakes or eventually intentionally give a wrong result. We want to use the feedback of other workers to evaluate the trust in the data at any time of the workflow. Ultimately, we want to use this trust to have a workflow which adapts itself depending on the data ant the perceived trust in it to improve data quality. I will first present a model for crowdsourcing applications then present the model for user feedback.

Title: Data-Centric workflow for Complex Crowdsourcing Applications

Speaker:  Rituraj Singh

Place and Time: Rennes -Oleron(A008) and 15 January 2019


Crowdsourcing has emerged as a major paradigm for accomplishing work by paying a small sum of money and alluring the worker whole across the globe. However, the targeted tasks at crowdsourcing platforms are relatively simple, uncomplicated and are independent. In this work, we propose a novel data-centric workflow model for the design of complex crowdsourcing tasks with dependencies. The model allows orchestration of simple tasks, handles data and crowd workers, allows concurrency, and in addition provides high-level constructs allowing decomposition of complex tasks into orchestrations of simpler subtasks. We first define the syntax and semantics of the model, and then consider its formal properties, starting with the question of termination of a complex workflow (i.e., whether a system has non-terminating runs). Unsurprisingly termination is undecidable even for the simplest models. However, upon restrictions that are sensible in the context of crowdsourcing (namely that a crowd worker only has a bounded number of contributions in a workflow ), termination becomes decidable. We then extend the termination question to address the correctness of a workflow, i.e. the question of whether a terminating workflow always satisfies a constraint depicted in terms of the relation between the input of the workflow and its output.

Comments are closed.