Thomas Liebig

Thomas Liebig
Materna SE · Artificial Intelligence Unit

Dr.

About

81
Publications
33,465
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
936
Citations
Citations since 2017
42 Research Items
666 Citations
2017201820192020202120222023020406080100
2017201820192020202120222023020406080100
2017201820192020202120222023020406080100
2017201820192020202120222023020406080100
Introduction
Thomas Liebig is head of Data Analytics & AI at Materna SE. Previously he worked at the TU Dortmund University – Artificial Intelligence Unit after receiving a Ph.D. in AI/ML from the University of Bonn and Fraunhofer IAIS.
Additional affiliations
January 2013 - present
Technische Universität Dortmund
Position
  • PostDoc Position
October 2010 - October 2012
University of Bonn
Position
  • Researcher
November 2006 - December 2012
Fraunhofer Institute for Intelligent Analysis and Information Systems IAIS
Position
  • Researcher

Publications

Publications (81)
Preprint
Our study reveals new theoretical insights into over-smoothing and feature over-correlation in deep graph neural networks. We show the prevalence of invariant subspaces, demonstrating a fixed relative behavior that is unaffected by feature transformations. Our work clarifies recent observations related to convergence to a constant state and a poten...
Preprint
Over-squashing and over-smoothing are two critical issues, that limit the capabilities of graph neural networks (GNNs). While over-smoothing eliminates the differences between nodes making them indistinguishable, over-squashing refers to the inability of GNNs to propagate information over long distances, as exponentially many node states are squash...
Chapter
Popular graph neural networks are shallow models, despite the success of very deep architectures in other application domains of deep learning. This reduces the modeling capacity and leaves models unable to capture long-range relationships. The primary reason for the shallow design results from over-smoothing, which leads node states to become more...
Chapter
Sustainability is the current global challenge. This is reflected in the demand for healthy food and CO neutrality. These challenges can be met with the industrial cultivation of algae: Algae can be used as food supplements, nutraceuticals, pharmaceuticals, fuel, CO sinks, and obtain high relative yield density per area. Current limitations in thei...
Preprint
Full-text available
Data privacy and decentralised data collection has become more and more popular in recent years. In order to solve issues with privacy, communication bandwidth and learning from spatio-temporal data, we will propose two efficient models which use Differential Privacy and decentralized LSTM-Learning: One, in which a Long Short Term Memory (LSTM) mod...
Preprint
Support Vector Machines have been successfully used for one-class classification (OCSVM, SVDD) when trained on clean data, but they work much worse on dirty data: outliers present in the training data tend to become support vectors, and are hence considered "normal". In this article, we improve the effectiveness to detect outliers in dirty training...
Preprint
Forecasting future states of sensors is key to solving tasks like weather prediction, route planning, and many others when dealing with networks of sensors. But complete spatial coverage of sensors is generally unavailable and would practically be infeasible due to limitations in budget and other resources during deployment and maintenance. Current...
Preprint
Full-text available
Data protection regulations like the GDPR or the California Consumer Privacy Act give users more control over the data that is collected about them. Deleting the collected data is often insufficient to guarantee data privacy since it is often used to train machine learning models, which can expose information about the training data. Thus, a guaran...
Preprint
Full-text available
There has been a growing interest in Machine Unlearning recently, primarily due to legal requirements such as the General Data Protection Regulation (GDPR) and the California Consumer Privacy Act. Thus, multiple approaches were presented to remove the influence of specific target data points from a trained model. However, when evaluating the succes...
Preprint
Popular graph neural networks are shallow models, despite the success of very deep architectures in other application domains of deep learning. This reduces the modeling capacity and leaves models unable to capture long-range relationships. The primary reason for the shallow design results from over-smoothing, which leads node states to become more...
Article
While probabilistic graphical models are a central tool for reasoning under uncertainty in AI, they are in general not as expressive as deep neural models, and inference is notoriously hard and slow. In contrast, deep probabilistic models such as sum-product networks (SPNs) capture joint distributions and ensure tractable inference, but still lack...
Article
Full-text available
The development and implementation of artificial intelligence (AI) applications in health care contexts is a concurrent research and management question. Especially for hospitals, the expectations regarding improved efficiency and effectiveness by the introduction of novel AI applications are huge. However, experiences with real-life AI use cases a...
Chapter
Due to IoT and Industry 4.0, more and more data is collected by sensor nodes, which send their data to a central data lake. This approach results in high data traffic and privacy risk, which we want to address in this paper. Therefore we use an existing Learning from Label Proportions (LLP) algorithm, to use the decentralized properties and extend...
Preprint
Occupied truck parking lots regularly cause hazardous situations. Estimation of current parking lot state could be utilized to provide drivers parking recommendations. In this work, we highlight based on a simulation scenario, how sparse observations, as obtained by a mobile application could be utilized to estimate parking lot occupancy. Our simul...
Chapter
We study the applicability of blockchain technology for distributed event detection under resource constraints. Therefore we provide a test-suite with several promising consensus methods (Proof-of-Work, Proof-of-Stake, Distributed Proof-of-Work, and Practical Proof-of-Kernel-Work). This is the first work analyzing the communication costs of blockch...
Article
Full-text available
The exploitation of vehicles as mobile sensors acts as a catalyst for novel crowdsensing-based applications such as intelligent traffic control and distributed weather forecast. However, the massive increases in Machine-type Communication (MTC) highly stress the capacities of the network infrastructure. With the system-immanent limitation of resour...
Preprint
Full-text available
Bayesian networks are a central tool in machine learning and artificial intelligence, and make use of conditional independencies to impose structure on joint distributions. However, they are generally not as expressive as deep learning models and inference is hard and slow. In contrast, deep probabilistic models such as sum-product networks (SPNs)...
Preprint
Full-text available
The exploitation of vehicles as mobile sensors acts as a catalyst for novel crowdsensing-based applications such as intelligent traffic control and distributed weather forecast. However, the massive increases in Machine-type Communication (MTC) highly stress the capacities of the network infrastructure. With the system-immanent limitation of resour...
Preprint
Full-text available
We study the applicability of blockchain technology for distributed event detection under resource constraints. Therefore we provide a test-suite with several promising consensus methods (Proof-of-Work, Proof-of-Stake, Distributed Proof-of-Work, and Practical Proof-of-Kernel-Work). This is the first work analyzing the communication costs of blockch...
Preprint
With the proliferation of electric vehicles, the electrical distribution grids are more prone to overloads. In this paper, we study an intelligent pricing and power control mechanism based on contextual bandits to provide incentives for distributing charging load and preventing network failure. The presented work combines the microscopic mobility s...
Preprint
Full-text available
While the development of fully autonomous vehicles is one of the major research fields in the Intelligent Transportation Systems (ITSs) domain, the upcoming longterm transition period - the hybrid vehicular traffic - is often neglected. However, within the next decades, automotive systems with heterogeneous autonomy levels will share the same road...
Article
Full-text available
The nature of contemporary spatial data infrastructures lies in the provision of geospatial information in an on-demand fashion. Although recent applications identified the need to react to real-time information in a time-critical way, research efforts in the field of geospatial Internet of Things in particular have identified substantial gaps in t...
Conference Paper
Full-text available
With the proliferation of electric vehicles, the electrical distribution grids are more prone to overloads. In this paper, we study an intelligent pricing and power control mechanism based on contextual bandits to provide incentives for distributing charging load and preventing network failure. The presented work combines the microscopic mobility s...
Preprint
Full-text available
The nature of contemporary Spatial Data Infrastructures lies in the provision of geospatial information in an on-demand fashion. Though recent applications identified the need to react to real-time information in a time-critical way. In particular, research efforts in the field of geospatial Internet of Things have identified substantial gaps in th...
Preprint
Full-text available
While cars were only considered as means of personal transportation for a long time, they are currently transcending to mobile sensor nodes that gather highly up-to-date information for crowdsensing-enabled big data services in a smart city context. Consequently, upcoming 5G communication networks will be confronted with massive increases in Machin...
Article
Full-text available
Upcoming Intelligent Traffic Control Systems (ITSCs) will base their optimization processes on crowdsensing data obtained for cars that are used as mobile sensor nodes. In conclusion, public cellular networks will be confronted with massive increases in Machine-Type Communication (MTC) and will require efficient communication schemes to minimize th...
Article
Full-text available
Upcoming 5G-based communication networks will be confronted with huge increases in the amount of transmitted sensor data related to massive deployments of static and mobile Internet of Things (IoT) systems. Cars acting as mobile sensors will become important data sources for cloud-based applications like predictive maintenance and dynamic traffic f...
Conference Paper
Autonomous robots need to perceive and represent their environments and act accordingly. Using simultaneous localization and mapping (SLAM) methods, robots can build maps of the environment which are efficient for localization and path planning as long as the environment remains unchanged. However, facility logistics environments are not static bec...
Conference Paper
Full-text available
Situation-aware route planning gathers increasing interest. The proliferation of various sensor technologies in smart cities allows the incorporation of real-time data and its predictions in the trip planning process. We present a system for individual multi-modal trip planning that incorporates predictions of future public transport delays in rout...
Conference Paper
Full-text available
Urban areas are increasingly subject to congestions. Most navigation systems and algorithms that avoid these congestions consider drivers independently and can, thus, cause novel congestions at unexpected places. Pre-computation of optimal trips (Nash equilibrium) could be a solution to the problem but is due to its static nature of no practical re...
Conference Paper
Full-text available
Route planning makes direct use of geographic data and provides beneficial recommendations to the public. In real-world the schedule of transit vehicles is dynamic and delays in the schedules occur. Incorporation of these dynamic schedule changes in multi-modal route computation is difficult and requires a lot of computational resources. Our approa...
Conference Paper
In this demo we present INSIGHT, a system that provides traffic event detection in Dublin by exploiting Big Data and Crowdsourcing techniques. Our system is able to process and analyze input from multiple heterogeneous urban data sources.
Chapter
Full-text available
Since the last decades the availability and granularity of location-based data has been rapidly growing. Besides the proliferation of smartphones and location-based social networks, also crowdsourcing and voluntary geographic data led to highly granular mobility data, maps and street networks. In result, location-aware, smart environments are creat...
Chapter
Full-text available
Proliferation of pervasive devices capturing sensible data streams, e.g. mobility records, raise concerns on individual privacy. Even if the data is aggregated at a central server, location data may identify a particular person. Thus, the transmitted data must be guarded against re-identification and an un-trusted server. This paper overcomes limit...
Article
Full-text available
Modern traffic management should benefit from the diverse sensors, smart phones, and social networks data that offer the potential of enhanced services. In disaster scenarios, it is no longer guaranteed that a central server and reliable communication is always available. This motivates a distributed computing setting with restricted communication....
Article
Full-text available
Individual multi-modal trip planning is a major task in transportation science. With increasing availability of new means of transportation personal constraints (e.g. elevator phobia or fear of flying) and preferences (e.g. train over bus) gain higher impact. Existing trip planners are mostly based on static time-tables and roadnetwork data. Furthe...
Conference Paper
Full-text available
We present a novel approach for event recognition in massive streams of heterogeneous data driven by privacy policies and big data event processing. New technologies in mobile computing combined with sensing infrastructures distributed in a city or country are generating massive, poly-structured spatio-temporal data. With a view on emergencies and...
Conference Paper
We give an overview of an intelligent urban traffic management system. Complex events related to congestions are detected from heterogeneous sources involving fixed sensors mounted on intersections and mobile sensors mounted on public transport vehicles. To deal with data veracity, sensor disagreements are resolved by crowdsourcing. To deal with da...
Article
Full-text available
Analysis of people's movements represented by continuous sequences of spatio-temporal data tuples have received lots of attention in recent years. The focus of those studies was mostly GPS data recorded on a constant sample rate. However, the creation of intelligent location-aware models and environments also requires reliable localization in indoo...
Conference Paper
Full-text available
We consider a city where induction-based vehicle count sensors are installed at some, but not all street junctions. Each sensor regularly outputs a count and a saturation value. We first use a discrete time Gauss-Markov model based on historical data to predict the evolution of these saturation values, and then a Gaussian Process derived from the s...
Conference Paper
Full-text available
Smart route planning gathers increasing interest as cities become crowded and jammed. We present a system for individual trip planning that incorporates future traffic hazards in routing. Future traffic conditions are computed by a Spatio-Temporal Random Field based on a stream of sensor readings. In addition, our approach estimates traffic flow in...
Conference Paper
Full-text available
Situation dependent route planning gathers increasing interest as cities become crowded and jammed. We present a system for individual trip planning that incorporates future traffic hazards in routing. Future traffic conditions are computed by a Spatio-Temporal Random Field based on a stream of sensor readings. In addition, our approach estimates t...
Article
Full-text available
With the emergence of the mobile app ecosystem, user location data has escaped the grip of the tightly regulated telecommunication industry and is now being collected at unprecedented scale and accuracy by mobile advertising, platform, and app providers. This position paper is based on discussions of the authors at the Dagstuhl seminar on Mobility...
Conference Paper
Full-text available
Understanding the spatial and temporal aspects of activities in urban regions is one of the key challenges for the emerging fields of urban computing and emergency management as it provides indispensable insights on the quality of services in urban environments and helps to describe the socio-dynamics of urban districts. This work presents a novel...
Conference Paper
Full-text available
Pedestrian quantity estimation receives increasing attention and has important applications, e.g. in location evaluation and risk analysis. In this work, we focus on pedestrian quantity estimation for event monitoring. We address the problem (1) how to estimate quantities for unmeasured locations, and (2) where to place a bounded number of sensors...
Conference Paper
Full-text available
In street-based mobility mining, traffic volume estimation receives increasing attention as it provides important applications such as emergency support systems, quality-of-service evaluation and billboard placement. In many real world scenarios, empirical measurements are usually sparse due to some constraints. On the other hand, pedestrians gener...
Article
Full-text available
Continuing advances in modern data acquisition techniques result in rapidly growing amounts of geo-referenced data about moving objects and in emergence of new data types. We define episodic movement data as a new complex data type to be considered in the research fields relevant to data analysis. In episodic movement data, position measurements ma...
Conference Paper
Full-text available
Recently evolved Bluetooth tracking technology is currently applied to extract individual pathways, movement patterns or to rank popularity of locations by their visitor quantities. To utilize this technology for the creation of location aware intelligent environments, the next steps are to come up with microscopic traffic values. This work propose...
Conference Paper
Emergence of Bluetooth tracking technology for event monitoring is currently applied to extract individual pathways, movement patterns or to rank popularity of locations by their visitor quantities. The next steps are to achieve short term movement predictions, to understand people's motivations and to come up with microscopic traffic values. This...
Conference Paper
Full-text available
In recent times, consumer research at major social events received significant interest by organizing companies. Understanding the movements and motivations of the customers enables new business strategies and is needed to minimize the risk of investment. The spatiotemporal complexity of major events poses high demands on survey and analytical meth...
Conference Paper
During the past years the first tools for visual analysis of trajectory data appeared. Considering the growing sizes of trajectory collections, one important task is to ensure user interactivity during data analysis. In this paper we present a fast, model-based visualization approach for the analysis of location dependencies in large trajectory col...
Conference Paper
Full-text available
Traffic routes through a street network contain patterns and are no random walks. Such patterns exist for instance along streets or between neighbouring street segments. The extraction of these patterns is a challenging task due to the enormous size of city street networks, the large number of required training data and the unknown distribution of...