Jakob Suchan
Constructor University Bremen gGmbH · Computer Science and Electrical Engineering

Dr.

About

Publications

12,704

Reads

368

Citations

My research lies in the intersection of AI, Computer Vision, and Human‐Centred Computing. In particular, I am interested in how we can develop general methods and tools that enable autonomous systems to abstract, understand, reason about, and learn from multimodal (human) Interactions, with the aim to assist humans in their everyday personal and professional tasks.

Skills and Expertise

Knowledge Representation

Artificial Intelligence

Cognitive Vision

Commonsense Abstractions of Space and Motion

Declarative Visuospatial Dynamics

Visual Perception

Publications

The Geometry of a Scene: On Deep Semantics for Visual Perception Driven Cognitive Film Studies

Conference Paper

Full-text available

Mar 2016

Semantic Question-Answering with Video and Eye-Tracking Data: AI Foundations for Human Visual Perception Driven Cognitive Film Studies

Conference Paper

Full-text available

Jul 2016

We present a computational framework for the grounding and semantic interpretation of dynamic visuo-spatial imagery consisting of video and eye-tracking data. Driven by cognitive film studies and visual perception research, we demonstrate key technological capabilities aimed at investigating attention & recipient effects vis-a-vis the motion pictur...

Robust Natural Language Processing - Combining Reasoning, Cognitive Semantics and Construction Grammar for Spatial Language

Conference Paper

Full-text available

Jul 2016

We present a system for generating and understanding of dynamic and static spatial relations in robotic interaction setups. Robots describe an environment of moving blocks using English phrases that include spatial relations such as "across" and "in front of". We evaluate the system in robot-robot interactions and show that the system can robustly...

Visual Explanation by High-Level Abduction: On Answer-Set Programming Driven Reasoning about Moving Objects

Conference Paper

Full-text available

Dec 2017

We propose a hybrid architecture for systematically computing robust visual explanation(s) encompassing hypothesis formation, belief revision, and default reasoning with video data. The architecture consists of two tightly integrated synergistic components: (1) (functional) answer set programming based abductive reasoning with SPACE-TIME TRACKLETS...

Out of Sight But Not Out of Mind: An Answer Set Programming Based Online Abduction Framework for Visual Sensemaking in Autonomous Driving

Conference Paper

Full-text available

May 2019

We demonstrate the need and potential of systematically integrated vision and semantics solutions for visual sensemaking (in the backdrop of autonomous driving). A general method for online visual sensemaking using answer set programming is systematically formalised and fully implemented. The method integrates state of the art in (deep learning bas...

Proceedings 39th International Conference on Logic Programming: Visual Sensemaking Needs Both Vision and Semantics: On Logic-Based Declarative Neurosymbolism for Reasoning about Space and Motion

Article

Aug 2023

Figure 3: Application Example: Interpretation & Projection

Assessing Drivers' Situation Awareness in Semi-Autonomous Vehicles: ASP based Characterisations of Driving Dynamics for Modelling Scene Interpretation and Projection

Article

Full-text available

Aug 2023

Fig. 2 The driving simulator is equipped with VR headset and...

Fig. 3 A matrix of driving environments illustrating the range of...

Fig. 4 Structure of the experimental session: The horizontal line...

Fig. 5 Analysis of detection rate among visuospatial complexity levels...

Fig. 6 Analysis of fixations among visuospatial complexity levels (all...

How do drivers mitigate the effects of naturalistic visual complexity? On attentional strategies and their implications under a change blindness protocol

Article

Full-text available

Aug 2023

How do the limits of high-level visual processing affect human performance in naturalistic, dynamic settings of (multimodal) interaction where observers can draw on experience to strategically adapt attention to familiar forms of complexity? In this backdrop, we investigate change detection in a driving context to study attentional allocation aimed...

Human Models for Future Mobility

Article

Feb 2023

The new DLR Institute of Systems Engineering for Future Mobility (DLR SE) opened its doors at the beginning of 2022. As the new DLR institute emerged from the former OFFIS Division Transportation, it can draw on more than 30 years of experience in the research field on safety critical systems. With the transition to the German Aerospace Center (DLR...

Neurosymbolic Learning: On Generalising Relational Visuospatial and Temporal Structure

Conference Paper

Jan 2023

Jakob Suchan

Artificial Visual Intelligence: Perceptual Commonsense for Human-Centred Cognitive Technologies

Chapter

Full-text available

Sep 2022

We address computational cognitive vision and perception at the interface of language, logic, cognition, and artificial intelligence. The chapter presents general methods for the processing and semantic interpretation of dynamic visuospatial imagery with a particular emphasis on the ability to abstract, learn, and reason with cognitively rooted str...

Attentional Synchrony in Films: A Window to Visuospatial Characterization of Events

Conference Paper

Sep 2022

The study of event perception emphasizes the importance of visuospatial attributes in everyday human activities and how they influence event segmentation, prediction and retrieval. Attending to these visuospatial attributes is the first step toward event understanding, and therefore correlating attentional measures to such attributes would help to...

Neurosymbolic Learning: On Generalising Relational Visuospatial and Temporal Structure

Article

Full-text available

Sep 2022

Challenges in Achieving Explainability for Cooperative Transportation Systems: Perspectives From an Ongoing Research Project

Presentation

Full-text available

Aug 2022

Presentation of the conference paper "Challenges in Achieving Explainability for Cooperative Transportation Systems" at the local hub of the "Second International Workshop on Requirements Engineering for Explainable Systems (RE4ES)"

Declarative Reasoning about Space and Motion in Visual Imagery - Theoretical Foundations and Applications

Thesis

Jun 2022

Jakob Suchan

Perceptual sensemaking of dynamic visual imagery, e.g., involving semantic grounding, explanation, and learning, is central to a range of tasks where artificial intelligent systems have to make decisions and interact with humans. Towards this, commonsense characterisations of space and motion encompassing spatio-temporal relations, motion patterns,...

Grounding Embodied Multimodal Interaction: Towards Behaviourally Established Semantic Foundations for Human-Centred AI

Conference Paper

Full-text available

Jun 2022

We position recent and emerging research in cognitive vision and perception addressing three key questions: (1) What kind of relational abstraction mechanisms are needed to perform (explainable) grounded inference --e.g., question-answering, qualitative generalisation, hypothetical reasoning-- relevant to embodied multimodal interaction? (2) How ca...

Spatial Cognition and Artificial Intelligence: Methods for In-The-Wild Behavioural Research in Visual Perception

Conference Paper

Jun 2022

Neurosymbolic Learning : On Generalising Relational Visuospatial and Temporal Structure

Conference Paper

Jan 2022

We position ongoing research aimed at developing a general framework for structured spatio-temporal learning from multimodal human behavioural stimuli. The framework and its underlying general, modular methods serve as a model for the application of integrated (neural) visuo-auditory processing and (semantic) relational learning foundations for app...

Cognitive Vision - Deep Semantics for (Neurosymbolic) Multimodal Learning

Conference Paper

Jan 2022

Jakob Suchan

Commonsense Visual Sensemaking for Autonomous Driving

Article

May 2021

We demonstrate the need and potential of systematically integrated vision and semantics solutions for visual sensemaking in the backdrop of autonomous driving. A general neurosymbolic method for online visual sensemaking using answer set programming (ASP) is systematically formalised and fully implemented. The method integrates state of the art in...

Commonsense Visual Sensemaking for Autonomous Driving: On Generalised Neurosymbolic Online Abduction Integrating Vision and Semantics

Article

Full-text available

Dec 2020

Commonsense Visual Sensemaking for Autonomous Driving: On Generalised Neurosymbolic Online Abduction Integrating Vision and Semantics

Preprint

Dec 2020

Towards a Human-Centred Cognitive Model of Visuospatial Complexity in Everyday Driving

Conference Paper

Full-text available

Jun 2020

We develop a human-centred cognitive model of visuospatial complexity in everyday , naturalistic driving conditions. With a focus on visual perception, the model incorporates quantitative, structural, and dynamic attributes identifiable in the chosen context; the human-centred basis of the model lies in its behavioural evaluation with human subject...

Towards a Human-Centred Cognitive Model of Visuospatial Complexity in Everyday Driving

Preprint

May 2020

We develop a human-centred, cognitive model of visuospatial complexity in everyday, naturalistic driving conditions. With a focus on visual perception, the model incorporates quantitative, structural, and dynamic attributes identifiable in the chosen context; the human-centred basis of the model lies in its behavioural evaluation with human subject...

Cognitive Vision and Perception: Deep Semantics Integrating AI and Vision for Reasoning about Space, Motion, and Interaction

Conference Paper

Full-text available

Feb 2020

Semantic interpretation of dynamic visuospatial imagery calls for a general and systematic integration of methods in knowledge representation and computer vision. Towards this, we highlight research articulating & developing "deep semantics", characterised by the existence of declarative models --e.g., pertaining "space and motion"-- and correspond...

Driven by Commonsense: On the Role of Human-Centred Visual Explainability for Autonomous Vehicles

Conference Paper

Full-text available

Feb 2020

Within the autonomous driving domain, there is now a clear need and tremendous potential for hybrid solutions (e.g., integrating semantics, learning, visual computing) towards fulfilling essential legal and ethical responsibilities involving explainability (e.g., for diagnosis), human-centred AI (e.g., interaction design), and industrial standardis...

Supplementary / Poster Version

Data

Aug 2019

Out of Sight But Not Out of Mind: An Answer Set Programming Based Online Abduction Framework for Visual Sensemaking in Autonomous Driving

Preprint

May 2019

We demonstrate the need and potential of systematically integrated vision and semantics} solutions for visual sensemaking (in the backdrop of autonomous driving). A general method for online visual sensemaking using answer set programming is systematically formalised and fully implemented. The method integrates state of the art in (deep learning ba...

Deep Semantics for Explainable Visuospatial Intelligence: Perspectives on Integrating Commonsense Spatial Abstractions and Low-Level Neural Features

Conference Paper

Full-text available

May 2019

High-level semantic interpretation of (dynamic) visual imagery calls for general and systematic methods integrating techniques in knowledge representation and computer vision. Towards this, we position "deep semantics", denoting the existence of declarative models --e.g., pertaining "space and motion"-- and corresponding formalisation and methods s...

Semantic Analysis of (Reflectional) Visual Symmetry: A Human-Centred Computational Model for Declarative Explainability

Article

Full-text available

Sep 2018

Minds. Movement. Moving Image. / 7th International Conference on Spatial Cognition (ICSC 2018)

Preprint

Full-text available

Sep 2018

MINDS. MOVEMENT. MOVING IMAGE. consists of two symposia held as part of the 7th International Conference on Spatial Cognition (ICSC 2018). September 10-14, 2018; Rome (Italy) / Convener: Mehul Bhatt., www.icsc-rome.org. Symposium 1: Spatial Cognition and the Built Environment Symposium 2: Visuo-Auditory Perception and the Moving Image Speakers a...

Answer Set Programming Modulo ‘Space-Time’: Second International Joint Conference, RuleML+RR 2018, Luxembourg, Luxembourg, September 18–21, 2018, Proceedings

Chapter

Aug 2018

We present ASP Modulo ‘Space-Time’, a declarative representational and computational framework to perform commonsense reasoning about regions with both spatial and temporal components. Supported are capabilities for mixed qualitative-quantitative reasoning, consistency checking, and inferring compositions of space-time relations; these capabilities...

Answer Set Programming Modulo `Space-Time'

Preprint

May 2018

We present ASP Modulo `Space-Time', a declarative representational and computational framework to perform commonsense reasoning about regions with both spatial and temporal components. Supported are capabilities for mixed qualitative-quantitative reasoning, consistency checking, and inferring compositions of space-time relations; these capabilities...

Answer Set Programming Modulo `Space-Time'

Preprint

Full-text available

May 2018

Visual Explanation by High-Level Abduction: On Answer-Set Programming Driven Reasoning About Moving Objects

Article

Full-text available

Dec 2017

Deep Semantic Abstractions of Everyday Human Activities: On Commonsense Representations of Human Interactions

Conference Paper

Full-text available

Nov 2017

We propose a deep semantic characterization of space and motion categorically from the viewpoint of grounding embodied human-object interactions. Our key focus is on an ontological model that would be adept to formalisation from the viewpoint of commonsense knowledge representation, relational learning, and qualitative reasoning about space and mot...

Deep Semantic Abstractions of Everyday Human Activities

Conference Paper

Nov 2017

We propose a deep semantic characterisation of space and motion categorically from the viewpoint of grounding embodied human-object interactions. Our key focus is on an ontological model that would be adept to formalisation from the viewpoint of commonsense knowledge representation, relational learning, and qualitative reasoning about space and mot...

Commonsense Scene Semantics for Cognitive Robotics: Towards Grounding Embodied Visuo-Locomotive Interactions

Conference Paper

Full-text available

Oct 2017

We present a commonsense, qualitative model for the semantic grounding of embodied visuo-spatial and locomotive interactions. The key contribution is an integrative methodology combining low-level visual processing with high-level, human-centred representations of space and motion rooted in artificial intelligence. We demonstrate practical applicab...

Symmetry in the Eye of the Beholder

Article

Aug 2017

Declarative Reasoning about Space and Motion with Video

Article

Aug 2017

Jakob Suchan

We present a commonsense theory of space and motion for representing and reasoning about motion patterns in video data, to perform declarative (deep) semantic interpretation of visuo-spatial sensor data, e.g., coming from object tracking, eye tracking data, movement trajectories. The theory has been implemented within constraint logic programming t...

Probabilistic Spatial Reasoning in Constraint Logic Programming

Conference Paper

Sep 2016

In this paper we present a novel framework and full implementation of probabilistic spatial reasoning within a Logic Programming context. The crux of our approach is extending Probabilistic Logic Programming (based on distribution semantics) to support reasoning over spatial variables via Constraint Logic Programming. Spatial reasoning is formulate...

Deeply Semantic Inductive Spatio-Temporal Learning

Conference Paper

Full-text available

Sep 2016

We present an inductive spatio-temporal learning framework rooted in inductive logic programming. With an emphasis on visuo-spatial language, logic, and cognition, the framework supports learning with relational spatio-temporal features identifiable in a range of domains involving the processing and interpretation of dynamic visuo-spatial imagery....

Deeply Semantic Inductive Spatio-Temporal Learning

Article

Full-text available

Aug 2016

Grounding Dynamic Spatial Relations for Embodied (Robot) Interaction

Chapter

Full-text available

Jul 2016

This paper presents a computational model of the processing of dynamic spatial relations occurring in an embodied robotic interaction setup. A complete system is introduced that allows autonomous robots to produce and interpret dynamic spatial phrases (in English) given an environment of moving objects. The model unites two separate research strand...

Embodied visuo-locomotive experience analysis: immersive reality based summarisation of experiments in environment-behaviour studies

Conference Paper

Full-text available

Jul 2016

Evidence-based design (EBD) for architecture involves the study of post-occupancy behaviour of building users with the aim to provide an empirical basis for improving building performance [Hamilton and Watkins 2009]. Within EBD, the high-level, qualitative analysis of the embodied visuo-locomotive experience of representative groups of building use...

The perception of symmetry in the moving image: multi-level computational analysis of cinematographic scene structure and its visual reception

Conference Paper

Jul 2016

This research is driven by visuo-spatial perception focussed cognitive film studies, where the key emphasis is on the systematic study and generation of evidence that can characterise and establish correlates between principles for the synthesis of the moving image, and its cognitive (e.g., embodied visuo-auditory, emotional) recipient effects on o...

Artificial Intelligence for Predictive and Evidence Based Architecture Design

Article

Mar 2016

The evidence-based analysis of people's navigation and wayfinding behaviour in large-scale built-up environments (e.g., hospitals, airports) encompasses the measurement and qualitative analysis of a range of aspects including people's visual perception in new and familiar surroundings, their decision-making procedures and intentions, the affordance...

ARTIFICIAL INTELLIGENCE FOR PREDICTIVE AND EVIDENCE BASED ARCHITECTURE DESIGN: Integrating Spatial Reasoning, Cognitive Vision, and Eye-Tracking for the Analysis of Embodied Visuo-Locomotive Experience in the Built Environment

Conference Paper

Full-text available

Feb 2016

VISUO-SPATIAL SEMANTICS OF THE MOVING IMAGE - Perspectives from Artificial Intelligence | Spatial Cognition | Computational Narrative

Conference Paper

Full-text available

Nov 2015

Talking about the Moving Image: A Declarative Model for Image Schema Based Embodied Perception Grounding and Language Generation

Article

Full-text available

Aug 2015

We present a general theory and corresponding declarative model for the embodied grounding and natural language based analytical summarisation of dynamic visuo-spatial imagery. The declarative model ---ecompassing spatio-linguistic abstractions, image schemas, and a spatio-temporal feature based language generator--- is modularly implemented within...

Grounding Dynamic Spatial Relations for Embodied (Robot) Interaction

Conference Paper

Full-text available

Dec 2014

Perceptual Narratives of Space and Motion for Semantic Interpretation of Visual Data

Conference Paper

Full-text available

Sep 2014

We propose a commonsense theory of space and motion for the high-level semantic interpretation of dynamic scenes. The theory provides primitives for commonsense representation and reasoning with qualitative spatial relations, depth profiles, and spatio-temporal change; these may be combined with probabilistic methods for modelling and hypothesising...

Cognitive Interpretation of Everyday Activities: Toward Perceptual Narrative Based Visuo-Spatial Scene Interpretation

Article

Full-text available

Jun 2013

We position a narrative-centred computational model for high-level knowledge representation and reasoning in the context of a range of assistive technologies concerned with "visuo-spatial perception and cognition" tasks. Our proposed narrative model encompasses aspects such as \emph{space, events, actions, change, and interaction} from the viewpoin...

ROTUNDE - A Smart Meeting Cinematography Initiative: Tools, Datasets, and Benchmarks for Cognitive Interpretation and Control

Article

Full-text available

Jun 2013

We construe smart meeting cinematography with a focus on professional situations such as meetings and seminars, possibly conducted in a distributed manner across socio-spatially separated groups. The basic objective in smart meeting cinematography is to interpret professional interactions involving people, and automatically produce dynamic recordin...

Toward an Activity Theory Based Model of Spatio-Temporal Interactions: Integrating Situational Inference and Dynamic (Sensor) Control

Conference Paper

Jan 2012

Spatial assistance systems designed to empower people in smart environments need to perceive their operational environment, recognize activities performed in the environment, and reason about the observed information in order to plan a course of action. Activities performed by humans are spatio-temporal interactions between a subject, objects, and...

The ExpCog Framework: High-Level Spatial Control and Planning for Cognitive Robotics

Conference Paper