About
24
Publications
318
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
62
Citations
Introduction
Current institution
Additional affiliations
April 2024 - April 2025
March 2021 - March 2024
Education
October 2021 - October 2024
October 2019 - September 2021
October 2016 - October 2019
Publications
Publications (24)
Legal texts are regulations specified in a general and abstract way: For specific situations, they determine the (legal) consequences of the situation on the basis of certain conditions that must or must not be met. Large Language Models (LLMs) are powerful tools for analyzing texts, but are prone to hallucinations and do not provide an explanation...
An agent providing an information retrieval service may work with a corpus of text documents. The documents in the corpus may contain annotations such as Subjective Content Descriptions (SCD) -- additional data associated with different sentences of the documents. Each SCD is associated with multiple sentences of the corpus and has relations among...
Let us consider the following scenario: A human is working with a corpus of text documents. In this corpus, the human needs to know documents with similar content and highlight relevant locations in the retrieved documents. An information system displaying the contents of the corpus and providing an information retrieval agent will help the human....
An agent in pursuit of a task may work with a corpus containing text documents. To perform information retrieval on the corpus, the agent may need annotations—additional data associated with the documents. Subjective Content Descriptions (SCDs) provide additional location-specific data for text documents. SCDs can be estimated without additional su...
Inference in dynamic probabilistic models is a complex task involving expensive operations. In particular, for Hidden Markov Models, the whole state space has to be enumerated for advancing in time. Even states with negligible probabilities are considered, resulting in computational inefficiency and increased noise due to the propagation of unlikel...
An agent providing an information retrieval service may work with a corpus of text documents. The documents in the corpus may contain annotations such as Subjective Content Descriptions (SCDs) — additional data associated with different sentences of the documents. Each SCD is associated with multiple sentences of the corpus and has relations among...
The growing practice of archiving research data in repositories reflects an upward trend. However, storing data in an RDR (Research Data Repository) does not guarantee that the archived data will always be readily reusable, even if this fulfils the FAIR (Findable, Accessible, Interoperable Reusable) principles. To ensure sustainable RDM (Research D...
Eine Vielzahl der im allgemeinen Sprachgebrauch als „KI-System“ bezeichneten Programme verarbeitet natürliche Sprache. Zum einen wird häufig natürliche Sprache als Medium für die Aufgabenbeschreibung eingesetzt und zum anderen sind die zu verarbeitenden Daten (sowohl Trainingsdaten als auch zB zu durchsuchende Textdokumente) und die generierten Aus...
Humanities scholars can use Large Language Models (LLMs) to simplify text analysis and pattern recognition. Fine-tuning LLMs for specific humanities tasks can be challenging due to limited training data. However, in the humanities exists a growing number of information systems with research data which can be used for this purpose. This article outl...
An agent in pursuit of a task may work with a corpus containing text documents. To perform information retrieval on the corpus, the agent internally maintains a model of the documents in the corpus. This model may contain annotations such as Subjective Content Descriptions (SCD)—additional data associated with different sentences of documents. In o...
This paper addresses the possibility to use a variety of sophisticated Artificial Intelligence (AI) driven technologies with a humanoid service robot without the need to have a deeper knowledge of the respective AI technology. Application interfaces are used that are capable to exchange data between the robot and highly complex AI services running...
Conceptually, an agent perceives its environment through sensors, builds a set of models, and then uses these models to select an appropriate action to fulfill its goals. As long as an agent is embodied by a robot, even humans that are not familiar with the concept of an agent, are more likely aware of the presence of an individual, independent of...
An agent in pursuit of a task may work with a corpus of documents with linked subjective content descriptions. Performing the task of document retrieval for a user or aiming to extend its own corpus, an agent so far relies on similarity measures to identify related documents. However, similarity may not be appropriate if looking for new information...
An agent pursuing a task may work with a corpus of documents as a reference library. Subjective content descriptions (SCDs) provide additional data that add value in the context of the agent’s task. In the pursuit of documents to add to the corpus, an agent may come across new documents where content text and SCDs from another agent are interleaved...
An agent in pursuit of a task may work with a reference library containing documents associated with additional data that provide location-specific explanations about the content. Faced with a new document, an agent has to decide whether to include the new document in its reference library. Basing the decision on words, topics, or entities has show...
A reference library can be described as a corpus of an individual composition of documents. Over time, the corpus might grow because an agent decides to extend its corpus with additional documents, e.g., new publications, or new articles. Existing approaches use topic modelling techniques to compare documents with each other within the same corpus...
An agent in pursuit of a task may work with a corpus of documents with linked subjective content descriptions. Faced with a new document, an agent has to decide whether to include that document in its corpus or not. Basing the decision on only words, topics, or entities, has shown to not lead to a balanced performance for varying documents. Therefo...