
Kevin Maik Bönisch- Bachelor of Science
- Researcher at Goethe University Frankfurt
Kevin Maik Bönisch
- Bachelor of Science
- Researcher at Goethe University Frankfurt
Annotating and visualizing large corpora by means of NLP pipelines, including LLMs, RAG and fine-tuning.
About
7
Publications
344
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
7
Citations
Introduction
Annotating and visualizing large corpora by means of NLP pipelines, including LLMs, RAG and fine-tuning.
Current institution
Publications
Publications (7)
The annotation and exploration of large text corpora, both automatic and manual, presents significant challenges across multiple disciplines, including linguistics, digital humanities, biology, and legal science. These challenges are exacerbated by the heterogeneity of processing methods, which complicates corpus visualization, interaction, and int...
We introduce a retrieval approach leveraging Support Vector Regression (SVR) ensembles, bootstrap aggregation (bagging), and embedding spaces on the German Dataset for Legal Information Retrieval (GerDaLIR). By conceptualizing the retrieval task in terms of multiple binary needle-in-a-haystack subtasks, we show improved recall over the baselines (0...
We present Viki LibraRy, a dynamically built library in virtual reality (VR) designed to visualise hypertext systems, with an emphasis on collaborative interaction and spatial immersion. Viki LibraRy goes beyond traditional methods of text distribution by providing a platform where users can share, process, and engage with textual information. It o...
We present HyperCausal, a 3D hypertext visualization framework for exploring causal inference in generative Large Language Models (LLMs). HyperCausal maps the generative processes of LLMs into spatial hypertexts, where tokens are represented as nodes connected by probability-weighted edges. The edges are weighted by the prediction scores of next to...
The constitution of multiple documents has so far been studied essentially as a process in which a single learner consults a number (of segments) of different documents in the context of the task at hand in order to construct a mental model for the purpose of completing the task. As a result of this research focus, the constitution of multiple docu...
As governments worldwide continue to release vast amounts of textual information, the need for efficient and insightful tools to extract, interpret and present this data has become increasingly critical. Towards solving this issue, we present the BUNDESTAG-MINE: an environment that periodically retrieves pertinent data from the German parliament, p...
We present Viki LibraRy, a VR-based system for generating and exploring online information as a spatial hypertext. It creates a virtual library based on Wikipedia in which Rooms are used to make data available via a RESTful backend. In these Rooms, users can browse through all articles of the corresponding Wikipedia category in the form of Books. I...