Sven Schmeier

Sven Schmeier
Deutsches Forschungszentrum für Künstliche Intelligenz | DFKI · Language Technology

PhD

About

43
Publications
5,712
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
352
Citations
Citations since 2017
12 Research Items
71 Citations
201720182019202020212022202305101520
201720182019202020212022202305101520
201720182019202020212022202305101520
201720182019202020212022202305101520

Publications

Publications (43)
Preprint
Full-text available
Scientific publications about machine learning in healthcare are often about implementing novel methods and boosting the performance - at least from a computer science perspective. However, beyond such often short-lived improvements, much more needs to be taken into consideration if we want to arrive at a sustainable progress in healthcare. What do...
Preprint
Full-text available
Many people share information in social media or forums, like food they eat, sports activities they do or events which have been visited. This also applies to information about a person's health status. Information we share online unveils directly or indirectly information about our lifestyle and health situation and thus provides a valuable data r...
Conference Paper
Full-text available
Many people share information in social media or forums, like food they eat, sports activities they do or events which have been visited. This also applies to information about a person's health status. Information we share online unveils directly or indirectly information about our lifestyle and health situation and thus provides a valuable data r...
Article
Full-text available
Human agents in technical customer support provide users with instructional answers to solve a task that would otherwise require a lot of time, money, energy, physical costs. Developing a dialogue system in this domain is challenging due to the broad variety of user questions. Moreover, user questions are noisy (for example, spelling mistakes), red...
Preprint
In many societies alcohol is a legal and common recreational substance and socially accepted. Alcohol consumption often comes along with social events as it helps people to increase their sociability and to overcome their inhibitions. On the other hand we know that increased alcohol consumption can lead to serious health issues, such as cancer, car...
Conference Paper
Full-text available
In many societies alcohol is a legal and common recreational substance and socially accepted. Alcohol consumption often comes along with social events as it helps people to increase their sociability and to overcome their inhibitions. On the other hand we know that increased alcohol consumption can lead to serious health issues, such as cancer, car...
Chapter
Full-text available
Die meisten Gesprachsversuche mit Migranten, die kein Deutsch oder Englisch sprechen, enden mit Handen und Füsen - und Frust. Das Deutsche Forschungszentrum für Kunstliche Intelligenz (DFKI) hat in Zusammenarbeit mit seiner Spin-off Firma Yocoy eine App entwickelt, die Immigranten aus arabischen Landern den Dialog beispielsweise mit Behorden, auf d...
Chapter
Full-text available
Um MS-Erkrankten und ihren Angehörigen die Möglichkeit zu bieten, sich miteinander zu vernetzen, und das in einem geschutzten Rahmen, hat die DMSG 2015 die Initiative zur Entwicklung einer verbandsinternen Online-Plattform ergriffen und vorangetrieben. Hier haben Mitglieder die Moglichkeit, sich überregional kennenzulernen und zwar unter Berücksich...
Article
Full-text available
Der Beitrag beschreibt den gegenwärtigen Forschungsstand im Hinblick auf die Analyse von Emotionswörtern in Texten. Neben einer Einführung in die grundlegenden Verfahren werden auch die relevanten psychologischen Konzepte und damit der theoretische Hintergrund erläutert und ein Ausblick auf Verfahren maschinellen Lernens bei der Emotionsanalyse in...
Conference Paper
Elderly people often need support in everyday situations -- e.g. common daily life activities like taking care of house and garden, or caring for an animal are often not possible without a larger support circle. However, especially in larger western cities, local social networks may not be very tight, friends may have moved away or died, and the tr...
Conference Paper
The proportion of elderly people in German society has been increasing for decades. As a result Germany, and other industrial countries as well, are currently facing large demographic changes in terms of age structure and population size, changes that will only increase in the future. Furthermore, especially in bigger cities, the traditional family...
Article
Full-text available
"Qualitative" is a python toolkit for ranking and selection of sentence-level output by different MT systems using Quality Estimation. The toolkit implements a basic pipeline for annotating the given sentences with black-box features. Consequently, it applies a machine learning mechanism in order to rank data based on models pre-trained on human pr...
Chapter
Yochina is a mobile application for crosslingual and cross-cultural understanding. The core of the demonstrated app supports dialogues between English and Chinese and German and Chinese. The dialogue facility is connected with interactive language guides, culture guides and country guides. The app is based on a generic framework enabling such novel...
Conference Paper
We present MobEx, a mobile touchable application for exploratory search on the mobile web. The system has been implemented for operation on a tablet computer, i.e. an Apple iPad, and on a mobile device, i.e. Apple iPhone or iPod touch. Starting from a topic issued by the user the system collects web snippets that have been determined by a standard...
Conference Paper
Full-text available
Human translators are the key to evaluating machine translation (MT) quality and also to addressing the so far unanswered question when and how to use MT in professional translation workflows. Usually, human judgments come in the form of ranking outputs of different translation systems and recently, post-edits of MT output have come into focus. Thi...
Article
Customer support departments of large companies are often faced with large amounts of customer requests about the same issue. These requests are usually answered by using preformulated text blocks. However, choosing the right text from a large number of text blocks can be challenging for the customer support agent, especially when the text blocks a...
Conference Paper
Full-text available
We present a mobile touchable application for guided exploration of web content and online topic graph extraction that has been successfully implemented on a tablet, i.e. an Apple iPad, and on a mobile device/phone, i.e. Apple iPhone or iPod. Starting from a user’s search query a set of web snippets is collected by a standard search engine in a fir...
Article
The coexistence of Western and Eastern languages and cultures poses a true challenge for global mobility and communication in business and personal life. In this paper, we will describe mobile software applications that are built on top of multilingual and crosslingual technologies for overcoming language barriers, e.g., between English and Chinese...
Chapter
Full-text available
In the following, we present an approach using interactive topic graph extraction for the exploration of web content. The initial information request, in the form of a query topic description, is issued online by a user to the system. The topic graph is then constructed from N web snippets that are produced by a standard search engine. We consider...
Conference Paper
In 2008, the percentage of people with a migration background in Germany had already reached more than 15% (12 Million people). Among that 15%, the ratio of seniors aged 50 years or older was 30% [1]. In most cases, their competence of the German language is adequate for dealing with everyday situations. However sometimes in emergency or medical si...
Conference Paper
Full-text available
We present a mobile touchable application for online topic graph extraction and exploration of web content. The system has been implemented for operation on an iPad. The topic graph is constructed from N web snippets which are determined by a standard search engine. We consider the extraction of a topic graph as a specific empirical collocation ext...
Conference Paper
Full-text available
We present a mobile touchable application for online topic graph extraction and exploration of web content. The system has been implemented for operation on a tablet computer, i.e. an Apple iPad, and on a mobile device, i.e. Apple iPhone or iPod touch. The topics are extracted from web snippets which are determined by a standard search engine. We c...
Conference Paper
Full-text available
In this paper we present the digital library assistant (DiLiA). The system aims at augmenting the search in digital libraries in several dimensions. In the project advanced information visualisation methods are developed for user controlled interactive search. The interaction model has been designed in a way that it is transparent to the user and e...
Conference Paper
Full-text available
This paper presents preliminary results of our current research project DiLiA (Digital Library Assistant). The goals of the project are are twofold. One goal of the project is the development of domain-independent information extraction methods. The other goal is the development of information visualization methods that interactively support resear...
Conference Paper
Full-text available
This paper describes SPMED, a system for robust and accurate linguistic parsing of medical documents which is used in several industrial products. The basic design criterion of the system is of providing a set of basic powerful, robust, and generic linguistic knowledge sources and modules which can easily customized for processing different tasks i...
Conference Paper
Full-text available
In this paper, we present an unsupervised hybrid text-mining approach to automatic acquisition of domain relevant terms and their relations. We deploy the TFIDF-based term classification method to acquire domain relevant single-word terms. Further, we apply two strategies in order to learn lexico-syntatic patterns which indicate paradigmatic and do...
Article
Full-text available
The classification of texts is one of the most important cross-application technologies in information management. It is relevant to many tasks, including text filtering, information retrieval, and information extraction.
Article
Full-text available
In this paper, we present an unsupervised hybrid text- mining approach to automatic acquisition of domain relevant terms and their relations. We deploy the TFIDF- based term classification method to acquire domain relevant terms. Further, we apply two strategies in order to learn lexico-syntatic patterns which indicate paradigmatic and domain relev...
Article
Full-text available
Customer care in technical domains is increasingly based on e-mail communication, allowing for the reproduction of approved solutions. Identifying the customer's problem is often time-consuming, as the problem space changes if new products are launched. This paper describes a new approach to the classification of e-mail requests based on shallow te...
Article
Full-text available
Appointment scheduling is a problem faced daily by many individuals and organizations. Cooperating agent systems have been developed to partially automate this task. In order to extend the circle of participants as far as possible we advocate the use of natural language transmitted by e-mail. We describe COSMA, a fully implemented German language s...
Conference Paper
Appointment scheduling is a problem faced daily by many individuals and organiza- tions. Cooperating agent systems have been developed to partially automate this task. In order to extend the circle of par- ticipants as far as possible we advocate the use of natural language transmitted by e- mail. We describe COSMA, a fully imple- mented German lan...
Article
The design of systems according to the metaphor of a society of cooperating autonomous intelligent agents has become very attractive recently. This is mainly due to a new technological level of world wide interconnection that has matured in the scientific community and is now going to make its way into business, public and private life: Internet-ba...
Article
Full-text available
In this paper, we present first results we achieved and experiences we had combining shallow text processing methods with machine learning tools. In two research projects, where DFKI and industrial partners are involved, Ger- man real world texts have to be classified into several predefined categories. We will point out that decisions concerning q...
Article
AnswerBus (http://www.answerbus.com/, (2,3)) is a Web-based open-domain Question-Answering (QA) system. It successfully uses NLP/IR techniques and reaches very high correct answer rate. Although it is not designed for TREC, it still correctly answers over 70% of TREC-8 questions with Web resources. The question remains whether the techniques for a...

Network

Cited By

Projects

Projects (5)
Archived project
The research work in DEEPLEE, which is carried out in the Language Technology research departments in Saabrücken and Berlin, builds on DFKI's expertise in the areas of "deep learning" (DL) and "language technology" (LT) and develops it further. They aim for profound improvements of DL approaches in LT by focusing on four central, open research topics: Modularity in DNN architectures Use of external knowledge DNNs with explanation functionality Machine Teaching Strategies for DNNs