Conference Paper

An Analysis of the Applicability of VoiceXML as Basis for a Dialog Control Flow in Industrial Interaction Management

Authors:
To read the full-text of this research, you can request a copy directly from the authors.

Abstract

Industry 4.0 (I4.0) looks to enable intelligent production by connecting and evaluating data. The asset administration shell, the Industry 4.0 specification of a digital twin describes various concepts to realize this data exchange. One part of the asset administration shell is the I4.0-language, which intends to standardize complex interactions between machines by using interaction protocols. VoiceXML is a W3C-standard from the field of interactive voice response, which has been established for several years. In this paper we analyze to which extend a future I4.0 interaction manager could derive VoiceXML-concepts to the asset administration shell. For this purpose, parallels between VoiceXML and the I4.0-language are shown and compared by implementing a selected interaction protocol from the VDI 2193 using VoiceXML.

No full-text available

Request Full-text Paper PDF

To read the full-text of this research,
you can request a copy directly from the authors.

Conference Paper
Platform Industry 4.0 conceptualizes Industry 4.0 as a smart factory that is composed of smart production processes, smart entities and smart products working in tandem to achieve larger goals, namely enabling flexible manufacturing, efficient resource utilization, optimal decision making, catering to individual preferences and creation of new services Smart entities are digital representations of physical or logical objects that can communicate with the external world and take independent decisions. These digital representations can choose different application-layer protocols (ALPs) such as MQTT or HTTP or CoAP for the exchange of messages with the external world. This requires addressing the concept of Application layer interoperability. In this paper, we review the past works by different authors that propose middleware architectures for enabling the exchange of messages in such a multi-protocol environment. Next, we argue for a better alternative and propose new architecture coined as Registry Infrastructure Component (RIC). RIC is a complex software component that maintains a registry, different ALP plugins. The digital representations have to register their ALP endpoint information with the RIC initially and later can post all the outbound messages to any of the RIC ALP plugins. The RIC takes the responsibility of transporting the message to the target recipient. Apart from registration with the RIC, the digital representations would have to communicate only in the I4.0 language.
Conference Paper
Full-text available
Eine Verwaltungsschale ist eine digitale Repräsentanz eines Assets in einem I4.0-System. Die grundsätzliche Struktur und die wesentlichen Elemente der Verwaltungsschale konnten bereits identifiziert werden. Jedoch gibt es I4.0-Szenarien, die eine gewisse Autonomie und Entscheidungsfähigkeit der Interaktionspartner erfordern. Es ist auffällig, dass bisherige Strukturen der Verwaltungsschale nicht die Frage klären, an welcher Stelle eine I4.0-Komponente Entscheidungen trifft, obwohl erst diese Eigenschaft die I4.0-Komponenten autark interagieren lässt. Außerdem wurde bisher nicht besprochen, welche Entität die Interaktionen und die potentiell dafür notwendigen Entscheidungen und Regeln beheimatet. In diesem Beitrag wird das Konzept und eine Struktur einer aktiven Verwaltungsschale vorgestellt und ihre beispielhafte Ausprägung diskutiert.
Article
Full-text available
This paper proposes a probabilistic framework for spoken dialog management using dialog examples. To overcome the complexity prob- lems of the classic partially observable Mar- kov decision processes (POMDPs) based dialog manager, we use a frame-based belief state representation that reduces the complexi- ty of belief update. We also used dialog ex- amples to maintain a reasonable number of system actions to reduce the complexity of the optimizing policy. We developed weather in- formation and car navigation dialog system that employed a frame-based probabilistic framework. This framework enables people to develop a spoken dialog system using a prob- abilistic approach without complexity prob- lem of POMDP.
Article
Full-text available
The environment has been recognized as an explicit and exploitable element to design multi-agent systems (MAS). It can be assigned a number of responsibilities that would be more difficult to design with the sole notion of agents. To support the engineering of these responsibilities, we identify a set of mechanisms that offer solutions to software designers. We describe the mechanisms, their usage in representative projects, and potential opportunities for further research and applications. The purpose of this article is to clarify the notion of environment in terms of mechanisms, from their abstract description to their practical exploitation. Mechanisms are expected to provide agent-based software designers with a set of design elements to build MAS that take advantage of the environment.
Article
Full-text available
Algebraic graph transformation has been promoted by several authors as a means to specify model transformations. This paper explores how we can specify graph transformation-based rules for a classical problem of transforming from sequence diagrams to state machines. The specification of the transformation rules is based on the concrete syntax of sequence diagrams and state machines. We introduce tailored transformation support for sequence diagrams and a novel graphical operator to match and transform combined fragments.
Article
Full-text available
This paper surveys recent research on programming languages and development tools for Multi-Agent Systems. It starts by addressing programming languages (declarative, imperative, and hybrid), followed by integrated development environments, and finally platforms and frameworks. To illustrate each of these categories, some systems were chosen based on the extent to which European researchers have contributed to their development. The current state of these systems is described and, in some cases, indications of future directions of research are given.
Conference Paper
Full-text available
Introduction Within the ATAL community, the belief-desire-intention (BDI) model has come to be possibly the best known and best studied model of practical reasoning agents. There are several reasons for its success, but perhaps the most compelling are that the BDI model combines a respectable philosophical model of human practical reasoning, (originally developed by Michael Bratman [1]), a number of implementations (in the IRMA architecture [2] and the various PRS-like systems currently available [7]), several successful applications (including the now-famous fault diagnosis system for the space shuttle, as well as factory process control systems and business process management [8]), and finally, an elegant abstract logical semantics, which have been taken up and elaborated upon widely within the agent research community [14, 16]. However, it could be argued that the BDI model is now becoming somewhat dated: the principles of the architecture were established in the mid-1980s
Article
Spoken Dialogue Technology provides extensive coverage of spoken dialogue systems, ranging from the theoretical underpinnings of the study of dialogue through to a detailed look at a number of well-established methods and tools for developing spoken dialogue systems. The book enables students and practitioners to design and test dialogue systems using several available development environments and languages, including the CSLU toolkit, VoiceXML, SALT, and XHTML+ voice. This practical orientation is usually available otherwise only in reference manuals supplied with software development kits. The latest research in spoken dialogue systems is presented along with extensive coverage of the most relevant theoretical issues and a critical evaluation of current research prototypes. A dedicated web site containing supplementary materials, code, links to resources will enable readers to develop and test their own systems (). Previously such materials have been difficult to track down, available only on a range of disparate web sites and this web site provides a unique and useful reference source which will prove invaluable.
Article
Information exchange requires a common semantic understanding of the meaning of the exchanged data by the involved communication partners. Properties with a predefined standardized meaning are one pillar of such an information exchange. Another pillar is the exchange of information about relationships between semantic entities by an appropriate system description meta model. In this paper we focus on the information exchange concerning property value expressions. Within the last years domain specific lists of properties have been standardized. The development was driven by the need to exchange information about industrial products electronically. The standardized lists allow an unambiguous characterization of products. Results of this comprehensive work are for example the international standard IEC61360CDD, IEC61987 and the property library eClass. An intuitive and general solution to deal with property information is crucial for future system design. One recent approach is the so called property value statement model. In this paper the basic concepts of this approach are reviewed. Additionally methods to store, exchange and to reason about property information in a formalized and general way are discussed. The presented concepts fit to the standardized property definition models and provides a formal system for a meta data based information handling in the digital factory.
Article
Several dialogue management (DM) architectures and conversational speech for dialogue systems are presented. Basic types of DM systems include dialogue grammars and frames, plan-based and collaborative systems, and conversational games theory. DM architectures include SmartKom, Trindi, WITAS, CONVERSE, COMIC, agent-based dialogue management, and DM and automatic speech recognition (ASR) language modeling. All data collection tasks should be tailored for the conversational scenario under consideration as each scenario can present different properties. It is shown in the multimodal dialogue system that turn taking can usually be achieved by a fusion of gesture, gaze, and intonation. Intonation within the speech signal informs the dialogue manager when new information is introduced into the current conversation. By placing established emotion detection methods within the recursive nature of conversation we can consider discourse as the exploitation of the shared set of interaction affordances.
Article
The contract net protocol has been developed to specify problem-solving communication and control for nodes in a distributed problem solver. Task distribution is affected by a negotiation process, a discussion carried on between nodes with tasks to be executed and nodes that may be able to execute those tasks. The specification of the protocol is presented, and its use in the solution of a problem in distributed sensing is demonstrated. The utility of negotiation as an interaction mechanism is discussed. It can be used to achieve different goals, such as distributing control and data to avoid bottlenecks and enabling a finer degree of control in making resource allocation and focus decisions than is possible with traditional mechanisms.
Article
The article delves in the conception and functioning of VoiceXML for web-based distributed conversational applications. VoiceXML as a language replaces the familiar Hypertext Markup Language interpreter with a VoiceXML interpreter, and the mouse and keyboard with the human voice. VoiceXML is an emerging standard Extensible Markup Language-based markup language for distributed web-based voice services. VoiceXML 1.0 was developed by the VoiceXML Forum, which released it in March 2000, and was accepted by the World Wide Web Consortium (W3C) two months later as the basis for developing a W3C dialogue markup language. VoiceXML contains a menu and a form. It enables a dialogue in which a user selects an item from a menu. The dialogue uses the menu element to carry out further instructions. The form is VoiceXML's basic dialogue unit, describing a set of inputs needed from the user to complete a transaction between the browser and a server. The grammars in VoiceXML are specified using the Java Speech Grammar Format. VoiceXML has been built on the same distributed model that has already proved so successful for visual web-based services.
Voicexml: Strategies and Techniques for Effective Voice Application Development with Voicexml
  • C Sharma
  • J Kunins
C. Sharma and J. Kunins, Voicexml: Strategies and Techniques for Effective Voice Application Development with Voicexml. USA: John Wiley & Sons, Inc., 2001.
Verwaltungsschale in der Praxis
Plattform Industrie 4.0, "Verwaltungsschale in der Praxis," Bundesministerium für Wirtschaft und Energie (BMWi), Diskussionspapier, April 2019.
Where Do We Go from Here?
  • R Pieraccini
  • J M Huerta
R. Pieraccini and J. M. Huerta, Where Do We Go from Here? Dordrecht: Springer Netherlands, 2008, pp. 1-24.
Error handling in spoken dialogue systems
  • skantze
G. Skantze, "Error handling in spoken dialogue systems," Ph.D. dissertation, KTH Computer Science and Communication, 2007.
Demonstrator I4.0-Language
  • belyaev
Belyaev, Alexander and Diedrich, Christian, "Demonstrator I4.0-Language," Otto-von-Guericke-University Magdebrug, Technical report, Juli 2019.
Weiterentwicklung des Interaktionsmodells für Industrie 4.0-Komponenten
Plattform Industrie 4.0, "Weiterentwicklung des Interaktionsmodells für Industrie 4.0-Komponenten," Bundesministerium für Wirtschaft und Energie (BMWi), Diskussionspapier, November 2016.
Überprüfung der Beschreibungsfähigkeit semantischer Inter-aktionsprotokolle der I4.0-Sprache mittels VoiceXML Standard
  • F Böhm
F. Böhm, "Überprüfung der Beschreibungsfähigkeit semantischer Interaktionsprotokolle der I4.0-Sprache mittels VoiceXML Standard," Bachelor's thesis, Otto-von-Guericke-Universität Magdeburg, 2020.
DOM -Living Standard
Web Hypertext Application Technology Working Group (WHATWG), "DOM -Living Standard," Last Updated 20 April 2020, 2020. [Online]. Available: https://dom.spec.whatwg.org/
Überprüfung der Beschreibungsfähigkeit semantischer Inter-aktionsprotokolle der I4.0-Sprache mittels VoiceXML Standard
  • böhm