About
84
Publications
24,557
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
4,946
Citations
Introduction
Current institution
Publications
Publications (84)
We describe a translation from a fragment of SUMO (SUMO-K) into higher-order set theory. The translation provides a formal semantics for portions of SUMO which are beyond first-order and which have previously only had an informal interpretation. It also for the first time embeds a large common-sense ontology into an interactive theorem proving syst...
We describe a translation from a fragment of SUMO (SUMO-K) into higher-order set theory. The translation provides a formal semantics for portions of SUMO which are beyond first-order and which have previously only had an informal interpretation. It also for the first time embeds a large common-sense ontology into a very secure interactive theorem p...
As with any software, the challenges of developing large andmanually-created axiomatizations in an expressive logic suchas first order logic with equality can be very different fromthose found in comparatively small theories. We present someof the tools and practices that have supported development ofa logical theories with tens of thousands of sta...
We attempt to answer the question of which kind of logical language should be chosen to represent the semantics of a broad selection of natural language sentences, and how prevalent different kinds of sentences are that require different levels of logical expressiveness. We examine these requirements for representing the semantics of text in logic...
Ontology matching is a research area aimed at finding ways to make different ontologies interoperable. Solutions to the problem have been proposed from different disciplines, including databases, natural language processing, and machine learning. The role of foundational ontologies for ontology matching is an important one, as they provide a well-f...
SUMOjEdit is a programmer’s text editor for the SUO-KIF language and SUMO http://www.ontologyportal.org theory. Modern procedural programming is done in a text editor with tool support. Development of ontologies and taxonomies has often been done in graphical editors, leading many developers to employ only logics of very limited expressiveness that...
PyRes is a complete theorem prover for classical first-order logic. It is not designed for high performance, but to clearly demonstrate the core concepts of a saturating theorem prover. The system is written in extensively commented Python, explaining data structures, algorithms, and many of the underlying theoretical concepts. The prover implement...
We discuss ontological modeling of legal terminology in SUMO (Pease, 2001) in combination with the lexico-semantic database WordNet (Fellbaum, 1998). Formal systems that allow for automated semantic interpretation of law supported by lexical resources can provide solutions to many tasks related to legal reasoning. We wish to formalize legal issues...
The present work addresses the challenge of integrating low-level information with high-level knowledge (known as semantic gap) that exists in content-based image retrieval by introducing an approach to describe images by means of spatial relations. The proposed approach is called Image Retrieval using Region Analysis (IRRA) and relies on decomposi...
This paper presents the process of manually establishing alignments between domain and foundational ontologies. The ontologies from the OAEI Conference track have been aligned to the SUMO foundational ontology. The Conference dataset is one of the most used dataset in ontology matching evaluation and has been extended in several versions. However,...
We develop a new method of discourse analysis using speech act theory and formal ontology. The method constitutes an attempt to make discourse analysis more formal and repeatable. We apply the method to a corpus of bi-lingual, interpreted legal dialogue, focusing on the speech act of clarification and its component acts. While discourse analysis is...
Large formalizations carry the risk of inconsistency, and hence may lead to instances of spurious reasoning. This paper describes a new approach and tool that automatically probes large first-order axiomatizations for inconsistency, by selecting subsets of the axioms centered on certain function and predicate symbols, and handling the subsets to a...
One reason why there is a lack of cross-references between articles on knowledge representation in the Cognitive and the Information Sciences is that cognitive scientists are interested in descriptive models of how people reason whereas information scientists are interested in prescriptive models to help people reason. Formal ontologies such as the...
We discuss the development of a multilingual lexicon linked to the Suggested Upper Merged Ontology (SUMO) formal ontology. The ontology as well as the lexicon have been expressed in Web Ontology Language (OWL), as well as their original formats, for use on the semantic web and in linked data. We describe the Open Multilingual Wordnet (OMW), a multi...
The Suggested Upper Merged Ontology (SUMO) is a large, comprehensive ontology stated in higher-order logic. It has co-evolved with a development environment called the Sigma Knowledge Engineering Environment (SigmaKEE). A large and important subset of SUMO can be expressed in first-order logic with equality. SigmaKEE has integrated different reason...
Psychoinformatics is an emerging discipline that uses tools from the information sciences to organize psychological data. This article supports that objective by proposing a framework for constructing cognition ontologies by using WordNet, FrameNet, and the Suggested Upper Merged Ontology (SUMO). The first section describes the major characteristic...
Sigma is an open source environment for the development of logical theories. It has been under development and regular release for nearly a decade, and has been the principal environment under which the open source Suggested Upper Merged Ontology (SUMO) has been created. We discuss its features and evolution, and explain why it is an appropriate en...
This article addresses the automation of higher-order aspects in expressive ontologies such as the suggested upper merged ontology SUMO. Evidence is provided that modern higher-order automated theorem provers like LEO-II can be fruitfully employed for the task. A particular focus is on embedded formulas (formulas as terms), which are used in SUMO,...
An ontology is a conceptual model of what "exists" in a domain, brought into machine-interpretable form by means of expressions in formal logic. Since ontologies are content theories about the kinds, properties and relations of objects, story generators can use them as concept dictionaries for actions and events in a story world. This paper present...
ontology defined in first-order logic with some higher-order extensions [1]. The project began in the year 2000. Each version has been released open source and publicly from the start, which provides a unique record of the construction of a formal ontology. While initially just an upper ontology, it now encompasses a wide variety of domains, and so...
Sigma[1,2] is an open source environment for the development of logical theories. It has been under development and regular release for nearly a decade, and has been the principal environment under which the SUMO[3] has been created. We discuss its features and evolution, and explain why it is an appropriate environment for the development of expre...
The Controlled English to Logic (CELT) system translates a restricted English grammar to expressions in formal logic. The
logic statements use terms from a large formal ontology, the Suggested Upper Merged Ontology (SUMO), giving each resulting
statement a wealth of deep meaning, similar in kind if not in degree to capturing the meaning a human ass...
We report on the application of higher-order automated theorem proving in ontology reasoning. Concretely, we have integrated the Sigma knowledge engineering environment and the Suggested Upper-Level Ontology (SUMO) with the higher-order theorem prover LEO-II. The basis for this integration is a translation from SUMO’s SUO-KIF representations into t...
Automatic story generation systems require a body of commonsense knowledge about the basic relationships between concepts we find everyday in our world in order to produce interesting narratives that describe human actions and world events. This paper presents an ongoing work that investigates the use of Suggested Upper Merged Ontology (SUMO) to re...
The relation between ontologies and language is currently at the forefront of natural language processing (NLP). Ontologies, as widely used models in semantic technologies, have much in common with the lexicon. A lexicon organizes words as a conventional inventory of concepts, while an ontology formalizes concepts and their logical relations. A sha...
The Suggested Upper Merged Ontology (SUMO) has provided the TPTP problem library with problems that have large numbers of axioms, of which typically only a few are needed to prove any given conjecture. The LTB division of the CADE ATP System Competition tests the performance of ATP systems on these types of problems. The SUMO problems were used in...
Reasoning with embedded formulas is relevant for the SUMO ontology but there is limited automation support so far. We investigate whether higher-order automated theorem provers are applicable for the task. Moreover, we point to a challenge that we have revealed as part of our experiments: modal operators in SUMO are in conflict with Boolean extensi...
In this paper, we introduce the Filipino word-net project (FilWordNet). Filipino is the na-tional language of the Philippines spoken by some 90 million people as their first or second language. However, it has historically had a limited number of computational linguistics resources. Creating the Filipino wordnet can be seen as the first step to ena...
Rightscom (a UK-based media and rights consultancy), is working with Articulate Software (a formal ontology consultancy) and with another system developer to create a large-scale metadata integration and transaction management system, founded on an ontology-based metamodel. Previous versions of this system have utilized lightweight schema and conve...
Ontologies are becoming more and more popular as background knowledge for intelligent applications. Up to now, there has been a schism between manually assembled, highly axiomatic ontologies and large, automatically constructed knowledge bases. This paper discusses how the two worlds can be brought together by combining the high-level axiomatizatio...
This paper describes the integration of the ATP support of the TPTPWorld into the Sigma Knowledge Engineering Environment. The result is an interactive knowledge based reasoning environment, with strong knowledge management features, and access to modern state of the art ATP systems for reasoning over knowledge bases.
Previous CASC competitions have focused on proving difficult problems on small numbers of axioms. However, typical reasoning applications for expert systems rely on knowledge bases that have large numbers of axioms of which only a small number may be relevant to any given query. We have created a category in the new LTB division of CASC to test thi...
This paper mirrors my invited talk at PACLIC22. It describes a call for a renewed emphasis in work on the logical semantics of languages. It lists some of the computational components needed for symbolic interpretations of language, and of automated reasoning within those semantics. It details existing components that meet those needs and provides...
We present results of our work on using first order theorem proving to reason over a large ontology (the Suggested Upper Merged Ontology - SUMO), and methods for making SUMO suitable for first order theorem proving. We describe the methods for translating into standard first order format, as well as optimizations that are intended to improve infere...
We report on the current status of the Arabic WordNet project and in particular on the contents of the database, the lexicographer and user interfaces, the Arabic WordNet browser, linking to the SUMO ontology, the Arabic word spotter, and techniques for semi-automatically extending Arabic WordNet. The central focus of the presentation is on the sem...
The authors have recently developed a new approach to performing a Delphi study that does not involve the use of sequential “rounds” and as a result, greatly improves the efficiency of the process and shortens the time to perform such studies. This paper describes this process, RT Delphi, and illustrates its use in a decision-making application dra...
Deductive reasoning with natural language requires combining lexical resources with the world knowledge provided by ontologies. In this paper we describe the connection of FrameNet - a lexicon for English - to the Suggested Upper Merged Ontology (SUMO). We express general-domain links between FrameNet Semantic Types (ST) and SUMO classes in SUO- KI...
The English word ontology, together with its counterparts in many languages, has made a breathtaking career during the last decades especially in information science, but also in other disciplines. Since its definitions vary considerably within and especially across disciplines, and since this volume, although clearly focused on linguistic matters,...
We discuss a restricted natural language understanding system and a proposed extension to it, which is a corpus of phrases. The Controlled English to Logic Translation (CELT) system allows users to make statements in a domain-independent, restricted English grammar that have a clear formal semantics and that are amenable to machine processing. CELT...
Ontologies are becoming extremely useful tools for sophisticated software engineering. Designing applications, databases, and knowledge bases with reference to a common ontology can mean shorter development cycles, easier and faster integration with other software and content, and a more scalable product. Although ontologies...
Analogy-based hypothesis generation is a promising technique for knowledge discovery. However, some hypotheses generated are nonsensical. This paper describes a two-phased method to increase the quality of analogy reasoning. The first phase employs an established approach to generate hypotheses through similarity matching. The second phase utilizes...
Summary form only given. The suggested upper merged ontology (SUMO) is a large, free, upper ontology in first order logic. It is increasingly being used as a resource in natural language understanding research. SUMO has been used as the basis for an interchange language for morphosyntactics, to resolve the meaning of terms in Web search, to express...
Ontologies provide advantages of knowledge reusability, sharing, and greater robustness when used to build large knowledge-based applications. Unfortunately, translating between English statements and a specific ontology requires skill in knowledge engineering and an understanding of formal logic and the ontology itself. A knowledge engineer must b...
MIDAS is a set of tools which allow a designer to specify the physical and functional characteristics of a complex system such as an aircraft cockpit, and analyze the system with regard to human performance. MIDAS allows for a number of static analyses such as military standard reach and fit analysis, display legibility analysis, and vision polars....
Abstract: We discuss the development of an environment for formal knowledge engineering. The Sigma system is an advance over previously developed systems in that it integrates a number of modern ontology development tools, which has motivated a number of research issues. Primary components include an ontology browsing and editing environment, a fir...
Knowledge Management is most necessary and valuable in a collaborative and distributed environment. A problem with commercial knowledge management tools is that they do not understand at a deep level the content that they are managing. In this paper we discuss the System for Collaborative Open Ontology Production (SCOOP), which manipulates logic ex...
In this paper we discuss the development and application of a large formal ontology to the semantic web. The Suggested Upper Merged Ontology (SUMO) (Niles & Pease, 2001) (SUMO, 2002) is a "starter document" in the IEEE Standard Upper Ontology effort. This upper ontology is extremely broad in scope and can serve as a semantic foundation for search,...
We address the development of a practical theory of semiotics within the context of the IEEE Standard Upper Ontology effort. The theory has the form of a set of predicates and other terms that are axiomatized in first order logic. We argue that this theory is sufficient for representing many of the subtleties of semantic representation that are lik...
The IEEE Standard Upper Ontology (IEEE, 2001) is an effort to create a large, general-purpose, formalontology. The ontology will be an open standard that can be reused for both academic and commercialpurposes without fee, and it will be designed to support additional domain-specific ontologies. Theeffort is targeted for use in automated inference,...
This effort focused on the development and integration of large knowledge bases and knowledge based tools. The project significantly advanced the state of the art in ontology and knowledge base development. Of particular scientific interest was a set of formal experiments that were conducted and described in technical reports and academic publicati...
The Suggested Upper Merged Ontology (SUMO) is an upper level
ontology that has been proposed as a starter document for The
Standard Upper Ontology Working Group, an IEEE-sanctioned working
group of collaborators from the fields of engineering, philosophy,
and information science. The SUMO provides definitions for
general-purpose terms and acts as a...
Introduction Research and applications in computer science are creating the need for precise definitions of the concepts that make up our world. Web searching is handicapped by the limitations of specifying search criteria in terms of keywords rather than concepts. Automated natural language understanding, both oral and written, is severely limited...
In this article we present an overview of knowledge-based systems. We begin by discussing the basic characteristics of knowledge base rules and present an example comparing a toy knowledge base to an equivalent procedural version of the same code. We then discuss the applications for which it makes sense to use knowledge-based technology and the ad...
This paper describes the design and development efforts associated with the Signal Loss Advisor (SLA) SLA is a command decision aid that is used to diagnose the cause for loss of an acoustic signal. The system utilizes several AI technologies. SLA has been implemented in a blackboard architecture. Within that architecture, there are two components;...
Abstract: Multi-modal and heterogeneous logic reasoning is of increasing importance within the AI community. The GIS based ArcView COA Sketcher (ArCS) sketch and translation tool developed under DARPA's High Performance Knowledge Bases program is an example of an enabling tool towards that goal. Army Course of Action (COA) sketches can be drawn and...
A variety of techniques within logic programming have developed for treating negated formulae. These essentially involve constructibility restrictions which are imposed upon complexity classes of quantifieational formulae, and which thereby determine models and model classes of computer programs declaratively regarded. These construetibility restri...
One factor that affects the rate of knowledge base construction is the availability and reuse of prior knowledge in ontologies and domain-specific knowledge bases. This paper reports an empirical study of reuse performed in the first year of the High Performance Knowledge Bases (HPKB) initiative. The study shows that some kinds of prior knowledge h...
The Core Plan Representation (Pease 1997) is a schema for the representation of plan information which is independent of any particular method of plan construction. It has been developed as an ontology and implemented as an object oriented design. Now in its second phase, the generic plan design has been specialized for several areas of military pl...
: The Core Plan Representation (CPR) is an effort to construct a general plan ontology. It has been developed to support the common plan representation of military planning, workflow and simulation. The CPR will be specialized to meet the needs of more specific subsets of planning applications. This paper reports on our progress. Pease and Carrico...
This document is issued as a request for comments. The first draft of this document was issued in September of 1996. Comments were received and incorporated and a second version was prepared and released. The second version was reviewed by the community and suggested changes were limited indicating that the design might be considered "mature". The...
This document is issued as a request for comments. The first draft of this document was issued in September of 1996. Comments were received and incorporated and a second version was prepared and released. The second version was reviewed by the community and suggested
Now completing its first year, the High-Performance Knowledge Bases Project promotes technology for developing very large, flexible, and reusable knowledge bases. The project is supported by the Defense Advanced Research Projects Agency and includes more than 15 contractors in universities, research laboratories, and companies; The evaluation of th...
The purpose of this ongoing research and development program is to develop software tools which enable the rapid development, upgrading, and maintenance of embedded real-time artificial intelligence systems. The goals of this phase of the research were to investigate the feasibility of developing software tools which automatically translate expert...
: One factor that affects the rate of knowledge base construction is the availability and reuse of prior knowledge in ontologies and domain-specific knowledge bases. This paper reports an empirical study of reuse performed in the first year of the High Performance Knowledge Bases (HPKB) initiative. The study shows that some kinds of prior knowledge...
Arabic is the official language of hundreds of millions of people in twenty Middle East and northern African countries, and is the religious language of all Muslims of various ethnicities around the world. Surprisingly little has been done in the field of computerised language and lexical resources. It is therefore motivating to develop an Arabic (...
This paper introduces a recently initiated project that focuses on building a lexical resource for Modern Standard Arabic based on the widely used Princeton WordNet for English (Fellbaum, 1998). Our aim is to develop a linguistic resource with a deep formal semantic foundation in order to capture the richness of Arabic as described in Elkateb (2005...
It is estimated that 1-2 exabytes of data is now being generated each year, almost all of it in purely digital form (Lyman et. ai. 2000). Properly structured, this information could form a global knowledge base. Currently however, this information exists in many different forms, many of which are only suitable for human consumption, and which are l...
The Core Plan Representation (CPR) (Pease and Carrico, 1996) is an effort to develop a plan representation which supports the representation needs of many different planning systems. It is being developed for the DARPA Joint Task Force Advanced Technology Demonstration (JTF ATD) (Hayes-Roth, 1995), (Hayes-Roth Erman, 1994), (Carrico, 1996). The goa...