About
69
Publications
14,195
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
461
Citations
Introduction
Skills and Expertise
Current institution
Additional affiliations
December 2018 - present
July 2005 - December 2018
Publications
Publications (69)
BACKGROUND
Background: Blockchains offer a promising new distributed technology to address the challenges of data standardization, system interoperability, security, privacy, and accessibility for all data. However, integrating pervasive computing with blockchain’s ability to store privacy-protected mHealth data while providing HIPAA compliance is...
Background:
Integrating pervasive computing with blockchain's ability to store privacy-protected mobile health (mHealth) data while providing Health Insurance Portability and Accountability Act (HIPAA) compliance is a challenge. Patients use a multitude of devices, apps, and services to collect and store mHealth data. We present the design of an i...
This paper presents the effects of a technology-based engagement platform, referred to as IMPRES (Interactive Mobile Platform for Retaining and Engaging Students). IMPRES is a mobile application, backed by a set of data analysis services, which seeks to engage and retain its users to enhance their professional development. The primary goal of the a...
Many different implementations of Blockchain currently exist, each offering its own implementation of Bitcoin's original proposal. It is difficult to compare these separate implementations for design choices, performance benchmarks or application suitability without a common basic design framework. Using the concept of design spaces, and using comm...
Enterprise resource planning systems (ERP) are the most essential and critical information systems that are widely used to build all the functions of a business into a unified and efficient enterprise system. Although the essence of ERP systems is easy to understand, students, especially those who do not have technical experience, have difficulty a...
Major Indian cities experience traffic choking from early morning, and commuters face extreme difficulty in reaching their destinations. Many surveys and empirical research results concluded with loads of suggestions for the government and transport agencies in India in terms of reducing traffic congestion and raising the commuters’ satisfaction. I...
Product identification is a crucial success factor for any business that involves tangible assets. The success or failure of a business depends on accurately tracking these assets through their life cycle, from manufacturing to sale and post-sale activities. Barcodes have been the technology of choice since their first use on Wrigley's gum in 1974....
This chapter introduces a complete storage and retrieval architecture for a database environment for XML documents. DocBase, a prototype system based on this architecture, uses a flexible storage and indexing technique to allow highly expressive queries without the necessity of mapping documents to other database formats. DocBase is an integration...
XML is one of the most widely accepted data representation languages in today's Internet-dominated Computing. While most XML data on the net today use commonly known structures, the power of XML lies in the ability to develop application-specific structures and models. XER (Extensible Entity Relationship) is a conceptual modeling methodology that u...
This article introduces a complete storage and retrieval architecture for a database environment for XML documents. DocBase, a prototype system based on this architecture, uses a flexible storage and indexing technique to allow highly expressive queries without the necessity of mapping documents to other database formats. DocBase is an integration...
This article presents DSQL, a conservative extension of SQL, as an ad-hoc query language for XML. The development of DSQL follows the theoretical foundations of first order logic, and uses common query semantics already accepted for SQL. DSQL represents a core subset of XQuery that lends well to optimization techniques, while at the same time allow...
This chapter presents DSQL, a conservative extension of SQL, as an ad-hoc query language for XML. The development of DSQL follows the theoretical foundations of first order logic, and uses common query semantics already accepted for SQL. DSQL represents a core subset of XQuery that lends well to query optimization techniques; while at the same time...
The surge of RFID technology and ensuing competition in the RFID vendor market has increased the complexity of integration
of RFID into current business processes. To address this problem, we present FlexRFID, a modeling template composed of state
indicators, conditions, a simulation engine, and a device-independent deployment architecture for rapi...
China ranks first in hard coal production worldwide. Unfortunately, Chinese coal mines are also known
to be the world’s deadliest and have recorded the largest number of fatal accidents and the highest death
tolls. Radio frequency identification (RFID) seems to provide a potential solution to this human safety
issue. Supported by the Chinese Coal B...
This article proposes a data warehouse integration technique that combines data and documents from different underlying documents and database design approaches. The well-defined and structured data such as relational, object-oriented and object relational data, semi-structured data such as XML, and unstructured data such as HTML documents are inte...
The process of conceptual design is independent of the final platform and the medium of implementation, and is usually in a form that is understandable and usable by managers and other personnel who may not be familiar with the low-level implementation details, but have a major influence in the development process. Although a strong design phase is...
This article introduces a measurement ontology for applications to Semantic Web applications, specifically for emerging domains such as microarray analysis. The Semantic Web is thenext-generation Web of structured data that are automatically shared by software agents, which apply definitions and constraints organized in ontologies to correctly proc...
FlexRFID is our solution to the general problem of resource time tracking, the problem of tracking the time a resource is utilized. This architecture is based on the premise that all RTT applications have some items of interest in common. One such item is the identity of the resources being tracked, and a second is a set of locations that mark the...
The use of pilot studies to evaluate the economic justification of technology projects is common in practice. The pilot studies play even greater role in the projects affecting customer interactions with the product/service offerings since perception and/or reaction of customers is captured and analyzed through such studies. Yet, many times the met...
Radio Frequency IDentification (RFID) innovations have gained considerable attention in the last several years and attracted the attention of organizations across different industries. RFID technologies have found applications in diverse areas such as package tracking, inventory management, baggage handling, school attendance, and logistics. Prior...
The process of conceptual design is independent of the final platform and the medium of implementation, and is usually in a form that is understandable and usable by managers and other personnel who may not be familiar with the low-level implementation details, but have a major influence in the development process. Although a strong design phase is...
This paper introduces a measurement ontology for applications to semantic Web applications, specifically for emerging domains such as microarray analysis. The semantic Web is the next-generation Web of structured data that are automatically shared by software agents, which apply definitions and constraints organized in ontologies to correctly proce...
Entity-Relationship (ER) diagrams are an established way of doing data modeling. In this paper, we report our experience with exploring the use of PVS to formally specify and reason with ER data models. Working with a text-book example, we rely on PVS's theory interpretation mechanism to verify the correctness of the mapping across various levels o...
RFID installations consist of a large number of interdependent and interconnected components that function together as a unit for proper functionality and operations of the system. These components include tags, readers, antennas, networks, computing systems, power supplies, and peripherals, accessories and other mechanical systems that are control...
Access control for semi-structured data is nontrivial, as witnessed by the number of access control approaches in literature.
Recently, a case has been made for expressing access constraints at finer levels of granularity on data nodes and extending
constraints to structural relationships. In this paper, we introduce a rewrite-based approach for ac...
XML plays an important role as the standard language for representing structured data for the traditional Web, and hence many
Web-based knowledge management repositories store data and documents in XML. If semantics about the data are formally represented
in an ontology, then it is possible to extract knowledge: This is done as ontology definitions...
Sharing data between organizations is challenging because it is difficult to ensure that those consuming the data accurately interpret it. The promise of the next generation WWW, the semantic Web, is that semantics about shared data will be represented in ontologies and available for automatic and accurate machine processing of data. Thus, there is...
When describing web services, one of the obvious aspects that need representing is 'Quality of Service' (QoS), the capability of a web service to meet an acceptable level of service as per factors such as availability and accessibility. However, too much of a focus on developing functional QoS ontologies has led to an over-emphasis on representing...
This paper introduces a measurement ontology for applications to semantic Web applications, specifically for emerging domains such as microarray analysis. The semantic Web is the next-generation Web of structured data that are automatically shared by software agents, which apply definitions and constraints organized in ontologies to correctly proce...
We present a user-centered database query language called QBT (Query By Templates) for user communication with databases containing complex structured data, such as data stored in the Extensible Markup Language (XML). XML has revolutionized data storage as well as representation and transfer methods in today's internet applications. The growing pop...
We present ACXESS (Access Control for XML with Enhanced Security Specifications), a system for specifying and enforcing enhanced security constraints on XML via virtual "security views" and query rewrites. ACXESS is the first system that bears the capability to specify and enforce complicated security policies on both subtrees and structural relati...
Because of its success, XML is increasingly used in many dierent application areas, and is moving towards the center of applications, evolving from an exchange format to the native data format of application components. These developments suggest that similar to other core areas of application design, XML should be designed conceptually before the...
XML is gaining predominance as the standard for data representation and exchange. Access con-trol for XML data is nontrivial as witnessed from the number of access control models presented in literature. Existing models provide the ability to extend access control to data as well as structure and enforce the specified access control via view materi...
We present SWAP (Semantic Web application pyramid), a framework for incorporating ontologies in data-oriented semantic Web applications. We have implemented this framework with a measurement ontology for a quality management Web service. This quality management Web service is built on top of a set of XML Web services implementing agents representin...
The process of conceptual design is independent of the final platform and the medium of implementation, and is usually in a form that is understandable and usable by managers and other personnel who may not be familiar with the low-level implementation details, but have a major influence in the development process. Although a strong design phase is...
We present SWAP (Semantic Web application pyramid), a framework for incorporating ontologies in data-oriented semantic Web applications. We have implemented this framework with a measurement ontology for a quality management Web service. This quality management Web service is built on top of a set of XML Web services implementing agents representin...
Being able to express and enforce role-based access control on XML data is a critical component of XML data management. However, given the semi-structured nature of XML, this is non-trivial, as access control can be applied on the values of nodes as well as on the structural relationship between nodes. In this context, we adopt and extend a graph e...
We present Circle, a classification algorithm based on the priciples of boolean function minimization. This classification process uses a recursive method to generate a set of implicants (or rules). The novelty of this algorithm is in the fact that the rules generated contain information about not only presence of features, but also their absence i...
We present a new application for experimental biologists, the Curation Alignment Tool for Protein Analysis (CATPA), that allows for the efficient and effective creation, storage, management, querying, and visualizing of experimentally curated protein families. Protein families in general include paralogs having diverged from a duplication and ortho...
Information provenance is a mechanism for tracing and verifying sources of information. In software development, provenance can be seen in two dimensions: (a) traceability among different versions of the same artifact and (b) traceability among various artifacts across system lifecycle. Maintaining the provenance, including the history of changes a...
The process of conceptual design is independent of the final platform and the medium of implementation, and is usually in a form that is understandable and usable by managers and other personnel who may not be familiar with the low-level implementation details, but have a major influence in the development process. Although a strong design phase is...
Can a system designed primarily for the purpose of database- type storage and retrieval be used for information-retrieval tasks? This was one of the questions that led us to participate in the INEX 2004 initiative. DocBase, a prototype database system developed initially for SGML, and adapted to work with XML, was used for the purpose of answering...
Starting with the 2003 academic year, the advanced placement (AP) exams in computer science conducted by the College Board moved from a C++ to a Java based curriculum. In order to assist high school computer science teachers with the transition from C++ to Java, the ACM together with the College Board conceived of Java engagement for teacher traini...
The concept of thumbnails is common in image representation. A thumbnail is a highly compressed version of an image that provides a small, yet complete visual representation to the human eye. We propose the adaptation of the concept of thumbnails to the domain of documents, whereby a thumbnail of any document can be generated from its semantic cont...
We present a novel classifier based upon principles of logic-theoretic Boolean function minimization. The classifier, called Circle, recursively produces a set of implicants (or rules). The implicant set contains information not only about the presence of features, but also about their absence in determining class values. Thus, Circle's implicant s...
XML is universally recognized as the standard document format for the purpose of inter change and device independent presentation Literature shows many e orts towards the devel opment of formal and conceptual models for XML although no commonly accepted model exists as yet In this paper we present a survey of some formal and conceptual modeling tec...
XML is one of the premier formats for data representation and interchange. Many organizations are starting to store data in XML and using XML as an intermediate format for publication and use of these documents. Most database systems have support for storing data in XML or internally representing XML data for storage. However, XML does not have a s...
SQL has been the result of years of query language research, and has many desirable properties. We introduce DSQL - a language
that is based on a theoretical foundation of a declarative language (document calculus) and an equivalent procedural language
(document algebra). The outcome of this design is a language that looks and feels like SQL, yet i...
SQL has been the result of years of query language research, and has many desirable properties. We introduce DSQL – a language that is based on a theoretical foundation of a declarative language (document calculus) and an equivalent procedural language (document algebra). The outcome of this design is a language that looks and feels like SQL, yet i...
A definition for a document type within an organization represents an organizational norm about the way the organizational actors represent products and supporting evidence of organizational processes. Generating a good organization-specific document structure is, therefore, important since it can capture a shared understanding among the organizati...
Case Based Reasoning systems have gained immense popularity over the recent years as problem-solving tools. Most case based reasoning systems, however, are developed essentially from scratch using proprietary systems and applications on a limited number of platforms. Although methods have been proposed to describe the structure of a case based reas...
With the advent of the World Wide Web (WWW) and the increased use of electronic documents in almost all aspects of computing, the problems of management of and systematic information retrieval from electronic documents have become highly pertinent. Information retrieval (IR) techniques allow us to retrieve documents based on keywords, but often the...
Standard Generalized Markup Language (SGML) has been widely accepted as a standard for document representation. The strength of SGML lies in the fact that it embeds logical structural information in documents while preserving a human-readable form. This structural information in SGML documents allows processing of these documents using database tec...
The WWW has a great potential of evolving into a globally distributed digital document library. The primary use of such a library is to retrieve information quickly and easily. Because of the size of these libraries, simple keyword searches often result in too many matches. More complex searches involving boolean expressions are difficult to formul...
The purpose of this query language is to capture feasible and interesting queries on SGML databases. The main property of this query language is that it acts on SGML documents directly and produces SGML documents as output. This approach is different from the current database systems that support SGML document databases. In current systems, queryin...
This is a summary of research activities so far performed in the Structured Document Database project. The project involves building a database environment for storing, querying and updating structured documents - in particular, documents encoded in SGML (Standard Generalized Markup Language) [ISO86]. The project has three major components - (i) de...
Have you ever been frustrated by how inadequate SGML databases are in terms of searching or querying your documents? With the current state of the art, you will easily be able to search for a word, phrase, or keywords in the whole document. Some systems let you perform approximate searches or regular expression searches. Even fewer systems let you...
One of the most exciting applications of SGML which has emerged in the recent years is its use in document databases. The structural information embedded in SGML documents makes it possible to query SGML documents and extract information in an automatic manner; however, this querying process has not been standardized. As a result, different SGML da...
A method for implementing a structured document database system is presented. The present-day systems dealing with structured or tagged documents have not been able to produce capabilities that even simple database systems possess - the ability to query the database based on the various properties of the database. Research in this area also has not...
In this paper, we describe the characteristics of two different query languages designed to query XML data: DSQL, a declarative SQL like language and XQuery, a procedural language that is fast becoming the defacto language for XML querying. We then describe the design of an experiment aimed at comparing the accuracy and efficiency of the query form...
Abstract Data Modeling,frameworks,like the Entity-Relationship (ER) approach are usually specified using graphical and natural language,representations. This limits the ability to formally,express and verify the consistency,of constraints on data models. The use of mathematical,notation makes,the specification precise, but also complex and tedious...