Arijit Sengupta

Arijit Sengupta
  • Ph.D. Computer Science
  • Head of Faculty at Florida International University

About

69
Publications
14,195
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
461
Citations
Current institution
Florida International University
Current position
  • Head of Faculty
Additional affiliations
December 2018 - present
Florida International University
Position
  • Head of Faculty
July 2005 - December 2018
Wright State University
Position
  • Head of Faculty

Publications

Publications (69)
Preprint
BACKGROUND Background: Blockchains offer a promising new distributed technology to address the challenges of data standardization, system interoperability, security, privacy, and accessibility for all data. However, integrating pervasive computing with blockchain’s ability to store privacy-protected mHealth data while providing HIPAA compliance is...
Article
Full-text available
Background: Integrating pervasive computing with blockchain's ability to store privacy-protected mobile health (mHealth) data while providing Health Insurance Portability and Accountability Act (HIPAA) compliance is a challenge. Patients use a multitude of devices, apps, and services to collect and store mHealth data. We present the design of an i...
Article
Full-text available
This paper presents the effects of a technology-based engagement platform, referred to as IMPRES (Interactive Mobile Platform for Retaining and Engaging Students). IMPRES is a mobile application, backed by a set of data analysis services, which seeks to engage and retain its users to enhance their professional development. The primary goal of the a...
Research
Full-text available
Many different implementations of Blockchain currently exist, each offering its own implementation of Bitcoin's original proposal. It is difficult to compare these separate implementations for design choices, performance benchmarks or application suitability without a common basic design framework. Using the concept of design spaces, and using comm...
Article
Full-text available
Enterprise resource planning systems (ERP) are the most essential and critical information systems that are widely used to build all the functions of a business into a unified and efficient enterprise system. Although the essence of ERP systems is easy to understand, students, especially those who do not have technical experience, have difficulty a...
Article
Major Indian cities experience traffic choking from early morning, and commuters face extreme difficulty in reaching their destinations. Many surveys and empirical research results concluded with loads of suggestions for the government and transport agencies in India in terms of reducing traffic congestion and raising the commuters’ satisfaction. I...
Chapter
Product identification is a crucial success factor for any business that involves tangible assets. The success or failure of a business depends on accurately tracking these assets through their life cycle, from manufacturing to sale and post-sale activities. Barcodes have been the technology of choice since their first use on Wrigley's gum in 1974....
Chapter
This chapter introduces a complete storage and retrieval architecture for a database environment for XML documents. DocBase, a prototype system based on this architecture, uses a flexible storage and indexing technique to allow highly expressive queries without the necessity of mapping documents to other database formats. DocBase is an integration...
Article
XML is one of the most widely accepted data representation languages in today's Internet-dominated Computing. While most XML data on the net today use commonly known structures, the power of XML lies in the ability to develop application-specific structures and models. XER (Extensible Entity Relationship) is a conceptual modeling methodology that u...
Article
This article introduces a complete storage and retrieval architecture for a database environment for XML documents. DocBase, a prototype system based on this architecture, uses a flexible storage and indexing technique to allow highly expressive queries without the necessity of mapping documents to other database formats. DocBase is an integration...
Article
This article presents DSQL, a conservative extension of SQL, as an ad-hoc query language for XML. The development of DSQL follows the theoretical foundations of first order logic, and uses common query semantics already accepted for SQL. DSQL represents a core subset of XQuery that lends well to optimization techniques, while at the same time allow...
Chapter
This chapter presents DSQL, a conservative extension of SQL, as an ad-hoc query language for XML. The development of DSQL follows the theoretical foundations of first order logic, and uses common query semantics already accepted for SQL. DSQL represents a core subset of XQuery that lends well to query optimization techniques; while at the same time...
Article
Full-text available
The surge of RFID technology and ensuing competition in the RFID vendor market has increased the complexity of integration of RFID into current business processes. To address this problem, we present FlexRFID, a modeling template composed of state indicators, conditions, a simulation engine, and a device-independent deployment architecture for rapi...
Article
Full-text available
China ranks first in hard coal production worldwide. Unfortunately, Chinese coal mines are also known to be the world’s deadliest and have recorded the largest number of fatal accidents and the highest death tolls. Radio frequency identification (RFID) seems to provide a potential solution to this human safety issue. Supported by the Chinese Coal B...
Chapter
This article proposes a data warehouse integration technique that combines data and documents from different underlying documents and database design approaches. The well-defined and structured data such as relational, object-oriented and object relational data, semi-structured data such as XML, and unstructured data such as HTML documents are inte...
Chapter
The process of conceptual design is independent of the final platform and the medium of implementation, and is usually in a form that is understandable and usable by managers and other personnel who may not be familiar with the low-level implementation details, but have a major influence in the development process. Although a strong design phase is...
Chapter
This article introduces a measurement ontology for applications to Semantic Web applications, specifically for emerging domains such as microarray analysis. The Semantic Web is thenext-generation Web of structured data that are automatically shared by software agents, which apply definitions and constraints organized in ontologies to correctly proc...
Article
FlexRFID is our solution to the general problem of resource time tracking, the problem of tracking the time a resource is utilized. This architecture is based on the premise that all RTT applications have some items of interest in common. One such item is the identity of the resources being tracked, and a second is a set of locations that mark the...
Article
Full-text available
The use of pilot studies to evaluate the economic justification of technology projects is common in practice. The pilot studies play even greater role in the projects affecting customer interactions with the product/service offerings since perception and/or reaction of customers is captured and analyzed through such studies. Yet, many times the met...
Article
Radio Frequency IDentification (RFID) innovations have gained considerable attention in the last several years and attracted the attention of organizations across different industries. RFID technologies have found applications in diverse areas such as package tracking, inventory management, baggage handling, school attendance, and logistics. Prior...
Chapter
The process of conceptual design is independent of the final platform and the medium of implementation, and is usually in a form that is understandable and usable by managers and other personnel who may not be familiar with the low-level implementation details, but have a major influence in the development process. Although a strong design phase is...
Chapter
Full-text available
This paper introduces a measurement ontology for applications to semantic Web applications, specifically for emerging domains such as microarray analysis. The semantic Web is the next-generation Web of structured data that are automatically shared by software agents, which apply definitions and constraints organized in ontologies to correctly proce...
Article
Full-text available
Entity-Relationship (ER) diagrams are an established way of doing data modeling. In this paper, we report our experience with exploring the use of PVS to formally specify and reason with ER data models. Working with a text-book example, we rely on PVS's theory interpretation mechanism to verify the correctness of the mapping across various levels o...
Article
Full-text available
RFID installations consist of a large number of interdependent and interconnected components that function together as a unit for proper functionality and operations of the system. These components include tags, readers, antennas, networks, computing systems, power supplies, and peripherals, accessories and other mechanical systems that are control...
Conference Paper
Full-text available
Access control for semi-structured data is nontrivial, as witnessed by the number of access control approaches in literature. Recently, a case has been made for expressing access constraints at finer levels of granularity on data nodes and extending constraints to structural relationships. In this paper, we introduce a rewrite-based approach for ac...
Article
Full-text available
XML plays an important role as the standard language for representing structured data for the traditional Web, and hence many Web-based knowledge management repositories store data and documents in XML. If semantics about the data are formally represented in an ontology, then it is possible to extract knowledge: This is done as ontology definitions...
Article
Full-text available
Sharing data between organizations is challenging because it is difficult to ensure that those consuming the data accurately interpret it. The promise of the next generation WWW, the semantic Web, is that semantics about shared data will be represented in ontologies and available for automatic and accurate machine processing of data. Thus, there is...
Article
Full-text available
When describing web services, one of the obvious aspects that need representing is 'Quality of Service' (QoS), the capability of a web service to meet an acceptable level of service as per factors such as availability and accessibility. However, too much of a focus on developing functional QoS ontologies has led to an over-emphasis on representing...
Article
Full-text available
This paper introduces a measurement ontology for applications to semantic Web applications, specifically for emerging domains such as microarray analysis. The semantic Web is the next-generation Web of structured data that are automatically shared by software agents, which apply definitions and constraints organized in ontologies to correctly proce...
Article
Full-text available
We present a user-centered database query language called QBT (Query By Templates) for user communication with databases containing complex structured data, such as data stored in the Extensible Markup Language (XML). XML has revolutionized data storage as well as representation and transfer methods in today's internet applications. The growing pop...
Conference Paper
Full-text available
We present ACXESS (Access Control for XML with Enhanced Security Specifications), a system for specifying and enforcing enhanced security constraints on XML via virtual "security views" and query rewrites. ACXESS is the first system that bears the capability to specify and enforce complicated security policies on both subtrees and structural relati...
Article
Full-text available
Because of its success, XML is increasingly used in many dierent application areas, and is moving towards the center of applications, evolving from an exchange format to the native data format of application components. These developments suggest that similar to other core areas of application design, XML should be designed conceptually before the...
Article
Full-text available
XML is gaining predominance as the standard for data representation and exchange. Access con-trol for XML data is nontrivial as witnessed from the number of access control models presented in literature. Existing models provide the ability to extend access control to data as well as structure and enforce the specified access control via view materi...
Article
We present SWAP (Semantic Web application pyramid), a framework for incorporating ontologies in data-oriented semantic Web applications. We have implemented this framework with a measurement ontology for a quality management Web service. This quality management Web service is built on top of a set of XML Web services implementing agents representin...
Chapter
The process of conceptual design is independent of the final platform and the medium of implementation, and is usually in a form that is understandable and usable by managers and other personnel who may not be familiar with the low-level implementation details, but have a major influence in the development process. Although a strong design phase is...
Chapter
We present SWAP (Semantic Web application pyramid), a framework for incorporating ontologies in data-oriented semantic Web applications. We have implemented this framework with a measurement ontology for a quality management Web service. This quality management Web service is built on top of a set of XML Web services implementing agents representin...
Conference Paper
Full-text available
Being able to express and enforce role-based access control on XML data is a critical component of XML data management. However, given the semi-structured nature of XML, this is non-trivial, as access control can be applied on the values of nodes as well as on the structural relationship between nodes. In this context, we adopt and extend a graph e...
Conference Paper
Full-text available
We present Circle, a classification algorithm based on the priciples of boolean function minimization. This classification process uses a recursive method to generate a set of implicants (or rules). The novelty of this algorithm is in the fact that the rules generated contain information about not only presence of features, but also their absence i...
Conference Paper
Full-text available
We present a new application for experimental biologists, the Curation Alignment Tool for Protein Analysis (CATPA), that allows for the efficient and effective creation, storage, management, querying, and visualizing of experimentally curated protein families. Protein families in general include paralogs having diverged from a duplication and ortho...
Conference Paper
Information provenance is a mechanism for tracing and verifying sources of information. In software development, provenance can be seen in two dimensions: (a) traceability among different versions of the same artifact and (b) traceability among various artifacts across system lifecycle. Maintaining the provenance, including the history of changes a...
Article
The process of conceptual design is independent of the final platform and the medium of implementation, and is usually in a form that is understandable and usable by managers and other personnel who may not be familiar with the low-level implementation details, but have a major influence in the development process. Although a strong design phase is...
Conference Paper
Full-text available
Can a system designed primarily for the purpose of database- type storage and retrieval be used for information-retrieval tasks? This was one of the questions that led us to participate in the INEX 2004 initiative. DocBase, a prototype database system developed initially for SGML, and adapted to work with XML, was used for the purpose of answering...
Conference Paper
Full-text available
Starting with the 2003 academic year, the advanced placement (AP) exams in computer science conducted by the College Board moved from a C++ to a Java based curriculum. In order to assist high school computer science teachers with the transition from C++ to Java, the ACM together with the College Board conceived of Java engagement for teacher traini...
Conference Paper
Full-text available
The concept of thumbnails is common in image representation. A thumbnail is a highly compressed version of an image that provides a small, yet complete visual representation to the human eye. We propose the adaptation of the concept of thumbnails to the domain of documents, whereby a thumbnail of any document can be generated from its semantic cont...
Conference Paper
Full-text available
We present a novel classifier based upon principles of logic-theoretic Boolean function minimization. The classifier, called Circle, recursively produces a set of implicants (or rules). The implicant set contains information not only about the presence of features, but also about their absence in determining class values. Thus, Circle's implicant s...
Article
Full-text available
XML is universally recognized as the standard document format for the purpose of inter change and device independent presentation Literature shows many e orts towards the devel opment of formal and conceptual models for XML although no commonly accepted model exists as yet In this paper we present a survey of some formal and conceptual modeling tec...
Article
Full-text available
XML is one of the premier formats for data representation and interchange. Many organizations are starting to store data in XML and using XML as an intermediate format for publication and use of these documents. Most database systems have support for storing data in XML or internally representing XML data for storage. However, XML does not have a s...
Conference Paper
SQL has been the result of years of query language research, and has many desirable properties. We introduce DSQL - a language that is based on a theoretical foundation of a declarative language (document calculus) and an equivalent procedural language (document algebra). The outcome of this design is a language that looks and feels like SQL, yet i...
Conference Paper
Full-text available
SQL has been the result of years of query language research, and has many desirable properties. We introduce DSQL – a language that is based on a theoretical foundation of a declarative language (document calculus) and an equivalent procedural language (document algebra). The outcome of this design is a language that looks and feels like SQL, yet i...
Article
Full-text available
A definition for a document type within an organization represents an organizational norm about the way the organizational actors represent products and supporting evidence of organizational processes. Generating a good organization-specific document structure is, therefore, important since it can capture a shared understanding among the organizati...
Article
Full-text available
Case Based Reasoning systems have gained immense popularity over the recent years as problem-solving tools. Most case based reasoning systems, however, are developed essentially from scratch using proprietary systems and applications on a limited number of platforms. Although methods have been proposed to describe the structure of a case based reas...
Article
Full-text available
With the advent of the World Wide Web (WWW) and the increased use of electronic documents in almost all aspects of computing, the problems of management of and systematic information retrieval from electronic documents have become highly pertinent. Information retrieval (IR) techniques allow us to retrieve documents based on keywords, but often the...
Article
Full-text available
Standard Generalized Markup Language (SGML) has been widely accepted as a standard for document representation. The strength of SGML lies in the fact that it embeds logical structural information in documents while preserving a human-readable form. This structural information in SGML documents allows processing of these documents using database tec...
Conference Paper
The WWW has a great potential of evolving into a globally distributed digital document library. The primary use of such a library is to retrieve information quickly and easily. Because of the size of these libraries, simple keyword searches often result in too many matches. More complex searches involving boolean expressions are difficult to formul...
Article
Full-text available
The purpose of this query language is to capture feasible and interesting queries on SGML databases. The main property of this query language is that it acts on SGML documents directly and produces SGML documents as output. This approach is different from the current database systems that support SGML document databases. In current systems, queryin...
Article
Full-text available
This is a summary of research activities so far performed in the Structured Document Database project. The project involves building a database environment for storing, querying and updating structured documents - in particular, documents encoded in SGML (Standard Generalized Markup Language) [ISO86]. The project has three major components - (i) de...
Article
Have you ever been frustrated by how inadequate SGML databases are in terms of searching or querying your documents? With the current state of the art, you will easily be able to search for a word, phrase, or keywords in the whole document. Some systems let you perform approximate searches or regular expression searches. Even fewer systems let you...
Article
Full-text available
One of the most exciting applications of SGML which has emerged in the recent years is its use in document databases. The structural information embedded in SGML documents makes it possible to query SGML documents and extract information in an automatic manner; however, this querying process has not been standardized. As a result, different SGML da...
Article
Full-text available
A method for implementing a structured document database system is presented. The present-day systems dealing with structured or tagged documents have not been able to produce capabilities that even simple database systems possess - the ability to query the database based on the various properties of the database. Research in this area also has not...
Article
Full-text available
In this paper, we describe the characteristics of two different query languages designed to query XML data: DSQL, a declarative SQL like language and XQuery, a procedural language that is fast becoming the defacto language for XML querying. We then describe the design of an experiment aimed at comparing the accuracy and efficiency of the query form...
Article
Full-text available
Abstract Data Modeling,frameworks,like the Entity-Relationship (ER) approach are usually specified using graphical and natural language,representations. This limits the ability to formally,express and verify the consistency,of constraints on data models. The use of mathematical,notation makes,the specification precise, but also complex and tedious...

Network

Cited By