
Wang-Pin Hsiung- NEC Laboratories America
Wang-Pin Hsiung
- NEC Laboratories America
About
48
Publications
5,030
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
1,058
Citations
Current institution
Publications
Publications (48)
With the growth of 5G, Internet of Things (IoT), edge computing and cloud computing technologies, the infrastructure (compute and network) available to emerging applications (AR/VR, autonomous driving, industry 4.0, etc.) has become quite complex. There are multiple tiers of computing (IoT devices, near edge, far edge, cloud, etc.) that are connect...
Identification of people with elevated body temperature can reduce or dramatically slow down the spread of infectious diseases like COVID-19. We present a novel fever-screening system, F3S, that uses edge machine learning techniques to accurately measure core body temperatures of multiple individuals in a free-flow setting. F3S performs real-time s...
A swap-based multitenant database load balancing (SWAT) invention finds an optimal swap set that eliminates hotspots, and finds a sequencing and parallelization of swap executions for the multitenant database.
Supporting an online transaction processing (OLTP) workload in a scalable and elastic fashion is a challenging task. Recently, a new breed of scalable systems have shown significant throughput gains by limiting consistency to small units of data called “entity-groups” (e.g., a user's account information stored together with all her emails in an onl...
Entity-group based new SQL systems achieve scalability and consistency at the same time by using a key-value store as the storage layer and limiting each transaction's boundary to a collection of data (called an entity-group). Examples of such systems are Google's Megastore, NEC's Partiqle, and LinkedIn's Espresso. Application developers of such sy...
Query scheduling, a fundamental problem in database management systems, has recently received a renewed attention, perhaps in part due to the rise of the "database as a service" (DaaS) model for database deployment. While there has been a great deal of work investigating different scheduling algorithms, there has been comparatively little work inve...
There has been a great interest in exploiting the cloud as a platform for database as a service. As with other cloud-based services, database services may enjoy cost efficiency through consolidation: hosting multiple databases within a single physical server. Aggressive consolidation, however, may hurt the service quality, leading to SLA violation...
Multitenant databases achieve cost efficiency through the consolidation of multiple small tenants. However, performance isolation is an inherent problem in multitenant databases due to resource sharing among the tenants. That is, a bursty workload from a co-located tenant, i.e., a noisy neighbor, may affect the performance of the other tenants shar...
The demo features Partiqle, a SQL engine over key-value stores as a relational alternative for the recent procedural approaches to support OLTP workloads elastically. Based on our microsharding framework [12], it employs a declarative specification, called transaction classes, of constraints applied on the transactions in a workload. We demonstrate...
The limitation of natural parallelization comes from data management. Ever increasing need for scalability and new data and applications create new challenges for relational database systems. Different business needs and workloads may be satisfied with diverse set of data management capabilities. More importantly the business needs also evolve over...
ABSTRACT Fully automatic methods that extract lists of objects from the Web have been studied extensively. Record extraction, the flrst step of this object extraction process, identifles a set of Web page segments, each of which represents an individ- ual object (e.g., a product). State-of-the-art methods su‐ce for simple search, but they often fai...
An XML publish/subscribe system needs to filter a large number of queries over XML streams. Most existing systems only consider filtering the simple XPath statements. In this paper, we focus on filtering of the more complex generalized-tree-pattern (GTP) queries. Our filtering mechanism is based on a novel Tree-of-Path (TOP) encoding scheme, which...
Monitoring moving objects is one of the key application domains for sensor networks. In the absence of cooperative objects and devices attached to these objects, target tracking algorithms have to be used for monitoring. In this paper, we present that many of the applications of moving object monitoring systems could be addressed with low-frequency...
Detecting complex patterns in event streams, i.e., complex event processing (CEP), has become increasingly important for modern enterprises to react quickly to critical situations. In many practical cases business events are generated based on pre-defined business logics. Hence constraints, such as occurrence and order constraints, often hold among...
An XML publish/subscribe system needs to filter a large number of queries over XML streams. Most existing systems only consider filtering the simple XPath statements. In this paper, we focus on filtering of the more complex Generalized-Tree-Pattern (GTP) queries. Our filtering mechanism is based on a novel Tree-of-Path (TOP) encoding scheme, which...
Efficient data acquisition in WSNs has attracted significant interest. For example, TinyDB [2] introduced query dissemination and data aggregation trees. Later, a probabilistic model of the physical world is used in [1]. Recently, [3] argues that probabilistic models of the physical world used in acquisition may miss outliers and introduces spatio-...
Continuous join queries (CJQ) are needed for correlating data from multiple streams. One fundamental problem for processing such queries is that since the data streams are infinite, this would require the join operator to store infinite states and eventually run out of space. Punctuation semantics has been proposed to specifically address this prob...
For e-commerce Web service applications, cache should be applicable especially to product information services. However, practical e-commerce use cases have limited cacheability: (1) a message may contain management components, which reduce reusability of cached messages; (2) content typically consists of composite objects, which makes it harder to...
XML message filtering problem involves searching for instances of a given, potentially large, set of patterns in a continuous stream of XML messages. Since the messages arrive continuously, it is essential that the filtering rate matches the data arrival rate. Therefore, the given set of filter patterns needs to be indexed appropriately to enable r...
Tree pattern matching is one of the most fundamental tasks for XML query processing. Holistic twig query processing techniques [4, 16] have been developed to minimize the intermediate results, namely, those root-to-leaf path matches that are not in the final twig results. However, useless path matches cannot be completely avoided, especially when t...
Response time is a key differentiation point among electronic commerce (e-commerce) applications. For many e-commerce applications, web pages are created dynamically based on the current state of a business stored in database systems. To improve the response time, many e-commerce websites deploy caching solutions for acceleration of content deliver...
For most Web-based applications, contents are created dynamically based on the current state of a business, such as product prices and inventory, stored in database systems. These applications demand personalized content and track user behavior while maintaining application integrity. Many of such practices are not compatible with Web acceleration...
For most Web-based applications, contents are created dynamically based on the current state of a business, such as product prices and inventory, stored in database systems. These applications demand personalized content and track user behavior while maintaining application integrity. Many of such practices are not compatible with Web acceleration...
This technical description has highlighted many features of CachePortal II using an e-commerce site built using some of most popular components on the market, such as Oracle DBMS and BEA WebLogic. It shows a real and useful application of integrating database and Web technologies. CachePortal II Beta version is currently available for J2EE complien...
This technical description has highlighted many features of Cache Portal II using an e-commerce site built using some of most popular components on the market, such as Oracle DBMS and BEA web logic. This chapter explores a real and useful application of integrating database and Web technologies. Cache Portal II Beta version is currently available f...
Both response time and content freshness are essential to e-commerce applications on the Web. One option to achieve good response time is to build a high performance Web site by deploying the state of art IT infrastructures with large network and server capacities. With such a system architecture, freshness of the content delivered is limited by th...
Wide-area database replication technologies and the availability of content delivery networks allow Web applications to be hosted and served from powerful data centers. This form of application support requires a complete Web application suite to be distributed along with the database replicas. A major advantage of this approach is that dynamic con...
Ultrahigh storage densities of up to 1 Tb/in2. or more can be achieved by using local-probe techniques to write, read back, and erase data in very thin polymer films. The thermomechanical scanning-probe-based data-storage concept, internally ...
Response time is essential to many Web applications. Consequently, many database-driven Web applications rely on data centers that host applications and database contents. Such IT infrastructure enables generation of requested pages at locations much closer to the end-users, thus reducing network latency. However, it incurs additional complexity as...
Response time is a key differentiation point among electronic commerce (e-commerce) applications. For many e-commerce applications, Web pages are created dynamically based on the current state of a business stored in database systems. To improve the response time, many e-commerce Web sites deploy caching solutions for acceleration of content delive...
Response time is a key differentiation among electronic commerce (e-commerce) applications. For many e-commerce applications, Web pages are created dynamically based on the current state of a business stored in database systems. Recently, the topic of Web acceleration for database-driven Web applications has drawn a lot of attention in both the res...
Response time is a key differentiation point among electronic commerce (e-commerce) applications. For many e-commerce applications, Web pages are created dynamically based on the current business state, stored in database systems. The architecture of database-driven e-commerce Web sites are more complex than that of typical Web sites. It requires i...
In today's multitiered application architectures, clients do not access data stored in the databases directly. Instead, they use applications which in turn invoke the DBMS to generate the relevant content. Since executing application programs may require significant time and other resources, it is more advantageous to cache application results in a...
The fast growing demand for e-commerce brings a unique set of challenges to build a high performance e-commerce Web site both
in technical terms and in business terms. To ensure the fast delivery of fresh dynamic content and engineer highly scalable
e-commerce Web sites for special events or peak times continuously put heavy pressures on IT staffs...
p r orm n s y r nt t on mon ont nt prov rs. n us n slow owns t m or w s t s mon-str t t ? ulty t t omp n s try n to s l to l r mount o w tr ?. n solut on to t s pro l m s to stor w ont nt t s rv r-s n-l v ry to t n us rs. ow v r, or m ny-omm r s t s, w p s r r t yn m lly s on t ur-r nt st t o us n ss pro ss s, r pr s nt n ppl t on s rv rs n t s s....
We present a vision for a comprehensive data management platform in the cloud. The envisioned system, called CloudDB, would provide data management capabilities as a service to transparently and efficiently support diverse application workloads with identifiable SLA guarantees and end-to-end system management functions. The system will be able to e...