Li Zha's research while affiliated with Chinese Academy of Sciences and other places
What is this page?
This page lists the scientific contributions of an author, who either does not have a ResearchGate profile, or has not yet added these contributions to their profile.
It was automatically created by ResearchGate to create a record of this author's body of work. We create such pages to advance our goal of creating and maintaining the most comprehensive scientific repository possible. In doing so, we process publicly available (personal) data relating to the author as a member of the scientific community.
If you're a ResearchGate member, you can follow this page to keep up with this author's work.
If you are this author, and you don't want us to display this page anymore, please let us know.
It was automatically created by ResearchGate to create a record of this author's body of work. We create such pages to advance our goal of creating and maintaining the most comprehensive scientific repository possible. In doing so, we process publicly available (personal) data relating to the author as a member of the scientific community.
If you're a ResearchGate member, you can follow this page to keep up with this author's work.
If you are this author, and you don't want us to display this page anymore, please let us know.
Publications (49)
This paper presents CirroData, a high-performance SQL-on-Hadoop system designed for Big Data analytics workloads. As a home-grown enterprise-level online analytical processing (OLAP) system with more than seven-year research and development (R&D) experiences, we share our design details to the community about how to achieve high performance in Cirr...
Deep Learning has recently been gaining popularity. From the micro-architecture field to the upper-layer end applications, a lot of research work has been proposed in the literature to advance the knowledge of Deep Learning. Deep Learning Benchmarking is one of such hot spots in the community. There are a bunch of Deep Learning benchmarks available...
With the popularity of cloud computing, decoupled compute-storage architecture has become a trend. While being able to independently scale compute and storage results in large cost savings and more flexibility, this architecture also increases the latency of data access, reducing the performance. To solve this problem, Alluxio was proposed. Alluxio...
In recent years, most Web 2.0/3.0 applications have been built on top of distributed systems which allow data to be modeled as Distributed Ordered Tables (DOTs) such as Apache HBase. To analyze the stored data, SQL-like range queries over a DOT are fundamental requirements. However, range queries over existing DOT implementations are highly ineffic...
Given a message, cascade prediction aims to predict the individuals who will potentially retweet it. Most existing methods either exploit demographical, structural, and temporal features for prediction, or explicitly rely on particular information diffusion models. Recently, researchers attempt to design fully data-driven methods for cascade predic...
In the original MapReduce model, reduce tasks need to fetch output data of map tasks in the manner of “pull”. However, reduce tasks which are occupying reduce slots cannot start executing until all the corresponding map tasks are completed. It forms the dependence between map and reduce tasks, which is called the coupled relationship in this paper....
Cloud database usually refers to a database based on the cloud computing technology. However, as far as we know, pre-existing solutions of cloud database cannot integrate the data from multi-sourced heterogeneous databases, only supplying an isolated homogeneous database cluster. This paper presents a new implementation approach for cloud database:...
MPI has been widely used in High Performance Computing. In contrast, such efficient communication support is lacking in the field of Big Data Computing, where communication is realized by time consuming techniques such as HTTP/RPC. This paper takes a step in bridging these two fields by extending MPI to support Hadoop-like Big Data Computing jobs,...
In the MapReduce model, reduce tasks need to fetch output data of map tasks in the manner of “pull”. However, reduce tasks which are occupying reduce slots cannot start to compute until all the corresponding map tasks are completed. It forms the dependence between map and reduce tasks, which is called the coupled relationship in this paper. The cou...
In the cloud age, heterogeneous application modes on large-scale infrastructures bring about the challenges on resource utilization and manageability to data centers. Many resource and runtime management systems are developed or evolved to address these challenges and relevant problems from different perspectives. This paper tries to identify the m...
In this paper, we propose an online search system based on Key-Value Store which aims to provide real-time k-NN (k-Nearest Neighbor) search in large-scale high-dimensional vector spaces. Through an improved indexing method based on KD-tree, the vector space is divided into a number of fixed-size heaps, only vectors of a specified heap need to do k-...
MapReduce is gaining increasing popularity as a parallel programming model for large-scale data processing. We find however some traditional MapReduce platforms have a poor performance in terms of cluster resource utilization since the traditional multi-phase parallel model and some existing schedule policies used in the cluster environment have so...
Hadoop has shown great power in processing vast data in parallel. Hive, the database on Hadoop, enables more experts to process relational data by providing sql-like interface. However, Hive does not provide an efficient approach for join, a common but expensive operator in relational database. Due to the importance of join, this paper proposes a n...
The Message Passing Interface (MPI) standard and its implementations (such as MPICH and OpenMPI) have been widely used in the high-performance computing area to provide an efficient communication infrastructure. This paper investigates whether MPI can be adapted to the data intensive computing area to substantially speed up Hadoop and MapReduce app...
The research on basic model, architecture, key technologies and system implementation is the hot topic of cloud computing in academic community and IT industry. By analyzing the characteristics of two existed types of cloud computing systems, including vertical cloud system and hybrid cloud system, three important and challenging issues of building...
In large organizations or IDCs, different departments always occupy and maintain dedicated resources to satisfy their or their customers' heterogeneous application loads. This situation easily makes the infrastructure management a repeated and inefficient work. Even worse, it is difficult to share the resources owned by different departments even w...
The research on basic model, architecture, key technologies and system implementation is the hot topic of cloud computing in academic community and IT industry. By analyzing the characteristics of two existed types of cloud computing systems, including vertical cloud system and hybrid cloud system, three important and challenging issues of building...
Map Reduce cluster is emerging as a solution of data-intensive scalable computing system. The open source implementation Hadoop has already been adopted for building clusters containing thousands of nodes. Such cloud infrastructure was used to processing many different jobs depending on different hardware resources, such as memory, CPU, Disk I/O an...
Massive scale distributed database like Google’s BigTable and Yahoo!’s PNUTS can be modeled as Distributed Ordered Table,
or DOT, which partitions data regions and supports range queries on key. Multi-dimensional range queries on DOTs are fundamental
requirements; however, none of existing schemes work well while considering three critical issues:...
Usability is an important feature of Grids or Clouds to end users, who may not be computer professionals but need to use massive
machines to compute their jobs. For meeting various computing or management requirements, heterogeneous clusters with diverse
Distributed Resource Management Systems (D-RMS) and applications are needed to supply computing...
Analyzing factors of affecting Web Service invocation performance is a hot topic. Among the factors, service interface complexity is a key one investigated by much research work. However, these researches mainly analyze the impact on the performance of primitives, some simple data structures like mesh interface object, or array of them. For the com...
In a virtual cluster based Cloud Computing environment, the sharing of infrastructure introduces two problems on user management: usability and security. Meanwhile, we observe that most conventional user management frameworks in the network environment are not fit for the scale expansion and interconnection of dynamic virtualization environment. In...
Grid technology has being widely accepted in distributed resources sharing and high performance computing cross multi administrative domains. In this paper, we analysis the security issues in grid computing environments, and propose a security framework for VegaGOS[1] which is a Service Oriented Architecture middleware developed for the China Natio...
Theoretically, multi-language clients invocating web services is no longer a problem due to XML-based interface descriptions by WSDL, but the reality is not so good. Some implementation level difficulties still exist when invoking web services from clients in different programming languages. These difficulties are caused by involving complex data s...
Classifying computing systems is a basic component of computer science, directly impacting the design, programming and optimization of computer systems artifacts. The past 20 years have seen many network computing applications. However, an effective classification of network computing systems has not emerged in the academia. This negatively impacte...
MapReduce is an important programming model for building data centers containing ten of thousands of nodes. In a practical data center of that scale, it is a common case that I/O-bound jobs and CPU-bound jobs, which demand different resources, run simultaneously in the same cluster. In the MapReduce framework, parallelization of these two kinds of...
This paper reviews the programming landscape for parallel and network computing systems, focusing on four styles of concurrent
programming models, and example languages/libraries. The four styles correspond to four scales of the targeted systems. At
the smallest coprocessor scale, Single Instruction Multiple Thread (SIMT) and Compute Unified Device...
This paper presents OverFly, a large-scale publish-subscribe system developed to support our grid middleware, Vega grid operating system, short as GOS. OverFly differs from most existing publish-subscribe systems in three ways. First, it introduces a message store at the client-side which enables clients to be more resilient to intermittent system...
Message-based debugging facilities for Web or Grid Services are separated from an infrastructure of source level debugging and can work in a self-identifying and coexisting mode within a normal services container. In this paper, we discuss problems for services debugging and approaches we take. We present the operational model and context inspectio...
Shell is a most important user interface for HPC users in scientific computing. This paper presents a login shell for computing grid, called GShell, which provides an integrated and uniform environment for constructing, running and managing grid applications. Besides conventional functionality of shell, GShell provides grid-level functionality incl...
As we enter the 21st century, a profound transformation is emerging in the field of computer science and technology, this is also true for the subfield of computer systems. The main characteristic of this transformation is the leap from man-Computer symbiosis to man-cyber-physical society (a tri-world of people, computers, and things). This raises...
Virtual Organizations (VOs) are dynamic collections of individuals, institutions, and resources. VOs are widely accepted in grid and other distributed computing environments. Previous VO research produces several separate tools to provide part of the functionality including user registration, user mapping, authentication, authorization, and VO mana...
In this paper, we first introduce some issues that are encountered in building a service debugger and briefly describe our approach to addressing them. Next, we outline some debugging modes and components of a simple composite debugger. Then, we mainly describe its message-based front-end and back-end, which are a co-existing, self-identifying, and...
This paper presents VegaBench, a benchmark tool, and its methodology of evaluation for service oriented grid system software. The aim of the VegaBench is to bring forth a set of benchmarks for both end users and the system software developers to measure grid system software. In order to do this in an organized and flexible way, we provide some inte...
The main obstacles to grid file management come from the fact that grid file resources are typically stored in heterogeneous
and distributed environment and accessed through various protocols. In this paper, we propose a grid file management system
called Vega [1][2] Hotfile2 for data-intensive application in widely distributed systems and grid en...
The China National Grid project developed and deployed a suite of grid system software called CNGrid Software. This paper
presents the features and implementation of the software suite from the viewpoints of grid system deployment, grid application
developers, grid resource providers, grid system administrators, and the end users.
Grid is open and distributed system, which has characters of scalability, dynamic and autonomy. To organize large scaled and disordered resources, providing share and coordination service are key issues that must be solved when constructing grid under Internet environment. VEGA research team aims at current shortages of grid platform, utilizes trad...
This paper describes a security mechanism of grid system, analyzes security requirements, security policy , security management of grid system and some security problems in grid computing system. Our mechanism consists of mutual authentication, agora based authorization, access control and secure communicate. We also describe an implementation of t...
Hotfile is a user level file management system. It wraps GridFTP and GASS or any other file transfer protocol compatible with hotfile structure into a unified vegafile protocol. Based on virtual grid file layer and a set of basic grid file operations, users can access grid file without knowing the physical transport protocol of the file. Test resul...
Recent work towards a standard based, service oriented Grid represents the convergence of distributed computing technologies
from e-Science and e-Commerce communities. A global service management system in such context is required to achieve efficient
resource sharing and collaborative service provisioning. This paper presents VEGA system software,...
Zhiwei Xu Wei Li Li Zha- [...]
Donghua Liu
In this paper, we contrast four approaches for Grid computing, and discuss a computer systems approach in detail. This approach views a Grid as a distributed computer system, and its main concerns are systems abstractions and constructs, such as the Grid equivalents of computer architecture, address space, process, device, file system, user/develop...
An approach modifying the binary code of that program by ELF-Rewriter directly and distributing the computation loads among the whole cluster system without touching the source code is presented. The ELF-Rewriter injects a stub code segment into the host program, responsible for the communication between host program and Task Dispatcher, and host p...
In China, CNGrid software is one of the first middleware level grid software aiming at distributed resource sharing and application integration. It is built by SOA technology and composed by Vega GOS, GriShield and GriDaEn, which are responsible for resource sharing mechanism implementation, security and file management separately. In CNGrid softwa...
Citations
... In this paper, the Aadhaar data analysis is carried out on different distributed computing frameworks mainly MapReduce [1], Hive [3] and Apache Spark [4] on top of Hadoop. ...
... For example, MLPerf [50] is a comprehensive benchmark for measuring ML inference performance across a spectrum of use cases. Architecture-oriented DNN benchmarks [39], [32], [38] target on analyzing the architectural features of DNNs on computing systems of different sizes. MDLBench [39], Embench [32] and AIoTBench [23] are representative benchmarks that characterize the features of different AI models on edge or mobile devices while NNBench-X [57], GNNMark [46] target on acceleration hardware design for different DNNs. ...
... Many large-scale distributed computing organizations that need to store and maintain continuous amounts of data deploy distributed storage systems, such as HDFS [4,5], GFS [2,3] (which were mentioned above), Ceph [12], Microsoft Azure [13,14], Amazon S3 [15], Alluxio [16] etc., which comprise multiple nodes, often organized into groups called racks. Currently, most of these systems write and store large data as blocks of fixed size, which are distributed almost evenly among the system's nodes using random block placement or load balancing policies. ...
... Here, the "Spark+HBase" scheme performs the worst which is mainly because HBase is not a good storage engine for processing complex queries with a large number of data accesses. HBase is mainly designed for point query and some range queries with the help of additional index techniques [17] . Compared with "Spark+Hive", CirroData has a better distributed query execution plan, and the tasks are scheduled in a more load-balanced manner. ...
... For example, when executing an UPDATE statement, MariaDB updates a table's secondary indexes in the same transaction as the table rows [91]. However, in systems that implement asynchronous (lazy) derived state maintenance policies [111,130,135] derived state can become stale with respect to corpus. ...
... [30] models the information propagation as heat diffusion in a high dimensional latent space through which the node's representation is learned. Following this work, [31,32] make modifications to improve the performance. The introduction of the deep learning method makes the use of network topology, temporal order and other features much more convenient, giving rise to a quick shift in designing the model. ...
... Sometimes not all mobile have cameras as well. Some devices can play video and some cannot play video [42]. Quality assurance of mobile apps deals with high-quality apps is produced based on standards and deliver highquality products line for the consumers. ...
Reference: Quality assurance for mobile applications
... After acquiring data and understanding the data structure, the process which distinguish the standard analytic project and big data project is data split process which applying the concept of map and reduce. In map and reduce concept, the master distributes the idle task to the slave [25]. A large file which is infeasible for standard processing tools is distribute to several files. ...
... Since the stage is set to have research experience of resource discovery, it is imperative for libraries to provide information with tailor-made expansive representation of library collections and relevant content. Finding a way to combine the best of the just-in-case and Library automation just-in-time technologies using the pre-harvested, pre-massaged and pre-indexed approach of search engines to meet changing user expectations is the current challenge for librarians and their vendors (Walker, 2009). According to a report by the Research Information Network (2008) on building research tools from the physical artefacts and resources of museums, "technological developments offer opportunities for cross searching, for making records findable by Google and other search engines, for linking to associated documentation, and for integrating museum catalogues with other resources, such as library catalogues. ...
... The research of Grid-Cloud based IT platform mainly includes three sub-fields, system architecture, software mechanism and resource allocation. A great deal of existing research concerning software mechanism that will be necessary to bring Grid and Cloud to fruition is underway9101112131415161718. Thereby, we are able to abstract away implementation details and focus on the scalable service system architecture and the effective Grid-Cloud resource allocation. ...
Reference: Grid-Cloud: IT platform for Service Science