Liutong Xu’s research while affiliated with Beijing University of Posts and Telecommunications and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (24)


A PaaS based metadata-driven ETL framework
  • Article

September 2011

·

256 Reads

·

2 Citations

Liutong Xu

·

Jia Liao

·

Ruixue Zhao

·

Knowledge discovery has often used as a background application to motivate many technical problems in ETL research. However, traditional ETL tools face new challenges include tremendous amount of data and limitation of computing ability and so on. Meanwhile, MapReduce parallel computing model has been widely used in recent years. In This paper, we first analyze the problems of existing ETL tools and propose a metadatadriven ETL service model, and then summarize the types of metadata and their application scopes. Based on this metadata-driven ETL service model, we put forward a concrete ETL framework combined ETL with MapReduce algorithm framework and provided as PaaS to meet the requirements. Afterwards, many significant services are also discussed. At last, we illustrate some strategies for advancing the flexibility, extensibility of the framework and promote the reusability of ETL components and ETL application. In conclusion, practices have proved that the model and the framework proposed in this paper have advantages that open-source or commercial ETL tools do not have and can deal the problem of processing large scale data.


A management platform for Eucalyptus-based IaaS

September 2011

·

82 Reads

·

3 Citations

Infrastructure-as-a-service (IaaS), it is typically a platform virtualization environment as a service, which deploys IT infrastructure as virtual machines in a provider’s data center. There are many open source cloud platforms such as Eucalyptus, Open Nebula, oVirt, Enomaly ECP etc, that can help to transform enterprises’ or organizations’ existing IT infrastructure into a private or hybrid cloud. With the growing popularity of IaaS, resource monitoring and VM (Virtual Machine) monitoring has became more and more important in cloud management. But all those open source cloud platforms cannot satisfy the requirement of cloud environment monitoring and management completely. In this paper, we present a novel IaaS management platform based on Eucalyptus, which uses Libvirt management library, SNMP (simple network management protocol) and VMM (Virtual Machine Monitor) to get more abilities to manage and monitor the cloud. This platform provides some new features which can manage and monitor resources and VM status in the cloud environment and give a good user experience.


MRData: A MapReduce-Based Tool for Heterogeneous Data Integration

August 2010

·

15 Reads

·

3 Citations

As the volume of data increasing sharply and the relationship among different data sources becoming intricately, how to integrate mass data sources and how to find latent information from the integrated data is a matter of urgency. At present, industry tends to adopt distributed computing model to solve the integration of massive data. Aiming at getting the valuable and in-depth information, visualization is a critical step in data analysis and data mining. We design a tool called MRData for heterogeneous data integration which has two features: 1) parallel data processing based on Hadoop which is a distributed platform; 2) visual analysis. And at last, experiments verify the efficiency of MRData.


Thematic Maps Service for GIS Based Urban Pipe Network Visualization System

June 2009

·

22 Reads

Liutong Xu

·

Xiaoting Fu

·

Hao Lin

·

[...]

·

Urban pipe network pipelines are criss-crossing and have 3D (three-dimension) distribution. Plenty of attribute information and spatial data are involved. In order to display it in a more intuitive way, UPNVS (urban pipe network visualization system) is designed. The thematic maps service is a part of it. In this paper, we propose the thematic maps service, upload and edit data service, information attention service and 3D viewing service. While 3D display service concentrates on displaying simple map and unique value map. It implements the transit from 2D thematic maps to 3D and reflects the different property values of pipelines with different colors. This application enhances thematic maps service lively and visibility. Finally, we plug it to our UPNVS which can show the patterns and trends easily and provide the information for decision-making. At the same time, the results can also be applied to other visualization system and 3D thematic maps.


A Service Stack for 3D Visualization of GIS Based Urban Pipe Network

June 2009

·

38 Reads

·

1 Citation

This paper studies the functional services of the 3D visualization of urban pipe network (UPN). In accordance with the characteristics of the UPN and its demands for 3D visualization. The division of the visualization services of the 3D visualization of UPN is researched, then a structure of services division named service stack is propose in this paper, which is applicable to the 3D visualization of UPN. The service stack provides a unified standard for the development of the 3D visualization of UPN, simplifies the process of the system development, and provides the system with good compatibility, scalability and reusability. Key technologies about the service stack are also discussed in this paper. Finally, based on the service stack, a prototype of urban pipe network 3D visualization system is implemented.


Spatial Data Model for Visualization System of GIS Based Urban Pipe Network

May 2009

·

41 Reads

·

3 Citations

Since planar graphics can not meet the needs of precisely reflecting real spatial relationship between pipelines, it is necessary to build a 3D visualization system of urban pipe network. 3D spatial data model is the basis of data representation and spatial visualization. Based on the analysis of the features of urban pipe network and the requirements of the 3D model, a spatial data model for Urban Pipe Network Visualization System (UPNVS) is designed and organized by spatial metadata. The advantage of the spatial metadata is discussed in the paper. The application of the model in a real project shows that it is effective.



Parallel Algorithm for Enumerating Maximal Cliques in Complex Network

October 2008

·

50 Reads

·

22 Citations

Studies in Computational Intelligence

Efficient enumeration of all maximal cliques in a given graph has many applications in the filed of Graph Theory, Social Network Analysis, Bioinformatics and etc. Recent researches indicate that many networks in our world are complex networks involving massive data. Being as the complete sub-graph, a maximal clique can represent a group of friends who all hang around together. It can also be used to find common sub-topologies in a set of protein structures. However, the large scale of real networks and the exponentially increasing computation time of the clique enumeration problem make most of the existing algorithms unsuitable in the real-world scenarios. Therefore, we present a parallel algorithm Peamc (Parallel Enumeration of All Maximal Cliques) which exploits several new and effective techniques to enumerate all maximal cliques in large-scale complex networks. Experimental results on true-life networks with up to 20 million vertices and 50 million edges show that Peamc can find all the maximal cliques with high efficiency and scalability.


DMGrid: A Data Mining System Based on Grid Computing

October 2008

·

16 Reads

Lecture Notes in Computer Science

Researchers in the field of data mining now confront a common problem that data mining tasks are time-consuming in that these tasks have to process large-scale datasets. Grid computing focuses on integrating distributed, heterogeneous and idle computers from the Internet to be a service system with high performance. Thus, it is possible to take advantage of grid computing to provide high performance computation capability to effectively reduce task durations. Here, we have successfully developed DMGrid, a grid handling data mining applications. In DMGrid, it not only considers efficient parallel computing as a crucial aspect, but also takes into account dynamic resource configuration. Unlike many existing data mining grids, DMGrid also provides an engine to execute the algorithm flow specified in an application. Moreover, it offers application execution monitoring. At last, we perform experiments and design two applications: Customer Churning Analysis and Customer Value Analysis through which the feasibility of DMGrid is validated.


SDMA: A Service-based Architecture for Data Mining Applications

August 2008

·

29 Reads

·

3 Citations

In the field of data mining, most of necessary resources, such as datasets, computing resources, might be distributed in different places of the world, which impedes scientists to do research successfully. Service science advocates every available resource as a service and focuses on their collaboration to settle a problem. Here, we propose a service-based architecture for data mining applications, including configuration service, service engine, monitor service, analysis service, visualization service, computing service and data & algorithm provision service. The first 5 services focus on 5 specific aspects of data mining requirements. A demo system has also been developed to validate the feasibility of our architecture.


Citations (15)


... Such a situation will lead to CPU slice schedule among these algorithms and make their efficiency declined. Due to heterogeneous computing resources in the grid environment, computing nodes with different performance results to different load-dealing performances [5]. Here, select workers C1, C2, C3, three nodes with performance distinction, to perform load experiments. ...

Reference:

Implementation of a Data Mining System Based on Grid Computing
Betweenness research in telecom social network
  • Citing Conference Paper
  • December 2006

Dynamics of Continuous, Discrete and Impulsive Systems Series B: Applications and Algorithms

... Some IaaS systems are replicating the same capabilities offered by public providers like Amazon AWS; examples include Nimbus, Eucalyptus, OpenStack and OpenNebula [9, 10]. Others may only suitable for services of one cloud like [11], or for multiple services from multiple providers, like ‗Karlsruhe Open Application for cLoud Administration' [12, 13]. There is also an open source, cross-platform, cloud management system called Scalr; provides server management and auto-scaling disaster recovery, where the manager is able to scale a virtual infrastructure according to the load based on RAM, disk, CPU, network or date [14]. ...

A management platform for Eucalyptus-based IaaS
  • Citing Article
  • September 2011

... Basic research data of urban water environment are complex and large, with obvious spatial characteristics, which require establishing an appropriate approach to obtain the necessary information (Varduhn et al. 2015). As a geographic spatial database-based information system, Geographic Information System (GIS) has been extensively applied to the process of analysis, management and decision-making in urban water environment (Ferrer et al. 2012;Mair et al. 2012;Yang et al. 2009). In this study, the network analysis techniques of GIS including Geometric Network tool and Utility Network Analyst tool were coupled with other theories to establish an approach for the tracking and tracing analyses of the UWPDS. ...

Spatial Data Model for Visualization System of GIS Based Urban Pipe Network
  • Citing Article
  • May 2009

... Networks provide a powerful framework for modeling the structure and dynamics of complex systems by representing them as nodes connected by edges [1,2]. This framework finds broad application across diverse fields, including social networks [3][4][5], transportation systems [6][7][8], finance [9][10][11], and neuroscience [12][13][14]. The modern science of networks seeks to unravel both the structural and functional aspects of these systems, where nodes represent fundamental units and edges denote their interactions [15]. ...

Community detection in large-scale social networks
  • Citing Article
  • Full-text available
  • August 2007

... Kavitha Ranganathan et al. [11] and Xu, L et al [12] describe scheduling considerations in data grid environment, they describe a scheduling frame work which both address the job and data scheduling. Their simulation results show that job scheduling and data scheduling handled with separately for achieving better performance. ...

A Strategy for Data Replication in Data Grids
  • Citing Chapter
  • January 2005

... Prior works on parallel MCE have largely focused on distributed memory algorithms [37,51,55,60]. There are a few works on shared-memory parallel algorithms [16,34,65]. However, these algorithms do not scale to larger graphs due to memory or computational bottlenecks -either the algorithms miss out significant pruning opportunities as in [16], or they need to generate a large number of non-maximal cliques as in [34,65]. ...

Parallel Algorithm for Enumerating Maximal Cliques in Complex Network
  • Citing Chapter
  • October 2008

Studies in Computational Intelligence

... There are also numerous studies that develop parallel algorithms for MCE [38], [39], [40], [41], [42], [43], [44], [45], [46], [47], [48], [49], [50], [51], [52], [53]. These algorithms extend the techniques that are originally proposed for sequential algorithms and aim to balance and reduce the computation overheads to achieve higher degrees of parallelism. ...

A Parallel Algorithm for Enumerating All Maximal Cliques in Complex Network

... Such as in scientific computing, physics, biology, astronomy, oceans, atmosphere, manufacturing and so on [2][3][4], and its desktop operation interface has been implemented well [5][6][7]. Combining computing grid and data grid for data-intensive computing applications has become a trend now [8][9][10][11]. However, to meet time requirement of data-intensive computing applications, how to accelerate transmission is the key problem faced by data grid [12]. ...

Qos-based Algorithm for Job Allocation and Scheduling in Data Grid
  • Citing Conference Paper
  • November 2006

... Furthermore, the Web services are dynamically linked using Business Process Execution Language for Web service (BPEL4WS) to construct a desired data mining process depending on user's requirement. Chen et al. [11] have proposed architecture of metadata service of data mining based on Common Warehouse Metamodel (CWM). These authors divided the service into two types which provide service respectively by analyzing the feature of data service and the services needed by the data mining system. ...

The Design of Data Mining Metadata Web Service Architecture Based on JDM in Grid Environment
  • Citing Conference Paper
  • September 2006

... Researchers have started to conduct research for SMEspecific cloud-based information services architectures by combining the characteristics of cloud computing with the actual needs and requirements of SMEs. Wang [81] proposes a cloud-based information services architecture for SMEs by adapting a high-level market-oriented cloud architecture developed by Buyya et al. [8]. This cloud-based information services architecture includes four main components: (1) Users/SMEs, who can submit their service requests to a selected cloud computing platform for processing; (2) Service level agreements (SLA) Resource Allocator, which is used as the interface between external users and SMEs [8]; (3) Virtual machines (VMs), which provide users with remote access to a pool of physical computers via the Internet and allow users to reserve a ''virtual computer'' with the desired operating system and applications; (4) Physical Machines, which are used to build a data center with numerous computing resources to meet specific needs and requirements of SMEs. ...

GOI-Based Information Service Architecture in Grid
  • Citing Conference Paper
  • December 2006