P.J. Garcia’s research while affiliated with University of Castilla-La Mancha and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (10)


Towards Modeling Interconnection Networks of Exascale Systems with OMNeT plus
  • Conference Paper

February 2013

·

53 Reads

·

41 Citations

·

·

P.J. Garcia

·

F.J. Quiles

One of the objectives of the decade for High-Performance Computing systems is to reach the exascale level of computing power before 2018, hence this will require strong efforts in their design. In that sense, High-speed low-latency interconnection networks are essential elements for exascale HPC systems. Indeed, the performance of the whole system depends on that of the interconnection network. In order to develop and test new techniques, suited to exascale HPC systems, software-based networks simulators are commonly used. As developing a network simulator from scratch is a difficult task, several platforms help the developers, OMNeT++ being one of the most popular. In this paper, we propose a new generic network simulator, exploiting the features of the OMNeT++ framework. The proposed tool is the first step to model HPC high-performance interconnection networks of exascale HPC systems: the message switching layer, routing and arbitration algorithms and buffer organizations have been modeled according to the current and expected characteristics of these systems. In addition, the tool has been designed so that it is possible to simulate networks of large size. Simulation results, validated against real systems, show the accuracy of the model.


Cost-Effective Congestion Management for Interconnection Networks Using Distributed Deterministic Routing

December 2010

·

33 Reads

·

12 Citations

·

P.J. Garcia

·

F.J. Quiles

·

[...]

·

J. Duato

The Interconnection networks are essential elements in current computing systems. For this reason, achieving the best network performance, even in congestion situations, has been a primary goal in recent years. In that sense, there exist several techniques focused on eliminating the main negative effect of congestion: the Head of Line (HOL) blocking. One of the most successful HOL blocking elimination techniques is RECN, which can be applied in source routing networks. FBICM follows the same approach as RECN, but it has been developed for distributed deterministic routing networks. Although FBICM effectively eliminates HOL blocking, it requires too much resources to be implemented. In this paper we present a new FBICM version, based on a new organization of switch memory resources, that significantly reduces the required silicon area, complexity and cost. Moreover, we present new results about FBICM, in network topologies not yet analyzed. From the experiment results we can conclude that a far less complex and feasible FBICM implementation can be achieved by using the proposed improvements, while not losing efficiency.


Combining Efficiency and Scalability on Congestion Management for Interconnection Networks

December 2008

·

18 Reads

·

1 Citation

Congestion has been an open issue in lossless interconnection networks for the last twenty years. In this environment, as packets can not be dropped, congestion may lead to a dramatic degradation of network performance. Although congestion was traditionally avoided by overdimensioning the network, this solution is no longer valid due to cost and power consumption concerns, that tend to reduce the size of the network in current systems. The restriction on network size makes the system working closer to the network saturation point, thus increasing congestion probability. Therefore, an effective congestion management mechanism is essential in order to avoid network performance degradation. However, except the recently proposed RECN mechanism, none of the congestion management strategies proposed in the literature had been able to achieve, at the same time, the required efficiency and scalability levels demanded by emerging systems. In this paper we analyze the difficulties in han-dling congestion on lossless interconnection networks, identifying the key points for the design of an efficient and scalable congestion management technique. We also describe how these ideas have been implemented in the RECN design. Moreover, we also show that RECN is fully compatible with the Advanced Switching standard, allowing efficient and scalable congestion management in real systems.


Decongestants for clogged networks

December 2007

·

23 Reads

IEEE Potentials

Interconnection networks are a key element in a wide variety of systems: massive parallel processors, local and system area networks, clusters of PCs and workstations, and Internet Protocol routers. They are essential to high performance in the form of high-bandwidth communications, with low latency, "quality of service" (guaranteed service levels), efficient switching, and flexibility of network topology, as embodied in Myrinet, InfiniBand, Quadrics, Advanced Switching, and similar interconnects. But, despite all the advances that modem interconnects offer, congestion is a growing problem as "lossless" interconnection networksrdquo those that do not allow data packets to be discarded" come to the fore.


Efficient, Scalable Congestion Management for Interconnection Networks

October 2006

·

32 Reads

·

47 Citations

IEEE Micro

Compared to the overdimensioned designs of the past, current interconnection networks operate closer to the point of saturation and run a higher risk of congestion. Among proposed strategies for congestion management, only the regional explicit congestion notification (RECN) mechanism achieves both the required efficiency and the scalability that emerging systems demand



3D wavelet compression by message passing on a Myrinet cluster
  • Conference Paper
  • Full-text available

February 2001

·

32 Reads

·

7 Citations

Canadian Conference on Electrical and Computer Engineering

We present a parallel algorithm for lossless compression. The algorithm is based on the 3D wavelet transform (3-D WT). The system under study consists of a parallel implementation of a wavelet compression software running on a cluster of eight nodes linked by a high performance local area network, Myrinet. The parallel implementation is based on the standard message passing interface (MPI). We have used three implementations of the MPI standard: MPI-BIP, MPICH, and LAM. Experimental results are reported based on these three implementations. We provide performance results of this parallel system for the compression video sequences. Some bugs in the efficiency for TCP implementation are reported and resolved for this system

Download

Performance study of NFS over Myrinet-based clusters for parallel multimedia applications

February 2001

·

11 Reads

·

5 Citations

Canadian Conference on Electrical and Computer Engineering

The Network File System (NFS) is an open standard that implements a client/server architecture enabling file sharing over heterogeneous computer systems from PCs to mainframes. NFS is currently being upgraded to better meet the performance and reliability requirements of a large variety of applications, ranging from database queries in a LAN cluster to multimedia applications via the Internet. The main objective of our work is to analyze the performance of NFS as the file system support for parallel image processing applications. We undertake the performance study of NFS over a parallel-processing platform used to accelerate the compression process of digital video. The experimental platform consists of eight nodes connected by a high-speed network (Myrinet). We use NFS to allow a set of processors to gain access to the uncompressed video sequences stored in the NFS-based file server. We have carried out a first set of experiments to evaluate the performance of NFS in terms of I/O response time. Our first results have shown the existence of communication bottlenecks and scalability problems in the system. We have identified some critical system parameters and analyzed their impact over the overall system response time. In a second set of experiments, we explored the performance of the system by changing the cluster topology


The need of multicast predictive NFS servers for high-speed networks used as parallel multimedia platforms

January 2001

·

15 Reads

·

5 Citations

In this paper we have carried out an experimental study of the I/O latency of a distributed file system over a parallel multimedia platform. Our experimental platform uses NFS (Network File System) to support an application characterized by its high processing and storage requirements. We evaluate the overall system by varying several NFS parameters as well as the underlying network topology and data access patterns. We have also examined the relevance of two other important file management access mechanisms: caching and prefetching. Our experimental results have allowed us to identify the major system parameters affecting the performance of NFS. We then propose the use of a predictive data access mechanisms. Under this mechanism, the file server should detect the similarities among the most recent I/O requests incoming from a set of cooperating clients. Once having identified the access pattern, the file server will be able to multicast the required data even before receiving explicit I/O data requests from the clients. Under this predictive mechanism, the system performance should improve as the number of explicit I/O requests directed to the file server decreases


Citations (7)


... Congestion control techniques usually limit packet injection as soon as the network presents signs of congestion. There are different ways of diagnosing these signs and techniques to avoid congestion, based on global knowledge like in [18], distributed like RECN [3] or based on information of the local router buffers like LBR [5]. The torus network of the IBM BlueGene/L [1] includes a mechanism that works prioritizing in-transit traffic-we call this IPR (Intransit Priority Restriction). ...

Reference:

Full-system simulation of distributed memory parallel computers using Simics
Combining Efficiency and Scalability on Congestion Management for Interconnection Networks
  • Citing Article
  • December 2008

... In more detail, in order to get results for this evaluation we have performed both simulations and experiments with real IB hardware, using a framework which integrates IB control software, IB-based hardware and OMNeT++based simulators [30]. Basically, in the context of this study we have extended a previously proposed methodology [18,31], adding support for managing and using SL2VL tables to the flit-level IB simulator contributed by Mellanox Technologies™ [18], and to an in-house packet-level technology-agnostic interconnection networks simulator [32]. ...

Towards Modeling Interconnection Networks of Exascale Systems with OMNeT plus
  • Citing Conference Paper
  • February 2013

... In the literature we can found several publications on performance improving of MPEG standards. Most of these works focus their improvements on parallelizing the distribution of data among the processors by either distributing different partitions of the same frame (spatial parallelism) or different GoPs (Groups of Pictures/Frames) to the various processors (temporal parallelism) [2,9], but it is usual to consider the codification of each frame as a sequential process. ...

The need of multicast predictive NFS servers for high-speed networks used as parallel multimedia platforms
  • Citing Conference Paper
  • January 2001

... Based on this, an information compression model for AQM design is proposed to deal with the heterogeneous RTT and time-varying traffic load in [19], achieving low latency and high scalability. To deal with head of line (HOL) blocking, regional explicit congestion notification (RECN) for source deterministic routing networks and flow-based implicit congestion management (FBICM) designed for distributed deterministic routing networks are proposed in [20]. A deterministic latency congestion control system is provided in [21] that integrates rate control, multiqueue management, and per-hop delay correction mechanisms, reducing the delay jitter by 90%. ...

Cost-Effective Congestion Management for Interconnection Networks Using Distributed Deterministic Routing
  • Citing Article
  • December 2010

... A diskless HPC system typically employs a read-only NFS-based root file system. While this solution provides some benefits, it is severely limited by the scalability of NFS servers [6]. Although the new NFSv4 [1] has shown the ability to combine multiple file system operations into one request, it is still based on a high-overhead protocol. ...

Performance study of NFS over Myrinet-based clusters for parallel multimedia applications
  • Citing Conference Paper
  • February 2001

Canadian Conference on Electrical and Computer Engineering

... Furthermore, it has the added provisions of highly scalable compression, which is mostly coveted in modern communications over heterogeneous channels like the Internet [2] . Successful application of 3- D-DWT has been reported in the literature in emerging fields like medical image compression [3], hyper-spectral and space image compression [4], etc. Software-based approaches are experimented to combat the huge computational complexity and memory requirement associated with 3-D-DWT realization [5], [6]. Though the processor speed of modern computers soars high at the order of GHz, data fetching and communicating with external memories consume several T states, making the computation quite slower at the end. ...

3D wavelet compression by message passing on a Myrinet cluster

Canadian Conference on Electrical and Computer Engineering

... Furthermore, they are also used to optimise quality of routing for some topologies [8]. Finally, they are also often used by queueing schemes for congestion avoidance and management as well [33,91]. VCs are largely out of the scope of this thesis, and simple links will be modelled, though there will be several references to VCs. ...

Efficient, Scalable Congestion Management for Interconnection Networks
  • Citing Article
  • October 2006

IEEE Micro