Dominic Battré’s research while affiliated with Technische Universität Berlin and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (40)


Figure 1: The logical routing tree (b) as inferred from the physical routing (a) based on end-to-end measurements
Figure 2: Observed loss rates against background traffic for different types of virtualization
Figure 3: Observed packet interarrival times against background traffic for different types of virtualization
Figure 4: Average RTTs against background traffic for different types of virtualization
Figure 5: The general idea of sandwich probing

+1

Evaluation of Network Topology Inference in Opaque Compute Clouds through End-to-End Measurements
  • Conference Paper
  • Full-text available

July 2011

·

129 Reads

·

23 Citations

Dominic Battré

·

Natalia Frejnik

·

·

[...]

·

Daniel Warneke

Modern Infrastructure-as-a-Service (IaaS) clouds offer an unprecedented flexibility and elasticity in terms of resource provisioning through the use of hardware virtualization. However, for the cloud customer, this virtualization also introduces an opaqueness which imposes serious obstacles for data-intensive distributed applications. In particular, the lack of network topology information, i.e. information on how the rented virtual machines are physically interconnected, can easily cause network bottlenecks as common techniques to exploit data locality cannot be applied. In this paper we study to what extent the underlying network topology of virtual machines inside an IaaS cloud can be inferred based on end-to-end measurements. Therefore, we experimentally evaluate the impact of hardware virtualization on the measurable link characteristics packet loss and delay using the popular open source hypervisors KVM and XEN. Afterwards, we compare the accuracy of different topology inference approaches and propose an extension to improve the inference accuracy for typical network structures in datacenters. We found that common assumptions for end-to-end measurements do not hold in presence of virtualization and that RTT-based measurements in Para virtualized environments lead to the most accurate inference results.

Download

Inferring Network Topologies in Infrastructure as a Service Cloud

May 2011

·

13 Reads

·

2 Citations

Infrastructure as a Service (IaaS) clouds are gaining increasing popularity as a platform for distributed computations. The virtualization layers of those clouds offer new possibilities for rapid resource provisioning, but also hide aspects of the underlying IT infrastructure which have often been exploited in classic cluster environments. One of those hidden aspects is the network topology, i.e. the way the rented virtual machines are physically interconnected inside the cloud. We propose an approach to infer the network topology connecting a set of virtual machines in IaaS clouds and exploit it for data-intensive distributed applications. Our inference approach relies on delay-based end-to-end measurements and can be combined with traditional IP-level topology information, if available. We evaluate the inference accuracy using the popular hyper visors KVM as well as XEN and highlight possible performance gains for distributed applications.



Detecting bottlenecks in parallel DAG-based data flow programs

December 2010

·

68 Reads

·

24 Citations

In recent years, several frameworks have been introduced to facilitate massively-parallel data processing on shared-nothing architectures like compute clouds. While these frameworks generally offer good support in terms of task deployment and fault tolerance, they only provide poor assistance in finding reasonable degrees of parallelization for the tasks to be executed. However, as billing models of clouds enable the utilization of many resources for a short period of time for the same cost as utilizing few resources for a long time, proper levels of parallelization are crucial to achieve short processing times while maintaining good resource utilization and therefore good cost efficiency. In this paper, we present and evaluate a solution for detecting CPU and I/O bottlenecks in parallel DAG-based data flow programs assuming capacity constrained communication channels. The detection of bottlenecks represents an important foundation for manually or automatically scaling out and tuning parallel data flow programs in order to increase performance and cost efficiency.


A hybrid approach to modeling end-to-end delay in P2P networks

October 2010

·

19 Reads

·

2 Citations

Abstract The quality of the underlying network model constitutes a pivotal basis for simulating the performance of distributed systems. We present an efficient network model based on measured data and exhibiting realistic network behavior by providing location- and load-dependent latency, jitter and packet loss samples for connections between arbitrary hosts world-wide, which makes it especially well suited for the simulation and evaluation of P2P streaming systems. The model features only linear computation and memory requirements with respect to the number of hosts. It benefits from the finding that network delay/jitter/packet loss can be conveniently divided into a location- and distance-dependent but (from the user's perspective) stateless backbone part and a stateful local Internet access part. The backbone characteristics are modeled using Global Network Positioning (GNP) by embedding all hosts into an n-dimensional delay space and interpolating delays from their distance, before adding regional jitter. The local characteristics are obtained from two queues modeling the send buffers within the local up-link, e.g. DSL modem, and down-link, e.g. broadband remote access server (BRAS). The results show that the simulated delay behavior under varying load much closer approximates real network delay measurements than other models without topology modeling, such as OverSim's Simple-Underlay while still requiring only moderate resources. Both the GNP and the queuing components were integrated into the OMNeT++/INET Framework so that they can be easily used in P2P models.


Fig. 3. 
Fig. 6. Re-Negotiation of Agreements (Symmetric signaling on the Negotiation and Agreement Layer. Both parties implement the WS-Agreement Negotiation and WS-Agreement port types. Moreover, both parties have their own instance of the original agreement. After the negotiation process, the responder of the original agreement creates the re-negotiated agreement.)
A Proposal for WS-Agreement Negotiation

October 2010

·

2,473 Reads

·

27 Citations

The Web Services Agreement specification defines a normative language to formulate Service Level Agreements and a basic protocol to expose service-level descriptions, validate service-level requests, and come to an agreement. This protocol, often called ''take-it-or-leave-it'', allows a service provider and a service consumer to decide whether to accept or reject a service offer. Although this approach is sufficient for a number of use cases, others exist with requirements for multi-step negotiation or the adaptation of an existing agreement. In this paper, we describe the Web Services Agreement Negotiation protocol, a proposal by the Open Grid Forum to extend the existing specification. This proposal is the result of combining various research activities that have been conducted to define protocols for negotiating service levels or to supersede the existing ''take-it-or-leave-it'' protocol. The main characteristics of this proposal are the multi-round negotiation capability, re-negotiation capability, and compliance with the original specification.


Figure 3: The example PACT program Figure 3 shows a simplified, graphical representation of the PACT program for the query. Selection and projection are implemented using Map PACTs. The join is expressed using a Match PACT. The Super-Key Output Contract states that the keys produced by the user code are superkeys of the input keys. Finally, the Reduce PACT performs the aggregation.
Massively Parallel Data Analysis with PACTs on Nephele.

September 2010

·

167 Reads

·

47 Citations

Proceedings of the VLDB Endowment

Large-scale data analysis applications require processing and analyzing of Terabytes or even Petabytes of data, particularly in the areas of web analysis or scientific data management. This trend has been discussed as "web-scale data management" in a panel at VLDB 2009. Formerly, parallel data processing was the domain of parallel database systems. Today, novel requirements like scaling out to thousands of machines, improved fault-tolerance, and schema free processing have made a case for new approaches.


Figure 4: Two different execution strategies for the PACT program
Figure 5: PACT components
Figure 6: Compiling a PACT program  
Nephele/PACTs: A programming model and execution framework for web-scale analytical processing

June 2010

·

676 Reads

·

261 Citations

We present a parallel data processor centered around a programming model of so called Parallelization Contracts (PACTs) and the scalable parallel execution engine Nephele [18]. The PACT programming model is a generalization of the well-known map/reduce programming model, extending it with further second-order functions, as well as with Output Contracts that give guarantees about the behavior of a function. We describe methods to transform a PACT program into a data flow for Nephele, which executes its sequential building blocks in parallel and deals with communication, synchronization and fault tolerance. Our definition of PACTs allows to apply several types of optimizations on the data flow during the transformation. The system as a whole is designed to be as generic as (and compatible to) map/reduce systems, while overcoming several of their major weaknesses: 1) The functions map and reduce alone are not sufficient to express many data processing tasks both naturally and efficiently. 2) Map/reduce ties a program to a single fixed execution strategy, which is robust but highly suboptimal for many tasks. 3) Map/reduce makes no assumptions about the behavior of the functions. Hence, it offers only very limited optimization opportunities. With a set of examples and experiments, we illustrate how our system is able to naturally represent and efficiently execute several tasks that do not fit the map/reduce model well.


Lessons Learned from Implementing WS-Agreement

March 2010

·

17 Reads

·

5 Citations

WS-Agreement describes a protocol and structure for creating and representing service level agreements. In order to remain domain independent, the authors of the WS-Agreement specification have provided many extension points for domain specific content. This creates high degrees of freedoms for programmers to implement the specification. Many attempts to do this have been made in the past. In this paper, we explain what we have learned from our own and other projects’ attempts of implementing WS-Agreement. The paper presents a set of guidelines how the features of WS-Agreement can be used in a sound way that allows transferring large parts of the WS-Agreement logic into a generic and domain-independent WS-Agreement framework.


A service level agreement layer for the D-Grid infrastructure

January 2010

·

18 Reads

·

2 Citations

D-Grid is a German national grid initiative founded by the German Federal Ministry of Education and Research. It provides a frame for the collaboration of researchers across Germany and offers access to distributed services and resources. The SLA4D-Grid project will enhance D-Grid by creating a service level agreement (SLA) layer which is transparently integrated into D-Grid and accessible for the D-Grid community.


Citations (28)


... Because of scalability issues we and several other research groups investigate the use of peer-to-peer based data stores for managing RDF data. Examples for such peer-to-peer based approaches include our BabelPeers [14,15,16,17,18,19,20,21,22,23,24,94,95] implementation, Atlas [99,103,111,113,114], RDFPeers [47], GridVine [5,60], and several others. ...

Reference:

Efficient Query Processing in DHT-based RDF Stores
Query processing in DHT based RDF stores
  • Citing Article
  • April 2009

網際網路技術學刊

... RDFPeers provides range queries and a publish/subscribe interface over the stored data. PeerThing [64] is another approach we developed at PC 2 . It consists of an unstructured p2p network for resource discovery in the Grid, where each node in the network runs a local Description Logic reasoner. ...

BabelPeers: P2P based semantic grid resource discovery
  • Citing Article
  • January 2008

... As a consequence of this rapid growth, the need to collect the large amounts of information produced and make it available to researchers worldwide has ended with a number of MS databases. 22,23 The Illinois Bio-Grid Mass Spectrometry Database (IBG-MSD) 22 is a public database of curated and annotated empirically derived mass spectra of peptides. The goal of the database is to address the need for a pubic database of mass spectrometry data and implement a useful web interface that will allow researchers to access the data and perform a variety of tasks based on their individual needs. ...

A Batch Import Module for an Empirically Derived Mass Spectral Database

... In summary, the outcome of the SLA4D-Grid project was an implemented SLA Management System and a monitoring solution, both applicable for the D-Grid community. Regarding SLAs, the project has developed an optimally adapted SLA layer of the SLA4D-Grid service negotiation and orchestration [108]. The project also contributed to the WS-Agreement Negotiation specification of the Open Grid Forum and provided an implementation for D-Grid through an adapted version of WSAG4J, a Java implementation of WS-Agreement and WS-Agreement Negotiation. ...

A service level agreement layer for the D-Grid infrastructure
  • Citing Article
  • January 2010

... Several works have dealt with contract establishment. WS-Agreement specification has been extended to support the dynamic nature of SLAs by allowing the possibility of SLA renegotiation at run time [13] [14] [15]. However, support for specifying guarantees on transactional properties has not been addressed. ...

Extending WS-agreement for dynamic negotiation of service level agreements

... 2) How to manage the negotiation session with a candidate service provider and find a mutually acceptable solution within a limited time. The existing SLA negotiation specification [15] models the bilateral offer exchanging process. But the scale, intermittent availability, and mobility of geographically distributed service providers are not considered. ...

WS-Agreement Negotiation Version 1.0

... As a result, many national and international grid initiatives and specific grid infrastructure for certain user communities emerged and are still maintained and extended. Well known examples are the European projects PRACE 6 and DEISA 7 , but also grid-based infrastructures like EGEE 8 , EGI 9 , FutureGrid 10 , and Tera-Grid 11 . These grid infrastructures are currently in production and serve the computing demands of researchers around the globe. ...

Implementing WS-Agreement in a Globus Toolkit 4.0 Environment
  • Citing Chapter
  • January 2008

... The IRRIIS project (Balducelli et al., 2008) is also improving CI dependability through middleware improved technology and simulation in synthetic environments. There has been work on fault and sabotage tolerance using an SLA for grids (Hovestadt, 2005;Naqvi et al., 2008), which SERSCIS builds on to create resilient service-oriented infrastructures. ...

Using Sla Based Approach To Handle Sabotage Tolerance In The Grids
  • Citing Chapter
  • January 2008

... These network designs can be seen as a hybrid between the early distributed systems and the early P2P networks, trying to balance the various tradeoffs and thus, offering extensive flexibility and adaptability to build any kind of application over it. Thus, there has been a tremendous amount of research over structured P2P networks, e.g., there is work on relational one-time and continuous query processing [39-41, 17, 42, 43] and RDF query processing [44][45][46][47]. ...

Towards Parallel Processing of RDF Queries in DHTs
  • Citing Conference Paper
  • August 2009

Lecture Notes in Computer Science

... In this paper, we consider the monitoring overhead as an aggregate of all these financial costs. In the Cloud-to-Thing continuum [32], the availability and capacity of resources, namely compute, storage and connectivity, decrease when moving from Cloud to Things. Typically, IoT edge gateways, located near IoT devices, are small devices with limited processing, storage, and connectivity capabilities. ...

Detecting bottlenecks in parallel DAG-based data flow programs
  • Citing Conference Paper
  • December 2010