Giovanni Matteo Fumarola’s research while affiliated with Microsoft and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (4)


Towards Geo-Distributed Machine Learning
  • Article

March 2016

·

113 Reads

·

51 Citations

Ignacio Cano

·

Markus Weimer

·

Dhruv Mahajan

·

[...]

·

Giovanni Matteo Fumarola

Latency to end-users and regulatory requirements push large companies to build data centers all around the world. The resulting data is "born" geographically distributed. On the other hand, many machine learning applications require a global view of such data in order to achieve the best results. These types of applications form a new class of learning problems, which we call Geo-Distributed Machine Learning (GDML). Such applications need to cope with: 1) scarce and expensive cross-data center bandwidth, and 2) growing privacy concerns that are pushing for stricter data sovereignty regulations. Current solutions to learning from geo-distributed data sources revolve around the idea of first centralizing the data in one data center, and then training locally. As machine learning algorithms are communication-intensive, the cost of centralizing the data is thought to be offset by the lower cost of intra-data center communication during training. In this work, we show that the current centralized practice can be far from optimal, and propose a system for doing geo-distributed training. Furthermore, we argue that the geo-distributed approach is structurally more amenable to dealing with regulatory constraints, as raw data never leaves the source data center. Our empirical evaluation on three real datasets confirms the general validity of our approach, and shows that GDML is not only possible but also advisable in many scenarios.


Preemption-aware planning on big-data systems

February 2016

·

12 Reads

ACM SIGPLAN Notices

Recent developments in Big Data frameworks are moving towards reservation based approaches as a mean to manage the increasingly complex mix of computations, whereas preemption techniques are employed to meet strict jobs deadlines. Within this work we propose and evaluate a new planning algorithm in the context of reservation based scheduling. Our approach is able to achieve high cluster utilization while minimizing the need for preemption that causes system overheads and planning mispredictions.


Preemption-aware planning on big-data systems

February 2016

·

13 Reads

ACM SIGPLAN Notices

Recent developments in Big Data frameworks are moving towards reservation based approaches as a mean to manage the increasingly complex mix of computations, whereas preemption techniques are employed to meet strict jobs deadlines. Within this work we propose and evaluate a new planning algorithm in the context of reservation based scheduling. Our approach is able to achieve high cluster utilization while minimizing the need for preemption that causes system overheads and planning mispredictions.


Figure 1: Task and job runtime distribution.
Figure 2: Ideal operational point of alternative scheduling approaches.
Figure 3: Mercury resource management lifecycle.  
Mercury: Hybrid Centralized and Distributed Scheduling in Large Shared Clusters
  • Conference Paper
  • Full-text available

January 2015

·

251 Reads

·

163 Citations

Download

Citations (2)


... speedup over two state-of-the-art DML systems over WAN. Cano et al. [22] proposed a Geo-DML system that offset the generally communication-intensive nature of ML algorithms by employing and extending communication-sparse ones. The experimental evaluation indicated that their approach could outperform other state-of-the-art systems by several orders of magnitude when measuring X-DC transfers, as well as respect stricter sovereignty constraints. ...

Reference:

Placement of parameter server in wide area network topology for geo-distributed machine learning
Towards Geo-Distributed Machine Learning
  • Citing Article
  • March 2016

... Other databases such as Umbra [40] and Microsoft SQL Server [7] directly manage the machine's CPU cores and allocate a time-share to each query using stride scheduling. Big data systems such as Hadoop and Spark rely on a resource negotiator such as YARN [39], Kubernetes [12], or Mercury [25] to fairly allocate specified requested resources. Finally, cloud providers such as Google [6], Oracle [8], and Amazon AWS [4] structure their pricing models based on resource reservations for at least one system resource, including CPU, memory, and/or storage resources. ...

Mercury: Hybrid Centralized and Distributed Scheduling in Large Shared Clusters