Figure - uploaded by Rustem Islamov
Content may be subject to copyright.
Data sets used in the experiments with the number of worker nodes n used in each case.

Data sets used in the experiments with the number of worker nodes n used in each case.

Source publication
Preprint
Full-text available
Recent advances in distributed optimization have shown that Newton-type methods with proper communication compression mechanisms can guarantee fast local rates and low communication cost compared to first order methods. We discover that the communication cost of these methods can be further reduced, sometimes dramatically so, with a surprisingly si...

Contexts in source publication

Context 1
... Table 2 for more detailed description. Theoretical parameters were used for gradient type methods: vanilla gradient descent (GD), , where ω = m K − 1. ...
Context 2
... Table 2 for more detailed description. Theoretical parameters were used for gradient type methods: vanilla gradient descent (GD), , where ω = m K − 1. ...