Diederik Kingma’s research while affiliated with University of Toronto and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (1)


Adam: A Method for Stochastic Optimization
  • Article

December 2014

·

75,666 Reads

·

135,132 Citations

Diederik Kingma

·

Jimmy Ba

We introduce Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions. The method is straightforward to implement and is based an adaptive estimates of lower-order moments of the gradients. The method is computationally efficient, has little memory requirements and is well suited for problems that are large in terms of data and/or parameters. The method is also ap- propriate for non-stationary objectives and problems with very noisy and/or sparse gradients. The method exhibits invariance to diagonal rescaling of the gradients by adapting to the geometry of the objective function. The hyper-parameters have intuitive interpretations and typically require little tuning. Some connections to related algorithms, on which Adam was inspired, are discussed. We also analyze the theoretical convergence properties of the algorithm and provide a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework. We demonstrate that Adam works well in practice when experimentally compared to other stochastic optimization methods.

Citations (1)


... Since the last decade, various algorithms for training ANN models have been invented, including Levenberg Marquardt, Stochastic Gradient Descent, Bayesian Regularization, and Adaptive Moment Estimation (Adam). Notably, Adam is the method published by Kingma and Ba (2014), which has been successfully applied in a wide range of studies (Pandey et al., 2022;Zhang et al., 2019;Rozante et al., 2023). Therefore, we have decided to employ this algorithm in this study. ...

Reference:

Enhancing Understanding of 3D Rectangular Tunnel Heading Stability in c-φ Soils with Surcharge Loading: A Comprehensive FELA Analysis Using Three Stability Factors and Machine Learning
Adam: A Method for Stochastic Optimization
  • Citing Article
  • December 2014