Tag: Blockwise Gradient Normalization

Research

A Robust Adaptive Stochastic Gradient Method for Deep Learning

This paper proposes an adaptive learning rate algorithm, which utilizes stochastic curvature information of the loss function for automatically tuning the learning rates.