Wiki: Improving NN: Assignment 4


#1

Beyond SGD: Gradient Descent with Momentum and Adaptive Learning Rate

Recommended reading: An overview of gradient descent optimization algorithms