SGD的各种变体公式:Momentum, Nesterov, Cumulative, AdaGrad, AdaDelta, AdaM, FTRL, FTML ...