{ }
Update '*var' according to the AdaMax algorithm.
tf.raw_ops.ApplyAdaMax(
var,
m,
v,
beta1_power,
lr,
beta1,
beta2,
epsilon,
grad,
use_locking=False,
name=None
)
mt <- beta1 * m{t-1} + (1 - beta1) * g vt <- max(beta2 * v{t-1}, abs(g)) variable <- variable - learning_rate / (1 - beta1^t) * m_t / (v_t + epsilon)
Returns | |
---|---|
A mutable Tensor . Has the same type as var .
|