public final class
BlockLSTMGrad
Computes the LSTM cell backward propagation for the entire time sequence.
This implementation is to be used in conjunction of LSTMBlock.
Public Methods
Output<T> |
bGrad()
The gradient for w to be back-propped.
|
static <T extends Number> BlockLSTMGrad<T> |
create(Scope scope, Operand<Long> seqLenMax, Operand<T> x, Operand<T> csPrev, Operand<T> hPrev, Operand<T> w, Operand<T> wci, Operand<T> wcf, Operand<T> wco, Operand<T> b, Operand<T> i, Operand<T> cs, Operand<T> f, Operand<T> o, Operand<T> ci, Operand<T> co, Operand<T> h, Operand<T> csGrad, Operand<T> hGrad, Boolean usePeephole)
Factory method to create a class wrapping a new BlockLSTMGrad operation.
|
Output<T> |
csPrevGrad()
The gradient of cs_prev to be back-propped.
|
Output<T> |
hPrevGrad()
The gradient of h_prev to be back-propped.
|
Output<T> |
wGrad()
The gradient for w to be back-propped.
|
Output<T> |
wcfGrad()
The gradient for wcf to be back-propped.
|
Output<T> |
wciGrad()
The gradient for wci to be back-propped.
|
Output<T> |
wcoGrad()
The gradient for wco to be back-propped.
|
Output<T> |
xGrad()
The gradient of x to be back-propped.
|
Inherited Methods
Public Methods
public static BlockLSTMGrad<T> create (Scope scope, Operand<Long> seqLenMax, Operand<T> x, Operand<T> csPrev, Operand<T> hPrev, Operand<T> w, Operand<T> wci, Operand<T> wcf, Operand<T> wco, Operand<T> b, Operand<T> i, Operand<T> cs, Operand<T> f, Operand<T> o, Operand<T> ci, Operand<T> co, Operand<T> h, Operand<T> csGrad, Operand<T> hGrad, Boolean usePeephole)
Factory method to create a class wrapping a new BlockLSTMGrad operation.
Parameters
scope | current scope |
---|---|
seqLenMax | Maximum time length actually used by this input. Outputs are padded with zeros beyond this length. |
x | The sequence input to the LSTM, shape (timelen, batch_size, num_inputs). |
csPrev | Value of the initial cell state. |
hPrev | Initial output of cell (to be used for peephole). |
w | The weight matrix. |
wci | The weight matrix for input gate peephole connection. |
wcf | The weight matrix for forget gate peephole connection. |
wco | The weight matrix for output gate peephole connection. |
b | The bias vector. |
i | The input gate over the whole time sequence. |
cs | The cell state before the tanh over the whole time sequence. |
f | The forget gate over the whole time sequence. |
o | The output gate over the whole time sequence. |
ci | The cell input over the whole time sequence. |
co | The cell after the tanh over the whole time sequence. |
h | The output h vector over the whole time sequence. |
csGrad | The current gradient of cs. |
hGrad | The gradient of h vector. |
usePeephole | Whether to use peephole weights. |
Returns
- a new instance of BlockLSTMGrad