TensorFlow is back at Google I/O on May 14! Register now

tf.Variable

tf.Variable(
    initial_value=None,
    trainable=None,
    validate_shape=True,
    caching_device=None,
    name=None,
    variable_def=None,
    dtype=None,
    import_scope=None,
    constraint=None,
    synchronization=tf.VariableSynchronization.AUTO,
    aggregation=tf.compat.v1.VariableAggregation.NONE,
    shape=None,
    experimental_enable_variable_lifting=True
)

Used in the notebooks

Used in the guide	Used in the tutorials
Migrating model checkpoints Introduction to gradients and automatic differentiation Introduction to Variables Advanced automatic differentiation Better performance with tf.function	Scalable model compression Custom training loop with Keras and MultiWorkerMirroredStrategy Neural style transfer Learned data compression Learnable Distributions Zoo

A variable maintains shared, persistent state manipulated by a program.

The Variable() constructor requires an initial value for the variable, which can be a Tensor of any type and shape. This initial value defines the type and shape of the variable. After construction, the type and shape of the variable are fixed. The value can be changed using one of the assign methods.

v = tf.Variable(1.)
v.assign(2.)
<tf.Variable ... shape=() dtype=float32, numpy=2.0>
v.assign_add(0.5)
<tf.Variable ... shape=() dtype=float32, numpy=2.5>

The shape argument to Variable's constructor allows you to construct a variable with a less defined shape than its initial_value:

v = tf.Variable(1., shape=tf.TensorShape(None))
v.assign([[1.]])
<tf.Variable ... shape=<unknown> dtype=float32, numpy=array([[1.]], ...)>

Just like any Tensor, variables created with Variable() can be used as inputs to operations. Additionally, all the operators overloaded for the Tensor class are carried over to variables.

w = tf.Variable([[1.], [2.]])
x = tf.constant([[3., 4.]])
tf.matmul(w, x)
<tf.Tensor:... shape=(2, 2), ... numpy=
  array([[3., 4.],
         [6., 8.]], dtype=float32)>
tf.sigmoid(w + x)
<tf.Tensor:... shape=(2, 2), ...>

When building a machine learning model it is often convenient to distinguish between variables holding trainable model parameters and other variables such as a step variable used to count training steps. To make this easier, the variable constructor supports a trainable=<bool> parameter. tf.GradientTape watches trainable variables by default:

with tf.GradientTape(persistent=True) as tape:
  trainable = tf.Variable(1.)
  non_trainable = tf.Variable(2., trainable=False)
  x1 = trainable * 2.
  x2 = non_trainable * 3.
tape.gradient(x1, trainable)
<tf.Tensor:... shape=(), dtype=float32, numpy=2.0>
assert tape.gradient(x2, non_trainable) is None  # Unwatched

Variables are automatically tracked when assigned to attributes of types inheriting from tf.Module.

m = tf.Module()
m.v = tf.Variable([1.])
m.trainable_variables
(<tf.Variable ... shape=(1,) ... numpy=array([1.], dtype=float32)>,)

This tracking then allows saving variable values to training checkpoints, or to SavedModels which include serialized TensorFlow graphs.

Variables are often captured and manipulated by tf.functions. This works the same way the un-decorated function would have:

v = tf.Variable(0.)
read_and_decrement = tf.function(lambda: v.assign_sub(0.1))
read_and_decrement()
<tf.Tensor: shape=(), dtype=float32, numpy=-0.1>
read_and_decrement()
<tf.Tensor: shape=(), dtype=float32, numpy=-0.2>

Variables created inside a tf.function must be owned outside the function and be created only once:

class M(tf.Module):
  @tf.function
  def __call__(self, x):
    if not hasattr(self, "v"):  # Or set self.v to None in __init__
      self.v = tf.Variable(x)
    return self.v * x
m = M()
m(2.)
<tf.Tensor: shape=(), dtype=float32, numpy=4.0>
m(3.)
<tf.Tensor: shape=(), dtype=float32, numpy=6.0>
m.v
<tf.Variable ... shape=() dtype=float32, numpy=2.0>

See the tf.function documentation for details.

Args
`initial_value`	A `Tensor`, or Python object convertible to a `Tensor`, which is the initial value for the Variable. The initial value must have a shape specified unless `validate_shape` is set to False. Can also be a callable with no argument that returns the initial value when called. In that case, `dtype` must be specified. (Note that initializer functions from init_ops.py must first be bound to a shape before being used here.)
`trainable`	If `True`, GradientTapes automatically watch uses of this variable. Defaults to `True`, unless `synchronization` is set to `ON_READ`, in which case it defaults to `False`.
`validate_shape`	If `False`, allows the variable to be initialized with a value of unknown shape. If `True`, the default, the shape of `initial_value` must be known.
`caching_device`	Note: This argument is only valid when using a v1-style `Session`. Optional device string describing where the Variable should be cached for reading. Defaults to the Variable's device. If not `None`, caches on another device. Typical use is to cache on the device where the Ops using the Variable reside, to deduplicate copying through `Switch` and other conditional statements.
`name`	Optional name for the variable. Defaults to `'Variable'` and gets uniquified automatically.
`variable_def`	`VariableDef` protocol buffer. If not `None`, recreates the Variable object with its contents, referencing the variable's nodes in the graph, which must already exist. The graph is not changed. `variable_def` and the other arguments are mutually exclusive.
`dtype`	If set, initial_value will be converted to the given type. If `None`, either the datatype will be kept (if `initial_value` is a Tensor), or `convert_to_tensor` will decide.
`import_scope`	Optional `string`. Name scope to add to the `Variable.` Only used when initializing from protocol buffer.
`constraint`	An optional projection function to be applied to the variable after being updated by an `Optimizer` (e.g. used to implement norm constraints or value constraints for layer weights). The function must take as input the unprojected Tensor representing the value of the variable and return the Tensor for the projected value (which must have the same shape). Constraints are not safe to use when doing asynchronous distributed training.
`synchronization`	Indicates when a distributed variable will be aggregated. Accepted values are constants defined in the class `tf.VariableSynchronization`. By default the synchronization is set to `AUTO` and the current `DistributionStrategy` chooses when to synchronize.
`aggregation`	Indicates how a distributed variable will be aggregated. Accepted values are constants defined in the class `tf.VariableAggregation`.
`shape`	(optional) The shape of this variable. If None, the shape of `initial_value` will be used. When setting this argument to `tf.TensorShape(None)` (representing an unspecified shape), the variable can be assigned with values of different shapes.
`experimental_enable_variable_lifting`	Whether to lift the variable out if it's in a `tf.function`. Default is `True`. When this argument is `True`, variable creation will follow the behavior and restrictions described here. If this argument is `False`, that description doesn't apply, and you can freely create and use the variable in the `tf.function`, as if it's a "mutable `tf.Tensor`". You can't return the variable though.

Raises
`ValueError`	If both `variable_def` and initial_value are specified.
`ValueError`	If the initial value is not specified, or does not have a shape and `validate_shape` is `True`.

Attributes
`aggregation`
`constraint`	Returns the constraint function associated with this variable.
`device`	The device of this variable.
`dtype`	The `DType` of this variable.
`graph`	The `Graph` of this variable.
`initial_value`	Returns the Tensor used as the initial value for the variable. Note that this is different from `initialized_value()` which runs the op that initializes the variable before returning its value. This method returns the tensor that is used by the op that initializes the variable.
`initializer`	The initializer operation for this variable.
`name`	The name of this variable.
`op`	The `Operation` of this variable.
`shape`	The `TensorShape` of this variable.
`synchronization`
`trainable`

Args
`value`	A `Tensor`. The new value for this variable.
`use_locking`	If `True`, use locking during the assignment.
`name`	The name of the operation to be created
`read_value`	if True, will return something which evaluates to the new value of the variable; if False will return the assign op.

Args
`delta`	A `Tensor`. The value to add to this variable.
`use_locking`	If `True`, use locking during the operation.
`name`	The name of the operation to be created
`read_value`	if True, will return something which evaluates to the new value of the variable; if False will return the assign op.

Args
`delta`	A `Tensor`. The value to subtract from this variable.
`use_locking`	If `True`, use locking during the operation.
`name`	The name of the operation to be created
`read_value`	if True, will return something which evaluates to the new value of the variable; if False will return the assign op.

Args
`sparse_delta`	`tf.IndexedSlices` to be assigned to this variable.
`use_locking`	If `True`, use locking during the operation.
`name`	the name of the operation.

Args
`indices`	A `Tensor`. Must be one of the following types: `int32`, `int64`. Index tensor.
`name`	A name for the operation (optional).

Args
`value`	New variable value
`session`	The session to use to evaluate this variable. If none, the default session is used.

Args
`sparse_delta`	`tf.IndexedSlices` to be added to this variable.
`use_locking`	If `True`, use locking during the operation.
`name`	the name of the operation.

Args
`sparse_delta`	`tf.IndexedSlices` to divide this variable by.
`use_locking`	If `True`, use locking during the operation.
`name`	the name of the operation.

Args
`sparse_delta`	`tf.IndexedSlices` to use as an argument of max with this variable.
`use_locking`	If `True`, use locking during the operation.
`name`	the name of the operation.

Args
`sparse_delta`	`tf.IndexedSlices` to multiply this variable by.
`use_locking`	If `True`, use locking during the operation.
`name`	the name of the operation.

Args
`indices`	The indices to be used in the operation.
`updates`	The values to be used in the operation.
`name`	the name of the operation.

Args
`sparse_delta`	`tf.IndexedSlices` to be subtracted from this variable.
`use_locking`	If `True`, use locking during the operation.
`name`	the name of the operation.

Args
`indices`	The index `Tensor`. Must be one of the following types: `int32`, `int64`. Must be in range `[0, params.shape[axis])`.
`name`	A name for the operation (optional).

Args
`x`	A `Tensor`. Must be one of the following types: `float32`, `float64`, `int32`, `uint8`, `int16`, `int8`, `int64`, `bfloat16`, `uint16`, `half`, `uint32`, `uint64`.
`y`	A `Tensor`. Must have the same type as `x`.
`name`	A name for the operation (optional).

Args
`var`	An `ops.Variable` object.
`slice_spec`	The arguments to `Tensor.getitem`.

Raises
`ValueError`	If a slice range is negative size.
`TypeError`	TypeError: If the slice indices aren't int, slice, ellipsis, tf.newaxis or int32/int64 tensors.

tf.Variable

Used in the notebooks

Args

Raises

Attributes

Child Classes

Methods

assign

assign_add

assign_sub

batch_scatter_update

count_up_to

eval

experimental_ref

from_proto

gather_nd

get_shape

initialized_value

load

read_value

ref

scatter_add

scatter_div

scatter_max

scatter_min

scatter_mul

scatter_nd_add

scatter_nd_sub

scatter_nd_update

scatter_sub

scatter_update

set_shape

sparse_read

to_proto

value

__abs__

__add__

__and__

__div__

__eq__

__floordiv__

__ge__

Example:

__getitem__

__gt__

Example:

__invert__

__iter__

__le__

Example:

__lt__

Example:

__matmul__

__mod__

__mul__

__ne__

__neg__

__or__

__pow__

__radd__

__rand__

__rdiv__

__rfloordiv__

__rmatmul__

__rmod__

__rmul__

__ror__

__rpow__

__rsub__

__rtruediv__

__rxor__

__sub__

__truediv__

__xor__

`assign`

`assign_add`

`assign_sub`

`batch_scatter_update`

`count_up_to`

`eval`

`experimental_ref`

`from_proto`

`gather_nd`

`get_shape`

`initialized_value`

`load`

`read_value`

`ref`

`scatter_add`

`scatter_div`

`scatter_max`

`scatter_min`

`scatter_mul`

`scatter_nd_add`

`scatter_nd_sub`

`scatter_nd_update`

`scatter_sub`

`scatter_update`

`set_shape`

`sparse_read`

`to_proto`

`value`

`abs`

`add`

`and`

`div`

`eq`

`floordiv`

`ge`

`getitem`

`gt`

`invert`

`iter`

`le`

`lt`

`matmul`

`mod`

`mul`

`ne`

`neg`

`or`

`pow`

`radd`

`rand`

`rdiv`

`rfloordiv`

`rmatmul`

`rmod`

`rmul`

`ror`

`rpow`

`rsub`

`rtruediv`

`rxor`

`sub`

`truediv`

`xor`