the directory to save the model results and log files.
initial_clusters
specifies how to initialize the clusters for training.
See clustering_ops.kmeans for the possible values.
distance_metric
the distance metric used for clustering.
See clustering_ops.kmeans for the possible values.
random_seed
Python integer. Seed for PRNG used to initialize centers.
use_mini_batch
If true, use the mini-batch k-means algorithm. Else assume
full batch.
mini_batch_steps_per_iteration
number of steps after which the updated
cluster centers are synced back to a master copy. See clustering_ops.py
for more details.
kmeans_plus_plus_num_retries
For each point that is sampled during
kmeans++ initialization, this parameter specifies the number of
additional points to draw from the current distribution before selecting
the best. If a negative value is specified, a heuristic is used to
sample O(log(num_to_sample)) additional points.
relative_tolerance
A relative tolerance of change in the loss between
iterations. Stops learning if the loss changes less than this amount.
Note that this may not work correctly if use_mini_batch=True.
config
See Estimator
Attributes
config
model_dir
Returns a path in which the eval process will look for checkpoints.
model_fn
Returns the model_fn which is bound to self.params.
Exports inference graph into given dir. (deprecated)
Args
export_dir
A string containing a directory to write the exported graph
and checkpoints.
input_fn
If use_deprecated_input_fn is true, then a function that given
Tensor of Example strings, parses it into features that are then
passed to the model. Otherwise, a function that takes no argument and
returns a tuple of (features, labels), where features is a dict of
string key to Tensor and labels is a Tensor that's currently not
used (and so can be None).
input_feature_key
Only used if use_deprecated_input_fn is false. String
key into the features dict returned by input_fn that corresponds to a
the raw Example strings Tensor that the exported model will take as
input. Can only be None if you're using a custom signature_fn that
does not use the first arg (examples).
use_deprecated_input_fn
Determines the signature format of input_fn.
signature_fn
Function that returns a default signature and a named
signature map, given Tensor of Example strings, dict of Tensors
for features and Tensor or dict of Tensors for predictions.
prediction_key
The key for a tensor in the predictions dict (output
from the model_fn) to use as the predictions input to the
signature_fn. Optional. If None, predictions will pass to
signature_fn without filtering.
default_batch_size
Default batch size of the Example placeholder.
exports_to_keep
Number of exports to keep.
checkpoint_path
the checkpoint path of the model to be exported. If it is
None (which is default), will use the latest checkpoint in
export_dir.
Returns
The string path to the exported directory. NB: this functionality was
added ca. 2016/09/25; clients that depend on the return value may need
to handle the case where this function returns None because subclasses
are not returning a value.
Exports inference graph as a SavedModel into given dir.
Args
export_dir_base
A string containing a directory to write the exported
graph and checkpoints.
serving_input_fn
A function that takes no argument and
returns an InputFnOps.
default_output_alternative_key
the name of the head to serve when none is
specified. Not needed for single-headed models.
assets_extra
A dict specifying how to populate the assets.extra directory
within the exported SavedModel. Each key should give the destination
path (including the filename) relative to the assets.extra directory.
The corresponding value gives the full path of the source file to be
copied. For example, the simple case of copying a single file without
renaming it is specified as
{'my_asset_file.txt': '/path/to/my_asset_file.txt'}.
as_text
whether to write the SavedModel proto in text format.
checkpoint_path
The checkpoint path to export. If None (the default),
the most recent checkpoint found within the model directory is chosen.
graph_rewrite_specs
an iterable of GraphRewriteSpec. Each element will
produce a separate MetaGraphDef within the exported SavedModel, tagged
and rewritten as specified. Defaults to a single entry using the
default serving tag ("serve") and no rewriting.
strip_default_attrs
Boolean. If True, default-valued attributes will be
removed from the NodeDefs. For a detailed guide, see
Stripping Default-Valued
Attributes.
Incremental fit on a batch of samples. (deprecated arguments)
This method is expected to be called several times consecutively
on different or the same chunks of the dataset. This either can
implement iterative training or out-of-core/online training.
This is especially useful when the whole dataset is too big to
fit in memory at the same time. Or when model is taking long time
to converge, and you want to split up training into subparts.
Args
x
Matrix of shape [n_samples, n_features...]. Can be iterator that
returns arrays of features. The training input samples for fitting the
model. If set, input_fn must be None.
y
Vector or matrix [n_samples] or [n_samples, n_outputs]. Can be
iterator that returns array of labels. The training label values
(class labels in classification, real numbers in regression). If set,
input_fn must be None.
input_fn
Input function. If set, x, y, and batch_size must be
None.
steps
Number of steps for which to train model. If None, train forever.
batch_size
minibatch size to use on the input, defaults to first
dimension of x. Must be None if input_fn is provided.
monitors
List of BaseMonitor subclass instances. Used for callbacks
inside the training loop.
Returns
self, for chaining.
Raises
ValueError
If at least one of x and y is provided, and input_fn is
provided.
Returns predictions for given features. (deprecated arguments)
Args
x
Matrix of shape [n_samples, n_features...]. Can be iterator that
returns arrays of features. The training input samples for fitting the
model. If set, input_fn must be None.
input_fn
Input function. If set, x and 'batch_size' must be None.
batch_size
Override default batch size. If set, 'input_fn' must be
'None'.
outputs
list of str, name of the output to predict.
If None, returns all.
as_iterable
If True, return an iterable which keeps yielding predictions
for each example until inputs are exhausted. Note: The inputs must
terminate if you want the iterable to terminate (e.g. be sure to pass
num_epochs=1 if you are using something like read_batch_features).
iterate_batches
If True, yield the whole batch at once instead of
decomposing the batch into individual samples. Only relevant when
as_iterable is True.
Returns
A numpy array of predicted classes or regression values if the
constructor's model_fn returns a Tensor for predictions or a dict
of numpy arrays if model_fn returns a dict. Returns an iterable of
predictions if as_iterable is True.
The method works on simple estimators as well as on nested objects
(such as pipelines). The former have parameters of the form
<component>__<parameter> so that it's possible to update each
component of a nested object.
Transforms each element to distances to cluster centers. (deprecated)
Note that this function is different from the corresponding one in sklearn.
For SQUARED_EUCLIDEAN distance metric, sklearn transform returns the
EUCLIDEAN distance, while this function returns the SQUARED_EUCLIDEAN
distance.
Args
input_fn
see predict.
as_iterable
see predict
Returns
Array with same number of rows as x, and num_clusters columns, containing
distances to the cluster centers.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2020-10-01 UTC."],[],[]]