tfr.utils.de_noise
Stay organized with collections
Save and categorize content based on your preferences.
Returns a float Tensor
as the de-noised counts
.
tfr.utils.de_noise(
counts, noise, ratio=0.9
)
The implementation is based on the the paper by Zhang and Xu: "Fast Exact
Maximum Likelihood Estimation for Mixture of Language Models." It assumes that
the observed counts
are generated from a mixture of noise
and the true
distribution: ratio * noise_distribution + (1 - ratio) * true_distribution
,
where the contribution of noise
is controlled by ratio
. This method
returns the true distribution.
Args |
counts
|
A 2-D Tensor representing the observations. All values should be
nonnegative.
|
noise
|
A 2-D Tensor representing the noise distribution. This should be
the same shape as counts . All values should be positive and are
normalized to a simplex per row.
|
ratio
|
A float in (0, 1) representing the contribution from noise.
|
Returns |
A 2-D float Tensor and each row is a simplex.
|
Raises |
ValueError
|
if ratio is not in (0,1).
|
InvalidArgumentError
|
if any of counts is negative or any of noise is
not positive.
|
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2023-08-18 UTC.
[{
"type": "thumb-down",
"id": "missingTheInformationINeed",
"label":"Missing the information I need"
},{
"type": "thumb-down",
"id": "tooComplicatedTooManySteps",
"label":"Too complicated / too many steps"
},{
"type": "thumb-down",
"id": "outOfDate",
"label":"Out of date"
},{
"type": "thumb-down",
"id": "samplesCodeIssue",
"label":"Samples / code issue"
},{
"type": "thumb-down",
"id": "otherDown",
"label":"Other"
}]
[{
"type": "thumb-up",
"id": "easyToUnderstand",
"label":"Easy to understand"
},{
"type": "thumb-up",
"id": "solvedMyProblem",
"label":"Solved my problem"
},{
"type": "thumb-up",
"id": "otherUp",
"label":"Other"
}]
{"lastModified": "Last updated 2023-08-18 UTC."}
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2023-08-18 UTC."],[],[]]