View source on GitHub |
Gives scores for all items in a batch.
tf_agents.bandits.agents.ranking_agent.compute_score_tensor_for_cascading(
chosen_index: tf_agents.typing.types.Int
,
chosen_value: tf_agents.typing.types.Float
,
num_slots: int,
non_click_score: float = -1.0
) -> tf_agents.typing.types.Float
The score of items that are before the chosen index is -1
, the score of
the chosen values are given by chosen_value
. The rest of the items receive
a score of 0
.
selected the negative feedback reward.
Returns | |
---|---|
A tensor of shape [batch_size, num_slots] , with scores for every item in
the recommendation.
|