-
Notifications
You must be signed in to change notification settings - Fork 99
Open
Description
Hi, on the README you write:
estimated_rewards_by_reg_model = regression_model.fit_predict(
context=bandit_feedback_test["context"],
action=bandit_feedback_test["action"],
reward=bandit_feedback_test["reward"],
)But this is basically fitting on test rewards. Is this legal?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels