bandit.Reward

bandit.Reward(self, outcome, covariates=None)

A simple class for reward functions.

Each reward function should return a reward object. For covariate adjusted algorithms, the reward should contain both the outcome as well as the corresponding covariates. For non-covariate-adjusted algorithms, only the outcome should be specified.

Attributes

Name	Type	Description
outcome	float	The outcome of the reward function.
covariates	pd.DataFrame \| None	(Optional) The corresponding individual-level covariates.