Javadoc
Initializes for a given epsilon value. The set of agents for which joint actions are returned, target agent whose q-values are maximized,
and multi-agent q-source provider will need to be set manually with the methods
#setAgentsInJointPolicy(List),
#setTargetAgent(int), and
#setQSourceProvider(MultiAgentQSourceProvider) before the policy can be queried.
Note that the
MultiAgentQLearning and
burlap.behavior.stochasticgames.agents.madp.MultiAgentDPPlanningAgent agents may do this themselves. Consult the documentation
to check.