How to use
GridGame$GGJointRewardFunction
in
burlap.domain.stochasticgames.gridgame

Best Java code snippets using burlap.domain.stochasticgames.gridgame.GridGame$GGJointRewardFunction (Showing top 6 results out of 315)

JointRewardFunction rf = new GridGame.GGJointRewardFunction(domain, -1, 100, false);
TerminalFunction tf = new GridGame.GGTerminalFunction(domain);
SGAgentType at = GridGame.getStandardGridGameAgentType(domain);

JointRewardFunction rf = new GridGame.GGJointRewardFunction(domain, -1, 100, false);
TerminalFunction tf = new GridGame.GGTerminalFunction(domain);

JointRewardFunction rf = new GridGame.GGJointRewardFunction(domain, -1, 100, false);
TerminalFunction tf = new GridGame.GGTerminalFunction(domain);
SGAgentType at = GridGame.getStandardGridGameAgentType(domain);

for(ObjectInstance o : obs){
  int aid = ((GGAgent)o).player;
  rewards[aid] = this.defaultCost(aid, ja);
  if(gp.isTrue(osp)){
    int aid = ((GGAgent)((OOState) sp).object(agentName)).player;
    rewards[aid] = this.getPersonalGoalReward(osp, agentName);

public static void VICorrelatedTest(){
  GridGame gridGame = new GridGame();
  final OOSGDomain domain = gridGame.generateDomain();
  final HashableStateFactory hashingFactory = new SimpleHashableStateFactory();
  final State s = GridGame.getPrisonersDilemmaInitialState();
  JointRewardFunction rf = new GridGame.GGJointRewardFunction(domain, -1, 100, false);
  TerminalFunction tf = new GridGame.GGTerminalFunction(domain);
  SGAgentType at = GridGame.getStandardGridGameAgentType(domain);
  MAValueIteration vi = new MAValueIteration(domain, rf, tf, 0.99, hashingFactory, 0., new CorrelatedQ(CorrelatedEquilibriumSolver.CorrelatedEquilibriumObjective.UTILITARIAN), 0.00015, 50);
  World w = new World(domain, rf, tf, s);
  //for correlated Q, use a correlated equilibrium policy joint policy
  ECorrelatedQJointPolicy jp0 = new ECorrelatedQJointPolicy(CorrelatedEquilibriumSolver.CorrelatedEquilibriumObjective.UTILITARIAN, 0.);
  MultiAgentDPPlanningAgent a0 = new MultiAgentDPPlanningAgent(domain, vi, new PolicyFromJointPolicy(0, jp0, true), "agent0", at);
  MultiAgentDPPlanningAgent a1 = new MultiAgentDPPlanningAgent(domain, vi, new PolicyFromJointPolicy(1, jp0, true), "agent1", at);
  w.join(a0);
  w.join(a1);
  GameEpisode ga = null;
  List<GameEpisode> games = new ArrayList<GameEpisode>();
  for(int i = 0; i < 10; i++){
    ga = w.runGame();
    games.add(ga);
  }
  Visualizer v = GGVisualizer.getVisualizer(9, 9);
  new GameSequenceVisualizer(v, domain, games);
}

public static void main(String[] args) {
  GridGame gg = new GridGame();
  OOSGDomain domain = gg.generateDomain();
  State s = GridGame.getTurkeyInitialState();
  JointRewardFunction jr = new GridGame.GGJointRewardFunction(domain);
  TerminalFunction tf = new GridGame.GGTerminalFunction(domain);
  World world = new World(domain, jr, tf, new ConstantStateGenerator(s));
  DPrint.toggleCode(world.getDebugId(),false);
  SGAgent ragent1 = new RandomSGAgent();
  SGAgent ragent2 = new RandomSGAgent();
  SGAgentType type = new SGAgentType("agent", domain.getActionTypes());
  world.join(ragent1);
  world.join(ragent2);
  GameEpisode ga = world.runGame(20);
  System.out.println(ga.maxTimeStep());
  String serialized = ga.serialize();
  System.out.println(serialized);
  GameEpisode read = GameEpisode.parse(serialized);
  System.out.println(read.maxTimeStep());
  System.out.println(read.state(0).toString());
}

Javadoc

Specifies goal rewards and default rewards for agents. Defaults rewards to 0 reward everywhere except transition to unviersal or personal goals which return a reward 1.

Most used methods

<init>
Initializes for a given domain, step cost reward, universal goal reward, and unique personal goal re
defaultCost
Returns a default cost for an agent assuming the agent didn't transition to a goal state. If noops i
getPersonalGoalReward
Returns the personal goal rewards. If a single common personal goal reward was set then that is retu

Popular in Java

Reactive rest calls using spring rest template
getContentResolver (Context)
runOnUiThread (Activity)
getApplicationContext (Context)
FileNotFoundException (java.io)
Thrown when a file specified by a program cannot be found.
Thread (java.lang)
A thread is a thread of execution in a program. The Java Virtual Machine allows an application to ha
InetAddress (java.net)
An Internet Protocol (IP) address. This can be either an IPv4 address or an IPv6 address, and in pra
TreeMap (java.util)
Walk the nodes of the tree left-to-right or right-to-left. Note that in descending iterations, next
JarFile (java.util.jar)
JarFile is used to read jar entries and their associated data from jar files.
Pattern (java.util.regex)
Patterns are compiled regular expressions. In many cases, convenience methods such as String#matches
Top Vim plugins

How to useGridGame$GGJointRewardFunction in burlap.domain.stochasticgames.gridgame

Best Java code snippets using burlap.domain.stochasticgames.gridgame.GridGame$GGJointRewardFunction (Showing top 6 results out of 315)

How to use
GridGame$GGJointRewardFunction
in
burlap.domain.stochasticgames.gridgame