java.lang.Object
org.tweetyproject.machinelearning.rl.mdp.algorithms.IterativePolicyEvaluation<S,A>
- Type Parameters:
S
- The type of statesA
- The type of actions
- All Implemented Interfaces:
PolicyEvaluation<S,
A>
public class IterativePolicyEvaluation<S extends State,A extends Action>
extends Object
implements PolicyEvaluation<S,A>
Determines utilities iteratively.
- Author:
- Matthias Thimm
-
Constructor Summary
ConstructorDescriptionIterativePolicyEvaluation
(long num_iterations) Creates a new policy evaluation algorithm -
Method Summary
Modifier and TypeMethodDescriptiongetUtilities
(MarkovDecisionProcess<S, A> mdp, Policy<S, A> pi, double gamma) Determines the utilities of the states in the MDP wrt.
-
Constructor Details
-
IterativePolicyEvaluation
public IterativePolicyEvaluation(long num_iterations) Creates a new policy evaluation algorithm- Parameters:
num_iterations
- the given number of num_iterations
-
-
Method Details
-
getUtilities
Description copied from interface:PolicyEvaluation
Determines the utilities of the states in the MDP wrt. the given policy.- Specified by:
getUtilities
in interfacePolicyEvaluation<S extends State,
A extends Action> - Parameters:
mdp
- some MDPpi
- some policygamma
- the discount factor- Returns:
- the utilities of the states of the MDP.
-