java.lang.Object
org.tweetyproject.machinelearning.rl.mdp.algorithms.IterativePolicyEvaluation<S,A>
- Type Parameters:
S- The type of statesA- The type of actions
- All Implemented Interfaces:
PolicyEvaluation<S,A>
public class IterativePolicyEvaluation<S extends State,A extends Action>
extends Object
implements PolicyEvaluation<S,A>
Determines utilities iteratively.
- Author:
- Matthias Thimm
-
Constructor Summary
ConstructorsConstructorDescriptionIterativePolicyEvaluation(long num_iterations) Creates a new policy evaluation algorithm -
Method Summary
Modifier and TypeMethodDescriptiongetUtilities(MarkovDecisionProcess<S, A> mdp, Policy<S, A> pi, double gamma) Determines the utilities of the states in the MDP wrt.
-
Constructor Details
-
IterativePolicyEvaluation
public IterativePolicyEvaluation(long num_iterations) Creates a new policy evaluation algorithm- Parameters:
num_iterations- the given number of num_iterations
-
-
Method Details
-
getUtilities
Description copied from interface:PolicyEvaluationDetermines the utilities of the states in the MDP wrt. the given policy.- Specified by:
getUtilitiesin interfacePolicyEvaluation<S extends State,A extends Action> - Parameters:
mdp- some MDPpi- some policygamma- the discount factor- Returns:
- the utilities of the states of the MDP.
-