java.lang.Object
org.tweetyproject.machinelearning.rl.mdp.algorithms.OfflineAlgorithm<S,A>
- Type Parameters:
S- The type of statesA- The type of actions
- Direct Known Subclasses:
PolicyIteration,ValueIteration
A general interface for algorithms to determine optimal
policies directly from an MDP
- Author:
- Matthias Thimm
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionDetermines the best policy, given the utilitiesgetPolicy(MarkovDecisionProcess<S, A> mdp, double gamma) Determines the optimal policy for the given MDP.
-
Constructor Details
-
OfflineAlgorithm
public OfflineAlgorithm()
-
-
Method Details
-
getPolicy
Determines the optimal policy for the given MDP.- Parameters:
mdp- some MDPgamma- the used discount factor for utility determination- Returns:
- the optimal policy
-
getPolicy
Determines the best policy, given the utilities- Parameters:
utilities- a mapping of states to utilitiesmdp- some MDPgamma- discount factor- Returns:
- the best policy
-