StrixTheKiet Notes

Search

❯

❯

ArtificialIntelligence

❯

❯

❯

Model-free Reinforcement Learning

Model-free Reinforcement Learning

Mar 20, 20251 min read

Description:

Agent can use the feedback it receives to iteratively update its policy while learning until eventually determining the optimal policy after sufficient exploration.
Estimate the values or q-values of states directly, without ever using any memory to construct a model of the rewards and transitions in the MDP.

Value Learning

Direct Evaluation
Temporal Difference Learning

Q-learning

Graph View

Description:
Value Learning
Q-learning

Backlinks

Passive Reinforcement Learning

Created with strixthekiet

GitHub
Email