Reinforcement Learning

  • Set of states, S
  • Set of actions, A
  • Reward function, R
  • Policy, π
  • Value, V




