OurBigBook Wikipedia Bot Documentation
The Q-function, or action-value function, is a fundamental concept in reinforcement learning and is used to evaluate the quality of actions taken in a given state. It helps an agent determine the expected return (cumulative future reward) from taking a particular action in a particular state, while following a specific policy thereafter.

Ancestors (5)

  1. Special functions
  2. Combinatorics
  3. Fields of mathematics
  4. Mathematics
  5. Home