Function Approximators For Solving Reinforcement Learning Problems

2025-11-24

The Function Approximator Choice is the Most Important Component When We Decide How To Solve a Reinforcement Learning Problem.

It determines how you represent:

These are key function approximator we can use to solve our Reinforcement Learning problem.

Appropriate usecases: Small, discrete state and action spaces (gridworld, bandits)

Pros:

Cons:

Used heavily in classic RL (Sutton & Barto, tile coding, LFA algorithms).

Feature types:

Great for control tasks like Mountain Car, Acrobot, Lunar Lander with SARSA, TD, Actor–Critic.

When state spaces are large, high-dimensional, or continuous, neural networks are the standard choice.

Types: a) Multilayer Perceptrons (MLPs) Used in:

b) Convolutional Neural Networks (CNNs) Used in:

c) Recurrent Neural Networks (RNNs: LSTM / GRU) Used when the environment is partially observable (POMDPs).

d) Transformers

Used in:

Function approximators used to learn transition or reward models:

a) Neural networks Learn f(s,a)→ s′ and r(s,a)

b) Probabilistic models

Used in:

Examples:

Useful when:

Want to Receive Updates On Fastest AI Models, Successful AI Startups and New Hiring Candidates. Subscribe To My Newsletters