Reinforcement Learning with Q

Sutton & Barto's canonical RL text, implemented in kdb+/q.

Read online: cloudstreet-dev.github.io/Reinforcement-Learning-with-Q

There are many reinforcement learning tutorials. All of them are in Python. This one is not.

This book follows Sutton & Barto's Reinforcement Learning: An Introduction (2nd ed.) chapter by chapter, implementing every algorithm in kdb+/q. The reader is a q developer who wants to understand RL properly, not port someone else's NumPy code. The code runs. The math is right. The environments match S&B so you can cross-reference.

Chapter	Topic	S&B Reference
Introduction	Why RL in Q? Why Not.	—
Bandits	Epsilon-greedy, UCB, gradient bandit	Ch. 2
MDPs	States, actions, rewards, GridWorld	Ch. 3
Dynamic Programming	Policy eval/iteration, value iteration	Ch. 4
Monte Carlo	Blackjack, first-visit MC, MC control	Ch. 5
Temporal Difference	TD(0), SARSA, Q-Learning, cliff walking	Ch. 6
N-Step Methods	N-step TD/SARSA, eligibility traces	Ch. 7
Function Approximation	Tile coding, semi-gradient TD, mountain car	Ch. 9–10
Policy Gradient	REINFORCE, baseline, actor-critic	Ch. 13
Conclusion	Where to go from here	—

Prerequisites

kdb+/q — the personal edition covers everything in this book
Familiarity with q syntax (imax, {x+y}/, functional amend)
Basic ML literacy helps; deep RL background is not required

Building Locally

# Install mdBook (https://rust-lang.github.io/mdBook/)
cargo install mdbook

# Build
mdbook build

# Serve with live reload
mdbook serve --open

License

See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
src		src
LICENSE		LICENSE
README.md		README.md
book.toml		book.toml
custom.css		custom.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning with Q

Contents

Prerequisites

Building Locally

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning with Q

Contents

Prerequisites

Building Locally

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages