Q-learning V 003

3 Ratings
7
Bookmarks
0
Plays
Your Rating
Description
A variable-length reinforcement learning algorithm that uses the Bellman Equation to learn how to pick the best loadout for a given team over time. If you want to add more weapons, add an action object, assign User Zulu label to it, add selection to brain.