Content Browser

Featured Browse All My Bookmarks

Q-learning V 003

3 Ratings

7

Bookmarks

0

Plays

Your Rating

About

Description

A variable-length reinforcement learning algorithm that uses the Bellman Equation to learn how to pick the best loadout for a given team over time. If you want to add more weapons, add an action object, assign User Zulu label to it, add selection to brain.