跳至主要内容
前往所有服役记录

Content Browser

Q-learning V 003

Thumbnail: Q-learning

3 Ratings

7

Bookmarks

0

Plays

Your Rating

About

Description
A variable-length reinforcement learning algorithm that uses the Bellman Equation to learn how to pick the best loadout for a given team over time. If you want to add more weapons, add an action object, assign User Zulu label to it, add selection to brain.