2021-01-07から1日間の記事一覧

2021-01-07

深層分布強化学習 ① Categorical DQN（C51）

分布強化学習（distributional reinforcement learning）の概念を深層強化学習へ導入したCategorical DQN（C51）をtensorflow2で実装します。 why restrict ourselves to the mean? ― [1707.06887] A Distributional Perspective on Reinforcement Learning …

#強化学習 #tensorflow2 #CategoricalDQN

どこから見てもメンダコ

軟体動物門頭足綱八腕類メンダコ科

2021-01-07から1日間の記事一覧

深層分布強化学習 ① Categorical DQN（C51）