V2EX = way to explore
V2EX 是一个关于分享和探索的地方
Sign Up Now
For Existing Member  Sign In
V2EX  ›  reinforcement
zoe1016aaa NVIDIA Shanghai LLM Reinforcement Learning Algorithm Engineer
酷工作  •  zoe1016aaa  •  14 days ago  •  Lastly replied by tyroshu
1
lufficc Reinforcement Learning 的核心基础概念及实现
  •  2   
    Python  •  lufficc  •  May 3, 2017  •  Lastly replied by aphorism
    5
    About   ·   Help   ·   Advertise   ·   Blog   ·   API   ·   FAQ   ·   Solana   ·   5405 Online   Highest 6679   ·     Select Language
    创意工作者们的社区
    World is powered by solitude
    VERSION: 3.9.8.5 · 11ms · UTC 03:54 · PVG 11:54 · LAX 20:54 · JFK 23:54
    ♥ Do have faith in what you're doing.