作者搜索結果

Select result number 1
1

A Reinforcement Learning Algorithm Based on Policy Iteration for Average Reward Empirical Results with Yield Management and Convergence Analysis. 由 Gosavi, Abhijit

發表在 Machine learning.

索引號: loading...
位於: loading...

Article loading...

添加到書包從書包裡刪除

TUKLAS: UP Libraries' Resource Discovery Tool
Copyright © 2020-2021. The University Library, University of the Philippines Diliman