Gosavi, A. A Reinforcement Learning Algorithm Based on Policy Iteration for Average Reward: Empirical Results with Yield Management and Convergence Analysis. Machine learning..
Chicagoスタイル(17版)引用形式Gosavi, Abhijit. "A Reinforcement Learning Algorithm Based on Policy Iteration for Average Reward: Empirical Results with Yield Management and Convergence Analysis." Machine Learning. .
MLA(9版)引用形式Gosavi, Abhijit. "A Reinforcement Learning Algorithm Based on Policy Iteration for Average Reward: Empirical Results with Yield Management and Convergence Analysis." Machine Learning., .
警告: この引用は必ずしも正確ではありません.