Gosavi, A. A Reinforcement Learning Algorithm Based on Policy Iteration for Average Reward: Empirical Results with Yield Management and Convergence Analysis. Machine learning..
Chicago Style (17. basım) AtıfGosavi, Abhijit. "A Reinforcement Learning Algorithm Based on Policy Iteration for Average Reward: Empirical Results with Yield Management and Convergence Analysis." Machine Learning. .
MLA (9th ed.) AtıfGosavi, Abhijit. "A Reinforcement Learning Algorithm Based on Policy Iteration for Average Reward: Empirical Results with Yield Management and Convergence Analysis." Machine Learning., .
Uyarı: Bu alıntı herzaman %100 doğru olmayabilir..