Gosavi, A. A Reinforcement Learning Algorithm Based on Policy Iteration for Average Reward: Empirical Results with Yield Management and Convergence Analysis. Machine learning..
Chicago Style (17th ed.) CitationGosavi, Abhijit. "A Reinforcement Learning Algorithm Based on Policy Iteration for Average Reward: Empirical Results with Yield Management and Convergence Analysis." Machine Learning. .
MLA (9th ed.) CitationGosavi, Abhijit. "A Reinforcement Learning Algorithm Based on Policy Iteration for Average Reward: Empirical Results with Yield Management and Convergence Analysis." Machine Learning., .
Warning: These citations may not always be 100% accurate.