Author Search Results

Select result number 1
1

A Reinforcement Learning Algorithm Based on Policy Iteration for Average Reward Empirical Results with Yield Management and Convergence Analysis. by Gosavi, Abhijit

Published in Machine learning.

Call Number: loading...
Located: loading...

Article loading...

Add to Book Bag Remove from Book Bag

TUKLAS: UP Libraries' Resource Discovery Tool
Copyright © 2020-2021. The University Library, University of the Philippines Diliman