[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: some ideas
Ives,
As opposed to for instance Backgammon [1], Go is strictly
deterministic, leading to some problems when applying
algorithms such as temporal difference learning (TD).
Still, there has been attempts to work that out [2].
Henrik
[1] G. Tesauro's TD-gammon (Known, I guess)
[2] N. Schraudolph, P. Dayan, T.J. Sejnowski,
"Temporal difference leraning of position evaluation
in the game of go", Preprint. Check salk.edu?
--
Henrik Rydberg (http://fy.chalmers.se/~rydberg),
Department of Applied Physics, Chalmers University of Technology.