[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: some ideas



Ives,

As opposed to for instance Backgammon [1], Go is strictly
deterministic, leading to some problems when applying
algorithms such as temporal difference learning (TD).
Still, there has been attempts to work that out [2].

Henrik

[1]   G. Tesauro's TD-gammon (Known, I guess)
[2]   N. Schraudolph, P. Dayan, T.J. Sejnowski,
      "Temporal difference leraning of position evaluation
       in the game of go", Preprint. Check salk.edu?

-- 
Henrik Rydberg (http://fy.chalmers.se/~rydberg),
Department of Applied Physics, Chalmers University of Technology.