[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: computer-go: Temporal Difference Learning



> Are there any major deviations from this plan within TD
> programs? Specifically, is anyone:
>
> ...doing more than 1 ply lookahead?

there is an algorithm called TDLeaf, but I am not convinced 
that it is useful.

> ...using TD to learn to answer more specific questions,
> such as, "can these two chains connect"?

NeuroGo in its most recent version uses local connectivity 
and single-point eyes as additional outputs that are 
trained with TD. I will present a paper about this at 
ACG2003 which takes place together with the Computer 
Olympiad in Graz/Austria in November.

- Markus