[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: computer-go: Temporal Difference Learning

To: computer-go@xxxxxxxxxxxxxxxxx
Subject: Re: computer-go: Temporal Difference Learning
From: Peter Drake <drake@xxxxxxxxxx>
Date: Mon, 14 Jul 2003 16:09:54 -0700
In-reply-to: <200307141621.14922.compgo@xxxxxxxxxxxxxxxxx>
Reply-to: computer-go@xxxxxxxxxxxxxxxxx
Sender: owner-computer-go@xxxxxxxxxxxxxxxxx

On Monday, July 14, 2003, at 03:21  PM, Markus Enzenberger wrote:

Are there any major deviations from this plan within TD
programs? Specifically, is anyone:

...doing more than 1 ply lookahead?

there is an algorithm called TDLeaf, but I am not convinced
that it is useful.

A quick web search found a paper by Baxter, Tridgell, and Weaver. Is this the canonical one?

Also, can you say why you're not convinced this is useful?

...using TD to learn to answer more specific questions,
such as, "can these two chains connect"?

NeuroGo in its most recent version uses local connectivity
and single-point eyes as additional outputs that are
trained with TD. I will present a paper about this at
ACG2003 which takes place together with the Computer
Olympiad in Graz/Austria in November.

So when and how do those of us stuck stateside get ahold of it?  :-)

Thanks,

Peter Drake
Assistant Professor of Computer Science
Lewis & Clark College
http://www.lclark.edu/~drake/

Follow-Ups:
- Re: computer-go: Temporal Difference Learning
  - From: Markus Enzenberger

References:
- Re: computer-go: Temporal Difference Learning
  - From: Markus Enzenberger

Prev by Date: Re: computer-go: Temporal Difference Learning
Next by Date: Re: computer-go: Temporal Difference Learning
Previous by thread: Re: computer-go: Temporal Difference Learning
Next by thread: Re: computer-go: Temporal Difference Learning
Index(es):
- Date
- Thread