[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: computer-go: Learning from existing games

To: computer-go@xxxxxxxxxxxxxxxxx
Subject: Re: computer-go: Learning from existing games
From: Heikki Levanto <heikki@xxxxxx>
Date: Tue, 14 Jan 2003 18:13:44 +0100
In-reply-to: <000001c2bb5c$a8db39c0$ada15018@transfinity>
References: <200301132056.h0DKuWT23955@xxxxxxxxxxxxxxxxx> <000001c2bb5c$a8db39c0$ada15018@transfinity>
Reply-to: computer-go@xxxxxxxxxxxxxxxxx
Sender: owner-computer-go@xxxxxxxxxxxxxxxxx
User-agent: Mutt/1.4i

On Mon, Jan 13, 2003 at 03:36:55PM -0800, Piotr Kaminski wrote:
> Nicol Schraudolph et al.:  Learning to Evaluate Go Positions Via Temporal
> Difference Methods (http://www.inf.ethz.ch/~schraudo/pubs/gochap.pdf)

Interesting paper. I had not thought that a neural network that just sees
the board position would ever get that far.

My own theory, which I have not had the time to do much about, and probably
never will, is that feeding the board position to the network is not
sufficient, that we need a higher level of abstraction.

Most current programs extract a lot of information from the position
(grouping of stones into strings, strings into groups, counting liberties,
eyes, and territory, and so on). I feel that this sort of info would be much
better input for a network that will estimate the winning probability (or
score).

Actually, I believe it would be enough to count a pile of key numbers that
reflect the position. Number of captured stones, number of strings and
stones in atari, with 2-5 liberties; number and total size of living groups,
weak groups, dead groups; number of points under more or less strong
black/white control; and so on. A small set of (say) 100 numerical inputs
ought to suffice. I believe TD-learning would work better on such a network.

Has anyone tried this sort of things?  It should not be overly difficult job
to take (for example) GnuGo's engine, extract the first analysis phase, 
calculate those numbers, feed them to a network, and train with TD(0).

Regards
	Heikki

-- 
Heikki Levanto  LSD - Levanto Software Development   <heikki@xxxxxxxxxxxxxxxxx>

Follow-Ups:
- Re: computer-go: Learning from existing games
  - From: schraudo

References:
- Re: computer-go: Learning from existing games
  - From: Don Dailey
- RE: computer-go: Learning from existing games
  - From: Piotr Kaminski

Prev by Date: Re: computer-go: Learning from existing games
Next by Date: Re: computer-go: Learning from existing games
Previous by thread: RE: computer-go: Learning from existing games
Next by thread: Re: computer-go: Learning from existing games
Index(es):
- Date
- Thread