[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: computer-go: Programs learning to play Go

To: computer-go@xxxxxxxxxxxxxxxxx,computer-go@xxxxxxxxxxxxxxxxx
Subject: Re: computer-go: Programs learning to play Go
From: David Fotland <fotland@xxxxxxxxxxxxxxx>
Date: Tue, 21 Aug 2001 20:33:45 -0700
In-reply-to: <Pine.GSO.4.21.0108210924260.29772-100000@eldar_0>
References: <5.1.0.14.1.20010820102124.028543a0@xxxxxxxxxxxxxxxxx>
Reply-to: computer-go@xxxxxxxxxxxxxxxxx
Sender: owner-computer-go@xxxxxxxxxxxxxxxxx



At 09:47 AM 8/21/2001 -0700, Matthew Corey Brown wrote:

The one thing in my opinion that makes it different from backgamonn and
checkers (blondie24) model, is passing.. THe other games there are a set
condition that determines the end of the game. Not so in Go, where the
players them selves choose when the game ends. (Well there is the rare
condition when there are no legal moves left) The trick is to seprate the
pasiing desicion from the best move descion. And this is what i am
planning once i finish the rewrite of my NN computation program.

No.  Treat a pass just like any other move, and pass when the pass is
the best move.  This is what Many Faces does.  There is no need to separate
the passing decision.

Put other intracatble problems (like TSP) are manageable with social
learning techniques. I do expect to take many 1000's of generations to
be even slightly competent as the dimensionality of the net is so
high. But social learning speeds up the training of nueral nets. (I expect
the 1000's of generations with social learning)

I'm don't think that a net with inputs of stone positions can ever learn to play
go with any understanding of simple tactics. I'll make the following challenge:

After you train it up until you thing it is slightly competent, play it against the
Larson algorithm. I expect it will lose and so demonstrate its incompetence.

The Larson algorithm is: Pick the string with the fewest number of liberties
(choose randomly if there are ties). Fill a liberty of that string, chosen at random.
If this move is illegal, pick another one. If you program doesn't understand liberties and
eyes, it will lose to this trival algorithm.

If you want a slightly tough opponent, choose the liberty that has the highest number of adjacent
empty points rather than a random one. But I think the random algorithm will
beat the neural net.

This simple test is what convinced me that my first go program, based on influence, was
the wrong approach, and that tactics is much more important than shapes or influence.

Matthew Corey Brown                             bromoc@xxxxxxxxxxxxxxxxx
                  Happiness is a dry place to live.

David Fotland

References:
- computer-go: Programs learning to play Go
  - From: Dave Dyer
- Re: computer-go: Programs learning to play Go
  - From: Matthew Corey Brown

Prev by Date: Re: computer-go: Programs learning to play Go
Next by Date: Re: computer-go: Programs learning to play Go
Previous by thread: Re: computer-go: Programs learning to play Go
Next by thread: Re: computer-go: Programs learning to play Go
Index(es):
- Date
- Thread