[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

computer-go: RE: Learn from Self-Play

To: "Computer-Go" <computer-go@xxxxxxxxxxxxxxxxx>,Måns Ullerstam <mans@xxxxxxxxxxxx>
Subject: computer-go: RE: Learn from Self-Play
From: "Ran Xiao" <ranxiao@xxxxxxxxxxxx>
Date: Mon, 17 Feb 2003 13:29:35 -0600
Importance: Normal
In-reply-to: <000301c2d658$05225f40$6600a8c0@kanin>
Reply-to: computer-go@xxxxxxxxxxxxxxxxx
Sender: owner-computer-go@xxxxxxxxxxxxxxxxx

-----Original Message-----
From: Måns Ullerstam [mailto:mans@xxxxxxxxxxxxxxxxx]
Sent: Monday, February 17, 2003 1:42 AM
To: 'Ran Xiao'
Subject: RE: computer-go: Learn from Self-Play

- Is that a neural network or are you storing all board positions and moves
directly?

All board positions are stored, and NN is used to learn the evaluation
function.

- You are saying that you let them play other computer go programs first.
What programs and for how many games?

ManyFaces, HandTalk, EZGO, Goliath, and TurboGO. All downloadable from
somewhere Internet free on 9x9 board.

- You are not specifically specifying the learning mechanism, but it souds
like you have implemented some form of Reinforcement Learning,TD(lambda) or
something similar. Can you elaborate on that.

Since I save all board positions with their scores, direct TD or RL are not
really needed.

- The evaluation function is evaluating what, the score of the board, the
probability of winning, something else?

Based on the score of the board.

Weimin Xiao

Follow-Ups:
- computer-go: RE: Learn from Self-Play
  - From: Måns Ullerstam

Prev by Date: computer-go: Re: life and death problem sets?
Next by Date: computer-go: email privacy on web archives
Previous by thread: computer-go: Re: life and death problem sets?
Next by thread: computer-go: RE: Learn from Self-Play
Index(es):
- Date
- Thread