[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

computer-go: RE: Learn from Self-Play



-----Original Message-----
From: Måns Ullerstam [mailto:mans@xxxxxxxxxxxxxxxxx]
Sent: Monday, February 17, 2003 1:42 AM
To: 'Ran Xiao'
Subject: RE: computer-go: Learn from Self-Play

- Is that a neural network or are you storing all board positions and moves
directly?

All board positions are stored, and NN is used to learn the evaluation
function.

- You are saying that you let them play other computer go programs first.
What programs and for how many games?

ManyFaces, HandTalk, EZGO, Goliath, and TurboGO. All downloadable from
somewhere Internet free on 9x9 board.

- You are not specifically specifying the learning mechanism, but it souds
like you have implemented some form of Reinforcement Learning,TD(lambda) or
something similar. Can you elaborate on that.

Since I save all board positions with their scores, direct TD or RL are not
really needed.

- The evaluation function is evaluating what, the score of the board, the
probability of winning, something else?

Based on the score of the board.


Weimin Xiao