[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: computer-go: Perl Module for next move.

To: computer-go@xxxxxxxxxxxxxxxxx
Subject: RE: computer-go: Perl Module for next move.
From: "Staelin, Carl" <staelin@xxxxxxxxxxxxxxx>
Date: Tue, 5 Jun 2001 23:19:43 -0700
Reply-to: computer-go@xxxxxxxxxxxxxxxxx
Sender: owner-computer-go@xxxxxxxxxxxxxxxxx

The original work on using temporal difference
learning for game players was by Gerald Tesauro
to train a backgammon player.  A copy of the 
paper can be found at:
	
	http://www.research.ibm.com/massive/tdl.html

Others thought it might be a nice technique to
apply to other games, such as checkers or GO.
Chellapilla and Fogel trained checkers players
using techniques similar to those used by 
Tesauro for his backgammon player.

	http://vision.ucsd.edu/~kchellap/papers
(Look for Chellapilla and Fogel, "Co-evolving
checkers playing programs using only win, lose
or draw", from SPIE 1999.

There has also been some work done on applying
similar techniques to GO players, such as Nici
Schraudolphs work on applying temporal difference
learning neural networks to GO:

	http://www.idsia.ch/~nic/pubs.html#gochap

Cheers,

Carl
_________________________________________________
[(hp)]	Carl Staelin
	Senior Research Scientist
	Hewlett-Packard Laboratories
	Technion City
	Haifa, 32000
	ISRAEL
	+972(4)823-1237x221	+972(4)822-0407 fax
	staelin@xxxxxxxxxxxxxxxxx
_______http://www.hpl.hp.com/personal/Carl_Staelin_______


> -----Original Message-----
> From: Matthew Corey Brown [mailto:bromoc@xxxxxxxxxxxxxxxxx]
> Sent: Wednesday, June 06, 2001 12:51 AM
> To: computer-go@xxxxxxxxxxxxxxxxx
> Subject: RE: computer-go: Perl Module for next move.
> 
> 
> On Tue, 5 Jun 2001, Mark Boon wrote:
> 
> > In a previous post a checkers project was said to use neural nets
> > successfully, but I'm wondering how much was done by a fast 
> mini-max program
> > which used the neural net in some way (as a move-selector 
> for example)
> > instead of having the net learn the mini-max strategy as 
> well. I somehow
> > feel the latter is necessary for neural networks to be 
> successful in Go.
> > 
> 
> What i was told all it did was say which move you made result 
> in a better
> position.. it never looked ahead.. just what was best in the here and
> now and which move would it be to improve your position. I 
> learned it from
> a SIG in AI that the local game dev community has started up. 
> I'm going to
> a different SIG meeting tonight but the person who told me 
> about it will
> be there and I'll try and get more reference. But the neural 
> net was the
> only descion maker of the checkers game to my understanding. The idea
> intrigued me when i heard it last week. 
> 
> 
> Matthew Corey Brown                             
> bromoc@xxxxxxxxxxxxxxxxx 
>   "Death can not stop true love. All it can do is delay it 
> for awhile."
>

Follow-Ups:
- computer-go: Who is "Xiaohan Huang"?
  - From: Jan van der Steen
- Re: computer-go: Perl Module for next move.
  - From: Toni Cebrian

Prev by Date: RE: computer-go: Perl Module for next move.
Next by Date: Re: computer-go: Perl Module for next move.
Previous by thread: Re: computer-go: Perl Module for next move.
Next by thread: Re: computer-go: Perl Module for next move.
Index(es):
- Date
- Thread