[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: computer-go: Learning from existing games

To: computer-go@xxxxxxxxxxxxxxxxx
Subject: Re: computer-go: Learning from existing games
From: Erik van der Werf <E.vanderWerf@xxxxxxxxxxxxx>
Date: Thu, 16 Jan 2003 11:06:58 -0100
References: <Pine.LNX.4.44.0301141957260.31137-100000@xxxxxxxxxxxxxxxxx> <200301141943.09995.compgo@xxxxxxxxxxxxxxxxx>
Reply-to: computer-go@xxxxxxxxxxxxxxxxx
Sender: owner-computer-go@xxxxxxxxxxxxxxxxx

Markus Enzenberger wrote:
> The big problem is the training time. I found TD with self-played games
> superior to other training methods and it takes at least 100000 games
> for best results (several weeks or even months on a fast PC).
> Algorithms for faster weight update are not necessarily helpful, because
> they decrease the exploration that is done with a given set of weights.

Do you train on-line or in batch style (e.g., several games before a
weight update)?

I think algorithms for faster weight update only work well with stable
gradient information. On-line learning may not provide good enough
gradient information.

Erik.

Follow-Ups:
- Re: computer-go: Learning from existing games
  - From: schraudo
- Re: computer-go: Learning from existing games
  - From: Markus Enzenberger
- Re: computer-go: Learning from existing games
  - From: Jan Ramon

References:
- Re: computer-go: Learning from existing games
  - From: schraudo
- Re: computer-go: Learning from existing games
  - From: Markus Enzenberger

Prev by Date: Re: computer-go: Upcoming competitions?
Next by Date: computer-go: Number of Go players in the world
Previous by thread: Re: computer-go: Learning from existing games
Next by thread: Re: computer-go: Learning from existing games
Index(es):
- Date
- Thread