[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: computer-go: Learning from existing games

To: computer-go@xxxxxxxxxxxxxxxxx
Subject: Re: computer-go: Learning from existing games
From: Markus Enzenberger <compgo@xxxxxxxxxxxxxxxxxxxxx>
Date: Thu, 16 Jan 2003 17:59:24 -0700
In-reply-to: <3E26A062.4365024@xxxxxxxxxxxxxxxxx>
References: <Pine.LNX.4.44.0301141957260.31137-100000@xxxxxxxxxxxxxxxxx> <200301141943.09995.compgo@xxxxxxxxxxxxxxxxx> <3E26A062.4365024@xxxxxxxxxxxxxxxxx>
Reply-to: computer-go@xxxxxxxxxxxxxxxxx
Sender: owner-computer-go@xxxxxxxxxxxxxxxxx
User-agent: KMail/1.4.3

On Thursday 16 January 2003 05:06, Erik van der Werf wrote:

> Do you train on-line or in batch style (e.g., several games before a
> weight update)?
>
> I think algorithms for faster weight update only work well with
> stable gradient information. On-line learning may not provide good
> enough gradient information.

I tried batch update, but it gave worse results.
online training with TD(0) seemed to have a stabilizing effect
on the training if you go backwards through the game
and update the weights once per position.

- Markus

Follow-Ups:
- Re: computer-go: Learning from existing games
  - From: schraudo

References:
- Re: computer-go: Learning from existing games
  - From: schraudo
- Re: computer-go: Learning from existing games
  - From: Markus Enzenberger
- Re: computer-go: Learning from existing games
  - From: Erik van der Werf

Prev by Date: Re: computer-go: Discarding the rubbish moves
Next by Date: Re: computer-go: Number of Go players in the world
Previous by thread: Re: computer-go: Learning from existing games
Next by thread: Re: computer-go: Learning from existing games
Index(es):
- Date
- Thread