[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: computer-go: Temporal Difference Learning

To: computer-go@xxxxxxxxxxxxxxxxx
Subject: Re: computer-go: Temporal Difference Learning
From: Don Dailey <drd@xxxxxxx>
Date: Tue, 22 Jul 2003 01:46:37 -0400
Cc: computer-go@xxxxxxxxxxxxxxxxx,computer-go@xxxxxxxxxxxxxxxxx,computer-go@xxxxxxxxxxxxxxxxx,computer-go@xxxxxxxxxxxxxxxxx,computer-go@xxxxxxxxxxxxxxxxx
In-reply-to: <3.0.32.20030721181550.015125d8@xxxxxxxxxxxxxxxxx> (message fromVincent Diepeveen on Mon, 21 Jul 2003 18:15:51 +0100)
References: <3.0.32.20030721181550.015125d8@xxxxxxxxxxxxxxxxx>
Reply-to: computer-go@xxxxxxxxxxxxxxxxx
Sender: owner-computer-go@xxxxxxxxxxxxxxxxx

   My conclusion and that of everyone i spoke with was ...

I think my results show that you need to rethink your conclusions.

Because I reported something much  better than what you descibe below.
If  your  results  were  really  this bad,  you  are  doing  something
different than I  am and you should be trying to  figure out what that
is.

   ... that within 1
   sequential movement through the evaluation function i can pick better
   values than TD learning in 2 years at a 500 processor machine will learn,
   under the assumption that the evaluation function is the evaluation
   function of a reasonable strength chessprogram with not too many bugs (and
   because of that first condition the number of parameters is several
   thousands trivially or more). What does it take me to go through the
   parameters of a chessprogram in 1 sequential movement, 3 hours or 6 hours
   at most for a big evaluation function?

   Setting up the TD learning experiment takes longer already...

References:
- Re: computer-go: Temporal Difference Learning
  - From: Vincent Diepeveen

Prev by Date: Re: computer-go: Temporal Difference Learning
Next by Date: computer-go: Neural Nets: suggesting and evaluating
Previous by thread: Re: computer-go: Temporal Difference Learning
Next by thread: computer-go: Neural Nets: suggesting and evaluating
Index(es):
- Date
- Thread