[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: computer-go: Temporal Difference Learning
My conclusion and that of everyone i spoke with was ...
I think my results show that you need to rethink your conclusions.
Because I reported something much better than what you descibe below.
If your results were really this bad, you are doing something
different than I am and you should be trying to figure out what that
is.
... that within 1
sequential movement through the evaluation function i can pick better
values than TD learning in 2 years at a 500 processor machine will learn,
under the assumption that the evaluation function is the evaluation
function of a reasonable strength chessprogram with not too many bugs (and
because of that first condition the number of parameters is several
thousands trivially or more). What does it take me to go through the
parameters of a chessprogram in 1 sequential movement, 3 hours or 6 hours
at most for a big evaluation function?
Setting up the TD learning experiment takes longer already...