[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [computer-go] TD(lambda), Neural Networks and evaluation functions
On Tue, 16 Sep 2003, Imran Ghory wrote:
> 1) Create 81 neural networks (one associated with each intersection on the
> board). Let's represent them by N(x, input-board) with x=1...81.
> 2) Use temporal difference learning to teach the neural networks, with the
> rewards being +1/-1 depending on which side controls that intersection at
> the end of that game.
>
> Has anyone experimented with this kind of approach before ?
That's exactly what we did 10 years ago, except that we reduced the model
complexity by having all but the bias weights shared between the networks.
See http://n.schraudolph.org/pubs/gochap.pdf. Yes it does work to some
extent, but you need better features to really get anywhere, a la NeuroGo.
- Nici.
--
Dr. Nicol N. Schraudolph http://n.schraudolph.org/
Steinwiesstr. 32 mobile: +41-76-585-3877
CH-8032 Zurich, Switzerland tel: -1-251-3661
_______________________________________________
computer-go mailing list
computer-go@xxxxxxxxxxxxxxxxx
http://computer-go.org/mailman/listinfo/computer-go