[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [computer-go] Poster available

To: computer-go <computer-go@xxxxxxxxxxxxxxx>
Subject: Re: [computer-go] Poster available
From: Peter Drake <drake@xxxxxxxxxx>
Date: Mon, 27 Oct 2003 08:59:30 -0800
Delivered-to: computer-go@xxxxxxxxxxxxxxxxx
In-reply-to: <3F9D4D9A.7040409@xxxxxxxxxxxxxxxxx>
List-archive: <http://computer-go.org/pipermail/computer-go>
List-help: <mailto:computer-go-request@xxxxxxxxxxxxxxxxx?subject=help>
List-id: computer-go <computer-go.computer-go.org>
List-post: <mailto:computer-go@xxxxxxxxxxxxxxxxx>
List-subscribe: <http://computer-go.org/mailman/listinfo/computer-go>,<mailto:computer-go-request@xxxxxxxxxxxxxxxxx?subject=subscribe>
List-unsubscribe: <http://computer-go.org/mailman/listinfo/computer-go>,<mailto:computer-go-request@xxxxxxxxxxxxxxxxx?subject=unsubscribe>
Reply-to: computer-go <computer-go@xxxxxxxxxxxxxxx>
Sender: computer-go-bounces@xxxxxxxxxxxxxxx

Well, we're thinking about putting in much more structure to the program aside from the neural network part, such as keeping track of blocks, eyes, joiners, etc. in some incremental fashion. Perhaps we'll figure out which parts we can't build by hand and apply machine learning to those, rather than to the entire problem.

The idea of using an Elman net or somesuch, which could remember its previous computations, might be interesting...

On Monday, October 27, 2003, at 08:53 AM, Erik van der Werf wrote:

Ah yes, this seems somewhat similar to the type of networks you get with cascade correlation. IME such an architecture easily overfits on the training data, leading to poor generalization.

For a moment I thought you were doing something pseudo-recurrent like with time-delayed inputs. (Which could be derived from the internal states of the net at previous board positions.) :-)

Maybe you can compare some different architectures for your next draft?

Best,
Erik

Peter Drake wrote:
On Monday, October 27, 2003, at 01:28  AM, Erik van der Werf wrote:
Interesting. What do you mean by: "each hidden unit also receives information from all previous hidden units".

Exactly that. Suppose the network has three input units A, B, and C, three input units D, E, and F, and three output units G, H, and I. Each unit has incoming connections as follows:
D: ABC
E: ABCD
F: ABCDE
G: DEF
H: DEF
I: DEF
The intent was to avoid any decisions about how many hidden layers to have, how big to make them, etc. Any arrangement of hidden layers is a special case of this architecture.
In retrospect, the philosophy of making the architecture as general as possible and leaving the details up to the backpropagation algorithm may not have been wise. Our next draft will have far more structure.
Peter Drake
Assistant Professor of Computer Science
Lewis & Clark College
http://www.lclark.edu/~drake/
_______________________________________________
computer-go mailing list
computer-go@xxxxxxxxxxxxxxxxx
http://computer-go.org/mailman/listinfo/computer-go
_______________________________________________
computer-go mailing list
computer-go@xxxxxxxxxxxxxxxxx
http://computer-go.org/mailman/listinfo/computer-go

Peter Drake
Assistant Professor of Computer Science
Lewis & Clark College
http://www.lclark.edu/~drake/

_______________________________________________
computer-go mailing list
computer-go@xxxxxxxxxxxxxxxxx
http://computer-go.org/mailman/listinfo/computer-go

References:
- Re: [computer-go] Poster available
  - From: Erik van der Werf

Prev by Date: Re: [computer-go] Poster available
Next by Date: [computer-go] Introduction
Previous by thread: Re: [computer-go] Poster available
Next by thread: Re: [computer-go] Poster available
Index(es):
- Date
- Thread