[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
computer-go: Need help building Graded Gnu Go Problem Database
From: Pierce Wetter <pierce@xxxxxxxxxxxxxxxxx>
Date: Sat Jul 28, 2001 09:04:54 AM America/Phoenix
To: computer-go@xxxxxxxxxxxxxxxxx
Cc: gnugo@xxxxxxxxxxxxxxxxx, T.Wolf@xxxxxxxxxxxxxxxxx
Subject: Need help building Graded Gnu Go Problem Database
A while ago, there was a discussion about Graded Go Problems for
Beginners....
The problem with using GGPFB is that the problems are copyrighted...
So I asked Thomas Wolf the author of GoTools if he would be willing to
make a small fraction of his 24,000 problem set available as part of a
public domain benchmark for go program developers.
He agreed! He's willing to make somewhere between 500-1000 problems
available. Which is really cool, as the computer-go world will be able
to have a standard set of problems they can exchange. I'll put them
under the LGPL (Library Gnu Public License or "Lesser" GPL) so its clear
that by including them your program doesn't be come public too...
That was a month ago (I took time off to learn perl to parse his problem
database).
So yesterday, I worked out a program to convert from his format to sgf
format. (in Perl!)
Now here's the rub: I need to now filter this problem set down to size
(from 24,000 problems down to 500-1000 problems). I've also found that
SGF isn't the best format for problem sets.
So I need a combination of help and advice:
I need help with the filtering and grading of the problems.
You'll need access to your own go program for the grading step. You'll
also need access to the database of problems, either by owning GoTools
(preferred), or by having sent Thomas a license agreement for the 40,000
problem set he distributes for non-profit use:
http://alpha.qmw.ac.uk/~ugah006/gotools/t.wolf.gotools.problems.html .
Then I can send you versions of the perl program that converts the files
to sgf format, and you can try them against your program... (That is,
I'm distributing the program, not the problems. While we can distribute
the resulting problem set freely, during the beta period, we'll only be
distributing the programs...)
For the filtering step, a program that could tell how similar two
problems are would be useful, which requires much less go knowledge.
That way the 500 problems we choose would be diverse as possible.
I need advice on the best way to translate the information from
GoTools format to SGF since SGF is really designed for games, not
problem sets. Here you need knowledge of SGF in practice, as opposed to
the spec.
Three emails to follow, one on each of these topics, plus one
"background" email on the problems.
Pierce
P.S.
Thanks again to Thomas Wolf for allowing me to put this together!