[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: computer-go: FPGA
FPGA in combination with running at a cluster will not speed your
local tactical search up much. Cluster communication is pretty slow
compared to the speed of the local tactical search.
At 10:02 PM 8/28/00 EDT, you wrote:
>Recently I tried to write code for local tactical search. To improve the
>search result, the code becomes more and more complicated and takes more and
>more time. It now can only search about 20 end nodes per second on a 233 MHz
>machine and it still need more code to produce desired results. It seems to
>me a hardware improvement is needed to produce a good program.
>
>The options for a better hardware are following
>
>1. Accessible to a super computer. Today's super computers use the parallel
>structure. Unforturnately adapting the search algorithm to a parallel
>computer is not a straight forward matter.
>
>2. Use PC clusters. This is a very realistic possibility. But it has the
same
>problem as in 1 plus more. One problem probably was not notice that is the
>electricity bill. Ten PC could amount to 2 kW.
>
>3. Specilized hardware. Accessible for us is the FPGA. I did some reading on
>this subject. I'll make some comment here.
>
>A specialized hardware based on FPGA is quite possible. For us realistically
>it probably will take the form of a PCI (or ISA) card. Because of this the
>speed gain from the FPGA must compensate the speed loss in the communication
>between the computer and the PCI card. Further because the search program is
>so complicated, the FPGA can only be used to do part of the calculation.
>Using this approach I estimate that the search speed can probably be
>increased to 10,000 end nodes per second. using a 250 MHz FPGA. To achieve
>this speed, one must take advantage of parrallel processing in the FPGA. I
>did some simple layout in VHDL. It seems work out ok. The resource it
>requires is similar to those in high end FPGAs in the market. However, if
>this layout can be realized in a commercial FPGA is till unknown, even
though
>the number of gates required is enough. For example, to use parallel
>processing, one need to use a RAM with only 1 word in depth. For a 1k byte
>memory. This will be a 1X 8000 RAM. One way to create such a RAM in FPGA is
>to use latches. It would need 8000 laches. In Altera's PLDs one latch takes
>one logic units. Only devices with more than 1 million gates provide such
>many logical units. Even the number of logical units is enough, question is
>still open if the 8000 latches can be routed out in the device. There is
also
>the financial obstacles. One need to purchase the manufacturer specific
>software to do the layout. For Xilinx such software cost $3500 /year. Altera
>is more customer friendly in this aspect, but still need to purchase
software
>to do layout in million gates PLDs. Then there is the cost to implement the
>FPGA in a PCI card. Many company will do this for you but with a big charge
>(probably more than $10,000). The total cost to implement such a PCI card
>with FPGA could run as high as $20,000 to $30,000. Unless one can find some
>funding, it's difficult for an individual to do.
>
>
>Dan Liu
>
>