Ralph & Rosetta optimized for GPU CUDA ?

Author	Message
Zarck Send message Joined: 17 Mar 06 Posts: 6 Credit: 5,188 RAC: 0	Message 4401 - Posted: 11 Dec 2008, 22:44:33 UTC Last modified: 11 Dec 2008, 22:50:47 UTC http://setiweb.ssl.berkeley.edu/beta/cuda.php it's possible for Seti ? and for Rosetta/Ralph ? @+ _ ID: 4401 · Reply Quote

robertmiles Send message Joined: 13 Jan 09 Posts: 103 Credit: 331,865 RAC: 0	Message 4700 - Posted: 14 Feb 2009, 20:36:25 UTC - in response to Message 4401. What I've seen indicates that it's possible but rather costly in programmer time, due to the large number of changes needed to make the programs fit in a GPU. Not likely to happen on BOINC projects that keep updating their software as often as Ralph/Rosetta do, only on projects that have a large amount of data to crunch but not much need to keep updating the software used to do it. ID: 4700 · Reply Quote

[VENETO] boboviz Send message Joined: 9 Apr 08 Posts: 928 Credit: 1,892,541 RAC: 294	Message 5136 - Posted: 12 May 2010, 5:35:48 UTC - in response to Message 4700. I hope ralph produce a OpenCl fork of code.. http://www.opencldev.com/ ID: 5136 · Reply Quote

robertmiles Send message Joined: 13 Jan 09 Posts: 103 Credit: 331,865 RAC: 0	Message 5269 - Posted: 2 May 2011, 18:38:03 UTC - in response to Message 5136. I've seen a thread saying that their current algorithm uses so much memory for each processor core that they're unlikely to get a significant speedup from a GPU version, and have therefore decided to stop even trying for now. It would need an entirely new version of their application program, based on a new algorithm, and likely also another computer language for writing it. ID: 5269 · Reply Quote

Jim Send message Joined: 13 Feb 08 Posts: 4 Credit: 339,036 RAC: 0	Message 5369 - Posted: 3 Aug 2011, 3:59:26 UTC Last modified: 3 Aug 2011, 4:08:42 UTC Since Ralph and Rosetta are Research type programs the software needs to be changed and tweeked all the time. As I understand it they just don't have the time AND staff to be able to code the GPU software needed on any kind of timely basics. I don't work here or represent the project, that's just my understanding of the problem. ID: 5369 · Reply Quote

[VENETO] boboviz Send message Joined: 9 Apr 08 Posts: 928 Credit: 1,892,541 RAC: 294	Message 5370 - Posted: 3 Aug 2011, 7:09:28 UTC - in response to Message 5369. Since Ralph and Rosetta are Research type programs the software needs to be changed and tweeked all the time. As I understand it they just don't have the time AND staff to be able to code the GPU software needed on any kind of timely basics. I agree. But also rosetta/ralph researchers have to consider: 1 The enormous power of gpu cards (a â€œsimpleâ€ 6950 have 2,2 tflops on single precision!!) , increase every 6/8 months. 2 The great improvements of software side (Cuda and OpenCL). The problem, i think, itâ€™s that not every kind of project can be converted for gpu (rosetta memory problems is well known). For example, the poem@home project have a opencl cpu client since march, but they have problems to convert it for gpuâ€¦. ID: 5370 · Reply Quote

uBronan Send message Joined: 26 May 07 Posts: 2 Credit: 9,863 RAC: 0	Message 5377 - Posted: 16 Aug 2011, 22:45:50 UTC - in response to Message 5370. Last modified: 16 Aug 2011, 23:03:20 UTC Well just my two cents on this matter are : They just should try one of the gpu guru's out there which made it possible to a few projects which are now famous (milkyway/collatz) Its simple you have to trust the knowledge available from for instance crunch3r. Which as far as i know helps all projects who was struggling with conversions to gpu processing. Or maybe Raistmer who is another guru already busy transforming seti into a multi gpu environment first at nvidia but since a while also on ati. If these guys don't see a fresh approach on how to do the actual math and can not find a performance gain into gpu processing then its simply not possible. But more and more projects finally starts learning from the knowledge available from the user base. Now i know thats sometimes its hard to think different but its a learning process and sometimes needs some fresh ideas how to approach a certain problem Ofcourse i do not expect that a project has a working solution in a short time but as we can see on einstein they are experimenting with it as well. This project has also very complex calculations but slowly we see some minor progress in the development in their beta software too. In that project some parts of the jobs can be assisted by the gpu as a old school co-processor which in time will speed up some parts of the work. Opencl is in itself not yet mature enough the reason is simple nvidia who is also in this open standard should help work on it, but does not really want it to succeed. They rather see their cuda adopted that is why its more mature, to be honest i do actually believe they do more harm then good in opencl. The ati stream processing is not a good solution if you need complex work been done ( but if it can do it will do it lightning fast) the commands available todo work are very limited. ID: 5377 · Reply Quote

[VENETO] boboviz Send message Joined: 9 Apr 08 Posts: 928 Credit: 1,892,541 RAC: 294	Message 5378 - Posted: 17 Aug 2011, 12:52:39 UTC - in response to Message 5377. In that project some parts of the jobs can be assisted by the gpu as a old school co-processor which in time will speed up some parts of the work. Another way may be: not to convert to gpu all the code of rosetta cpu wus, but create a "sub-projet" with specific functions (and a little part of code) and test it largely...ralph is here. After they will add, little by little, functions and code of rosetta to this gpu project. But we don't know the code, so this is only ideas.... ID: 5378 · Reply Quote

uBronan Send message Joined: 26 May 07 Posts: 2 Credit: 9,863 RAC: 0	Message 5379 - Posted: 19 Aug 2011, 5:43:42 UTC - in response to Message 5378. That is exactly how einstein got to an working gpu processing portion of the work In the first betas it seemed as if gpu did not enhance any of the work, and even did sometimes worse then the normal cpu version. But later they took another approach and with some help got to a now perfect working gpu units system which partially (much of it ) uses the gpu Still some work gets done by the cpu simply because of the complexity of this work. That approach is not uncommon and in time the gpu languages will evolve as well, which would give a better way of handling the intructions. I myself had hoped that opencl would make more progress so all can benefit from an open standard, so many scientist would get access to this huge powers more easily. ID: 5379 · Reply Quote

[VENETO] boboviz Send message Joined: 9 Apr 08 Posts: 928 Credit: 1,892,541 RAC: 294	Message 5385 - Posted: 26 Aug 2011, 15:08:15 UTC - in response to Message 5378. Another way may be: not to convert to gpu all the code of rosetta cpu wus, but create a "sub-projet" with specific functions (and a little part of code) and test it largely...ralph is here. After they will add, little by little, functions and code of rosetta to this gpu project. Despite the ram problem on gpu, may be interesting to know what ralph team thinks about ideas of the forum..... ID: 5385 · Reply Quote

[VENETO] boboviz Send message Joined: 9 Apr 08 Posts: 928 Credit: 1,892,541 RAC: 294	Message 5386 - Posted: 7 Sep 2011, 19:21:34 UTC - in response to Message 5385. Despite the ram problem on gpu, may be interesting to know what ralph team thinks about ideas of the forum..... Anybody out there? ID: 5386 · Reply Quote

Rocco Moretti Volunteer moderator Project developer Project scientist Send message Joined: 18 May 10 Posts: 11 Credit: 30,188 RAC: 0	Message 5388 - Posted: 9 Sep 2011, 18:10:36 UTC - in response to Message 5385. I don't think anyone would argue that speeding up Ralph/Rosetta@home would be anything but a positive. That said, from what I understand, GPU programing is hard, especially if you don't have a naturally parallel application. SETI@Home has it a bit easy, as Fourier transforms/signal processing matches GPU processors very well. In contrast, because of the way Rosetta works internally, it's non-trivial to convert to GPU programing style. From what I understand, while bits and pieces might be easily moved to a GPU, you'd be killed by the data transfer to and from the GPU when switching between GPU-using sections and CPU-using sections. My sketchy understanding is that on several occasions groups with experience doing GPU programming have taken a look at the Rosetta code with little to nothing ultimately coming of it. That said, one of the researchers in the lab was/is working on a "skunkworks"-type project regarding incorporating the use of GPUs into Rosetta - though I hesitate to mention this because it's very uncertain if anything will actually come of it, or if it does, when. Even if there are parts of Rosetta that are GPU-ized, there's no guarantee that it would be those parts which would make a significant difference in the typical runs performed on Rosetta@Home. Short answer: No one would refuse a GPU version of Rosetta, but the considered opinion is that making it would likely be a lot of effort for not all that much actual speedup. ID: 5388 · Reply Quote

[VENETO] boboviz Send message Joined: 9 Apr 08 Posts: 928 Credit: 1,892,541 RAC: 294	Message 5389 - Posted: 10 Sep 2011, 7:44:11 UTC - in response to Message 5388. Thanks for the answer!! ID: 5389 · Reply Quote

[VENETO] boboviz Send message Joined: 9 Apr 08 Posts: 928 Credit: 1,892,541 RAC: 294	Message 5651 - Posted: 9 Mar 2013, 8:43:34 UTC - in response to Message 5388. That said, one of the researchers in the lab was/is working on a "skunkworks"-type project regarding incorporating the use of GPUs into Rosetta - though I hesitate to mention this because it's very uncertain if anything will actually come of it, or if it does, when. 18 months ago, this answer Any news?? ID: 5651 · Reply Quote

[VENETO] boboviz Send message Joined: 9 Apr 08 Posts: 928 Credit: 1,892,541 RAC: 294	Message 5661 - Posted: 6 May 2013, 15:57:56 UTC - in response to Message 5388. From what I understand, while bits and pieces might be easily moved to a GPU, you'd be killed by the data transfer to and from the GPU when switching between GPU-using sections and CPU-using sections. As i write on Rosetta forum, this may be the answer hUMA ID: 5661 · Reply Quote

[VENETO] boboviz Send message Joined: 9 Apr 08 Posts: 928 Credit: 1,892,541 RAC: 294	Message 5667 - Posted: 24 Jun 2013, 8:24:59 UTC - in response to Message 5661. I see a post on rosetta forum about gpu An admin is testing part of rosetta code with phyton and opencl... ID: 5667 · Reply Quote

[VENETO] boboviz Send message Joined: 9 Apr 08 Posts: 928 Credit: 1,892,541 RAC: 294	Message 5937 - Posted: 10 Dec 2015, 10:15:19 UTC - in response to Message 5667. I see a post on rosetta forum about gpu An admin is testing part of rosetta code with phyton and opencl... Documentation about PyOpenCl ID: 5937 · Reply Quote

[VENETO] boboviz Send message Joined: 9 Apr 08 Posts: 928 Credit: 1,892,541 RAC: 294	Message 6067 - Posted: 19 Mar 2016, 14:16:24 UTC - in response to Message 5370. I agree. But also rosetta/ralph researchers have to consider: 1 The enormous power of gpu cards (a â€œsimpleâ€ 6950 have 2,2 tflops on single precision!!) , increase every 6/8 months. 5 years ago. Now AMD Radeon Pro Duo has 16 Tflops in SP and 1 Tflops in DP!! :-O ID: 6067 · Reply Quote