Message boards : RALPH@home bug list : RoseTTAFold All-Atom
Previous · 1 . . . 5 · 6 · 7 · 8 · 9 · 10 · 11 · Next
Author | Message |
---|---|
[VENETO] boboviz Send message Joined: 9 Apr 08 Posts: 913 Credit: 1,892,541 RAC: 294 |
It is like GPU grid Python apps for GPU hosts So, why i'm running some wus exclusively on cpu? |
[VENETO] boboviz Send message Joined: 9 Apr 08 Posts: 913 Credit: 1,892,541 RAC: 294 |
Yet you forgot the most important part of that- the time taken to do a given Task by the slowest of CPUs must be the same as the time taken to do that very same Task by the most powerful of GPUs. If not, all Scheduling is screwed & deadlines will be missed, Resource Share balancing will take forever, if it were to occur at all (oh, and i forgot about the random nature of the amount of Credit being awarded). It seems so clear and logical to me |
kotenok2000 Send message Joined: 26 Feb 21 Posts: 22 Credit: 1,893 RAC: 0 |
Rosettafold doesn't suspend whend told do. When i suspended it it continued running. |
Henk Haneveld Send message Joined: 13 Apr 21 Posts: 8 Credit: 88 RAC: 0 |
Why is there a stupid discussion over a combined CPU and GPU app, No such thing exists. If you look in the properties for Ralph in your client you will see it shows: Project has no apps for NVIDIA GPU There is also no GPU app listed in Applications on the site and the app just runs on the CPU and cannot run on GPU. |
mikey Send message Joined: 28 Nov 20 Posts: 9 Credit: 114,771 RAC: 17 |
Look at the Einstein GPU tasks, they use BOTH the cpu and gpu. Just like Ralph, precisely no difference.That shows just how confused you are. The GPU processes the Task, the CPU supports the GPU by keeping it fed. The CPU doesn't actually do any processing, the GPU does that. Depending on the Task, with a very well written application, CPU support can be next to nothing. Actually Peter is right about the Einstein gpu tasks, the newer tasks pause gpu crunching for a bit at 2 different times and process stuff on the cpu then go back to running more of the task on the gpu again. They said the reason is the gpu isn't as accurate as the cpu is and they need the more accurate cpu numbers. |
mikey Send message Joined: 28 Nov 20 Posts: 9 Credit: 114,771 RAC: 17 |
Rosettafold doesn't suspend whend told do. give it time, mine does the same thing but does suspend eventually |
mikey Send message Joined: 28 Nov 20 Posts: 9 Credit: 114,771 RAC: 17 |
Already done that, but just using max_concurrent. Even so, that doesn't limit the number of threads per Task, just the number of Tasks.There needs to be a way to limit the number of threads a single Task can use. Thank you, I will un-suspend the Project then. |
kotenok2000 Send message Joined: 26 Feb 21 Posts: 22 Credit: 1,893 RAC: 0 |
And i was trying to run it on 4 gb gpu. |
mikey Send message Joined: 28 Nov 20 Posts: 9 Credit: 114,771 RAC: 17 |
Mikey said: The other thing I'm seeing is that the Ralph tasks are taking about 10gb of ram for EACH task so I had to limit my running tasks accordingly. I'm running 1 cpu core per task and they are taking over 2 days to finish. I'm still getting some errors but have not ruled out it being a pc problem as yet. Grant SSF said: The most i've seen for a CPU processed Task in use is a bit over 1.5GB. mikey said: Application Generalized biomolecular modeling and design with RoseTTAFold All-Atom 0.02 Name RF_SAVE_ALL_OUT_NOJRAN_IGNORE_THE_REST_validation_env_g_pred_171_16903_6 State Running Received 6/14/2024 3:05:08 AM Report deadline 6/15/2024 3:05:09 AM Estimated computation size 80,000 GFLOPs CPU time 00:03:30 CPU time since checkpoint 00:03:30 Elapsed time 1d 07:17:48 Estimated time remaining --- Fraction done 100.000% Virtual memory size 12.77 GB Working set size 4.75 GB Directory slots/16 Process ID 8508 Progress rate 3.240% per hour Executable w_0.02_windows_x86_64.exe |
[VENETO] boboviz Send message Joined: 9 Apr 08 Posts: 913 Credit: 1,892,541 RAC: 294 |
I completed my first "cpu-ony" wu: 5465161 |
Grant (SSSF) Send message Joined: 13 Jun 24 Posts: 126 Credit: 193,939 RAC: 2,635 |
Why is there a stupid discussion over a combined CPU and GPU app, No such thing exists.That is why my earlier long & pointless discussion took place. There is now an application for both CPU & GPU- we are testing it here, now, but BOINC is completely unaware of that. It asks for CPU work, it gets a Task. If it can't run on the GPU, it runs on the CPU. If it can run on the GPU, then it does. Hence my attempt to point out the ramifications of this behaviour earlier. And unfortunately, unless there is an error with the Task, a completed Tasks doesn't give any indication in the Stderr output of what it was processed on (let alone what work or how much was done). Grant Darwin NT |
Grant (SSSF) Send message Joined: 13 Jun 24 Posts: 126 Credit: 193,939 RAC: 2,635 |
I had Tasks suspended for over an hour, they were still running.Rosettafold doesn't suspend whend told do. Yes, the BOINC Manager shows them as suspended, but in Task Manager they are still running and using CPU time, along with the other TTAFold Tasks that show as running. Grant Darwin NT |
Grant (SSSF) Send message Joined: 13 Jun 24 Posts: 126 Credit: 193,939 RAC: 2,635 |
You don't have an Nvidia GPU with the right driver.It is like GPU grid Python apps for GPU hostsSo, why i'm running some wus exclusively on cpu? If you did, they would run on the GPU. Grant Darwin NT |
Grant (SSSF) Send message Joined: 13 Jun 24 Posts: 126 Credit: 193,939 RAC: 2,635 |
mikey said:OK, and...? (You do realise the swap file & Virtual Memory are the same? (Virtual Memory makes use of the swap file) It is disk space and not physical RAM that is in use?) Grant Darwin NT |
rilian Send message Joined: 7 Sep 07 Posts: 35 Credit: 107,666 RAC: 725 |
i ve got <core_client_version>7.24.1</core_client_version> <![CDATA[ <message> The access code is invalid. (0xc) - exit code 12 (0xc)</message> <stderr_txt> 'C:Program' is not recognized as an internal or external command, operable program or batch file. </stderr_txt> ]]> but few WUs validated fine -- I crunch for Ukraine |
Grant (SSSF) Send message Joined: 13 Jun 24 Posts: 126 Credit: 193,939 RAC: 2,635 |
i ve gotHaven't seen that error message before. but few WUs validated fineOn a different system. When we get some more work, if you still get the same error on your WIn7 system, i'd suggest resetting the project. If after re-downloading all the files, it could be that the application isn't supported by WIn7 Edit- actually a quick search on "Python for Windows" shows that none of the versions released in the last 12-18 months can be used on Win7 or earlier. Edit- the version of Python being used here is 3.9.19 from 19/3/2024, and cannot be used on Win7 or earlier. I suggest you set no new Tasks for Ralph. So two more things on the Developer's to do list- block any attempt to get work from Win7 or older Operating Systems. - advise us of the minimum video driver version required for GPU processing, and stop it from attempting to run on systems with unsupported drivers. Grant Darwin NT |
kotenok2000 Send message Joined: 26 Feb 21 Posts: 22 Credit: 1,893 RAC: 0 |
Looks like they stopped workunit generation |
Grant (SSSF) Send message Joined: 13 Jun 24 Posts: 126 Credit: 193,939 RAC: 2,635 |
Looks like they stopped workunit generationYes, almost 2 days ago. Hopefully the next batch will have at least a few of the issues of the last batch sorted out. Grant Darwin NT |
[VENETO] boboviz Send message Joined: 9 Apr 08 Posts: 913 Credit: 1,892,541 RAC: 294 |
Hopefully the next batch will have at least a few of the issues of the last batch sorted out. Waiting for 0.03 version...and for clarifications about app (OS, gpu, etc) P.S. If you see my profile, i'm here since...well, i don't remember and the Ralph admins rarely are clear about project/app |
Bill F Send message Joined: 1 Jan 18 Posts: 21 Credit: 34,272 RAC: 52 |
In my Windows environment my CPU only system completed and was credited with 4 tasks. My CPU / GPU system with a lot more resources failed ever task and tried to run too many at once and tied itself up so bad that I could not get control of the system short of a Hard ungraceful power down. I have set this box to no new Ralph tasks until the app is better behaved. Bill F In October 1969 I took an oath to support and defend the Constitution of the United States against all enemies, foreign and domestic; There was no expiration date. |
Message boards :
RALPH@home bug list :
RoseTTAFold All-Atom
©2024 University of Washington
http://www.bakerlab.org