Message boards : RALPH@home bug list : Bug reports for Ralph 5.14
Author | Message |
---|---|
Rhiju Volunteer moderator Project developer Project scientist Send message Joined: 14 Feb 06 Posts: 161 Credit: 3,725 RAC: 0 |
Please post bugs in 5.14. A few comments: (1) The "phantom chain" or "broken chain" that appear in some workunits are OK -- they're new science modes we're testing that either focus on specific parts of the protein or rearrange the protein topology to better sample long-range contacts. (2) The debugger messages (which caused slowdowns for some users with 5.10-5.12, and were removed in 5.13) have been put back into ralph. But they're not on by default. We'll ask Rom to post here and fill you in on how to turn them on. (3) We're testing a new science mode which uses the sequence and structural information from homologous proteins in an early phase of the simulation, but then returns to the target protein sequence in the final refinement phase. (4) We're also continuing our efforts to reduce memory usage by rosetta/ralph! |
dainenyu Send message Joined: 19 Feb 06 Posts: 6 Credit: 7,772 RAC: 0 |
Downloaded 6 WUs, all failed immediately, giving message 5/12/2006 5:41:23 PM|ralph@home|Unrecoverable error for result MAPRELAX_TEST_hom023_1fna__509_8_0 (Incorrect function. (0x1) - exit code 1 (0x1)) WU numbers are 97527, 97483, 97460, 97440, 97430, 97382. Edit: stderr out reads <core_client_version>5.4.9</core_client_version> |
TCU Computer Science Send message Joined: 16 Feb 06 Posts: 5 Credit: 241,166 RAC: 0 |
I upgraded to the latest stable release of the BOINC client (5.4.9 for Mac OS X) and now I'm getting immediate failures: Fri May 12 16:47:16 2006|ralph@home|Starting task MAPRELAX_TEST_hom018_1fna__509_7_0 using rosetta_beta version 514 Fri May 12 16:47:18 2006|ralph@home|Unrecoverable error for result MAPRELAX_TEST_hom018_1fna__509_7_0 (process exited with code 1 (0x1)) Fri May 12 16:48:28 2006|ralph@home|Starting task MAPRELAX_TEST_hom001_1fna__509_11_0 using rosetta_beta version 514 Fri May 12 16:48:30 2006|ralph@home|Unrecoverable error for result MAPRELAX_TEST_hom001_1fna__509_11_0 (process exited with code 1 (0x1)) Fri May 12 16:53:20 2006|ralph@home|Starting task MAPRELAX_TEST_hom003_1fna__509_13_0 using rosetta_beta version 514 Fri May 12 16:53:22 2006|ralph@home|Unrecoverable error for result MAPRELAX_TEST_hom003_1fna__509_13_0 (process exited with code 1 (0x1)) Fri May 12 16:57:32 2006|ralph@home|Starting task MAPRELAX_TEST_hom022_1fna__509_10_0 using rosetta_beta version 514 Fri May 12 16:57:34 2006|ralph@home|Unrecoverable error for result MAPRELAX_TEST_hom022_1fna__509_10_0 (process exited with code 1 (0x1)) Fri May 12 17:01:47 2006|ralph@home|Starting task MAPRELAX_TEST_hom013_1fna__509_11_0 using rosetta_beta version 514 Fri May 12 17:01:50 2006|ralph@home|Unrecoverable error for result MAPRELAX_TEST_hom013_1fna__509_11_0 (process exited with code 1 (0x1)) Fri May 12 17:05:50 2006|ralph@home|Starting task MAPRELAX_TEST_hom028_1fna__509_12_0 using rosetta_beta version 514 Fri May 12 17:05:53 2006|ralph@home|Unrecoverable error for result MAPRELAX_TEST_hom028_1fna__509_12_0 (process exited with code 1 (0x1)) |
Fuzzy Hollynoodles Send message Joined: 19 Feb 06 Posts: 37 Credit: 2,089 RAC: 0 |
I wanted my Seti Beta 5.14 WU finished, so I suspended the 5.14 Ralph WU, which apparently killed it (or what?) https://ralph.bakerlab.org/workunit.php?wuid=97405 Result: https://ralph.bakerlab.org/result.php?resultid=111703 From my log: 5/13/2006 12:57:29 AM|SETI@home|Restarting task 01mr99aa.26277.31696.309670.3.101_1 using setiathome_enhanced version 512 5/13/2006 12:57:29 AM|SETI@home Beta Test|Pausing task 05au01ab.24507.112.847158.3.93_6 (removed from memory) 5/13/2006 12:58:58 AM||Rescheduling CPU: result suspended, resumed or aborted by user 5/13/2006 12:58:58 AM|SETI@home|Pausing task 01mr99aa.26277.31696.309670.3.101_1 (removed from memory) 5/13/2006 12:58:58 AM|ralph@home|Starting task MAPRELAX_TEST_hom003_1fna__509_6_0 using rosetta_beta version 514 5/13/2006 12:59:01 AM||Rescheduling CPU: result suspended, resumed or aborted by user 5/13/2006 12:59:02 AM|rosetta@home|Restarting task TEST_HOMOLOG_ABRELAX_hom003_1fna__503_12404_0 using rosetta version 513 5/13/2006 12:59:02 AM|ralph@home|Pausing task MAPRELAX_TEST_hom003_1fna__509_6_0 (removed from memory) 5/13/2006 12:59:03 AM|ralph@home|Unrecoverable error for result MAPRELAX_TEST_hom003_1fna__509_6_0 (Forkert funktion. (0x1) - exit code 1 (0x1)) 5/13/2006 12:59:03 AM|ralph@home|Deferring scheduler requests for 1 minutes and 0 seconds 5/13/2006 12:59:03 AM||Rescheduling CPU: application exited 5/13/2006 12:59:03 AM|ralph@home|Computation for task MAPRELAX_TEST_hom003_1fna__509_6_0 finished 5/13/2006 12:59:06 AM||Rescheduling CPU: result suspended, resumed or aborted by user 5/13/2006 12:59:06 AM|LHC@home|Restarting task wfeb1A_v6s4vvnom_mqx__19__64.258_59.268__4_6__6__75_1_sixvf_boinc202329_1 using sixtrack version 467 5/13/2006 12:59:06 AM|rosetta@home|Pausing task TEST_HOMOLOG_ABRELAX_hom003_1fna__503_12404_0 (removed from memory) 5/13/2006 12:59:09 AM||Rescheduling CPU: result suspended, resumed or aborted by user 5/13/2006 12:59:10 AM|LHC@home|Pausing task wfeb1A_v6s4vvnom_mqx__19__64.258_59.268__4_6__6__75_1_sixvf_boinc202329_1 (removed from memory) 5/13/2006 12:59:10 AM|LHC@home|Starting task wfeb1A_v6s4vvnom_mqx__3__64.265_59.275__10_12__6__55_1_sixvf_boinc181092_5 using sixtrack version 467 5/13/2006 12:59:14 AM||Rescheduling CPU: result suspended, resumed or aborted by user 5/13/2006 12:59:15 AM|SETI@home Beta Test|Restarting task 05au01ab.24507.112.847158.3.93_6 using setiathome_enhanced version 514 5/13/2006 12:59:15 AM|LHC@home|Pausing task wfeb1A_v6s4vvnom_mqx__3__64.265_59.275__10_12__6__55_1_sixvf_boinc181092_5 (removed from memory) 5/13/2006 12:59:15 AM|SETI@home Beta Test|Sending scheduler request to http://setiboinc.ssl.berkeley.edu/beta_cgi/cgi 5/13/2006 12:59:15 AM|SETI@home Beta Test|Reason: To fetch work 5/13/2006 12:59:15 AM|SETI@home Beta Test|Requesting 3994 seconds of new work 5/13/2006 12:59:20 AM|SETI@home Beta Test|Scheduler request succeeded 5/13/2006 12:59:22 AM|SETI@home Beta Test|Started download of file 01jn01aa.7728.12640.709662.3.121 5/13/2006 12:59:35 AM|SETI@home Beta Test|Finished download of file 01jn01aa.7728.12640.709662.3.121 5/13/2006 12:59:35 AM|SETI@home Beta Test|Throughput 29497 bytes/sec 5/13/2006 12:59:36 AM||Rescheduling CPU: files downloaded 5/13/2006 12:59:56 AM|ralph@home|Sending scheduler request to https://ralph.bakerlab.org/ralph_cgi/cgi 5/13/2006 12:59:56 AM|ralph@home|Reason: Requested by user 5/13/2006 12:59:56 AM|ralph@home|Reporting 1 tasks 5/13/2006 1:00:01 AM|ralph@home|Scheduler request succeeded [color=navy][b]"I'm trying to maintain a shred of dignity in this world." - Me[/b][/color] |
wizzszz Send message Joined: 28 Apr 06 Posts: 17 Credit: 1,128 RAC: 0 |
Had the same error using BOINC V5.4.9, WU aborted immediately, reporting this error: Unrecoverable error for result MAPRELAX_TEST_hom007_1fna__510_3_0 (Unzulässige Funktion. (0x1) - exit code 1 (0x1)) |
Moderator9 Volunteer moderator Send message Joined: 16 Feb 06 Posts: 251 Credit: 0 RAC: 0 |
Had the same error using BOINC V5.4.9, WU aborted immediately, reporting this error: ALL of these groups of errors look like a bad batch of Work Units. I have over 20 on each of my machines as well. I will bring this to Rhiju's attantion. EDIT: Rhiju is "commuting" at the moment but I am advised that as I expected this is a bad batch of Work Units. Rhiju says to let you know- "A new batch has been queued up. These should pass through very quickly. Sorry for the inconvience." Moderator9 RALPH@home FAQs RALPH@home Guidelines Moderator Contact |
dainenyu Send message Joined: 19 Feb 06 Posts: 6 Credit: 7,772 RAC: 0 |
EDIT: Rhiju is "commuting" at the moment but I am advised that as I expected this is a bad batch of Work Units. Rhiju says to let you know- I've got a couple of the new WUs (HOMOLOG_ABRELAX_hom*) and they seem to be running fine, almost an hour in. |
wizzszz Send message Joined: 28 Apr 06 Posts: 17 Credit: 1,128 RAC: 0 |
Fetched a new WU, this time it started w/o error. RMSD is missing, I assume that it should be like that, because the native graphic is missing, too... This causes the RMSD/Lowest Energy graphic to vanish, only a single red spot at the left edge is displayed. And the description text is a bit too long. (display end at "has very close seque") So nothing serious, everything else works fine, even the graphics! Accepted Energy is now below -216 for the second model. Seems like the stranding algorithm improvements work fine. Virtual memory load is about 132 MB, no clue what it was before... |
Moderator9 Volunteer moderator Send message Joined: 16 Feb 06 Posts: 251 Credit: 0 RAC: 0 |
Fetched a new WU, this time it started w/o error. All of the CASP7 target Work Units will have this display type. All that you describe is normal (except the long text overrun). Since they do not know the structure, they do not have the RMSD value, the Natural structure, or any other comparative information so it cannot be displayed. Because the RMSD is unknown, this forces the value to zero and the red dots all display at what would be the zero point of the RMSD graph (to the left of the box). As close as they can get to the graphic we all are familiar with is to show the accepted and lowest energy shapes as they occur. Rhiju has said they will work on the text overrun. Moderator9 RALPH@home FAQs RALPH@home Guidelines Moderator Contact |
suguruhirahara Send message Joined: 5 Mar 06 Posts: 40 Credit: 11,320 RAC: 0 |
OS : WindowsXP Professional x64 Edition CPU : Intel PentiumD 920 (2.80GHz) Used RAM : approx. 115MB x2 at max. / 1GB Graphic card: nVidia GeForce6600GT 128MB BOINC version : the newest, 5.4.9 Work tasks - OK before closed Graphic - OK They worked fine without error at first. However, once BOINC client has been closed and restarted, the taskes which were being done more than half started from the beginning. Is it an error, or due to my preference of RALPH? |
rbpeake Send message Joined: 16 Feb 06 Posts: 19 Credit: 3,370 RAC: 0 |
In case anyone missed this on the Rosetta board, here is an interesting thought on why the debugger code might have been causing the many page faults. Rosetta Post |
wizzszz Send message Joined: 28 Apr 06 Posts: 17 Credit: 1,128 RAC: 0 |
In case anyone missed this on the Rosetta board, here is an interesting thought on why the debugger code might have been causing the many page faults. So I think it would be useful, if all the guys with the 'hanging/slow' machines post here what cpu type they got (HT/dual core/single core)! If the error occures only there, it would help the developers a lot! |
Moderator9 Volunteer moderator Send message Joined: 16 Feb 06 Posts: 251 Credit: 0 RAC: 0 |
OS : WindowsXP Professional x64 Edition [color=darkred]If the work units start, and then you stop BOINC before about 25-40 min of processing, or in any case before the percent complete is more than 1.4%, when you restart BOINC they will start from zero. [color] Moderator9 RALPH@home FAQs RALPH@home Guidelines Moderator Contact |
suguruhirahara Send message Joined: 5 Mar 06 Posts: 40 Credit: 11,320 RAC: 0 |
...They worked fine without error at first. However, once BOINC client has been closed and restarted, the taskes which were being done more than half started from the beginning. Is it an error, or due to my preference of RALPH? Is it an unavoidable thing or an error with just this version? |
Rhiju Volunteer moderator Project developer Project scientist Send message Joined: 14 Feb 06 Posts: 161 Credit: 3,725 RAC: 0 |
Hi: I wanted to quickly apologize for the batch of bad WU's yesterday on ralph. Thanks for your patience! Its actually a new scientific mode in Rosetta, and I think I know why the WUs were failing on ralph. Will be testing the fix later today. Had the same error using BOINC V5.4.9, WU aborted immediately, reporting this error: |
Message boards :
RALPH@home bug list :
Bug reports for Ralph 5.14
©2024 University of Washington
http://www.bakerlab.org