Message boards : RALPH@home bug list : Bug reports for Ralph 5.24
Author | Message |
---|---|
Rhiju Volunteer moderator Project developer Project scientist Send message Joined: 14 Feb 06 Posts: 161 Credit: 3,725 RAC: 0 |
We've added a couple features: (1) We can now use prior predictions for which parts of the chain are buried or exposed to guide the Rosetta search. (2) We can efficiently assemble predefined domains of the protein chain into a whole structure. If this round of ralph goes well, we'll probably update the application on Rosetta@home later this week! |
doc :) Send message Joined: 16 Feb 06 Posts: 46 Credit: 4,437 RAC: 0 |
id love to run 5.24, but i am getting "no work from project". i thought there was supposed to be work available at any time now? :) server status page is all green, it only shows no WUs ready to send. |
feet1st Send message Joined: 7 Mar 06 Posts: 313 Credit: 116,623 RAC: 0 |
Still no work. |
bt1228 Send message Joined: 22 Mar 06 Posts: 7 Credit: 9,385 RAC: 0 |
I just received this error: 2006-Jun-19 7:34:43 pm|ralph@home|Requesting 366120 seconds of new work 2006-Jun-19 7:34:48 pm|ralph@home|Scheduler request succeeded 2006-Jun-19 7:34:48 pm|ralph@home|Message from server: No work sent 2006-Jun-19 7:34:48 pm|ralph@home|Message from server: (reached daily quota of 1 results) Is there actually a quota of one (1) result per day ? I do have one 5.24 WU in my current task list. --- bt |
doc :) Send message Joined: 16 Feb 06 Posts: 46 Credit: 4,437 RAC: 0 |
no clue how your quota got that low, i got mine by 18 right now, after a couple of errors. finally got some 5.24 work, all of them that got their turn so far errored out with incorrect function. all had names like t299_LOOPRELAX_hom00X_S_000XX_000XXXX_0_727_1_X and i was not the only one where those failed. the result page contains the following error: <stderr_txt> ERROR:: Unable to obtain sequence information. fasta file must be provided. ERROR:: Exit at: .initialize.cc line:236 </stderr_txt> |
Carlos_Pfitzner Send message Joined: 16 Feb 06 Posts: 182 Credit: 22,792 RAC: 0 |
9 jobs with this error on Windows and 2 jobs on Linux process exited with code 1 (0x1) </message> <stderr_txt> Graphics are disabled due to configuration... ERROR:: Unable to obtain sequence information. fasta file must be provided. ERROR:: Exit at: initialize.cc line:236 I am somewhat lazy by now to provide a link for each WU -; So, a link to 1 on Linux https://ralph.bakerlab.org/result.php?resultid=187655 and a link to 1 on Windows https://ralph.bakerlab.org/result.php?resultid=187566 Thanks Click signature for global team stats |
Deborah Goldsmith Send message Joined: 16 Feb 06 Posts: 3 Credit: 253,789 RAC: 0 |
Mac OS X (Intel), failed immediately on running the application: 2006-06-19 16:59:46 [ralph@home] Starting task t299_LOOPRELAX_hom004_S_00018_0004609_0_728_2_0 using rosetta_beta version 524 2006-06-19 16:59:48 [ralph@home] Unrecoverable error for result t299_LOOPRELAX_hom004_S_00018_0004609_0_728_2_0 (process exited with code 1 (0x1)) 2006-06-19 16:59:48 [ralph@home] Unrecoverable error for result t299_LOOPRELAX_hom004_S_00018_0004609_0_728_2_0 (process exited with code 1 (0x1)) 2006-06-19 16:59:48 [---] Rescheduling CPU: application exited It did this 12 times in a row before exhausting the daily quota. 5.24 is working fine on my PPC machines. 5.23 worked fine on my Intel machine. This failure appears to be Intel-specific and 5.24-specific. |
suguruhirahara Send message Joined: 5 Mar 06 Posts: 40 Credit: 11,320 RAC: 0 |
https://ralph.bakerlab.org/result.php?resultid=187639 <stderr_txt> |
feet1st Send message Joined: 7 Mar 06 Posts: 313 Credit: 116,623 RAC: 0 |
6 failed in a row upon startup. Same Fasta file not found error. |
doc :) Send message Joined: 16 Feb 06 Posts: 46 Credit: 4,437 RAC: 0 |
just a little update, after those failures i had with my first couple of 5.24s i have some successes now (they are out of another WU batch as far as i can tell though, 731 instead of 727 in my case, so it was probably just a bad batch of WUs) |
Rhiju Volunteer moderator Project developer Project scientist Send message Joined: 14 Feb 06 Posts: 161 Credit: 3,725 RAC: 0 |
Thanks for posting. I think it was just all the simultaneous downloading produced some strain on our fileserver and some users got incomplete jobs. Once things settled down, we don't seem to be seeing many errors! just a little update, after those failures i had with my first couple of 5.24s i have some successes now (they are out of another WU batch as far as i can tell though, 731 instead of 727 in my case, so it was probably just a bad batch of WUs) |
Rhiju Volunteer moderator Project developer Project scientist Send message Joined: 14 Feb 06 Posts: 161 Credit: 3,725 RAC: 0 |
We tried the "continuous work for ralph" for a week or so, but our turnaround time for testing new WUs was just too slow! It would take about 24 hours for computers to free up, accept new WUs, and give back initial results -- we want the tunaround time to be less than 12 hours. Hope that's a reasonable explanation! My advice is to crunch for both rosetta@home and ralph, if you haven't already... the good news is that we're setting up the debugging stuff over on rosetta@home, so we'll be hopefully be getting a steady stream of callstacks from whatever errors are going on over there too! id love to run 5.24, but i am getting "no work from project". |
feet1st Send message Joined: 7 Mar 06 Posts: 313 Credit: 116,623 RAC: 0 |
...we want the tunaround time to be less than 12 hours. This doesn't leave you much time to test running the WUs for 24hrs then. Nor much time to test running in environments where you aren't the only application. |
feet1st Send message Joined: 7 Mar 06 Posts: 313 Credit: 116,623 RAC: 0 |
Looking back in my messages, I see that I did complete a download of a fasta file prior to the 6 WUs that failed on startup. 6/19/2006 6:45:30 PM|ralph@home|Started download of file nterm_nohelix3_hom001_t299_.fasta.gz 6/19/2006 6:45:31 PM|ralph@home|Finished download of file nterm_nohelix3_hom001_t299_.fasta.gz I see no failed attempts to download any other fasta files when those WUs came down. It seems there may be more to the story then a busy server. |
Rhiju Volunteer moderator Project developer Project scientist Send message Joined: 14 Feb 06 Posts: 161 Credit: 3,725 RAC: 0 |
Hmm, thanks for checking on this. I'll look at those WUs more carefully... Looking back in my messages, I see that I did complete a download of a fasta file prior to the 6 WUs that failed on startup. |
Rhiju Volunteer moderator Project developer Project scientist Send message Joined: 14 Feb 06 Posts: 161 Credit: 3,725 RAC: 0 |
OK, I checked those jobs. Looks like those work units all failed for the same known reason (a problem in the submitter's submission script), and then were fixed and sent out again. The second round went well. Phew. It wasn't the app! By the way, its pretty great that we had so many postings -- its very useful to get this continuous feedback . Hmm, thanks for checking on this. I'll look at those WUs more carefully... |
Carlos_Pfitzner Send message Joined: 16 Feb 06 Posts: 182 Credit: 22,792 RAC: 0 |
We tried the "continuous work for ralph" for a week or so, but our turnaround time for testing new WUs was just too slow! It would take about 24 hours for computers to free up, accept new WUs, and give back initial results -- we want the tunaround time to be less than 12 hours. Hope that's a reasonable explanation! I make a new host, venue school , connect to network every second , running *only* ralph@home *This one should download only 1 job at a time, crunch and upload, report and then download the next -:) and on and on However my WUs are failing on them ! May be a host setup problem ? -or- just bad WUs ? Exit status -1073741684 (0xffffffffc000008c) https://ralph.bakerlab.org/result.php?resultid=190094 https://ralph.bakerlab.org/result.php?resultid=190240 https://ralph.bakerlab.org/result.php?resultid=190262 any help appreciated -> this host owns a virtual CPU and had crunched simap and einstein , with no problems -:) my host specs Average upload rate 0.53 KB/sec Average download rate 9.59 KB/sec Average turnaround time 0 days Also was able to meet TANPAKU deadline of 1 hour Thanks Click signature for global team stats |
Carlos_Pfitzner Send message Joined: 16 Feb 06 Posts: 182 Credit: 22,792 RAC: 0 |
We tried the "continuous work for ralph" for a week or so, but our turnaround time for testing new WUs was just too slow! It would take about 24 hours for computers to free up, accept new WUs, and give back initial results -- we want the tunaround time to be less than 12 hours. Hope that's a reasonable explanation! Found the problem, and fixed it. was a corrupted NTFS virtual FileSystem Now the problem is another Outcome Client error Client state Downloading Exit status -186 (0xffffffffffffff46 I looked at the download progress, and before all files of the WU has finished downloading, the job started to run, and imediatelly got this error. https://ralph.bakerlab.org/result.php?resultid=190486 I am believing on a ralph server problem, or my ISP had some glitch with my Link Anyway the job (WU) should not start to run before all files downloaded Now, I'll wait there is work again ... Xlab (backup project) is downloading / crunching / uploading 100% OK ;) I select Xlab to "backup project" cause their WUs crunch in 9 minutes so, when there is work, immediattely the vpc switch to ralph. Thanks Click signature for global team stats |
[B^S] sTrey Send message Joined: 15 Feb 06 Posts: 58 Credit: 15,430 RAC: 0 |
Errored out on t299__CASP7_ABRELAX_SAVE_ALL_OUT_TEST_ASSEMBLE_hom001__753_34_0 with -161 <file_xfer_error> <file_name>t299__CASP7_ABRELAX_SAVE_ALL_OUT_TEST_ASSEMBLE_hom001__753_34_0_0 </file_name> <error_code>-161</error_code> |
anders n Send message Joined: 16 Feb 06 Posts: 166 Credit: 131,419 RAC: 0 |
Error -WARNING! attempt to gzip file .xxt299.out failed: file does not exist. on this wu https://ralph.bakerlab.org/result.php?resultid=191298 Anders n |
Message boards :
RALPH@home bug list :
Bug reports for Ralph 5.24
©2024 University of Washington
http://www.bakerlab.org