Posts by Jose

1) Message boards : RALPH@home bug list : Bug reports for Ralph 5.16 (Message 1688)
Posted 22 May 2006 by Jose
Post:
Jose: great, thanks for posting this! This is actually
a very rare error code that you are seeing. You can
see a list of top errors at [url] http://www.romwnet.org/dasblogce/[/url].
Have you seen this if you run other BOINC apps, e.g. Seti@home?

http://ralph.bakerlab.org/result.php?resultid=133961




That bug has been reported in some of the Rosetta Work Units that have failed.
Other BOINC Applications I have run had not reported that error
2) Message boards : RALPH@home bug list : Bug reports for Ralph 5.16 (Message 1683)
Posted 21 May 2006 by Jose
Post:
http://ralph.bakerlab.org/result.php?resultid=133961

3) Message boards : RALPH@home bug list : Bug reports for Ralph 5.05 and higher (Message 1400)
Posted 27 Apr 2006 by Jose
Post:
Okies I have been running the following RALPH Work Unit:
ID 6204
Name AB_CASP6_t198__438_5_0

It worked for around 57 minutes and then was preempted ( keeping the record of the CPU Time in my work record) and the corresponding Rosetta Work Unit restarted. Once the Rosetta Work Unit stopped, the application switched to RALPH and Work Unit ID 6204 restarted , it started DE NOVO , that is from CPU time of 0 and not from the accumulated 57+ minutes it had when it preempted and the application switch happened. My preferences are set so that work is kept in memory and this did not happened in this case.

So to make the story short: the 57+ CPU time for the Work Unit that have been stored in memory disappeared into the big void in the sky. :)





4) Message boards : Current tests : Weird.. I kept the 60 minutes switch between Applications (Message 1389)
Posted 27 Apr 2006 by Jose
Post:
And more than 2 hours have passed between the time my Rosetta application started and there have been no switch to the RALPH application that is marked as Ready To Run and is in line to start operating when the Rosetta Application WU becomes preempted.

( I will wait to see what happens to report : the Rosetta WU has less than one hour to run and I will check what starts (a new Rosetta WU or the RALPH one that is supposed to run next)when a new unit starts.


Latter Edit: After the Rosetta WU finished, another Rosetta WU started instead of a RALPH WU ... The Switch between applications has failed 3 times today. After the first RALPH operation was done, only Rosetta WUs are running even though RALPH Wu's are in my pending work area in the BOINC Manager.




What do I do? Do I wait till all the Rosetta WU's finish and see if the RALPH units start running or Do I suspend a Rosetta WU to see what happens and risk a computing error in a "real life unit"?
5) Message boards : Current tests : Weird.. I kept the 60 minutes switch between Applications (Message 1388)
Posted 27 Apr 2006 by Jose
Post:
And more than 2 hours have passed between the time my Rosetta application started and there have been no switch to the RALPH application that is marked as Ready To Run and is in line to start operating when the Rosetta Application WU becomes preempted.

( I will wait to see what happens to report : the Rosetta WU has less than one hour to run and I will check what starts (a new Rosetta WU or the RALPH one that is supposed to run next)when a new unit starts.


Latter Edit: After the Rosetta WU finished, another Rosetta WU started instead of a RALPH WU ... The Switch between applications has failed 3 times today. After the first RALPH operation was done, only Rosetta WUs are running even though RALPH Wu's are in my pending work area in the BOINC Manager.
6) Message boards : Current tests : Weird.. I kept the 60 minutes switch between Applications (Message 1387)
Posted 26 Apr 2006 by Jose
Post:
And more than 2 hours have passed between the time my Rosetta application started and there have been no switch to the RALPH application that is marked as Ready To Run and is in line to start operating when the Rosetta Application WU becomes preempted.

( I will wait to see what happens to report : the Rosetta WU has less than one hour to run and I will check what starts (a new Rosetta WU or the RALPH one that is supposed to run next)when a new unit starts.
7) Message boards : RALPH@home bug list : Bug reports for Ralph 5.05 and higher (Message 1386)
Posted 26 Apr 2006 by Jose
Post:
This Unit was aborted after less than one hour of runing ( My time preference is 2 hours)

http://ralph.bakerlab.org/result.php?resultid=97305

AB_CASP6_t216__438_3_0

Workunit 86138

CPU time 3180.21875
stderr out <core_client_version>5.2.13</core_client_version>
<stderr_txt>
# random seed: 3882811
# cpu_run_time_pref: 7200
**********************************************************************
Rosetta score is stuck or going too long. Watchdog is killing the run!
Stuck at score 71.0875 for 3600 seconds
**********************************************************************
GZIP SILENT FILE: .xxt216.out
WARNING! attempt to gzip file .xxt216.out failed: file does not exist.

</stderr_txt>
<message><file_xfer_error>
<file_name>AB_CASP6_t216__438_3_0_0</file_name>
<error_code>-161</error_code>
<error_message></error_message>
</file_xfer_error>

</message>
Validate state Invalid
Claimed credit 11.1502124855248
Granted credit 0
application version 5.05






©2024 University of Washington
http://www.bakerlab.org