Posts by apohawk

1) Message boards : RALPH@home bug list : Minirosetta 2.14 (Message 5177)
Posted 7 Sep 2010 by Profile apohawk
Post:
SAXS batch strikes back. This time with new twist. They end right after the start.
I got several of those already. No luck for SAXS ;)
e.g. WU named SAXS-score-1kbwC__14888_2_0 http://ralph.bakerlab.org/result.php?resultid=1912150
ERROR: Option file open failed for: 1kbwC-linker_sampler_protocol.flags
2) Message boards : RALPH@home bug list : Minirosetta 2.14 (Message 5167)
Posted 6 Sep 2010 by Profile apohawk
Post:
No luck here either on those "SAXS-rescore".
For 13 tasks crunched so far on two machines (1 windows, 1 linux), all 13 got "computation error".
<message>
<file_xfer_error>
  <file_name>SAXS-rescore-1tf4B__14879_16_0_0</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>


For Your Information.

EDIT

I just saw one WU finish on win7, got to 100% and then this:
2010-09-06 22:37:57	ralph@home	Output file SAXS-rescore-1mgtA__14879_16_0_0 for task SAXS-rescore-1mgtA__14879_16_0 absent
3) Message boards : RALPH@home bug list : Minirosetta 2.14 (Message 5151)
Posted 28 May 2010 by Profile apohawk
Post:
parvalbumin_redo_targets_CONTROL_BOINC_abrelax.score12.fastrelax.v1_SAVE_ALL_OUT_14803_2_1
http://ralph.bakerlab.org/workunit.php?wuid=1631447

This unit crashed in similiar fashion at both hosts crunching it.

http://ralph.bakerlab.org/result.php?resultid=1848472


2010-05-28 07:30:03 ralph@home Output file parvalbumin_redo_targets_CONTROL_BOINC_abrelax.score12.fastrelax.v1_SAVE_ALL_OUT_14803_2_1_0 for task parvalbumin_redo_targets_CONTROL_BOINC_abrelax.score12.fastrelax.v1_SAVE_ALL_OUT_14803_2_1 absent
# cpu_run_time_pref: 28800
======================================================
DONE :: 50 starting structures 28554.5 cpu seconds
This process generated 50 decoys from 50 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down cleanly ...
called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
<file_name>parvalbumin_redo_targets_CONTROL_BOINC_abrelax.score12.fastrelax.v1_SAVE_ALL_OUT_14803_2_1_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
4) Message boards : RALPH@home bug list : minirosetta 2.10 (Message 5117)
Posted 8 Apr 2010 by Profile apohawk
Post:
New issue. I have BM set to keep applications in memory when switching. Now during the BM update the boinc service is shut down. During that shutdown all those remaining in memory applications are being closed. 2 of minirosetta_2.10_windows_x86_64.exe did not close during boinc shutdown and became orphaned in the system. Unfortunately i can't tell which WU they were crunching.
5) Message boards : RALPH@home bug list : minirosetta 2.10 (Message 5116)
Posted 8 Apr 2010 by Profile apohawk
Post:
7gbnnotyr_3gbn work units tend to behave weird. They kinda get stuck 10 minutes to completion. Then it either jumps to 100% or stays at ~10 minutes to finish, but the progress bar is slowly advancing. I had pref time set to 4 hours, then i reduced it to 2 hours, but the magic 10 minutes is still the same.
Also i had one task (7gbnnotyr_3gbn) in that "10 minutes" phase, which i pressed "show graphics", which probably broke it. It was on model 873, step 187500 or 196500, something like that. 2 hours later it was on model 873, step 174500 os something like that, but continued slowly going up. I aborted this task.

This ha_notyr caught compute error. On www it says it was running for ~10 minutes, but in BOINC Manager it was shown as running for over 6 hours. It happened overnight, so i can't be sure what exactly happened there.
http://ralph.bakerlab.org/result.php?resultid=1782992

OS: WinXP 64
CPU: phenomII x4 945, NOT overclocked
RAM: 8GB
BOINC Manager was 6.10.37, now i updated to 6.10.43 to see if it helps anything ;)






©2024 University of Washington
http://www.bakerlab.org