Posts by AdeB

41) Message boards : RALPH@home bug list : Bug Reports for Minirosetta v1.39 (Message 4307)
Posted 26 Oct 2008 by AdeB
Post:
1151331

stderr out:
<core_client_version>5.10.45</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
# cpu_run_time_pref: 14400
recovering checkpoint of tag S_1LVHA_6_00000001 with id abrelax_rg_state
SIGSEGV: segmentation violation
Stack trace (12 frames):
[0x8b104a7]
[0x8b3aba0]
[0xffffe420]
[0x81723f8]
[0x81822ae]
[0x8181580]
[0x8180f11]
[0x8131626]
[0x8133829]
[0x804b891]
[0x8b9669c]
[0x8048111]

Exiting...

</stderr_txt>
]]>
42) Message boards : RALPH@home bug list : Bug Reports for Minirosetta v1.36 (Message 4290)
Posted 23 Oct 2008 by AdeB
Post:
Two workunits running too long, both with error after calling boinc_finish.

1141183:
Rosetta is going too long. Watchdog is ending the run!
CPU time: 50277.8 seconds. Greater than 3X preferred time: 14400 seconds
**********************************************************************
called boinc_finish
SIGSEGV: segmentation violation

1142108:
Rosetta is going too long. Watchdog is ending the run!
CPU time: 50018.7 seconds. Greater than 3X preferred time: 14400 seconds
**********************************************************************
called boinc_finish
SIGILL: illegal instruction
43) Message boards : RALPH@home bug list : Bug Reports for Minirosetta v1.36 (Message 4284)
Posted 16 Oct 2008 by AdeB
Post:
At this moment i have a ralph and a rosetta task running at the same time on a single-core computer, both using 50 %CPU.
The ralph task has the status 'Waiting to run' in boinc manager, and it is a hombench_tex_looprelax_tex_cst_oneparam_looprelax_tex_...-task, so that is the one i'm going to abort.
Something is seriously wrong with these tasks.


I too have had this same experience with a four core machine running 7 boinc work units. 4 Docking and 3 Ralph, the Ralph work units had "waiting to run" next to them but were still running. I had to stop Boinc and restart it to get just 4 WU's running.

This has happened on more than one machine and more than once. All with the 'hombench_tex' name.


Some time later i saw the same thing happening on my other PC with a rosetta work unit. This w.u. was the only one running, when i checked 20 minutes later it had status 'Waiting to run' but was still running side by site with a Seti work unit. I've let them run and both work units had a valid result.
So apparently these tasks sometimes don't react to the boinc-command to suspend. Which is a bad thing when you look at it from any other boinc-project, but it does not screw up the science.
44) Message boards : RALPH@home bug list : Bug Reports for Minirosetta v1.36 (Message 4274)
Posted 14 Oct 2008 by AdeB
Post:
At this moment i have a ralph and a rosetta task running at the same time on a single-core computer, both using 50 %CPU.
The ralph task has the status 'Waiting to run' in boinc manager, and it is a hombench_tex_looprelax_tex_cst_oneparam_looprelax_tex_...-task, so that is the one i'm going to abort.
Something is seriously wrong with these tasks.
45) Message boards : RALPH@home bug list : Bug Reports for Minirosetta v1.36 (Message 4271)
Posted 13 Oct 2008 by AdeB
Post:
five failures:
1127424, 1127480, 1127566, 1128390 and 1122258.
All have the name: hombench_tex_looprelax_tex_cst_oneparam_looprelax_tex_cst_... and all have the same error:

ERROR: NANs occured in hbonding!
ERROR:: Exit from: src/core/scoring/hbonds/hbonds_geom.cc line: 763
called boinc_finish


And this one ran too long.
stderr out:
<core_client_version>5.10.45</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 14400
**********************************************************************
Rosetta is going too long. Watchdog is ending the run!
CPU time: 49551.9 seconds. Greater than 3X preferred time: 14400 seconds
**********************************************************************
called boinc_finish
SIGSEGV: segmentation violation
Stack trace (20 frames):
[0x89f8027]
[0x8a22720]
[0xffffe420]
[0x887f36b]
[0x82f934a]
[0x830358d]
[0x8749cb6]
[0x8945431]
[0x874b4a6]
[0x874e760]
[0x8749400]
[0x8066196]
[0x8084942]
[0x8092d68]
[0x808c16e]
[0x805e8f8]
[0x809795c]
[0x804bed3]
[0x8a7e21c]
[0x8048111]

Exiting...

</stderr_txt>
]]>
46) Message boards : RALPH@home bug list : Bug Reports for Minirosetta v1.36 (Message 4251)
Posted 8 Oct 2008 by AdeB
Post:
task 1109308 and task 1111517

ERROR: NANs occured in hbonding!
ERROR:: Exit from: src/core/scoring/hbonds/hbonds_geom.cc line: 763

and Granted credit: 0 for both after running more than 4 hours.
47) Message boards : RALPH@home bug list : Bug Reports for Minirosetta v1.36 (Message 4240)
Posted 6 Oct 2008 by AdeB
Post:
May I ask where the heck Application Version 1.00 came from if we are supposed to be up to 1.36 ???

I have had 3 validate errors and all had version 1.00 stamped on them.

All have the same messages in the result that version 1.35 have (such as "recovering checkpoint" etc for about 30 lines for all new work units but does not validate.

See 1107308
1107311
1107417


Also still no RAC decay on this project for participants and hosts.

Thanks, Conan.


A few days ago there was a new application running on my computer: minirosetta_split_terms version 100.
48) Message boards : RALPH@home bug list : Bug Reports for Rosetta version 5.98 (Message 4227)
Posted 29 Sep 2008 by AdeB
Post:
This task had an error at startup.

stderr out:

<core_client_version>5.10.45</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)
</message>
<stderr_txt>
Graphics are disabled due to configuration...
ERROR:: Exit from: options.cc line: 525

</stderr_txt>
]]>
49) Message boards : RALPH@home bug list : Bug Reports for Minirosetta version 1.34 (Message 4220)
Posted 26 Sep 2008 by AdeB
Post:
Hmm, "NANs occured in hbonding!"
I don't know what it means, but it didn't give any credit.

<core_client_version>5.10.45</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)
</message>
<stderr_txt>
# cpu_run_time_pref: 14400

ERROR: NANs occured in hbonding!
ERROR:: Exit from: src/core/scoring/hbonds/hbonds_geom.cc line: 763
called boinc_finish

</stderr_txt>
]]>
50) Message boards : RALPH@home bug list : Bug Reports for Rosetta version 5.98 (Message 4172)
Posted 31 Jul 2008 by AdeB
Post:
segmentation violation
51) Message boards : RALPH@home bug list : Bug report for Minirosetta version 1.29 (Message 4158)
Posted 3 Jul 2008 by AdeB
Post:
Error in resultid=1050453

stderr out:
<core_client_version>5.10.45</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)
</message>
<stderr_txt>
Unrecognized XML in parse_init_data_file: fraction_done_update_period
Skipping: 1.000000
Skipping: /fraction_done_update_period
ERROR: Option matching -fragdate not found in command line top-level context

</stderr_txt>
]]>
52) Message boards : RALPH@home bug list : Scheduler/Server Problems? (Message 4068)
Posted 26 May 2008 by AdeB
Post:
No errors here on 5/25 and 5/26.
53) Message boards : RALPH@home bug list : minirosetta v1.24 bug thread (Message 4059)
Posted 24 May 2008 by AdeB
Post:
Workunit 972507

stderr out
<core_client_version>5.10.45</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)
</message>
<stderr_txt>
ERROR: Option matching -description_file not found in command line top-level context

</stderr_txt>
]]>
54) Message boards : RALPH@home bug list : minirosetta 1.19 bug thread (Message 4013)
Posted 8 May 2008 by AdeB
Post:
resultid=950809

<core_client_version>5.10.45</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 14400
======================================================
DONE :: 1 starting structures 11991.7 cpu seconds
This process generated 2 decoys from 2 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...
called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
<file_name>1tig__BOINC_ABRELAX_IGNORE_THE_REST-S25-17-S3-17--1tig_-_3581_28_1_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
55) Message boards : RALPH@home bug list : Bug reports for 5.96 (Message 4000)
Posted 7 May 2008 by AdeB
Post:
Had these two fail with the same error after 14 seconds,
This WU
and this WU

<core_client_version>5.10.45</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)
</message>
<stderr_txt>
Graphics are disabled due to configuration...
# cpu_run_time_pref: 21600
# random seed: 1381311
ERROR:: Exit from: loop_relax.cc line: 1745


Same error here in resultid=949856 and resultid=949857
56) Message boards : RALPH@home bug list : minirosetta 1.19 bug thread (Message 3999)
Posted 7 May 2008 by AdeB
Post:
Rack up another "compute error" for me and my wing man. This one failed immediately at start up on my PC:

Task ID 949788
Name fa_max_dis_9-1c8cA-test_2008-5-6_3655_9_1
Workunit 842742
Outcome Client error
Client state Compute error
Exit status 1 (0x1)

stderr out

<core_client_version>6.1.0</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
ERROR: Option matching -psipred_ss2:1c8cA.fa_max_dis_9.psipred_ss2 not found in command line top-level context

</stderr_txt>

]]>

Similar error in resultid=950615

<core_client_version>5.10.45</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)
</message>
<stderr_txt>
ERROR: Option matching -psipred_ss2:1opd_.fa_max_dis_9.psipred_ss2 not found in command line top-level context

</stderr_txt>
]]>
57) Message boards : RALPH@home bug list : Bug reports for 5.96 (Message 3890)
Posted 12 Apr 2008 by AdeB
Post:
2 errors, both on linux PC's.

resultid=905699
stderr out:
<core_client_version>5.10.28</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
Graphics are disabled due to configuration...
# cpu_run_time_pref: 14400
# random seed: 1398092
SIGSEGV: segmentation violation
Stack trace (14 frames):
[0x8e1b49b]
[0x8e15d8c]
[0xffffe420]
[0x8c836d4]
[0x87d2748]
[0x87d7a5e]
[0x8d6bbd9]
[0x8b89e24]
[0x8b8c203]
[0x8629ccb]
[0x8768a9f]
[0x8768b4a]
[0x8e80034]
[0x8048111]

Exiting...

</stderr_txt>
]]>

resultid=885864
stderr out:
<core_client_version>5.10.28</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)
</message>
<stderr_txt>
Graphics are disabled due to configuration...
# cpu_run_time_pref: 14400
# random seed: 1417106
ERROR:: Exit from: minimize.cc line: 2088

</stderr_txt>
]]>
58) Message boards : RALPH@home bug list : Bug reports for 5.96 (Message 3842)
Posted 31 Mar 2008 by AdeB
Post:
What happened with this workunit?
Stderr out looks normal to me.
Valid state: Workunit error - check skipped
59) Message boards : RALPH@home bug list : Bug Reports for Rosetta Mini Versions 1.+ (Message 3809)
Posted 29 Feb 2008 by AdeB
Post:
Error in resultid=798264

<core_client_version>5.10.28</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)
</message>
<stderr_txt>
Graphics are disabled due to configuration...
# cpu_run_time_pref: 14400
Failed to find rotamer: 3 0 2
Amongst options: 3 2 2
3 3 3
3 3 2
3 2 1
3 2 3
3 1 1
1 3 2
3 1 2
1 2 2
3 3 1
1 2 1
1 2 3
3 1 3
1 3 1
2 2 2
1 3 3
2 2 1
2 2 3
2 1 1
2 3 2
2 1 2
1 1 2
1 1 1
2 3 3
2 1 3
1 1 3
2 3 1
ERROR:: Exit from: src/core/scoring/dunbrack/SingleResidueDunbrackLibrary.tmpl.hh line: 142
called boinc_finish

</stderr_txt>
]]>
60) Message boards : Current tests : Help us debug minirosetta. (Message 3773)
Posted 20 Feb 2008 by AdeB
Post:
I'm sometimes getting double of what I'm claiming.

These are my last three Mini tasks (Linux)
CPU time (sec)  	claimed credit  	granted credit
 	13,005.13  	31.92  			50.88
 	13,564.10  	33.29  			50.73
	12,407.80  	30.77  			50.87



In contrast, it seems like I get exactly what I claim on Beta tasks.


Not all linux machines get more credit than claimed.
My mobile AMD Athlon(tm) XP2500+ gets less than half of claimed, on Beta tasks it got more.
On my AMD Athlon(tm) XP 2000+ there isn't much difference between Mini and Beta tasks


Previous 20 · Next 20



©2024 University of Washington
http://www.bakerlab.org