Bug reports for 5.96

Message boards : RALPH@home bug list : Bug reports for 5.96

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4

AuthorMessage
Profile Conan
Avatar

Send message
Joined: 16 Feb 06
Posts: 364
Credit: 1,368,421
RAC: 0
Message 4122 - Posted: 20 Jun 2008, 14:10:13 UTC
Last modified: 20 Jun 2008, 14:11:04 UTC

Also errors on 't419N' type work units.
This WU gave this error


<core_client_version>5.10.38</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# cpu_run_time_pref: 21600
ERROR:: Unable to determine sequence length from pdb file
# random seed: 1178541
ERROR:: Exit from: .refold.cc line: 338

And this WU gave the error that "Maximum Disk Space Exceeded".


ID: 4122 · Report as offensive    Reply Quote
Pepo
Avatar

Send message
Joined: 8 Sep 06
Posts: 104
Credit: 36,890
RAC: 0
Message 4123 - Posted: 20 Jun 2008, 14:33:31 UTC

Linux Boinc 6.2.4, t405__CASP8_JUMPAB_TYPE2_RES81to192_SAVE_ALL_OUT_BARCODE__4233_55_0: task got stuck at 100% and around 3 1/2 hour CPU time, for half day long, after restarting the client progress jumped to 48% and two hours CPU time. Preference is set to 2 hours.

In the logs I've found following:

4:27:44 ralph@home [cpu_sched] Resuming t405__CASP8_JUMPAB_TYPE2_RES81to192_SAVE_ALL_OUT_BARCODE__4233_55_0
4:27:44 ralph@home [task_debug] task_state=EXECUTING for t405__CASP8_JUMPAB_TYPE2_RES81to192_SAVE_ALL_OUT_BARCODE__4233_55_0 from unsuspend
4:27:44 ralph@home Resuming task t405__CASP8_JUMPAB_TYPE2_RES81to192_SAVE_ALL_OUT_BARCODE__4233_55_0 using rosetta_beta version 596
4:30:45 --- Restarting t405__CASP8_JUMPAB_TYPE2_RES81to192_SAVE_ALL_OUT_BARCODE__4233_55_0 - message timeout
4:30:45 ralph@home [task_debug] task_state=UNINITIALIZED for t405__CASP8_JUMPAB_TYPE2_RES81to192_SAVE_ALL_OUT_BARCODE__4233_55_0 from kill_task
4:30:45 ralph@home [cpu_sched] Starting t405__CASP8_JUMPAB_TYPE2_RES81to192_SAVE_ALL_OUT_BARCODE__4233_55_0(resume)
4:30:45 --- [task_debug] ACTIVE_TASK::start(): forked process: pid 11473
4:30:45 ralph@home [task_debug] task_state=EXECUTING for t405__CASP8_JUMPAB_TYPE2_RES81to192_SAVE_ALL_OUT_BARCODE__4233_55_0 from start
4:30:45 ralph@home Restarting task t405__CASP8_JUMPAB_TYPE2_RES81to192_SAVE_ALL_OUT_BARCODE__4233_55_0 using rosetta_beta version 596
4:30:46 --- [error] Process 735 not found
4:34:23 ralph@home [task_debug] result t405__CASP8_JUMPAB_TYPE2_RES81to192_SAVE_ALL_OUT_BARCODE__4233_55_0 checkpointed
4:37:58 ralph@home [task_debug] result t405__CASP8_JUMPAB_TYPE2_RES81to192_SAVE_ALL_OUT_BARCODE__4233_55_0 checkpointed
.....
5:26:16 ralph@home [task_debug] result t405__CASP8_JUMPAB_TYPE2_RES81to192_SAVE_ALL_OUT_BARCODE__4233_55_0 checkpointed
5:29:51 ralph@home [task_debug] result t405__CASP8_JUMPAB_TYPE2_RES81to192_SAVE_ALL_OUT_BARCODE__4233_55_0 checkpointed

that's all, idle machine since.

I think it might be possible that the sudden exit at 4:30:45 and the lockup might have something common?

Slot's stdout.txt contains 52591 lines with "res 13 and var 1 at position 1 is not a proper Nterm variant". It is still just 96% of 3.2 MB file :-)
I'll try to let it finish.

Peter
ID: 4123 · Report as offensive    Reply Quote
Pepo
Avatar

Send message
Joined: 8 Sep 06
Posts: 104
Credit: 36,890
RAC: 0
Message 4124 - Posted: 20 Jun 2008, 19:13:58 UTC - in response to Message 4123.  

Linux Boinc 6.2.4, t405__CASP8_JUMPAB_TYPE2_RES81to192_SAVE_ALL_OUT_BARCODE__4233_55_0: task got stuck at 100% and around 3 1/2 hour CPU time, for half day long, after restarting the client progress jumped to 48% and two hours CPU time. Preference is set to 2 hours.

I'll try to let it finish.

The task is again idle at 100%, 4:16 hours and marked as running. Restarted the client again - task jumped to 59.9% and 2:24 hours... bye bye. (And the aborted one got replaced with a coil2_* task - hopefully not another beast from the same family.)

Peter
ID: 4124 · Report as offensive    Reply Quote
Path7

Send message
Joined: 11 Feb 08
Posts: 56
Credit: 4,974
RAC: 0
Message 4125 - Posted: 20 Jun 2008, 21:03:25 UTC
Last modified: 20 Jun 2008, 21:09:15 UTC

Hello all,
Having errors on T419 WU's:
Link to my Linux computer
process exited with code 1 (0x1, -255)
ERROR:: Unable to determine sequence length from pdb file
ERROR:: Exit from: refold.cc line: 338

Ubuntu 7.10 x86, Boinc 5.10.45

Have a nice day,
Path7.
ID: 4125 · Report as offensive    Reply Quote
Profile Conan
Avatar

Send message
Joined: 16 Feb 06
Posts: 364
Credit: 1,368,421
RAC: 0
Message 4130 - Posted: 21 Jun 2008, 13:51:57 UTC - in response to Message 4122.  

Also errors on 't419N' type work units.
This WU gave this error


<core_client_version>5.10.38</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# cpu_run_time_pref: 21600
ERROR:: Unable to determine sequence length from pdb file
# random seed: 1178541
ERROR:: Exit from: .refold.cc line: 338

And this WU gave the error that "Maximum Disk Space Exceeded".



Also had same "ERROR:: Exit from: .refold.cc line: 338" on WU 1044472 and WU 1044495
ID: 4130 · Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4

Message boards : RALPH@home bug list : Bug reports for 5.96



©2024 University of Washington
http://www.bakerlab.org