minirosetta 1.58

Message boards : RALPH@home bug list : minirosetta 1.58

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · Next

AuthorMessage
Profile Paul D. Buck

Send message
Joined: 14 Jan 09
Posts: 62
Credit: 33,293
RAC: 0
Message 4723 - Posted: 6 Mar 2009, 6:46:55 UTC

The error is: "Unable to open weights."

1331227
1331235
ID: 4723 · Report as offensive    Reply Quote
Evan

Send message
Joined: 23 Dec 07
Posts: 75
Credit: 69,584
RAC: 0
Message 4724 - Posted: 6 Mar 2009, 9:33:09 UTC

validate error:
all first time runs
1329199
1329195
1329205
1329212
1329219
1329224


What causes a validation error? It would appear that my 6 errant work units are well on the way to have successful second runs.
ID: 4724 · Report as offensive    Reply Quote
Evan

Send message
Joined: 23 Dec 07
Posts: 75
Credit: 69,584
RAC: 0
Message 4725 - Posted: 9 Mar 2009, 23:42:46 UTC

There are going to be quite few errors coming through until the system sorts itself out after the closure. I have had a good number of ghost work units reportedly downloaded to me but I can't find them. They have been already been reported as successes and rewarded points but handed out again.
ID: 4725 · Report as offensive    Reply Quote
Speedy

Send message
Joined: 4 Dec 06
Posts: 8
Credit: 1,985
RAC: 0
Message 4726 - Posted: 10 Mar 2009, 7:57:10 UTC
Last modified: 10 Mar 2009, 8:04:33 UTC

This Workunit Displayed a black window when I pushed the show graphics button under the tasks tab in Boinc manger. It completed successfully after about 55 minutes I got credit for it. Could the reason be because the task may have been a resend? Is their a eta as to when 1.58 was be deployed on the main Rosetta?
ID: 4726 · Report as offensive    Reply Quote
Profile feet1st

Send message
Joined: 7 Mar 06
Posts: 313
Credit: 116,623
RAC: 0
Message 4727 - Posted: 13 Mar 2009, 19:00:36 UTC
Last modified: 13 Mar 2009, 19:01:33 UTC

A few tasks lately have produced large outfiles, and ended prematurely (as compared to my 24hr runtime preference). Presumably due to the 99 model limit. But I thought you would want to be aware of them.

runtime !result size !task
11:41 !1.6MB !1343012
10hrs !unknown !1339103
21:41 !2.56MB !1343325

These are all the loopbuild tasks of various flavors.
ID: 4727 · Report as offensive    Reply Quote
Profile Pentti Kiesi

Send message
Joined: 2 Jan 09
Posts: 2
Credit: 111,437
RAC: 0
Message 4728 - Posted: 15 Mar 2009, 7:49:43 UTC

One WU seems not willing to upload at all. Others before and after it are
uploading correctly:

13.3.2009 21:38:39|ralph@home|Started upload of loopbuild_mamaln_full_hb_t303__IGNORE_THE_REST_1te2_99_8360_1_0_0
13.3.2009 21:38:39|ralph@home|Started upload of loopbuild_mamaln_full_hb_t303__IGNORE_THE_REST_1f5s_99_8360_1_0_0
13.3.2009 21:38:39|Poem@Home|Restarting task Peptide_387_1236485967_1676836187_0 using poem version 100
13.3.2009 21:38:39|malariacontrol.net|Restarting task wu_510_511_2640_0_1236930257_1 using malariacontrolBeta version 612
13.3.2009 21:38:39|malariacontrol.net|Restarting task wu_510_415_2640_0_1236930257_0 using malariacontrolBeta version 612
13.3.2009 21:38:39|malariacontrol.net|Restarting task wu_510_414_2640_0_1236930257_1 using malariacontrolBeta version 612
13.3.2009 21:38:40|ralph@home|[error] Error reported by file upload server: [loopbuild_mamaln_full_hb_t303__IGNORE_THE_REST_1te2_99_8360_1_0_0] locked by file_upload_handler PID=-1
13.3.2009 21:38:40|ralph@home|Temporarily failed upload of loopbuild_mamaln_full_hb_t303__IGNORE_THE_REST_1te2_99_8360_1_0_0: transient upload error
13.3.2009 21:38:40|ralph@home|Backing off 3 hr 5 min 27 sec on upload of loopbuild_mamaln_full_hb_t303__IGNORE_THE_REST_1te2_99_8360_1_0_0


...

15.3.2009 9:34:49|ralph@home|Started upload of loopbuild_mamaln_full_hb_t303__IGNORE_THE_REST_1te2_99_8360_1_0_0
15.3.2009 9:34:51|ralph@home|[error] Error reported by file upload server: [loopbuild_mamaln_full_hb_t303__IGNORE_THE_REST_1te2_99_8360_1_0_0] locked by file_upload_handler PID=-1
15.3.2009 9:34:51|ralph@home|Temporarily failed upload of loopbuild_mamaln_full_hb_t303__IGNORE_THE_REST_1te2_99_8360_1_0_0: transient upload error
15.3.2009 9:34:51|ralph@home|Backing off 2 hr 45 min 43 sec on upload of loopbuild_mamaln_full_hb_t303__IGNORE_THE_REST_1te2_99_8360_1_0_0

What is the problem?

ID: 4728 · Report as offensive    Reply Quote
BigMike
Avatar

Send message
Joined: 23 Feb 06
Posts: 63
Credit: 58,730
RAC: 0
Message 4729 - Posted: 15 Mar 2009, 7:56:03 UTC - in response to Message 4725.  

I have had a good number of ghost work units reportedly downloaded to me but I can't find them.

Same here. I have about 40 WU's that R@H thinks are "in progress", but I never saw them. Something's broken...

==Mike

Don't believe everything you think.
ID: 4729 · Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 13 Jan 09
Posts: 100
Credit: 331,865
RAC: 0
Message 4730 - Posted: 15 Mar 2009, 12:39:00 UTC

Another lockfile problem:

https://ralph.bakerlab.org/result.php?resultid=1358791

I'm still running at 95% CPU time but don't think I enabled graphics at any time for this workunit.
ID: 4730 · Report as offensive    Reply Quote
Profile feet1st

Send message
Joined: 7 Mar 06
Posts: 313
Credit: 116,623
RAC: 0
Message 4731 - Posted: 15 Mar 2009, 14:39:19 UTC
Last modified: 15 Mar 2009, 14:41:19 UTC

this mamaln task was preempted at like 98% complete. When BOINC got back to it, it immediately (16 seconds later) finished. No other suspicious messages about the task.

Status page shows scheduler is active, but I'm getting these when I try to update.
Scheduler request failed: Server returned nothing (no headers, no data)
ID: 4731 · Report as offensive    Reply Quote
Profile KC0ISW

Send message
Joined: 17 Feb 06
Posts: 20
Credit: 11,725
RAC: 0
Message 4732 - Posted: 15 Mar 2009, 20:34:55 UTC

https://ralph.bakerlab.org/result.php?resultid=1357308
ID: 4732 · Report as offensive    Reply Quote
Profile KC0ISW

Send message
Joined: 17 Feb 06
Posts: 20
Credit: 11,725
RAC: 0
Message 4733 - Posted: 15 Mar 2009, 20:38:24 UTC


https://ralph.bakerlab.org/result.php?resultid=1357307
https://ralph.bakerlab.org/result.php?resultid=1357306
https://ralph.bakerlab.org/result.php?resultid=1357304
https://ralph.bakerlab.org/result.php?resultid=1357303
https://ralph.bakerlab.org/result.php?resultid=1357302
https://ralph.bakerlab.org/result.php?resultid=1357274
https://ralph.bakerlab.org/result.php?resultid=1357231

ID: 4733 · Report as offensive    Reply Quote
BigMike
Avatar

Send message
Joined: 23 Feb 06
Posts: 63
Credit: 58,730
RAC: 0
Message 4734 - Posted: 15 Mar 2009, 22:19:12 UTC - in response to Message 4729.  

I have had a good number of ghost work units reportedly downloaded to me but I can't find them.

Same here. I have about 40 WU's that R@H thinks are "in progress", but I never saw them.


It just did it to me again. Three WU's completed...37 non-existent ones "in progress". And I've reached my daily "quota".

==Mike

Don't believe everything you think.
ID: 4734 · Report as offensive    Reply Quote
Profile KC0ISW

Send message
Joined: 17 Feb 06
Posts: 20
Credit: 11,725
RAC: 0
Message 4735 - Posted: 16 Mar 2009, 5:10:03 UTC

https://ralph.bakerlab.org/result.php?resultid=1357309
https://ralph.bakerlab.org/result.php?resultid=1357310
https://ralph.bakerlab.org/result.php?resultid=1357311
https://ralph.bakerlab.org/result.php?resultid=1357312
ID: 4735 · Report as offensive    Reply Quote
Profile KC0ISW

Send message
Joined: 17 Feb 06
Posts: 20
Credit: 11,725
RAC: 0
Message 4736 - Posted: 16 Mar 2009, 5:22:15 UTC - in response to Message 4735.  

ok my errors maybe because of dep settings i told dep to over bonic see that helps.
ID: 4736 · Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 14 Jan 09
Posts: 62
Credit: 33,293
RAC: 0
Message 4737 - Posted: 16 Mar 2009, 8:39:12 UTC

First death in like forever ... I have run off a ton of tasks recently and only the one death:

1362077 0x006C43DD write attempt to address 0x00000000

Pretty sure this is an address reported previously ...
ID: 4737 · Report as offensive    Reply Quote
Profile KC0ISW

Send message
Joined: 17 Feb 06
Posts: 20
Credit: 11,725
RAC: 0
Message 4738 - Posted: 16 Mar 2009, 14:47:46 UTC

all my error keep coming from

Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x005286C6 read attempt to address 0x06CA4FF8

ID: 4738 · Report as offensive    Reply Quote
Profile feet1st

Send message
Joined: 7 Mar 06
Posts: 313
Credit: 116,623
RAC: 0
Message 4739 - Posted: 16 Mar 2009, 19:55:10 UTC
Last modified: 16 Mar 2009, 20:04:45 UTC

Stage "unk"?? So... it's "unknown"? (and truncated?)

The protein seemed to slip off the pane too. I just saw black until it later came in to view.



ID: 4739 · Report as offensive    Reply Quote
Profile feet1st

Send message
Joined: 7 Mar 06
Posts: 313
Credit: 116,623
RAC: 0
Message 4740 - Posted: 16 Mar 2009, 20:12:17 UTC - in response to Message 4735.  

https://ralph.bakerlab.org/result.php?resultid=1357309
https://ralph.bakerlab.org/result.php?resultid=1357310
https://ralph.bakerlab.org/result.php?resultid=1357311
https://ralph.bakerlab.org/result.php?resultid=1357312


In each case, when your task failed another task was generated for the work unit, and the next person was able to run it without error.

Seems to point to some problem on your machine. Overclocking? Dust bunnies clogging cooling system? Memory failing?
ID: 4740 · Report as offensive    Reply Quote
svincent

Send message
Joined: 4 Apr 08
Posts: 34
Credit: 51,768
RAC: 0
Message 4742 - Posted: 17 Mar 2009, 16:45:54 UTC

A couple of recent compute errors on Mac OS X 10.4.11

Workunit 1210811

ERROR: Cannot open PDB file "1a17_201.pdb"
ERROR:: Exit from: src/core/io/pdb/pose_io.cc line: 179
BOINC:: Error reading and gzipping output datafile: default.out
called boinc_finish

Workunit 1210764

RROR: Cannot open PDB file "1ad6_197.pdb"
ERROR:: Exit from: src/core/io/pdb/pose_io.cc line: 179
BOINC:: Error reading and gzipping output datafile: default.out
called boinc_finish


ID: 4742 · Report as offensive    Reply Quote
Evan

Send message
Joined: 23 Dec 07
Posts: 75
Credit: 69,584
RAC: 0
Message 4743 - Posted: 17 Mar 2009, 16:54:21 UTC
Last modified: 17 Mar 2009, 16:55:10 UTC

A PDB error this time on windows with this one:

1369045

Cannot open PDB file "1xqo_213.pdb"
ERROR:: Exit from: ....srccoreiopdbpose_io.cc line: 179
BOINC:: Error reading and gzipping output datafile: default.out
called boinc_finish
ID: 4743 · Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · Next

Message boards : RALPH@home bug list : minirosetta 1.58



©2024 University of Washington
http://www.bakerlab.org