minirosetta 1.58

Message boards : RALPH@home bug list : minirosetta 1.58

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · Next

AuthorMessage
Profile robertmiles

Send message
Joined: 13 Jan 09
Posts: 100
Credit: 331,865
RAC: 0
Message 4698 - Posted: 14 Feb 2009, 19:05:12 UTC - in response to Message 4697.  

This 1.58 workunit:

https://ralph.bakerlab.org/workunit.php?wuid=1151935


It looks like the second try it ran to completion ...

It looks to me like my assertion that running at less than 100% CPU causes issues ... like the "can't acquire lockfile" error ...


Mike, have you thought of adding some debug code to the parts of the next version of minirosetta that have anything to do with the lockfile, and occasionally recording the percentage of the CPU time BOINC runs?


Mike,

Also, debug code for any parts that can do an exit from the program without setting the status.

Some of the messages so far from a workunit likely to need this debug code to determine just what's going on:

2/14/2009 12:29:53 PM|ralph@home|Task loopbuild_chunk_2_7_B_hb_t327__IGNORE_THE_REST_1Z7UA_9_7851_1_0 exited with zero status but no 'finished' file
2/14/2009 12:29:53 PM|ralph@home|If this happens repeatedly you may need to reset the project.
2/14/2009 12:29:53 PM|ralph@home|Restarting task loopbuild_chunk_2_7_B_hb_t327__IGNORE_THE_REST_1Z7UA_9_7851_1_0 using minirosetta version 158
2/14/2009 12:30:34 PM|ralph@home|Task loopbuild_chunk_2_7_B_hb_t327__IGNORE_THE_REST_1Z7UA_9_7851_1_0 exited with zero status but no 'finished' file
2/14/2009 12:30:34 PM|ralph@home|If this happens repeatedly you may need to reset the project.
2/14/2009 12:30:35 PM|ralph@home|Restarting task loopbuild_chunk_2_7_B_hb_t327__IGNORE_THE_REST_1Z7UA_9_7851_1_0 using minirosetta version 158
2/14/2009 12:31:16 PM|ralph@home|Task loopbuild_chunk_2_7_B_hb_t327__IGNORE_THE_REST_1Z7UA_9_7851_1_0 exited with zero status but no 'finished' file
2/14/2009 12:31:16 PM|ralph@home|If this happens repeatedly you may need to reset the project.
2/14/2009 12:31:16 PM|ralph@home|Restarting task loopbuild_chunk_2_7_B_hb_t327__IGNORE_THE_REST_1Z7UA_9_7851_1_0 using minirosetta version 158
2/14/2009 12:31:57 PM|ralph@home|Task loopbuild_chunk_2_7_B_hb_t327__IGNORE_THE_REST_1Z7UA_9_7851_1_0 exited with zero status but no 'finished' file
2/14/2009 12:31:57 PM|ralph@home|If this happens repeatedly you may need to reset the project.
2/14/2009 12:31:57 PM|ralph@home|Restarting task loopbuild_chunk_2_7_B_hb_t327__IGNORE_THE_REST_1Z7UA_9_7851_1_0 using minirosetta version 158
2/14/2009 12:32:38 PM|ralph@home|Task loopbuild_chunk_2_7_B_hb_t327__IGNORE_THE_REST_1Z7UA_9_7851_1_0 exited with zero status but no 'finished' file
2/14/2009 12:32:38 PM|ralph@home|If this happens repeatedly you may need to reset the project.
2/14/2009 12:32:39 PM|ralph@home|Restarting task loopbuild_chunk_2_7_B_hb_t327__IGNORE_THE_REST_1Z7UA_9_7851_1_0 using minirosetta version 158
2/14/2009 12:33:20 PM|ralph@home|Task loopbuild_chunk_2_7_B_hb_t327__IGNORE_THE_REST_1Z7UA_9_7851_1_0 exited with zero status but no 'finished' file
2/14/2009 12:33:20 PM|ralph@home|If this happens repeatedly you may need to reset the project.
2/14/2009 12:33:20 PM|ralph@home|Restarting task loopbuild_chunk_2_7_B_hb_t327__IGNORE_THE_REST_1Z7UA_9_7851_1_0 using minirosetta version 158
2/14/2009 12:34:01 PM|ralph@home|Task loopbuild_chunk_2_7_B_hb_t327__IGNORE_THE_REST_1Z7UA_9_7851_1_0 exited with zero status but no 'finished' file
2/14/2009 12:34:01 PM|ralph@home|If this happens repeatedly you may need to reset the project.
2/14/2009 12:34:01 PM|ralph@home|Restarting task loopbuild_chunk_2_7_B_hb_t327__IGNORE_THE_REST_1Z7UA_9_7851_1_0 using minirosetta version 158
2/14/2009 12:34:42 PM|ralph@home|Task loopbuild_chunk_2_7_B_hb_t327__IGNORE_THE_REST_1Z7UA_9_7851_1_0 exited with zero status but no 'finished' file
2/14/2009 12:34:42 PM|ralph@home|If this happens repeatedly you may need to reset the project.
2/14/2009 12:34:42 PM|ralph@home|Restarting task loopbuild_chunk_2_7_B_hb_t327__IGNORE_THE_REST_1Z7UA_9_7851_1_0 using minirosetta version 158
2/14/2009 12:35:23 PM|ralph@home|Task loopbuild_chunk_2_7_B_hb_t327__IGNORE_THE_REST_1Z7UA_9_7851_1_0 exited with zero status but no 'finished' file
2/14/2009 12:35:23 PM|ralph@home|If this happens repeatedly you may need to reset the project.
2/14/2009 12:35:23 PM|ralph@home|Restarting task loopbuild_chunk_2_7_B_hb_t327__IGNORE_THE_REST_1Z7UA_9_7851_1_0 using minirosetta version 158
2/14/2009 12:36:04 PM|ralph@home|Task loopbuild_chunk_2_7_B_hb_t327__IGNORE_THE_REST_1Z7UA_9_7851_1_0 exited with zero status but no 'finished' file
2/14/2009 12:36:04 PM|ralph@home|If this happens repeatedly you may need to reset the project.


https://ralph.bakerlab.org/workunit.php?wuid=1156395

04:40:56 CPU so far with 6 hours requested, and no longer changing even while the workunit is running. Task Manager indicates that it is using 0% CPU time.

This is with a 1.58 workunit at 90% CPU, under BOINC 6.2.28 with 32-bit Vista SP1 with a dual-core AMD CPU. I don't know if it's significant that I only saw this problem after enabling the graphics for a few minutes, even though I normally keep it disabled. The graphics looked reasonable, though.
ID: 4698 · Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 13 Jan 09
Posts: 100
Credit: 331,865
RAC: 0
Message 4701 - Posted: 14 Feb 2009, 20:52:51 UTC - in response to Message 4698.  

https://ralph.bakerlab.org/workunit.php?wuid=1156395

04:40:56 CPU so far with 6 hours requested, and no longer changing even while the workunit is running. Task Manager indicates that it is using 0% CPU time.

This is with a 1.58 workunit at 90% CPU, under BOINC 6.2.28 with 32-bit Vista SP1 with a dual-core AMD CPU. I don't know if it's significant that I only saw this problem after enabling the graphics for a few minutes, even though I normally keep it disabled. The graphics looked reasonable, though.


Now that workunit has ended with a Computation error after a lot of these messages (not visible until the workunit ends):

[2009- 2-14 11:55:25:] :: BOINC:: Initializing ... ok.
Can't acquire lockfile - exiting
[2009- 2-14 11:56: 7:] :: BOINC:: Initializing ... ok.
Can't acquire lockfile - exiting
[2009- 2-14 11:56:48:] :: BOINC:: Initializing ... ok.
Can't acquire lockfile - exiting
[2009- 2-14 11:57:29:] :: BOINC:: Initializing ... ok.
Can't acquire lockfile - exiting
[2009- 2-14 11:58:11:] :: BOINC:: Initializing ... ok.
Can't acquire lockfile - exiting
[2009- 2-14 11:58:52:] :: BOINC:: Initializing ... ok.
Can't acquire lockfile - exiting
[2009- 2-14 11:59:33:] :: BOINC:: Initializing ... ok.
Can't acquire lockfile - exiting
[2009- 2-14 12: 0:14:] :: BOINC:: Initializing ... ok.
Can't acquire lockfile - exiting
[2009- 2-14 12: 0:56:] :: BOINC:: Initializing ... ok.
Can't acquire lockfile - exiting

Hope these results are at least useful in tracking down the lockfile problem.
ID: 4701 · Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 13 Jan 09
Posts: 100
Credit: 331,865
RAC: 0
Message 4702 - Posted: 14 Feb 2009, 21:01:15 UTC - in response to Message 4701.  

I tried resetting the Ralph@home project as the messages suggested, and got these messages as a result:

2/14/2009 2:56:12 PM|ralph@home|Resetting project
2/14/2009 2:56:18 PM|ralph@home|[error] Couldn't delete file projects/ralph.bakerlab.org/minirosetta_1.58_windows_intelx86.exe
2/14/2009 2:56:33 PM|ralph@home|Sending scheduler request: To fetch work. Requesting 3853 seconds of work, reporting 0 completed tasks
2/14/2009 2:56:38 PM|ralph@home|Scheduler request succeeded: got 0 new tasks

Looks like there's also a problem in your reset procedure.
ID: 4702 · Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 14 Jan 09
Posts: 62
Credit: 33,293
RAC: 0
Message 4703 - Posted: 14 Feb 2009, 21:49:21 UTC - in response to Message 4701.  

I don't know if it's significant that I only saw this problem after enabling the graphics for a few minutes, even though I normally keep it disabled. The graphics looked reasonable, though.


Ha! There may be the clue I was missing!

I don't recall if I had been looking at the graphics or not. But it is likely when I saw a failure. Until I gave up in disgust.

Though there are no tasks perhaps the test would be to run a few tasks with no looking and some where you look at the graphics.

I am not sure why the launching of the graphics application would cause this issue but this could be the missing clue ... and why I never saw the issue in Einstein even when I had the setting that caused this issue in Rosetta ...

MOST interesting is that you can launch graphics at 100% and have no issue. But, that the switch to pause the application would cause it.

Oh, and if you don't want to lose that much CPU you can use 99% like I did and get the same effect ...
ID: 4703 · Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 13 Jan 09
Posts: 100
Credit: 331,865
RAC: 0
Message 4704 - Posted: 14 Feb 2009, 23:47:10 UTC - in response to Message 4703.  

Oh, and if you don't want to lose that much CPU you can use 99% like I did and get the same effect ...


I tried 99% for a while but had two problems with this setting:

1. Problems making this setting reduce the CPU percentage at all - now fixed.

2. Problems getting Task Manager to show me such small gaps in CPU usage.

I may try again soon at 95%, though.



Mike, in order to save time in testing, you may want to consider these ideas:

1. Try to send a larger share of any workunits aimed at the lockfile problem to machines known to have had these problems recently.

2. If these same machines get a workunit aimed at testing anything else, immediately put a copy of that workunit back on the queue to be sent to machines not in this group.
ID: 4704 · Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 13 Jan 09
Posts: 100
Credit: 331,865
RAC: 0
Message 4705 - Posted: 15 Feb 2009, 4:15:04 UTC - in response to Message 4702.  
Last modified: 15 Feb 2009, 4:17:41 UTC

I tried resetting the Ralph@home project as the messages suggested, and got these messages as a result:

2/14/2009 2:56:12 PM|ralph@home|Resetting project
2/14/2009 2:56:18 PM|ralph@home|[error] Couldn't delete file projects/ralph.bakerlab.org/minirosetta_1.58_windows_intelx86.exe
2/14/2009 2:56:33 PM|ralph@home|Sending scheduler request: To fetch work. Requesting 3853 seconds of work, reporting 0 completed tasks
2/14/2009 2:56:38 PM|ralph@home|Scheduler request succeeded: got 0 new tasks

Looks like there's also a problem in your reset procedure.


Better check just what that Ralph@home reset procedure does. Since the reset, I haven't been able to connect to Rosetta@home, either through BOINC or through its website. I have a Rosetta@home result I haven't been able to send, or I'd try resetting the Rosetta@home project also.

Has Rosetta@home been offline for several hours, or is this part of the result of the Ralph@home reset attempt?
ID: 4705 · Report as offensive    Reply Quote
I _ quit

Send message
Joined: 13 Jan 09
Posts: 44
Credit: 88,562
RAC: 0
Message 4706 - Posted: 15 Feb 2009, 9:37:07 UTC - in response to Message 4705.  

I tried resetting the Ralph@home project as the messages suggested, and got these messages as a result:

2/14/2009 2:56:12 PM|ralph@home|Resetting project
2/14/2009 2:56:18 PM|ralph@home|[error] Couldn't delete file projects/ralph.bakerlab.org/minirosetta_1.58_windows_intelx86.exe
2/14/2009 2:56:33 PM|ralph@home|Sending scheduler request: To fetch work. Requesting 3853 seconds of work, reporting 0 completed tasks
2/14/2009 2:56:38 PM|ralph@home|Scheduler request succeeded: got 0 new tasks

Looks like there's also a problem in your reset procedure.


Better check just what that Ralph@home reset procedure does. Since the reset, I haven't been able to connect to Rosetta@home, either through BOINC or through its website. I have a Rosetta@home result I haven't been able to send, or I'd try resetting the Rosetta@home project also.

Has Rosetta@home been offline for several hours, or is this part of the result of the Ralph@home reset attempt?



It's been nearly 24 hrs and Rosetta is still down. It's almost like a system failure happened there. The problems there have nothing to do with Ralph and that problem you are having deleting the file.
ID: 4706 · Report as offensive    Reply Quote
AdeB
Avatar

Send message
Joined: 22 Dec 07
Posts: 61
Credit: 161,367
RAC: 0
Message 4707 - Posted: 15 Feb 2009, 11:27:17 UTC
Last modified: 15 Feb 2009, 11:28:03 UTC

This looks like a long-running model: resultid=1309799
Name: loopbuild_chunk_2_7_B_hb_t286__IGNORE_THE_REST_1YZFA_5_7846_1_0
Outcome: Validate error
stderr out:
. . .
BOINC:: Worker startup. 
Starting watchdog...
Watchdog active.
# cpu_run_time_pref: 14400
Hbond tripped !!!
BOINC:: CPU time: 28889.1s, 14400s + 14400s[2009- 2-14 15:24: 2:] :: BOINC 
======================================================
DONE ::     2 starting structures  28889.1 cpu seconds
This process generated      3 decoys from       3 attempts
======================================================
called boinc_finish


AdeB
ID: 4707 · Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 14 Jan 09
Posts: 62
Credit: 33,293
RAC: 0
Message 4708 - Posted: 19 Feb 2009, 7:01:57 UTC

At least we got a little work again ...
ID: 4708 · Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 13 Jan 09
Posts: 100
Credit: 331,865
RAC: 0
Message 4709 - Posted: 19 Feb 2009, 8:11:41 UTC

A failed workunit:

https://ralph.bakerlab.org/workunit.php?wuid=1160172

Some typical messages from it:

2/19/2009 12:19:42 AM|ralph@home|Restarting task 1ig5A_BOINC_ABRELAX_IGNORE_THE_REST-ENV10000--1ig5A-_7875_1_0 using minirosetta version 158
2/19/2009 12:20:22 AM|ralph@home|Task 1ig5A_BOINC_ABRELAX_IGNORE_THE_REST-ENV10000--1ig5A-_7875_1_0 exited with zero status but no 'finished' file
2/19/2009 12:20:22 AM|ralph@home|If this happens repeatedly you may need to reset the project.
2/19/2009 12:20:22 AM|ralph@home|Restarting task 1ig5A_BOINC_ABRELAX_IGNORE_THE_REST-ENV10000--1ig5A-_7875_1_0 using minirosetta version 158
2/19/2009 12:21:04 AM|ralph@home|Task 1ig5A_BOINC_ABRELAX_IGNORE_THE_REST-ENV10000--1ig5A-_7875_1_0 exited with zero status but no 'finished' file
2/19/2009 12:21:04 AM|ralph@home|If this happens repeatedly you may need to reset the project.

These messages are repeated many times.

I'm now running at 95% CPU, in order to help pin down the cause of this problem.
ID: 4709 · Report as offensive    Reply Quote
Profile Ian_D

Send message
Joined: 16 Feb 06
Posts: 16
Credit: 39,518
RAC: 0
Message 4710 - Posted: 19 Feb 2009, 18:23:25 UTC
Last modified: 19 Feb 2009, 18:25:15 UTC

https://ralph.bakerlab.org/result.php?resultid=1311870

<core_client_version>6.4.5</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
[2009- 2-17 6:40:25:] :: BOINC:: Initializing ... ok.
[2009- 2-17 6:40:25:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
Registering options..
Trying to access options object.
Success.
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/ralph.bakerlab.org/minirosetta_database_rev26003.zip
Setting database description ...
Setting up checkpointing ...
Setting up folding (abrelax) ...

ERROR: ERROR: FragmentIO: could not open file cs_aa_1ji8A09_05.200_v1_3.gz
ERROR:: Exit from: ....srccorefragmentFragmentIO.cc line: 245
BOINC:: Error reading and gzipping output datafile: default.out
called boinc_finish

</stderr_txt>
]]>

https://ralph.bakerlab.org/result.php?resultid=1311551



<core_client_version>6.4.5</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
[2009- 2-15 22:23:26:] :: BOINC:: Initializing ... ok.
[2009- 2-15 22:23:26:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
Registering options..
Trying to access options object.
Success.
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/ralph.bakerlab.org/minirosetta_database_rev26003.zip
Unpacking WU data ...
Unpacking data: ../../projects/ralph.bakerlab.org/loopbuild_chunk_cheat_3_5.loopbuild_chunk.t326_.mtyka.boinc_files.zip
Setting database description ...
Setting up checkpointing ...

ERROR: [ERROR] Error opening RBSeg file 'core_2GHRA_10_noloop_loops.txt'
ERROR:: Exit from: ....srcprotocolsloopsLoopClass.cc line: 443
BOINC:: Error reading and gzipping output datafile: default.out
called boinc_finish

</stderr_txt>
]]>
ID: 4710 · Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 14 Jan 09
Posts: 62
Credit: 33,293
RAC: 0
Message 4711 - Posted: 19 Feb 2009, 19:44:06 UTC - in response to Message 4709.  

These messages are repeated many times.

I'm now running at 95% CPU, in order to help pin down the cause of this problem.


The one hint is that POSSIBLY only those tasks where you have the screen saver activate or use the graphics application ALONG with CPU throtteling may be linked ... can you make note of that?
ID: 4711 · Report as offensive    Reply Quote
Profile Ian_D

Send message
Joined: 16 Feb 06
Posts: 16
Credit: 39,518
RAC: 0
Message 4712 - Posted: 20 Feb 2009, 20:35:22 UTC

Invalid, Huh ?

https://ralph.bakerlab.org/result.php?resultid=1316610

<core_client_version>6.4.5</core_client_version>
<![CDATA[
<stderr_txt>
[2009- 2-20 5:36:36:] :: BOINC:: Initializing ... ok.
[2009- 2-20 5:36:36:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
Registering options..
Trying to access options object.
Success.
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/ralph.bakerlab.org/minirosetta_database_rev26003.zip
Unpacking WU data ...
Unpacking data: ../../projects/ralph.bakerlab.org/loopbuild_mamaln_ideal.loopbuild.t312_.mtyka.boinc_files.zip
Setting database description ...
Setting up checkpointing ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
======================================================
DONE :: 1 starting structures 3034.98 cpu seconds
This process generated 5 decoys from 5 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down cleanly ...
called boinc_finish

</stderr_txt>
]]>

ID: 4712 · Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 13 Jan 09
Posts: 100
Credit: 331,865
RAC: 0
Message 4713 - Posted: 21 Feb 2009, 12:03:56 UTC - in response to Message 4711.  
Last modified: 21 Feb 2009, 12:29:46 UTC

These messages are repeated many times.

I'm now running at 95% CPU, in order to help pin down the cause of this problem.


The one hint is that POSSIBLY only those tasks where you have the screen saver activate or use the graphics application ALONG with CPU throtteling may be linked ... can you make note of that?


Since then, I've had two 1.58 workunits complete successfully with no graphics application activation. Still running at 95% CPU.

Doing the same for 1.54 over on Rosetta@home doesn't trigger the problem.

In other words, the combination of all of the following trigger the problem for me: 1.58, less than 100% CPU, activating graphics after the workunit starts without them, shutting down the graphics window. Running 1.58 at less than 100% CPU, but with no graphics, doesn't trigger it for me. The problem doesn't trigger for 1.54. I haven't tested the other possibilities yet.

I use an all-black screen saver these days.
ID: 4713 · Report as offensive    Reply Quote
I _ quit

Send message
Joined: 13 Jan 09
Posts: 44
Credit: 88,562
RAC: 0
Message 4714 - Posted: 22 Feb 2009, 22:25:50 UTC

Just FYI, ALL tasks assigned to me completed ok. NO compute errors!
ID: 4714 · Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 13 Jan 09
Posts: 100
Credit: 331,865
RAC: 0
Message 4715 - Posted: 26 Feb 2009, 13:42:35 UTC

The last few times I looked at the System Status, the File Deleter was not running. Does it need to be running more often?
ID: 4715 · Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 14 Jan 09
Posts: 62
Credit: 33,293
RAC: 0
Message 4719 - Posted: 3 Mar 2009, 7:01:12 UTC

It looks like three tasks with the same error... and it is not one I have seen before:

ERROR: ( vol_a.length() == 2 ) && ( std::isalpha( vol_a[ 0 ] ) ) && ( vol_a[ 1 ] == ':' )
ERROR:: Exit from: ....srcutilityfileFileName.cc line: 41
BOINC:: Error reading and gzipping output datafile: default.out

1325665
1325692
1325691
ID: 4719 · Report as offensive    Reply Quote
svincent

Send message
Joined: 4 Apr 08
Posts: 34
Credit: 51,768
RAC: 0
Message 4720 - Posted: 3 Mar 2009, 17:16:55 UTC

I've had 4 workunits on Mac OS X 10.4.11 that all failed after apparent successful completion

</stderr_txt>
<message>
<file_xfer_error>
<file_name>homobench_natrelax_t312__8094_1_1_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>


Workunit ID's

1170292
1170291
1170290
1170289

It appears in each case, that they had previously been sent to a Windows machine where they failed (as noted by Paul Buck) in the manner shown below, but at the start, not at the end:

ERROR: ( vol_a.length() == 2 ) && ( std::isalpha( vol_a[ 0 ] ) ) && ( vol_a[ 1 ] == ':' )
ERROR:: Exit from: ....srcutilityfileFileName.cc line: 41
BOINC:: Error reading and gzipping output datafile: default.out
called boinc_finish

ID: 4720 · Report as offensive    Reply Quote
Profile feet1st

Send message
Joined: 7 Mar 06
Posts: 313
Credit: 116,623
RAC: 0
Message 4721 - Posted: 4 Mar 2009, 23:48:42 UTC
Last modified: 4 Mar 2009, 23:51:12 UTC

This one had a result file over 1MB. It's name is cc_2_2_mamcstmix_cen_bounded_0.1_hb_t311__ IGNORE_THE_REST_1B0NA_6_8133_1_0
It also ended after under 15 hrs on my 24hr preference. Looks like it hit the 99 model limit.
ID: 4721 · Report as offensive    Reply Quote
Evan

Send message
Joined: 23 Dec 07
Posts: 75
Credit: 69,584
RAC: 0
Message 4722 - Posted: 5 Mar 2009, 18:27:44 UTC

validate error:
all first time runs
1329199
1329195
1329205
1329212
1329219
1329224
ID: 4722 · Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · Next

Message boards : RALPH@home bug list : minirosetta 1.58



©2024 University of Washington
http://www.bakerlab.org