Rosetta mini 3.68 about 50% of WUs in "Validate error" state

Message boards : Number crunching : Rosetta mini 3.68 about 50% of WUs in "Validate error" state

To post messages, you must log in.

AuthorMessage
zioriga

Send message
Joined: 16 Feb 06
Posts: 7
Credit: 253,519
RAC: 0
Message 5961 - Posted: 6 Jan 2016, 11:05:43 UTC

As I stated in the title about 50% of the WUs terminate with no particular message, but with a completion time of few minutes (2-4 minutes) and the server put them in Validate error state

I think 50% is really too much. May be it's time to suspend new WUs creation, debug the problem and then restart the process, isn't it ???
ID: 5961 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 581
Credit: 1,028,705
RAC: 180
Message 5962 - Posted: 6 Jan 2016, 15:12:32 UTC - in response to Message 5961.  

As I stated in the title about 50% of the WUs terminate with no particular message, but with a completion time of few minutes (2-4 minutes) and the server put them in Validate error state


I've the same problem (error is present in user's profile).
From home page
Successes last 24h: 20,981
Failures last 24h: 5,630

25% is, anyway, high!!!
ID: 5962 · Report as offensive    Reply Quote
Dr. Merkwürdigliebe

Send message
Joined: 12 Jun 15
Posts: 16
Credit: 23,473
RAC: 0
Message 5963 - Posted: 6 Jan 2016, 16:06:21 UTC
Last modified: 6 Jan 2016, 16:07:48 UTC

I don't see any problems on Linux x64

http://ralph.bakerlab.org/results.php?hostid=35093
ID: 5963 · Report as offensive    Reply Quote
Snagletooth

Send message
Joined: 4 May 07
Posts: 67
Credit: 132,513
RAC: 0
Message 5964 - Posted: 6 Jan 2016, 16:30:44 UTC

!00% success rate (completed and validated) so far on my Mac. One of mine (there may be more, haven't time to trawl through the whole list) was a resend which had ended quickly (68.02 cpu sec) on a Windows machine and received a validate error. The sderr out included this bit:

ERROR: Can't create a polymer bond after residue 4 due to incompatible type: LYS:CtermProteinFull
ERROR:: Exit from: ..\..\..\src\core\conformation\Conformation.cc line: 845
DummyMover::apply() should never have been called! (JobDistributor/Parser should have replaced DummyMover.)

ERROR: false
ERROR:: Exit from: ..\..\..\src\apps\public\boinc\minirosetta.cc line: 96
DummyMover::apply() should never have been called! (JobDistributor/Parser should have replaced DummyMover.)


repeat, repeat, repeat...before ending with:

DONE :: 99 starting structures 1201 cpu seconds
This process generated 99 decoys from 99 attempts



It ran 5248 cpu seconds on my machine successfully generating two decoys from two attempts. Click the link above ("one of mine") to see the details.
ID: 5964 · Report as offensive    Reply Quote
zioriga

Send message
Joined: 16 Feb 06
Posts: 7
Credit: 253,519
RAC: 0
Message 5969 - Posted: 7 Jan 2016, 2:36:47 UTC

as of your replies, it seems that, only on my Windows 10, there is such a problem
ID: 5969 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 581
Credit: 1,028,705
RAC: 180
Message 5972 - Posted: 7 Jan 2016, 8:37:00 UTC - in response to Message 5969.  

as of your replies, it seems that, only on my Windows 10, there is such a problem


Same here. 3 machines with Windows 10 (2 64 bit and 1 32 bit)
ID: 5972 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 581
Credit: 1,028,705
RAC: 180
Message 5975 - Posted: 7 Jan 2016, 16:01:44 UTC - in response to Message 5962.  

From home page
Successes last 24h: 20,981
Failures last 24h: 5,630


Successes last 24h: 18,786
Failures last 24h: 5,467

Failure rate is increasing :-(
ID: 5975 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 581
Credit: 1,028,705
RAC: 180
Message 5987 - Posted: 14 Jan 2016, 10:02:10 UTC - in response to Message 5975.  

Failure rate is increasing :-(


With new version, failure rate is decreasing!! :-)
ID: 5987 · Report as offensive    Reply Quote

Message boards : Number crunching : Rosetta mini 3.68 about 50% of WUs in "Validate error" state



©2018 University of Washington
http://www.bakerlab.org