minirosetta v1.55 bug thread

Message boards : RALPH@home bug list : minirosetta v1.55 bug thread

To post messages, you must log in.

Previous · 1 · 2 · 3

AuthorMessage
I _ quit

Send message
Joined: 13 Jan 09
Posts: 44
Credit: 88,562
RAC: 0
Message 4661 - Posted: 6 Feb 2009, 0:06:16 UTC

nothing today either and i got room on my system now after playing catch up again.
ID: 4661 · Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 14 Jan 09
Posts: 62
Credit: 33,293
RAC: 0
Message 4671 - Posted: 7 Feb 2009, 19:28:50 UTC - in response to Message 4661.  

nothing today either and i got room on my system now after playing catch up again.


Cheep, Breek, Nereek ...

I can hear the crickets ...

The calm before the storm?

The storm before the hurricane?

Or was it something I said?


:)
ID: 4671 · Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 14 Jan 09
Posts: 62
Credit: 33,293
RAC: 0
Message 4673 - Posted: 10 Feb 2009, 8:54:03 UTC

Got a couple more tasks, ran fine ...

A post over at Rosetta NC from another user on the lock file problem and the indication there is that they also cured it with using CPU at 100% and not less ...

The more cores you have running the worse the problem was my experience ... for example I did not see it that much on my 4 cores but it was a real killer on my 8 CPU i7 ... YMMV
ID: 4673 · Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 13 Jan 09
Posts: 100
Credit: 331,865
RAC: 0
Message 4674 - Posted: 10 Feb 2009, 13:19:23 UTC - in response to Message 4673.  

Then maybe that's why I'm not seeing it even though I've tried to set my CPU to 90% to catch the problem. For some reason, something seems to be making my CPU stay at 100% even though I've told it to change to 90%. I have a dual CPU core machine.
ID: 4674 · Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 14 Jan 09
Posts: 62
Credit: 33,293
RAC: 0
Message 4675 - Posted: 10 Feb 2009, 15:52:19 UTC - in response to Message 4674.  

Then maybe that's why I'm not seeing it even though I've tried to set my CPU to 90% to catch the problem. For some reason, something seems to be making my CPU stay at 100% even though I've told it to change to 90%. I have a dual CPU core machine.


Did you try to set it locally?

If you have local preferences already set, then changing the web ones will not do anything. Sadly, there is no clear indication on the preference pane that you are using local vs. remote preferences. Try setting them locally for a shot and then use clear to go back to the web settings...
ID: 4675 · Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 13 Jan 09
Posts: 100
Credit: 331,865
RAC: 0
Message 4676 - Posted: 10 Feb 2009, 17:50:07 UTC - in response to Message 4675.  

I've attempted to set them both locally and remotely, both at less than 100%. I'm not sure if I've found the correct way to set them locally, though, and I'll want to know the difference when I get the new computer I'm planning to order.
ID: 4676 · Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 14 Jan 09
Posts: 62
Credit: 33,293
RAC: 0
Message 4677 - Posted: 10 Feb 2009, 21:27:45 UTC - in response to Message 4676.  
Last modified: 10 Feb 2009, 21:28:03 UTC

I've attempted to set them both locally and remotely, both at less than 100%. I'm not sure if I've found the correct way to set them locally, though, and I'll want to know the difference when I get the new computer I'm planning to order.


Advanced menu/preferences ... make the setting ... click "Ok" button ...

To go back to the web settings:

Advanced menu/preferences ... click "Clear" button ...

You should be able to tell if the setting has taken, assuming you set it to less than 100% by watching the CPU trend-line and there will be periodic dips in the level ... the way BOINC does "throttling" is to halt operation on a periodic basis. If you set it lower the dips should be more dramatic.
ID: 4677 · Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 14 Jan 09
Posts: 62
Credit: 33,293
RAC: 0
Message 4678 - Posted: 11 Feb 2009, 1:35:48 UTC
Last modified: 11 Feb 2009, 1:36:48 UTC

Every task is failing at line 330 ... halt the madness ...


ERROR: [ERROR] Unable to open constraints file: .pdb.distances.csts.bounded_1.0
ERROR:: Exit from: ....srccorescoringconstraintsConstraintIO.cc line: 330
BOINC:: Error reading and gzipping output datafile: default.out
called boinc_finish
ID: 4678 · Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 13 Jan 09
Posts: 100
Credit: 331,865
RAC: 0
Message 4679 - Posted: 11 Feb 2009, 12:09:53 UTC - in response to Message 4678.  

Every task is failing at line 330 ... halt the madness ...


ERROR: [ERROR] Unable to open constraints file: .pdb.distances.csts.bounded_1.0
ERROR:: Exit from: ....srccorescoringconstraintsConstraintIO.cc line: 330
BOINC:: Error reading and gzipping output datafile: default.out
called boinc_finish


Another workunit that failed at that line both for me and my wingman:

https://ralph.bakerlab.org/result.php?resultid=1302770

In 1.58 at least for me.
ID: 4679 · Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 13 Jan 09
Posts: 100
Credit: 331,865
RAC: 0
Message 4680 - Posted: 11 Feb 2009, 12:25:39 UTC - in response to Message 4677.  
Last modified: 11 Feb 2009, 12:38:22 UTC

I've attempted to set them both locally and remotely, both at less than 100%. I'm not sure if I've found the correct way to set them locally, though, and I'll want to know the difference when I get the new computer I'm planning to order.


Advanced menu/preferences ... make the setting ... click "Ok" button ...

To go back to the web settings:

Advanced menu/preferences ... click "Clear" button ...

You should be able to tell if the setting has taken, assuming you set it to less than 100% by watching the CPU trend-line and there will be periodic dips in the level ... the way BOINC does "throttling" is to halt operation on a periodic basis. If you set it lower the dips should be more dramatic.


Thank you - finally, a method that works. Now set at 90%, so when I get another workunit, I can look for any effects that causes, or maybe faster if Rosetta@home workunits are affected also.

I've never used that function before, so I suspect that one of the other BOINC projects I participate in set it at 100% without providing any way to reverse that action.
ID: 4680 · Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 14 Jan 09
Posts: 62
Credit: 33,293
RAC: 0
Message 4682 - Posted: 11 Feb 2009, 18:40:44 UTC - in response to Message 4680.  

I've attempted to set them both locally and remotely, both at less than 100%. I'm not sure if I've found the correct way to set them locally, though, and I'll want to know the difference when I get the new computer I'm planning to order.


Advanced menu/preferences ... make the setting ... click "Ok" button ...

To go back to the web settings:

Advanced menu/preferences ... click "Clear" button ...

You should be able to tell if the setting has taken, assuming you set it to less than 100% by watching the CPU trend-line and there will be periodic dips in the level ... the way BOINC does "throttling" is to halt operation on a periodic basis. If you set it lower the dips should be more dramatic.


Thank you - finally, a method that works. Now set at 90%, so when I get another workunit, I can look for any effects that causes, or maybe faster if Rosetta@home workunits are affected also.

I've never used that function before, so I suspect that one of the other BOINC projects I participate in set it at 100% without providing any way to reverse that action.


THe only project that "poisoned" the well that I know of was QCN where a debug action set the date to a very bad value. So, you had to do lots of stuff to try to get the settings right. The problem is that the setting with the latest date is the one that prevails ... and in the case of QCN the server got the date 2030 or something like that ... so, any computer with the settings from that set would keep "updating" all the old projects with the settings you no longer wanted.

They have directons on the site to fix the issue though you have to touch all projects affected for some reason ...

Anyway,

*MY* experience is that it is an intermittent problem, that is to say, that it will not affect all tasks run. It caused me problems on only about 40% of the tasks run ... I have no idea what is the other issue in the mix that gives rise to the problem. And, to a large extent I am too production oriented to spend much time and energy I don't have to poke and prod the systems looking for these problems.

I posted a note on the BOINC Dev list where it vanished without a comment or even a gurgle as it sank out of sight about the fact that the preferences pane does not really show that you are on local settings vice global settings ...
ID: 4682 · Report as offensive    Reply Quote
mtyka
Volunteer moderator
Project developer
Project scientist

Send message
Joined: 19 Mar 08
Posts: 79
Credit: 0
RAC: 0
Message 4683 - Posted: 11 Feb 2009, 18:54:21 UTC

Hi all,

sorry i've been quiet on the boards recently. Been away last week and focusing on some other projects.

SOrry about the bunch of work units that went out with a faulty file reference - my bad. More today that should actually work.

The error rate has been excellent recently, there a couple of minor fixes coming soon. Then we'll start focusing on scientific improvements again. woo!


Mike
ID: 4683 · Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 13 Jan 09
Posts: 100
Credit: 331,865
RAC: 0
Message 4687 - Posted: 12 Feb 2009, 16:39:34 UTC - in response to Message 4682.  

I've attempted to set them both locally and remotely, both at less than 100%. I'm not sure if I've found the correct way to set them locally, though, and I'll want to know the difference when I get the new computer I'm planning to order.


Advanced menu/preferences ... make the setting ... click "Ok" button ...

To go back to the web settings:

Advanced menu/preferences ... click "Clear" button ...

You should be able to tell if the setting has taken, assuming you set it to less than 100% by watching the CPU trend-line and there will be periodic dips in the level ... the way BOINC does "throttling" is to halt operation on a periodic basis. If you set it lower the dips should be more dramatic.


Thank you - finally, a method that works. Now set at 90%, so when I get another workunit, I can look for any effects that causes, or maybe faster if Rosetta@home workunits are affected also.

I've never used that function before, so I suspect that one of the other BOINC projects I participate in set it at 100% without providing any way to reverse that action.


THe only project that "poisoned" the well that I know of was QCN where a debug action set the date to a very bad value. So, you had to do lots of stuff to try to get the settings right. The problem is that the setting with the latest date is the one that prevails ... and in the case of QCN the server got the date 2030 or something like that ... so, any computer with the settings from that set would keep "updating" all the old projects with the settings you no longer wanted.

They have directons on the site to fix the issue though you have to touch all projects affected for some reason ...

Anyway,

*MY* experience is that it is an intermittent problem, that is to say, that it will not affect all tasks run. It caused me problems on only about 40% of the tasks run ... I have no idea what is the other issue in the mix that gives rise to the problem. And, to a large extent I am too production oriented to spend much time and energy I don't have to poke and prod the systems looking for these problems.

I posted a note on the BOINC Dev list where it vanished without a comment or even a gurgle as it sank out of sight about the fact that the preferences pane does not really show that you are on local settings vice global settings ...


I've never tried to run QCN. The only project I know of that even offers the chance of setting device-specific parameters is WCG, and their method of changing them to 90% CPU time didn't work.

I didn't try to change these settings on a few projects I try to participate in that look unlikely to offer any workunits soon. One of them, Cels@home, is no longer even reachable online.

I've never seen it when I was using 100% CPU time. However, I'm now using 90% CPU time, and it may be related to the problem I just posted over in the hard-to-find thread for 1.58 problems.
ID: 4687 · Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 14 Jan 09
Posts: 62
Credit: 33,293
RAC: 0
Message 4690 - Posted: 12 Feb 2009, 20:42:02 UTC - in response to Message 4687.  

I've never tried to run QCN. The only project I know of that even offers the chance of setting device-specific parameters is WCG, and their method of changing them to 90% CPU time didn't work.

I didn't try to change these settings on a few projects I try to participate in that look unlikely to offer any workunits soon. One of them, Cels@home, is no longer even reachable online.

I've never seen it when I was using 100% CPU time. However, I'm now using 90% CPU time, and it may be related to the problem I just posted over in the hard-to-find thread for 1.58 problems.


I have sensors on the way so that I can try QCN as another non-cpu intense project. Living in California, though away from where most of the fault lines are ... still ... it is a topic of interest ...

Cels@Home has changed their URL so that there is now a Cels@Home (old) and a Cels@Home that is in Alpha state ...

Now to the other thread ...
ID: 4690 · Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 14 Jan 09
Posts: 62
Credit: 33,293
RAC: 0
Message 4691 - Posted: 12 Feb 2009, 20:42:47 UTC - in response to Message 4683.  

Hi all,

sorry i've been quiet on the boards recently. Been away last week and focusing on some other projects.

SOrry about the bunch of work units that went out with a faulty file reference - my bad. More today that should actually work.

The error rate has been excellent recently, there a couple of minor fixes coming soon. Then we'll start focusing on scientific improvements again. woo!


Mike,

You should change the link on the front page to point to the latest bug thread ...

ID: 4691 · Report as offensive    Reply Quote
Previous · 1 · 2 · 3

Message boards : RALPH@home bug list : minirosetta v1.55 bug thread



©2024 University of Washington
http://www.bakerlab.org