Help us debug minirosetta.

Message boards : Current tests : Help us debug minirosetta.

To post messages, you must log in.

1 · 2 · 3 · Next

AuthorMessage
Profile dekim
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 20 Jan 06
Posts: 250
Credit: 543,579
RAC: 0
Message 3695 - Posted: 8 Feb 2008, 19:51:45 UTC
Last modified: 10 Feb 2008, 2:45:01 UTC

Minirosetta is currently running well on R@h with a 92% success rate but we would like to make it even better.

If you have experienced errors while running minirosetta and would like to help, we ask that you download our application symbols file on windows platforms so that we can get debugging information. Ideally, the windows symstore utilities that Rom Walton added to boinc would take care of this and automatically retrieve the symbols from our site but it doesn't seem to be working as it did before so we are asking for your help.

The file can be retrieved at:

https://ralph.bakerlab.org/download/minirosetta_1.07_windows_intelx86.pdb

Please place it in the ralph@home project directory of your BOINC installation usually located in 'c:/Program Files/BOINC/projects/ralph.bakerlab.org/'

Thanks in advance!

David K


edit: You can also place the symbols (pdb) file in the Rosetta@home project usually located in
'c:/Program Files/BOINC/projects/boinc,balerkab.org_rosetta/'


edit: The symbols file is 25MB.

ID: 3695 · Report as offensive    Reply Quote
j2satx

Send message
Joined: 17 Feb 06
Posts: 42
Credit: 168,797
RAC: 0
Message 3696 - Posted: 9 Feb 2008, 0:13:24 UTC - in response to Message 3695.  

Minirosetta is currently running well on R@h with a 92% success rate but we would like to make it even better.

If you have experienced errors while running minirosetta and would like to help, we ask that you download our application symbols file on windows platforms so that we can get debugging information. Ideally, the windows symstore utilities that Rom Walton added to boinc would take care of this and automatically retrieve the symbols from our site but it doesn't seem to be working as it did before so we are asking for your help.

The file can be retrieved at:

https://ralph.bakerlab.org/download/minirosetta_1.07_windows_intelx86.pdb

Please place it in the ralph@home project directory of your BOINC installation usually located in 'c:/Program Files/BOINC/projects/ralph.bakerlab.org/'

Thanks in advance!

David K


edit: You can also place the symbols (pdb) file in the Rosetta@home project usually located in
'c:/Program Files/BOINC/projects/boinc,balerkab.org_rosetta/'


We can't help much when you don't give us WUs.

I've only had 19 WUs since 5 Feb on 16 cores.


ID: 3696 · Report as offensive    Reply Quote
Profile dekim
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 20 Jan 06
Posts: 250
Credit: 543,579
RAC: 0
Message 3697 - Posted: 9 Feb 2008, 4:05:16 UTC

Please let us know if any of you are experiencing errors on either ralph or R@h and have downloaded the symbols file and placed it in the appropriate directories. I'll add more tests on ralph soon.
ID: 3697 · Report as offensive    Reply Quote
Evan

Send message
Joined: 23 Dec 07
Posts: 75
Credit: 69,584
RAC: 0
Message 3699 - Posted: 9 Feb 2008, 10:13:15 UTC

Can you put a larger number on as they are all taken up before before daylight reaches this side of the Atlantic - ie +0 hours Go Make Tea Time.
ID: 3699 · Report as offensive    Reply Quote
Profile Conan
Avatar

Send message
Joined: 16 Feb 06
Posts: 364
Credit: 1,368,421
RAC: 0
Message 3700 - Posted: 9 Feb 2008, 13:23:01 UTC

Does the Linux application not need debugging information as well? Or is it working ok?
I have recently done a couple that went ok on Linux, though much shorter in run time than my preferences.
I have also reported already a problem with a 'mini' WU that did not validate on my Windows machine, KSMarksPhych has also had the same problem, see following two threads

id 357
id 361
ID: 3700 · Report as offensive    Reply Quote
Profile Conan
Avatar

Send message
Joined: 16 Feb 06
Posts: 364
Credit: 1,368,421
RAC: 0
Message 3701 - Posted: 9 Feb 2008, 13:26:56 UTC

You could of told us before we downloaded that the file is over 25 MB in size.
What are the uploads likely to be?
ID: 3701 · Report as offensive    Reply Quote
Profile dekim
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 20 Jan 06
Posts: 250
Credit: 543,579
RAC: 0
Message 3702 - Posted: 10 Feb 2008, 2:43:23 UTC

Oh, shoot! I really meant to but just forgot. Sorry about that. Yeah, the symbol file is pretty big but you just have to download it once for each application version update. sorry again.

The validation bug was fixed in 1.07. There might be another validation bug but it is very very rare. I'm looking into it.
ID: 3702 · Report as offensive    Reply Quote
Profile Angus

Send message
Joined: 17 Feb 06
Posts: 10
Credit: 1,007
RAC: 0
Message 3705 - Posted: 10 Feb 2008, 20:44:19 UTC

No work. Can't debug nothingingness.
ID: 3705 · Report as offensive    Reply Quote
Profile feet1st

Send message
Joined: 7 Mar 06
Posts: 313
Credit: 116,623
RAC: 0
Message 3706 - Posted: 11 Feb 2008, 15:43:43 UTC

...and don't forget to enable the mini .exe application in your firewall so that it can send in any dump of the symbol table enabled by downloading the PDB file.
ID: 3706 · Report as offensive    Reply Quote
Profile Angus

Send message
Joined: 17 Feb 06
Posts: 10
Credit: 1,007
RAC: 0
Message 3708 - Posted: 12 Feb 2008, 2:56:17 UTC

First problem (or two)

Ralph (or the mini app) does not appear to honor the runtime preference set in Ralph Preferences

It also does not even recognize the preferences were changed. I had to change the General Preferences for a Boinc manager Update to recognize a change had been made and download new preferences. I still don't know if it has the correct run-time preference set. I configured it for 1 hour, and the "To Completion" in the Boinc Manager shows 5:24:14 for all tasks.


ID: 3708 · Report as offensive    Reply Quote
Profile Angus

Send message
Joined: 17 Feb 06
Posts: 10
Credit: 1,007
RAC: 0
Message 3709 - Posted: 12 Feb 2008, 3:01:08 UTC

Out of work again?? Says there are 6000+ WU queued on the front page, and 10000+ on the server status page. Getting "no work available" messages.

<grump> Can't help if there's no work. </grump>
ID: 3709 · Report as offensive    Reply Quote
j2satx

Send message
Joined: 17 Feb 06
Posts: 42
Credit: 168,797
RAC: 0
Message 3710 - Posted: 12 Feb 2008, 4:07:50 UTC - in response to Message 3709.  

Out of work again?? Says there are 6000+ WU queued on the front page, and 10000+ on the server status page. Getting "no work available" messages.

<grump> Can't help if there's no work. </grump>


Are you still not getting WUs?

All the computers I have attached to Ralph are getting work and have warm and happy CPUs.....Intel, AMD, Windows and Linux, 32-bit and 64-bit.
ID: 3710 · Report as offensive    Reply Quote
Pepo
Avatar

Send message
Joined: 8 Sep 06
Posts: 104
Credit: 36,890
RAC: 0
Message 3711 - Posted: 12 Feb 2008, 9:45:02 UTC - in response to Message 3708.  

Ralph (or the mini app) does not appear to honor the runtime preference set in Ralph Preferences. [... After] download new preferences. I still don't know if it has the correct run-time preference set. I configured it for 1 hour, and the "To Completion" in the Boinc Manager shows 5:24:14 for all tasks.

Is it just the "To Completion" in Boinc Manager or also the real runtime, which does not honour your prefs? What's your Duration Correction Factor? It seems that the run time estimate is "overestimated" and will be correct just when the DCF settles down.

For instance, my Pentium III has target run time 4 hours, DCF=0.15 and real runtime ~2:46 (average of 8 results, 2:18-3:31, 1-3 decoys). Other machine, C2D T7200 has target runtime 2 hours, DCF=0.29 and real runtime ~1:35 (average of 11 results 1:06-2:16, 1-6 decoys).

Peter
ID: 3711 · Report as offensive    Reply Quote
Profile feet1st

Send message
Joined: 7 Mar 06
Posts: 313
Credit: 116,623
RAC: 0
Message 3712 - Posted: 12 Feb 2008, 12:40:39 UTC

If your preferences did not get updated, then perhaps your runtime preference didn't update until later as well.

Your "to completion" really doesn't tell anyone if your runtime preference took effect or not. The behavior would be that the to completion time would be modified every 5 seconds, and that if your true preference were only an hour, then you'd see like 25 seconds knocked off the completion time with each update.

Once your preference is know, BOINC still uses the time it took your last WU to complete and adjusts the runtime it was sent from Ralph. That's why you have to look at the rate at which it is declining.
ID: 3712 · Report as offensive    Reply Quote
Profile KC0ISW

Send message
Joined: 17 Feb 06
Posts: 20
Credit: 11,725
RAC: 0
Message 3714 - Posted: 12 Feb 2008, 19:14:14 UTC

not sure if this is a problem but i just finished my first mini workunit had been runn slowly for 18and half hours and at 76% showing 8 hours to go about 5mins later to it finished and uploaded with success here the result https://ralph.bakerlab.org/result.php?resultid=747540


ID: 3714 · Report as offensive    Reply Quote
Azurrio

Send message
Joined: 27 Jun 07
Posts: 12
Credit: 8,020
RAC: 0
Message 3715 - Posted: 12 Feb 2008, 19:32:14 UTC - in response to Message 3714.  
Last modified: 12 Feb 2008, 19:32:33 UTC

This WU failed on my system.
ID: 3715 · Report as offensive    Reply Quote
Profile dekim
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 20 Jan 06
Posts: 250
Credit: 543,579
RAC: 0
Message 3716 - Posted: 12 Feb 2008, 19:35:23 UTC
Last modified: 12 Feb 2008, 19:43:41 UTC

current minirosetta 1.07 status



We are still not getting any trace information for debugging so if you are getting errors please download and install our symbols file. See the first post of this thread.

If you use the NOD32 virus scanner w/ IMON (web access protection enabled), you can configure the web access protection to exclude the boinc.exe application from content filtering or add the download url(s) (ralph.bakerlab.org/download* for ralph and *.bakerlab.org/rosetta/download* for r@h) in the list of addresses to exclude. I haven't heard back from anyone at ESET about this issue.

edit: be sure to submit the virus detection for review back to ESET so that they could hopefully include this as a false positive in their databases.


ID: 3716 · Report as offensive    Reply Quote
Profile dekim
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 20 Jan 06
Posts: 250
Credit: 543,579
RAC: 0
Message 3717 - Posted: 12 Feb 2008, 19:40:26 UTC - in response to Message 3715.  

This WU failed on my system.


we looked into that error and the only explanation is that the application can't see or open one of the input files. It is specific to your situation and is not a general error. without being able work on your computer, I can't really figure out what is happening. did a file get removed accidentally?
ID: 3717 · Report as offensive    Reply Quote
Evan

Send message
Joined: 23 Dec 07
Posts: 75
Credit: 69,584
RAC: 0
Message 3719 - Posted: 12 Feb 2008, 20:05:53 UTC

I aborted this one on mini 1.07:
It was stuck for over an hour at 92.912%

score13_hb_envtest62_A_1cg5B_3225_1018_0

work unit 665842
ID: 3719 · Report as offensive    Reply Quote
Profile dekim
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 20 Jan 06
Posts: 250
Credit: 543,579
RAC: 0
Message 3720 - Posted: 12 Feb 2008, 20:51:17 UTC - in response to Message 3719.  

I aborted this one on mini 1.07:
It was stuck for over an hour at 92.912%

score13_hb_envtest62_A_1cg5B_3225_1018_0

work unit 665842



If you suspect another stuck work unit can you let it keep going for a while? at least a few times greater than your cpu run time preference.
ID: 3720 · Report as offensive    Reply Quote
1 · 2 · 3 · Next

Message boards : Current tests : Help us debug minirosetta.



©2024 University of Washington
http://www.bakerlab.org