Bug Reports for 5.44

Message boards : RALPH@home bug list : Bug Reports for 5.44

To post messages, you must log in.

AuthorMessage
Chu
Volunteer moderator
Project developer
Project scientist

Send message
Joined: 26 Sep 06
Posts: 61
Credit: 12,545
RAC: 0
Message 2687 - Posted: 21 Jan 2007, 5:08:31 UTC
Last modified: 21 Jan 2007, 17:30:48 UTC

This update has some new rosetta applications added in, such as a preliminary version of rosetta protein design protocol and a special rosetta docking protocol which handles symmetric oligomers. The primary developers of those protocols will post more details about their applications. Please note that we are still working on adding thread synchronization features to the rosetta graphics and we are sorry that this update DOES NOT have the graphic-related problem fixed.
ID: 2687 · Report as offensive    Reply Quote
Profile Inais
Avatar

Send message
Joined: 30 Jul 06
Posts: 12
Credit: 13,115
RAC: 0
Message 2689 - Posted: 21 Jan 2007, 10:17:53 UTC

The progress sometimes socked - for example at 23.142% or on an other WU at 1,041%. But some crunched WU's completed correctly (hope so)
I wish I can fly like a bird in the sky
ID: 2689 · Report as offensive    Reply Quote
genes
Avatar

Send message
Joined: 16 Feb 06
Posts: 45
Credit: 43,300
RAC: 0
Message 2690 - Posted: 21 Jan 2007, 12:49:20 UTC

I saw this WU (which is a 5.44) running this morning, and thought "Ooh, maybe they fixed the graphics", so I clicked on "show graphics". Well, the graphics ran for a few seconds, then locked up. Not only that, but the whole machine locked up. After about 20 seconds or so, a feature of this new ATI driver I'm using (7.1) kicked in: it's called VPU recover. The display driver basically reset itself, and everything came back. The WU in question is still running, it didn't error out. I guess I can safely assume, though, that the graphics bug is not fixed yet.

ID: 2690 · Report as offensive    Reply Quote
Profile slavko.sk
Avatar

Send message
Joined: 16 Feb 06
Posts: 4
Credit: 6,755
RAC: 0
Message 2691 - Posted: 21 Jan 2007, 12:58:41 UTC

With last WU on Linux box I'm getting this message right after WU starts:

ralph@home 21.1.07 11:09:15 Task 1c9oA_BOINC_ABRELAX_NEWRELAXFLAGS_frags83__1640_3_0 exited with zero status but no 'finished' file
ralph@home 21.1.07 14:50:47 If this happens repeatedly you may need to reset the project.

Resseting project doesn't help.
Any idea?
ALL GLORY TO THE HYPNOTOAD!
My Stats
ID: 2691 · Report as offensive    Reply Quote
Conrad Poohs
Avatar

Send message
Joined: 29 Aug 06
Posts: 9
Credit: 1,955
RAC: 0
Message 2692 - Posted: 21 Jan 2007, 15:15:20 UTC

I hoped that the graphics were fixed in 5.44 but when I left the screensaver on and came back to it I couldnt exit screensaver mode without cancelling This WU.

ID: 2692 · Report as offensive    Reply Quote
Chu
Volunteer moderator
Project developer
Project scientist

Send message
Joined: 26 Sep 06
Posts: 61
Credit: 12,545
RAC: 0
Message 2693 - Posted: 21 Jan 2007, 15:53:14 UTC - in response to Message 2691.  

Did you get errors like this for all the recent WUs so far?
With last WU on Linux box I'm getting this message right after WU starts:

ralph@home 21.1.07 11:09:15 Task 1c9oA_BOINC_ABRELAX_NEWRELAXFLAGS_frags83__1640_3_0 exited with zero status but no 'finished' file
ralph@home 21.1.07 14:50:47 If this happens repeatedly you may need to reset the project.

Resseting project doesn't help.
Any idea?

ID: 2693 · Report as offensive    Reply Quote
Profile slavko.sk
Avatar

Send message
Joined: 16 Feb 06
Posts: 4
Credit: 6,755
RAC: 0
Message 2694 - Posted: 21 Jan 2007, 16:37:07 UTC - in response to Message 2693.  

Yes, for below mentioned one and also for:

ralph@home 21.1.07 18:30:19 Task 1urnA_BOINC_POSE_ABRELAX_VARY_SC_BOND_ANGLES_NEWRELAXFLAGS_frags83__1641_3_0 exited with zero status but no 'finished' file

ralph@home 21.1.07 18:29:55 Task 1tif__BOINC_ABRELAX_NEWRELAXFLAGS_frags83__1640_7_0 exited with zero status but no 'finished' file

You may check my results:
https://ralph.bakerlab.org/results.php?userid=197

Did you get errors like this for all the recent WUs so far?
With last WU on Linux box I'm getting this message right after WU starts:

ralph@home 21.1.07 11:09:15 Task 1c9oA_BOINC_ABRELAX_NEWRELAXFLAGS_frags83__1640_3_0 exited with zero status but no 'finished' file
ralph@home 21.1.07 14:50:47 If this happens repeatedly you may need to reset the project.

Resseting project doesn't help.
Any idea?



ALL GLORY TO THE HYPNOTOAD!
My Stats
ID: 2694 · Report as offensive    Reply Quote
Profile slavko.sk
Avatar

Send message
Joined: 16 Feb 06
Posts: 4
Credit: 6,755
RAC: 0
Message 2695 - Posted: 21 Jan 2007, 17:10:47 UTC

It is special Linux 2.4.27-grsec build ... maybe there is a problem. I seem the same error on ABC@home project, Malariacontrol.net doesn't work at all cause no graphic libraries are on the system. Rosetta so far work and RALF was working also except of the last WU's.
ID: 2695 · Report as offensive    Reply Quote
Hans Sveen

Send message
Joined: 17 Feb 06
Posts: 11
Credit: 368,311
RAC: 0
Message 2696 - Posted: 22 Jan 2007, 8:22:42 UTC

Hello!
Work unit 351148 just ended without any problems (screensaver not running!) on my host id 474......

BUT....
The next one wuid 352037 has a coupple of issues so far; first its cpu usage was 50 % of my 3 cores currently running; still after suspending it still uses 25 %! The cpu clock still runs after I suspended it hmmmm... I am not sure what is common memory usage, but this wu uses 221968 K when not running !
Restarted the wu then the memory usage jumped to nearly 225 000 K then dropped to about 219 000 K , this time with "show graphic" running I can see it works ok. Memory usage jumps between the two before mentioned "digits".
It has currenly completed 1.5 hour of max 6 hours!

Hope this can be useful in the debugging!

With regards,


Hans Sveen
Oslo, Norway

ID: 2696 · Report as offensive    Reply Quote
Profile anders n

Send message
Joined: 16 Feb 06
Posts: 166
Credit: 131,419
RAC: 0
Message 2697 - Posted: 22 Jan 2007, 8:39:33 UTC

The memory usage is alot bigger on my new tasks than before.

About 207000-215000kB.

It´s not a problem on my computers but might be for others.

Anders n
ID: 2697 · Report as offensive    Reply Quote
Hans Sveen

Send message
Joined: 17 Feb 06
Posts: 11
Credit: 368,311
RAC: 0
Message 2698 - Posted: 22 Jan 2007, 8:44:27 UTC - in response to Message 2696.  

Hello!
Work unit 351148 just ended without any problems (screensaver not running!) on my host id 474......

BUT....
The next one wuid 352037 has a coupple of issues so far; first its cpu usage was 50 % of my 3 cores currently running; still after suspending it still uses 25 %! The cpu clock still runs after I suspended it hmmmm... I am not sure what is common memory usage, but this wu uses 221968 K when not running !
Restarted the wu then the memory usage jumped to nearly 225 000 K then dropped to about 219 000 K , this time with "show graphic" running I can see it works ok. Memory usage jumps between the two before mentioned "digits".
It has currenly completed 1.5 hour of max 6 hours!

Hope this can be useful in the debugging!

With regards,



Edit:

A small follow up, I guess vthe cpu usage has to do with the Not fixed graphic problem; when closing down boinc and after the restart ralph went back to 25 %cpu usage, so it seems open the graphic starts as a whole new work unit!!

See You soon!



Hans Sveen
Oslo, Norway

ID: 2698 · Report as offensive    Reply Quote
Profile feet1st

Send message
Joined: 7 Mar 06
Posts: 313
Credit: 116,623
RAC: 0
Message 2699 - Posted: 22 Jan 2007, 16:00:54 UTC

This thread needs a sticky... and perhaps UNsticky some of the older releases.

Wondering if there is anything in the WU names, or elsewhere that one can use to confirm a given WU was specifically designed for the "high memory" systems? I mean these docking WUs seem to take 200MB rather then the 110 that has been passing for normal. As a tester, just wanting to assure these WUs were properly flagged to only go to systems with 512MB. And how does that work if the system is a hyperthreaded CPU? Would it require the machine to have 1GB? (512 per active core?)
ID: 2699 · Report as offensive    Reply Quote
Pieface

Send message
Joined: 16 Feb 06
Posts: 64
Credit: 203,513
RAC: 0
Message 2700 - Posted: 23 Jan 2007, 18:01:00 UTC

I don't know if this is exactly a 'problem', but i have been noticing that
my pentium-m Win-xp machine hostid 2606 occasionally is being granted twice what it requests credit wise, for example, running four hours it requests forty someting credits and is granted 100 or so. A couple of the wu's i noticed were:
wu 346971
wu 348266
wu 347050
Might cause a 'stir' over on the production side.
ID: 2700 · Report as offensive    Reply Quote
Profile Conan
Avatar

Send message
Joined: 16 Feb 06
Posts: 364
Credit: 1,368,421
RAC: 0
Message 2702 - Posted: 24 Jan 2007, 2:39:24 UTC - in response to Message 2700.  

I don't know if this is exactly a 'problem', but i have been noticing that
my pentium-m Win-xp machine hostid 2606 occasionally is being granted twice what it requests credit wise, for example, running four hours it requests forty someting credits and is granted 100 or so. A couple of the wu's i noticed were:
wu 346971
wu 348266
wu 347050
Might cause a 'stir' over on the production side.


I think you will see that on the 'production' side the cobblestones awarded are lower than in Ralph, a lot depends on what science has been done in the WU. From my point of view it is not a problem.
ID: 2702 · Report as offensive    Reply Quote
Profile Conan
Avatar

Send message
Joined: 16 Feb 06
Posts: 364
Credit: 1,368,421
RAC: 0
Message 2703 - Posted: 24 Jan 2007, 2:44:59 UTC

On a different note, I had a WU that after 4 hours 27 seconds (4.00.27) according to Boinc my WU was 100% complete. Unfortunately it was still 'running' according to Boinc with the time not progressing. It was stuck so I had to abort it. I do not run graphics on this Linux machine. My time preference is 6 hours.

https://ralph.bakerlab.org/result.php?resultid=402096
ID: 2703 · Report as offensive    Reply Quote
Profile Silver Streak

Send message
Joined: 11 Dec 06
Posts: 5
Credit: 216,369
RAC: 0
Message 2707 - Posted: 25 Jan 2007, 15:16:35 UTC

Just getting around to reporting this:

Yesterday while updating some software on one of my computers, I had to restart to finish the install. Upon restart, both of the WU's in progress had erred out. Sorry I can't give more information, but I thought I should report this.
ID: 2707 · Report as offensive    Reply Quote
Profile slavko.sk
Avatar

Send message
Joined: 16 Feb 06
Posts: 4
Credit: 6,755
RAC: 0
Message 2711 - Posted: 27 Jan 2007, 9:00:18 UTC

I'm still getting these messages:

2007-01-27 10:37:00 [ralph@home] Restarting task 1bgf__BOINC_ABRELAX_NEWRELAXFLAGS_frags83__1660_17_0 using rosetta_beta version 544
2007-01-27 10:37:01 [ralph@home] Task 1bgf__BOINC_ABRELAX_NEWRELAXFLAGS_frags83__1660_17_0 exited with zero status but no 'finished' file
2007-01-27 10:37:01 [ralph@home] If this happens repeatedly you may need to reset the project.

Nothing in stderrdae.txt, only above messages in stdoutdae.txt.

ALL GLORY TO THE HYPNOTOAD!
My Stats
ID: 2711 · Report as offensive    Reply Quote
Profile Conan
Avatar

Send message
Joined: 16 Feb 06
Posts: 364
Credit: 1,368,421
RAC: 0
Message 2712 - Posted: 27 Jan 2007, 12:02:27 UTC

> Just had work unit fail

https://ralph.bakerlab.org/result.php?resultid=403445

stderr out

<core_client_version>5.2.14</core_client_version>
<message>process exited with code 1 (0x1)
</message>
<stderr_txt>
Graphics are disabled due to configuration...
fullatom_setup.cc: CHANGING fa_max_dis to 6!!!!!!!!!!!!!
ERROR:: Exit at: read_paths.cc line:346

</stderr_txt>

ID: 2712 · Report as offensive    Reply Quote
Pieface

Send message
Joined: 16 Feb 06
Posts: 64
Credit: 203,513
RAC: 0
Message 2713 - Posted: 27 Jan 2007, 13:51:19 UTC

I'm seeing quite a few of those failures as well, but a few - maybe 1/3 - are getting thru OK.
Bad ones are:
[/url=https://ralph.bakerlab.org/result.php?resultid=403020] resultid 403020 [/url]
[/url=https://ralph.bakerlab.org/result.php?resultid=403021] resultid 403021 [/url]
[/url=https://ralph.bakerlab.org/result.php?resultid=403093] resultid 403093 [/url]
[/url=https://ralph.bakerlab.org/result.php?resultid=403094] resultid 403094 [/url]
[/url=https://ralph.bakerlab.org/result.php?resultid=403262] resultid 403262 [/url]
[/url=https://ralph.bakerlab.org/result.php?resultid=405824] resultid 405824 [/url]

ID: 2713 · Report as offensive    Reply Quote
Profile Trog Dog
Avatar

Send message
Joined: 8 Aug 06
Posts: 38
Credit: 41,996
RAC: 0
Message 2716 - Posted: 27 Jan 2007, 22:48:13 UTC

Same with this wu.
ID: 2716 · Report as offensive    Reply Quote

Message boards : RALPH@home bug list : Bug Reports for 5.44



©2024 University of Washington
http://www.bakerlab.org