Rosetta beta 4.84 wu seems to hang on Linux

Message boards : Current tests : Rosetta beta 4.84 wu seems to hang on Linux

To post messages, you must log in.

AuthorMessage
tgm

Send message
Joined: 19 Feb 06
Posts: 5
Credit: 1,066
RAC: 0
Message 559 - Posted: 24 Feb 2006, 6:21:22 UTC

I'm seeing some bizare behavior from a wu that is running on a P3 600 Linux (Fedora 4) w/ 768 MB ram.

The configuration is set to remove the wu from memory when pre-empted.

The work unit gets to a point where it is not doing anything and just sort of sits there and does NOT consume any CPU load. I don't know when it is happening (within the 60 minute cycle), but it is definitely occurring while ralph is running. It's not crashed and boinc isn't crashed either. If I restart the boinc service it restarts where it left off and runs normally.

I've been running a Boinc beta (5.2.15) and I just downgraded it back to 5.2.5 to make sure that boinc itself is not the culprit. Hopefully the client does not update anytime soon where I had to shift my configuration on the server to keep the wu in memory in order that Windows wu's can process (another thread).

The wu in question is: https://ralph.bakerlab.org/workunit.php?wuid=5558
ID: 559 · Report as offensive    Reply Quote
Dimitris Hatzopoulos

Send message
Joined: 16 Feb 06
Posts: 31
Credit: 2,308
RAC: 0
Message 594 - Posted: 25 Feb 2006, 0:34:05 UTC

Hi, I've had "hangs" as you described with R@H (not with RALPH sofar, but I've only run very few RALPH WUs).

Let's try to find possible common things between our setups:

1/ BOINC version
2/ RAM
3/ Linux kernel
4/ R settings (leave in mem=Y/N)

In my case

BOINC 5.2.14 (optimized) by Crunch3r
256MB RAM (only)
Linux kernel 2.4.27
R set to remain in mem while pre-empted

I was suspecting a memory issue, because it was a commonality with CarlosP's setup, but in your case you've too much RAM (786MB) so perhaps we can rule it out.

I wonder if it could be some race condition when the BOINC app starts Rosetta... But, I've ran 8 different BOINC projects on that Linux, and only Rosetta had this problem.

ID: 594 · Report as offensive    Reply Quote
LanDroid

Send message
Joined: 19 Feb 06
Posts: 1
Credit: 12,980
RAC: 0
Message 760 - Posted: 1 Mar 2006, 2:20:46 UTC - in response to Message 559.  
Last modified: 1 Mar 2006, 2:23:24 UTC

I have the same problem. The work unit has been stuck at 2:35:34 and 32.36% for a long time...

Boinc version 5.2.13
Ram 512K
Linux kernel 2.6.12 (Ubuntu distro)
Leave in memory = N
Pentium 4 2.8 Ghz

I tried to "unstick" the work unit by suspend/resume, but as soon as I hit resume, it switches to another project. What to do?
ID: 760 · Report as offensive    Reply Quote

Message boards : Current tests : Rosetta beta 4.84 wu seems to hang on Linux



©2024 University of Washington
http://www.bakerlab.org