Posts by Nuadormrac

1) Message boards : Number crunching : No new tasks (Message 4968)
Posted 17 Sep 2009 by Nuadormrac
Post:
Basically Rosetta is the actual science project. This is more a software testing project, to design apps for future use at Rosetta. They'll play fine together, they're different projects, running different versions of the same app (this project has the app in alpha or beta state).

As such, work is only available when software testing is needed here. This said, also be prepared for failed WUs, and just provide the necessary bug reporting. After this testing is done here, to the devs satisfaction, the app is moved off ralph and onto rosetta, for use in actual science crunching purposes.
2) Message boards : Number crunching : workunits encountering computation errors (Message 4962)
Posted 17 Sep 2009 by Nuadormrac
Post:
3 have now run into a computation error.

http://ralph.bakerlab.org/result.php?resultid=1605063
http://ralph.bakerlab.org/result.php?resultid=1605019

which also gave the same error to another

http://ralph.bakerlab.org/workunit.php?wuid=1418841

tbh, exceding maximum disk useage is rather not believable, especially as CPDN has no such problem... But besides

http://ralph.bakerlab.org/show_host_detail.php?hostid=19614

Free Disk Space 34.06 GB


If a single WU is requiring more then 34 GB (though the resource share for BOINC is 20 GB of which 19.65 GB is available to BOINC at present), then there might be something going on.

The last WU gives a much more detailed error report

http://ralph.bakerlab.org/result.php?resultid=1604908
3) Message boards : Feedback : Probably need to run RALPH again, and get you guys some debugging info (Message 1824)
Posted 12 Jun 2006 by Nuadormrac
Post:
Well, 2 completed fine in RALPH, I'll try bumping up the time setting and see what happens. It might be specific to 5.22 on Vista. Not sure...
4) Message boards : Feedback : Probably need to run RALPH again, and get you guys some debugging info (Message 1822)
Posted 12 Jun 2006 by Nuadormrac
Post:
It was mentioned that the debug code was optional or something? I s'pose for a moment I need to get some feedback returned. Since installing Windows Vista beta 2, I haven't gotten a successful WU crunch on Rossetta (though Einstein does seem to be going OK).

Though Vista (well it shows up as Longhorn in the BOINC computer stats) probably isn't the first thing on your mind, some info on how it's doing with the up-comming version of Windows might be of use
5) Message boards : RALPH@home bug list : Bug reports for Ralph 5.16 (Message 1700)
Posted 26 May 2006 by Nuadormrac
Post:
back to topic :)
just got this WU
got no stuff in the searching and accepted boxes just like wizzszz in his screenshot, i was able to get the structure in the low energy screen though through moving it randomly around, it was somewhere offscreen, looking ok at first, but started to get randomly broken after a while. now at 1.561% (or a little earlier maybe) everything looks like its normal, all pics where they should be, and the structure in the low energy window is in the center now too when i move it around.


I managed to nab a few WUs, though I'm currently running with a lower 1 hour setting on account of having a fair amount of Rosseta WUs to turn in, though if need be, I can play with the time setting there.

Anyhow, I'm seeing this exact same problem on my computer. It's a t283_lowHB_LOOPRELAX_hand_aligned_hom... WU type

Edit: Model number 2 didn't show anything in those 2 windows as mentioned above. Model 3 is showing a little something in the upper right hand corner of the windows for those 2, which has now turned to look like the low energy window... Model 6 is also messed up like model 2. Whatever this is, it might be specific to some models within the same WU, though not others...
6) Message boards : RALPH@home bug list : Bug reports for Ralph 5.13 (Message 1586)
Posted 11 May 2006 by Nuadormrac
Post:
There's also when the apps are released to other scientific projects... If the app is worked out by then, they might not want to take the time to run debugging (which is more important during the apps development process)... But who knows, and that will be between the respective scientists to discuss/decide among themselves...

If it's optimal enough, that it isn't like it's seizing the computer on one however, it won't matter so much for the user. It's when it becomes difficult to do anything else that issues start to arrise.

Anyhow, I went on, with both downloading/uploading some torrents, and catching up on some episodes of Lost I hadn't seen previously. I let RALPH run while doing all that, and all still seemed well...
7) Message boards : RALPH@home bug list : Bug reports for Ralph 5.13 (Message 1570)
Posted 10 May 2006 by Nuadormrac
Post:
So far, it's looking good. Actually, I had to do some fanagling to force WU downloads, as my resource share is 25, and had a lot of debt to other projects, but managed to grab some...

Anyhow, I did some initial testing with what was running into severe performance probs, and not seeing it right now. I've got some more things to do, so I'll let some other projects run while I'm away (which should help balance the debt anyhow); and resume testing when I'm back to push the computer some so as to get more testing with it under load.
8) Message boards : RALPH@home bug list : Bug reports for Ralph 5.11 and 5.12 (Message 1564)
Posted 10 May 2006 by Nuadormrac
Post:
Well, I do think a 1 liner is cutting things a little too tight for WU description, however there does come a point where large gfx can mount up...

BTW, just having the window not maximized, and then doing an (alt + print screen), will just copy the current app window to the clipboard, rather then the whole desktop. This would cut the start menu, and other such Windows items outa the screen capture...
9) Message boards : RALPH@home bug list : Bug reports for Ralph 5.11 and 5.12 (Message 1557)
Posted 9 May 2006 by Nuadormrac
Post:
I'm definitely not running screensaver, though am running as a single user install...
10) Message boards : RALPH@home bug list : Bug reports for Ralph 5.11 and 5.12 (Message 1555)
Posted 9 May 2006 by Nuadormrac
Post:
OK, I did some further checks. I went to uninstall the media player, at which point there was a long delay for even the control panel to open up while RALPH was running. It then wanted a reboot, so I let it. Making sure the prog directory is deleted so it would get a clean fresh install of the newer version (not that the older had given me this problem in like 6 months of useage), I then installed, it rebooted, and BOINC scheduled an Einstein unit to run, using (well I'm using) akosf's u41.04 science app.

It loaded up normally, no wait times, performance delays, or anything. So, I suspended Einstein, and forced it over to a RALPH WU after shutting down. I then tried to open the same exact media file again, and the performance delay/wait time was reintroduced.

As these were the only 2 projects loaded thus far, there was less in memory this time then before. (Before I had QMC, CPDN, seasonal attribution, as well as crunch3r's SETI app loaded.) With all of that loaded, that only represented 711 MB of memory allocation according to Task Manager, and this computer has 1 GB of RAM. Right now, with this memory allocation is only 402 MB.

In the past (though it could be worth a check again), without RALPH, but having both CPDN and seasonal attribution loaded (which are real taxing on the memory useage), I hadn't seen this. The RALPH app 5.11 and up does appear to be the difference between the performance delay on doing other stuff invoked vs. not invoked. Not sure if anyone else has noticed this or not...

Further note: Resuming Einstein didn't fix this performance drag, though shutting down BOINC and restarting it, so Ralph 5.12 wasn't loaded in RAM did fix it however...

Comp:

- Athlon 64 3500+ Venice core (socket 939)
- 1 GB Corsair 400 Pro DDR (2x 512 MB DIMMs)
- MSI Neo 2 (nForce 3 chipset)
- LVD SCSI HDs (Seagate Cheetahs), connected through an Adaptec SCSI card 29160
- Windows XP Professional Service Pack 2

Not sure if the other components would matter for a performance standpoint, though temp wise, things are kept nice and cool due to a fair number of fans and a Swifttech HSF, and Arctic Silver 5 thermal paste...
11) Message boards : RALPH@home bug list : Bug reports for Ralph 5.11 and 5.12 (Message 1554)
Posted 9 May 2006 by Nuadormrac
Post:
Not sure if this is totally related, though hmm... With 5.11 and now 5.12, when the rossetabeta process is running, I'm noticing more sluggish performance when starting my media player to listen to an MP3 file, watch a torrent, whatever... It's gone from the media file being ready right away, to a wait for "loading audio system..." Never seemed to run into this before version 5.11, or with other projects only being loaded in memory.

I updated the media player last night, and all seemed fine for awhile. Today, it's got a 10+ sec delay, whereas last night it didn't when I was crunching other stuff/no RALPH units for the app to be in memory. As the priority should be just as low with BOINC, I'm surprised/mystified about this noticing as well, or what to believe of it Humph... I'll need to do further testing myself to even begin to try to make heads/tails of this.
12) Message boards : RALPH@home bug list : Bug reports for Ralph 5.09 and 5.10 (Message 1526)
Posted 7 May 2006 by Nuadormrac
Post:
Got 3 more WUs with the same exact error.

stderr out

<core_client_version>5.4.1</core_client_version>
<message>Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
ERROR:: Exit at: .barcode_classes.cc line:500

</stderr_txt>


And on these 3 computers, both computers the WUs were given to errored out...

http://ralph.bakerlab.org/result.php?resultid=105778
http://ralph.bakerlab.org/result.php?resultid=105764
http://ralph.bakerlab.org/result.php?resultid=105754
13) Message boards : RALPH@home bug list : Bug reports for Ralph 5.09 and 5.10 (Message 1523)
Posted 7 May 2006 by Nuadormrac
Post:
My first 2 units on 5.10 crashed, though the third is crunching now... Here are the particulars

http://ralph.bakerlab.org/result.php?resultid=105784

stderr out

<core_client_version>5.4.1</core_client_version>
<message>Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
ERROR:: Exit at: .barcode_classes.cc line:500

</stderr_txt>


Crashed at 10 seconds of crunch time. Looking further, another computer crashed with this same WU

http://ralph.bakerlab.org/workunit.php?wuid=92286

The second WU, had the same exact symptoms and error message, though thus far it hasn't yet been assigned to a second computer:

http://ralph.bakerlab.org/result.php?resultid=105748
14) Message boards : Current tests : Weird.. I kept the 60 minutes switch between Applications (Message 1395)
Posted 27 Apr 2006 by Nuadormrac
Post:
If you really want to get to testing the WU, you can force the issue by suspending Rosseta, letting a test unit get through, and then resume it...

If you allow RALPH to try to get work at times, we're running low, then after a time RALPH will develop some debt which is owed it, and this will become less of an issue. This is what happens on LHC for instance, and can happen over here in time as well...
15) Message boards : Number crunching : Rosetta@Home is down (Message 1370)
Posted 26 Apr 2006 by Nuadormrac
Post:
Yeah, going to connect to the servers, I got:

4/26/2006 3:00:51 AM|ralph@home|Scheduler request to http://ralph.bakerlab.org/ralph_cgi/cgi succeeded
4/26/2006 3:00:51 AM|ralph@home|Message from server: Server has software problem
4/26/2006 3:00:51 AM|ralph@home|Project is down
16) Message boards : RALPH@home bug list : Old - Bug reports for Windows Ver - 5.00 (and higher) (Message 1239)
Posted 19 Apr 2006 by Nuadormrac
Post:
ps: I have not yet tested it on Linux, cause lack of enough WUs,
but at least the windows 5.00 I consider GOOD !


Not sure about 5.0 on Linux, but 4.98 did seem to go without incident on Linux, the problems we were seeing, being specific to the Windows app...

5.0 has been running without error for me as well...
17) Message boards : RALPH@home bug list : Umm, 5.0 or 5.01? (Message 1236)
Posted 19 Apr 2006 by Nuadormrac
Post:
I ask because a thread was created to report bugs with 5.01 WUs... However, connecting to the RALPH servers, 5.0 WUs were just downloaded to my comp, not 5.01?

Is anyone getting 5.01 WUs, or is there a mix of both versions on the servers, or hmm?
18) Message boards : Number crunching : Sorry to say, but my remaining RALPH units are gone (Message 1200)
Posted 17 Apr 2006 by Nuadormrac
Post:
and not due to an application error (well a RALPH application error).

Lets just say the CPDN model went, due to a problem they were having on that project. Basically, one of there files has incorrect data which was feeding into the model, so they sent a "kill trickle" out to all the comps running the older model. Upon loading, I got the thing faulting, so went to recover. Heck, didn't even know they could do this, let along it happened. Was completely perplexed on this happening during a model load, and attempts at a project folder restore failing... :eek:

Anyhow, trying to get the thing to recover, sorta ended up messing up my BOINC CC, and all projects ended up resetting :o

Anyhow, all the 5.0 units I had gotten and crunched through, completed successfully, so hadn't been having probs with the newer version of the app...
19) Message boards : Number crunching : Max time (Message 1199)
Posted 17 Apr 2006 by Nuadormrac
Post:
Not sure anyone has run into it, but with a 4 day setting, I would imagine there could come a point where one runs out of models in a given WU? On the HDLBR, at a 4 hour setting, I got 48+ models complete (48 when I last checked it, and it ran a bit over). This was on an Athlon 64 3500+...

At an 8 hour setting, the FACONTACTS got through about 60 or so models when I last checked, don't remember exactly... Which of course brings up 1 other thing with a 4 day setting, the variability in processor speed from say an Athlon FX 60! down to some Pentium II and IIIs, if anyone has attached any of those to this project. Not sure if they have...
20) Message boards : RALPH@home bug list : Debugger Stuff (Message 1197)
Posted 16 Apr 2006 by Nuadormrac
Post:
<offtopic> I can't see why it wouldn't work with other porjects. Since all 5.4.x really is, is a bug fixes and some added features to 5.2.13
If you want to see the 'change log' look at the Mac's changlog it shows the alterations (that are probably relevant) throughout the development (5.3.x series). Don't know why the Mac log has them and the windows only starts from 5.4.0

Mind I have 5.3.31 running and it's fine on all the projects I run so far. Actually a lot better than 5.2.13, so thank's to Rom and the rest of the boinc dev team.



Typically, one would expect that it would. But sometimes, some project servers get picky about CC versions... If you've crunched a CPDN result before, you'll know wny some give it some thought. This coupled ocean model will likely take > 1,800 hours crunch time on my Athlon 64, which mind you, an Athlon 64 isn't exactly an average computer, performance wise. The deadline on these is greater then 1 year, if that also gives some idea... Yeah, people who crunch CPDN end up learning they need to be careful with it. Too many sulpher units for instance have gone south on people, and for smaller things then an upgrade. For instance, it doesn't well handle another process using 100% CPU time if it's running, and could easily get "out of sync"...

BTW, some servers were being taken offline for additional support (aka server version 5.5.0), with notes that it was in anticipation of the next CC. Probably support for the extra debugging and what not...


Next 20



©2024 University of Washington
http://www.bakerlab.org