Posts by anders n

21) Message boards : Number crunching : Credits (Message 3256)
Posted 30 Jun 2007 by Profile anders n
Post:
Is there a site where you can se which Boinc clients we are using here on
Ralph and Rosetta?


22) Message boards : Number crunching : Credits (Message 3255)
Posted 30 Jun 2007 by Profile anders n
Post:
It may have something to do with the test work units. I did ask once if these Ralph work units are actual work units or not but don't recall getting an answer. So their actual value and therefore comparison to Rosetta may not have a lot to do with each other.


If I understand what we are doing here right we are cruching the same type of work units here as will come on Rosetta later but not all of them passes the test here.

What I don´t know is if they use the Wu-s as science basis or if the results
"just" is test runs for correct settings on Rosetta.

@Feet1st

I suspected that it could be the differnet computer mix that made the differance.

Anders n
23) Message boards : RALPH@home bug list : Bug reports for 5.69-5.70 (Message 3243)
Posted 28 Jun 2007 by Profile anders n
Post:
Not a bug but some of the lates Wu-s are huge and take up to 600 in VM.

Anders n
24) Message boards : Number crunching : Credits (Message 3242)
Posted 28 Jun 2007 by Profile anders n
Post:
Yes I know that there are projects that give more credit/H(I even run some of them).

But this is Rosetta / Ralph and credit should be about the same or??

Anders n

25) Message boards : RALPH@home bug list : Bug reports for 5.69-5.70 (Message 3239)
Posted 27 Jun 2007 by Profile anders n
Post:
1 more
2007-06-27 11:35:41|ralph@home|Reason: Unrecoverable error for result SSH1_BOINC_MFR_ABRELAX_2150_47_1 (<file_xfer_error> <file_name>SSH1_BOINC_MFR_ABRELAX_2150_47_1_0</file_name> <error_code>-161</error_code></file_xfer_error>)

http://ralph.bakerlab.org/workunit.php?wuid=499792

It seem the <error_code>-161</error_code>
from 5.68 is still here.
http://ralph.bakerlab.org/workunit.php?wuid=499558

Anders n

26) Message boards : Number crunching : Credits (Message 3238)
Posted 27 Jun 2007 by Profile anders n
Post:
I wonder what makes the credit difference on this host here and on Rosetta.

Rosetta
Ralph

Anders n

27) Message boards : RALPH@home bug list : Bug reports for 5.69-5.70 (Message 3237)
Posted 27 Jun 2007 by Profile anders n
Post:
It seem the <error_code>-161</error_code>
from 5.68 is still here.
http://ralph.bakerlab.org/workunit.php?wuid=499558

Anders n
28) Message boards : RALPH@home bug list : Bug reports for 5.69-5.70 (Message 3236)
Posted 26 Jun 2007 by Profile anders n
Post:
Just for Info this Wu was restarted at the same time and continued as usual.

http://ralph.bakerlab.org/result.php?resultid=569051




Great, that's what I was hoping for actually. We're testing a mode of ralph in which we can run an old app version and a new app version at the same time. This is to allow some of our workunits to continue with a stable version to enable consistent results for publication, while other workunits can take advantage of later bug fixes and features. That workunit has a checkpointing issue with 5.68, but works well with 5.69 (now 5.70); I wanted to make sure I could send out one batch of jobs for the old app and one for the newer app!


This Wu failed after trying to restart from last checkpoint.

(I update BOINC so it was taken out of memory)

http://ralph.bakerlab.org/result.php?resultid=569191

Anders n


29) Message boards : RALPH@home bug list : Bug reports for 5.69-5.70 (Message 3234)
Posted 26 Jun 2007 by Profile anders n
Post:
This Wu failed after trying to restart from last checkpoint.

(I update BOINC so it was taken out of memory)

http://ralph.bakerlab.org/result.php?resultid=569191

Anders n
30) Message boards : RALPH@home bug list : Bug reports for 5.66-5.68 (Message 3206)
Posted 23 Jun 2007 by Profile anders n
Post:
This one failed after restarting computer.

http://ralph.bakerlab.org/result.php?resultid=562489

Anders n
31) Message boards : RALPH@home bug list : Bug reports for 5.66-5.68 (Message 3204)
Posted 18 Jun 2007 by Profile anders n
Post:
Any ideas how come these 2 Wu-s was ok on my computers and not the others?

http://ralph.bakerlab.org/workunit.php?wuid=484660

http://ralph.bakerlab.org/workunit.php?wuid=488628

Anders n
32) Message boards : RALPH@home bug list : Bug reports for 5.66-5.68 (Message 3191)
Posted 4 Jun 2007 by Profile anders n
Post:


Same problem as mentioned above. Permanently 10 mins (exactly) to run. currently 92% complete, Progress still incrementing but 'to complete' static at 10 minutes.


--
Rod Ellery


When a WU takes longer than your run pref. time it looks like what you describe.

As long as the % complete increase things should be ok :)

Anders n

ps
Some of the WU-s take up to 4 H to make 1 model on my computers.
33) Message boards : RALPH@home bug list : Bug reports for 5.66-5.68 (Message 3189)
Posted 4 Jun 2007 by Profile anders n
Post:
Have got a WU running at present

1acf__TREEJUMP_ABRELAX_TJTOP3_SAVE_ALL_OUT_BARCODE__2095_19_0
rosetta_beta version 568
Windows XP,
Processor: AuthenticAMD Unknown CPU Type [x86 Family 6 Model 8 Stepping 1] [fpu tsc sse 3dnow mmx]
Memory: 751.49 MB physical, 3.43 GB virtual

Same problem as mentioned above. Permanently 10 mins (exactly) to run. currently 92% complete, Progress still incrementing but 'to complete' static at 10 minutes.

BOINC running as a service so no graphics, but Windows had blanked screen as screensaver kicked in overnight.

Should I abort?


--
Rod Ellery


How long has it been running and what are your pref. run time?

34) Message boards : RALPH@home bug list : Bug reports for 5.66-5.68 (Message 3183)
Posted 29 May 2007 by Profile anders n
Post:
This WU did a crash and burn when the computer ran out of VM.

Anders n
35) Message boards : RALPH@home bug list : not sure if this is a bug (Message 3176)
Posted 27 May 2007 by Profile anders n
Post:
Where does it say there are WU-s to download?

Can you please copy and show me?
Anders n
36) Message boards : RALPH@home bug list : Bug reports for 5.66-5.68 (Message 3170)
Posted 26 May 2007 by Profile anders n
Post:
This WU uses A LOT of memory. My laptop has only got 512 mb ram, so the wu uses above 95% of the pagefile (1,5 to 1,6 Gb). Now after running for 3hours 27 minutes, the wu pauses and the status-field in BOINC shows the message "Waiting for memory". BOINC then just switched to another wu.

What should I do?


Hi Bjarke

I'm not on the team "just" a tester like you :)

Is there a chance for you to increase the VM?

Maybe it would get the WU kicking again.

Anders n
37) Message boards : RALPH@home bug list : Bug reports for 5.66-5.68 (Message 3168)
Posted 26 May 2007 by Profile anders n
Post:
This is on an Intel Mac (Macbook Pro). 5.68 seems to use far more memory than earlier versions. The VM size is 1.6GB, and the working set is over 600MB. This is causing a big impact on the machine.



I have this running on a XP and it take atleast 1,2 GB of VM.

Anders n

EDIT

It took 5H 20 min to do 1 model on a P4 2,8
38) Message boards : RALPH@home bug list : Bug reports for 5.65 (Message 3156)
Posted 25 May 2007 by Profile anders n
Post:
Please add […] to the list of aborted workunits running over 10 hours or more

Are we supposed to abort tasks that run for more than ten hours? I couldn’t find any announcement to that effect.


No we are not supposed to abort any tasks!

Unless there are a direct instruction to do so from the team.

Anders n
39) Message boards : RALPH@home bug list : Project down? (Message 3148)
Posted 24 May 2007 by Profile anders n
Post:
I get

2007-05-24 07:03:03|ralph@home|Scheduler RPC succeeded
2007-05-24 07:03:03|ralph@home|Message from server: Project encountered internal error: shared memory
2007-05-24 07:03:03|ralph@home|Deferring communication for 1 hr 0 min 0 sec
2007-05-24 07:03:03|ralph@home|Reason: project is down

Anders n
40) Message boards : RALPH@home bug list : Bug reports for 5.65 (Message 3140)
Posted 23 May 2007 by Profile anders n
Post:
Intell MAC
Validate error
Silent_out::setup: silent output with symmetry info not compatible with non-ideal bonds yet.
http://ralph.bakerlab.org/result.php?resultid=523041
http://ralph.bakerlab.org/result.php?resultid=523042

Anders n

EDIT
And
http://ralph.bakerlab.org/result.php?resultid=523042


Previous 20 · Next 20



©2024 University of Washington
http://www.bakerlab.org