Posts by anders n

41) Message boards : RALPH@home bug list : Bug reports for 5.65 (Message 3136)
Posted 23 May 2007 by Profile anders n
Post:
Intresting WU. I have a 4 H setting and it errord out after 1H 23min. Next chrucher has a 1 H setting and it came out ok after 62 min.

http://ralph.bakerlab.org/workunit.php?wuid=462868

Anders n
42) Message boards : RALPH@home bug list : Bug reports for 5.65 (Message 3122)
Posted 22 May 2007 by Profile anders n
Post:
http://ralph.bakerlab.org/result.php?resultid=521786

- exit code -1073741819 (0xc0000005)

Anders n
43) Message boards : RALPH@home bug list : bug report for version 5.64 (Message 3112)
Posted 18 May 2007 by Profile anders n
Post:
@KC0ISW

Can you please just give us a link to the result instead of posting
all of it.

Oh and keep posting errors :)

Anders n
44) Message boards : RALPH@home bug list : bug report for version 5.64 (Message 3092)
Posted 12 May 2007 by Profile anders n
Post:
My dual Opteron 64bit Linux system (Ubuntu 7.04) is not receiving any work. It keeps requesting, but not getting anything. If I can help you out with additional information, please reply and say so :)

Regards,
BorkesComputers


This is a test project so there are not work availabel at all times.

Stick arround and you will get your share :)

Anders n


45) Message boards : RALPH@home bug list : bug report for version 5.64 (Message 3086)
Posted 8 May 2007 by Profile anders n
Post:
Just a heads up don't know if this is relevant here but on Einstien they have had much troubel with process "Caught SIGABRT in graphics thread" and
"got signal 11" mostley on LINUX.

I reported my first error on that 5 posts down and now I saw a report on Rosetta to.

Hope it's nothing.

Anders n
46) Questions and Answers : Preferences : Setting length of tasks (Message 3081)
Posted 6 May 2007 by Profile anders n
Post:
In the output from my first successful result, from version 5.63, SEARCH_PAIRINGS_-1dhn_-round2_1976_120 I see:
Rosetta score is stuck or going too long. Watchdog is ending the run!
CPU time: 14813 seconds. Greater than 4X preferred time: 3600 seconds

Does this mean I should increase my preferred run-length? (At the moment I’ve left the project preferences at their default settings.) BOINC’s initial to-completion estimate was something like 24 hours. Are these messages fairly common here, or might this be indicative of a problem?


It means that the WU got stuck and was aborted by the watchdog.

The first estimate is often off by alot :) but comes close to your preferred setting after a few WU-s.

Happy Cruching
Anders n
47) Message boards : RALPH@home bug list : bug report for version 5.64 (Message 3080)
Posted 6 May 2007 by Profile anders n
Post:
1. the cpu time & progress looks good, but the time to completion is meaningless.
the time allowed is 2H but the To completion shows 5H with 5% done

2. I closed BOINC manager (right click - exit) via the task bar icon, at about 7% & 12min then reloaded BOINC, but the running task started from 0 CPU time & 0% Progress, so it lost any results that should have been saved?

save to disk is set to 120sec.


Om my P4 2,8 the tasks I got today checkpoints every 10-12 min.

Anders n
48) Message boards : RALPH@home bug list : bug report for version 5.64 (Message 3078)
Posted 6 May 2007 by Profile anders n
Post:
I got a failure on my MAC.

<core_client_version>5.8.15</core_client_version>
<![CDATA[
<message>
process got signal 6
</message>
<stderr_txt>
Rosetta@home Macintosh Stack Size checker.
Original size: 8388608.
Maximum size: 0.
RLIM_INFINITY 67108864
# cpu_run_time_pref: 14400
SIGBUS: bus error

Crashed executable name: rosetta_beta_5.64_i686-apple-darwin
built using BOINC library version 5.9.5
Machine type Intel 80486
System version: Macintosh OS 10.4.9 build 8P2137
Sun May 6 04:29:25 2007
/Users/boinc/boinc_build/rosetta_5.64/boinc/mac_build/../lib/mac/QCrashReport.c:343: failed assertion `*crRefPtr == NULL'
Caught SIGABRT in graphics thread

This WU http://ralph.bakerlab.org/result.php?resultid=506638

Anders n
49) Message boards : RALPH@home bug list : Bug reports for 5.63 (Message 3055)
Posted 4 May 2007 by Profile anders n
Post:
same for me.
#503918
#503462
#503449

Anders n
50) Message boards : RALPH@home bug list : Bug reports for 5.56-5.59 (Message 2991)
Posted 2 Apr 2007 by Profile anders n
Post:
@feet1st thanks :)
51) Message boards : RALPH@home bug list : Bug reports for 5.56-5.59 (Message 2988)
Posted 2 Apr 2007 by Profile anders n
Post:
Can anyone explain the new text on MAC results?

It looks like this

Rosetta@home Macintosh Stack Size checker.
Original size: 8388608.
Maximum size: 0.
RLIM_INFINITY 67108864

Anders n
52) Message boards : RALPH@home bug list : Bug reports for 5.56-5.59 (Message 2978)
Posted 1 Apr 2007 by Profile anders n
Post:
Anders n, I think the behavior you observe is partly due to an additional "correction" that the BOINC API applies when estimating time to completion -- it should never really be over 4 hours, right? We really don't have any control over that extra "correction".

But we do have control over percent complete, and that shouldn't go to zero upon resuming ralph! So I'm still worried. On my mac intel machine, I just tried to suspend a ralph WU, and ran einstein@home for a few minutes; then suspended the einstein@home workunit, and resumed the ralph WU. Everything was fine (pct complete never dropped to zero)... when you try this, does pct complete drop to zero?

[edit]
Another question: you posted that 5.57 was fine; are you seeing an issue only with 5.58? If so, this is totally puzzling, since the small change I made to the Mac app shouldn't affect behacior of pct complete.


OK, just talked to David K about this. Right now we keep track of time crunched based on a call to the BOINC API ... i.e. the BOINC manager keeps track of how much time was spent on each workunit. If you preempt after an hour and resume later, the BOINC manager will tell Rosetta about the hour already spent.

But if you shut BOINC down and restart that could cause a problem in a lot of estimates... we can try to make the Rosetta app more self-sufficient, keeping track of cpu time spent so far, but that might be a can of worms. Worth the time?


Just so we have all the facts right. When a Ralph Wu 5.58 is resumed after preemt the % done goes back to 0 and time to complete goes very high.
I just had one it preemted at 2 H and when restarted time to complete was
at nearly 6 H (rapidly going down as % was going up). I have a 4 H setting for
Ralph on that computer.

Anders n




Oups sorry I should have said that it was on a windows XP computer
the % went to 0. It works ok on the MAC.
As a side effect it does not happen when I suspend and resume in the middel of a model, it only happens when at model swich by Boinc it self.
(I have "Leave applications in memory while suspended" set to yes)

[edit] The MAC has done one more night swiching with Einstein without any trouble now with 5.58 [/edit]

Anders n
53) Message boards : RALPH@home bug list : Bug reports for 5.56-5.59 (Message 2975)
Posted 31 Mar 2007 by Profile anders n
Post:
OK, just talked to David K about this. Right now we keep track of time crunched based on a call to the BOINC API ... i.e. the BOINC manager keeps track of how much time was spent on each workunit. If you preempt after an hour and resume later, the BOINC manager will tell Rosetta about the hour already spent.

But if you shut BOINC down and restart that could cause a problem in a lot of estimates... we can try to make the Rosetta app more self-sufficient, keeping track of cpu time spent so far, but that might be a can of worms. Worth the time?


Just so we have all the facts right. When a Ralph Wu 5.58 is resumed after preemt the % done goes back to 0 and time to complete goes very high.
I just had one it preemted at 2 H and when restarted time to complete was
at nearly 6 H (rapidly going down as % was going up). I have a 4 H setting for
Ralph on that computer.

Anders n
54) Message boards : RALPH@home bug list : Bug reports for 5.56-5.59 (Message 2974)
Posted 31 Mar 2007 by Profile anders n
Post:
Update on my MAC
Ralph 5.57 and Einstein has been swiching all-night without any errors.

On to 5.58 :)
Anders n
55) Message boards : RALPH@home bug list : Bug reports for 5.56-5.59 (Message 2967)
Posted 30 Mar 2007 by Profile anders n
Post:
MAC
I tried to get Ralph to "hang" again by pause and resume then
by manually get it to swich between Einstein and Ralph... no success. :)
I'll let it run by it self hopfully starting to swich by itself to se if
it still works as it should.
Anders n
56) Message boards : RALPH@home bug list : Bug reports for 5.56-5.59 (Message 2961)
Posted 30 Mar 2007 by Profile anders n
Post:
% issue
I have a Wu that was at 40% and had started model no 4.

I restarted Boinc and the Wu restarted at model no 4 but with 0% and
started counting up.

Anders n
57) Message boards : RALPH@home bug list : Bug reports for 5.56-5.59 (Message 2932)
Posted 29 Mar 2007 by Profile anders n
Post:
The issue on hanging after preemting did start in the middel of a version.
I think there was some kind of security update on the OS about that time.
Just a thought.
Anders n
58) Message boards : RALPH@home bug list : Bug reports for 5.55 (Message 2930)
Posted 29 Mar 2007 by Profile anders n
Post:
Anders n, actually, wait, when did this start happening for you? Is there a discussion thread on this?

I haven't been able to reproduce the Mac issue (process not found) noted on the R@H message boards yet. But I'm hoping to find a fix for the next update


How about the other MAC issue where Ralph/Rosetta hangs after beening preemted and then resumed.
I just checked my MAC and 1 WU on each project was hanging.

Anders n



Se Bug reports 5.52-5.54.

It started 18/3.

Anders n
59) Message boards : RALPH@home bug list : Bug reports for 5.55 (Message 2925)
Posted 29 Mar 2007 by Profile anders n
Post:
I haven't been able to reproduce the Mac issue (process not found) noted on the R@H message boards yet. But I'm hoping to find a fix for the next update


How about the other MAC issue where Ralph/Rosetta hangs after beening preemted and then resumed.
I just checked my MAC and 1 WU on each project was hanging.

Anders n
60) Message boards : RALPH@home bug list : Bug reports for Ralph 5.52-5.54 (Message 2912)
Posted 27 Mar 2007 by Profile anders n
Post:
MAC my XP computers is running just fine.
As soon as a Ralph or Rosetta task is preemted and then resumed it
"hangs" that is shows as running but the only thing happening is
that the time counts up.
If I recall right Boinc only swiches tasks at checkpoints,
if the task don't progress = no checkpoint.
And it looks like the watchdog don't work on this problem :(

Anders n


Previous 20 · Next 20



©2024 University of Washington
http://www.bakerlab.org