RALPH@home

Bug reports for Ralph 5.42 and 5.43

  UW Seal
 
[ Home ] [ Join ] [ About ] [ Participants ] [ Community ] [ Statistics ]
  [ login/out ]


Advanced search

Message boards : RALPH@home bug list : Bug reports for Ralph 5.42 and 5.43

AuthorMessage
Rhiju
Forum moderator
Project developer
Project scientist

Joined: Feb 14 06
Posts: 161
ID: 4
Credit: 3,725
RAC: 0
Message 2586 - Posted 12 Dec 2006 1:53:32 UTC

    Last modified: 13 Dec 2006 3:37:02 UTC

    We are trying to increase stability in this release... We have turned off mouse rotation and sidechains temporarily. Please let us know if you can force a crash by playing with the \"show graphics\" option from the boinc manager, or with your screensaver!
    ____________

    Profile feet1st

    Joined: Mar 7 06
    Posts: 312
    ID: 1028
    Credit: 110,522
    RAC: 0
    Message 2587 - Posted 12 Dec 2006 2:19:30 UTC

      Last modified: 12 Dec 2006 3:17:07 UTC

      Might I suggest more WUs then average?? So people have time to read your message and perhaps attach to Ralph, and get some work to run? Otherwise I fear those first 1000 WUs all were consumed by the folks that run as a service and don\'t run the screensaver.

      Also, just got a new error updating to the project:

      12/11/2006 9:12:36 PM|ralph@home|Scheduler request succeeded
      12/11/2006 9:12:36 PM|ralph@home|Message from server: Project encountered internal error: shared memory
      12/11/2006 9:12:36 PM|ralph@home|Project is down

      At time of this post (just 25min after Rhiju\'s) there are zero WUs available and just over 1000 in progress.
      ____________

      Profile feet1st

      Joined: Mar 7 06
      Posts: 312
      ID: 1028
      Credit: 110,522
      RAC: 0
      Message 2588 - Posted 12 Dec 2006 2:58:19 UTC

        It should be pointed out that if you wish to know with some certainty that a given screensaver problem occured on Ralph, you\'re going to have to suspend Rosetta while you run Ralph this time. That way you will KNOW which was driving the screensaver.
        ____________

        FluffyChicken

        Joined: Feb 17 06
        Posts: 54
        ID: 414
        Credit: 710
        RAC: 0
        Message 2590 - Posted 12 Dec 2006 7:44:45 UTC

          Last modified: 12 Dec 2006 8:31:57 UTC

          Maybe you should also note in the front page news say in bold that it is graphics you are testing and please play or turn on the graphics/screensaver.

          Hint big time.



          Also put a request in the Rosetta news that anyone who uses the screensaver and has seen problems with crashing/failed tasks then please attach to Ralph to help in the testing (link to ralph as well)


          ____________

          Rhiju
          Forum moderator
          Project developer
          Project scientist

          Joined: Feb 14 06
          Posts: 161
          ID: 4
          Credit: 3,725
          RAC: 0
          Message 2591 - Posted 12 Dec 2006 8:34:28 UTC - in response to Message 2587.

            Hi. I queued a ton of WUs, but there\'s a problem with one of the work daemons. I\'m working on it. Thanks for the post!



            Might I suggest more WUs then average?? So people have time to read your message and perhaps attach to Ralph, and get some work to run? Otherwise I fear those first 1000 WUs all were consumed by the folks that run as a service and don\'t run the screensaver.

            Also, just got a new error updating to the project:

            12/11/2006 9:12:36 PM|ralph@home|Scheduler request succeeded
            12/11/2006 9:12:36 PM|ralph@home|Message from server: Project encountered internal error: shared memory
            12/11/2006 9:12:36 PM|ralph@home|Project is down

            At time of this post (just 25min after Rhiju\'s) there are zero WUs available and just over 1000 in progress.


            ____________

            Rhiju
            Forum moderator
            Project developer
            Project scientist

            Joined: Feb 14 06
            Posts: 161
            ID: 4
            Credit: 3,725
            RAC: 0
            Message 2592 - Posted 12 Dec 2006 8:49:30 UTC - in response to Message 2591.

              OK, I think I fixed the feeder; work should be sent out smoothly.

              Hi. I queued a ton of WUs, but there\'s a problem with one of the work daemons. I\'m working on it. Thanks for the post!



              Might I suggest more WUs then average?? So people have time to read your message and perhaps attach to Ralph, and get some work to run? Otherwise I fear those first 1000 WUs all were consumed by the folks that run as a service and don\'t run the screensaver.

              Also, just got a new error updating to the project:

              12/11/2006 9:12:36 PM|ralph@home|Scheduler request succeeded
              12/11/2006 9:12:36 PM|ralph@home|Message from server: Project encountered internal error: shared memory
              12/11/2006 9:12:36 PM|ralph@home|Project is down

              At time of this post (just 25min after Rhiju\'s) there are zero WUs available and just over 1000 in progress.



              ____________

              Profile feet1st

              Joined: Mar 7 06
              Posts: 312
              ID: 1028
              Credit: 110,522
              RAC: 0
              Message 2593 - Posted 12 Dec 2006 13:28:25 UTC

                Last modified: 12 Dec 2006 14:16:29 UTC

                Thanks Rhiju, I got 2 work units about 50 minutes before posting this. I\'ve suspended Rosetta, enabled my screensaver and she\'ll run all day (we hope).

                I also enabled the new version in my firewall, just in case it tries to report in with the debugger info.

                Also wanted to point out that timestamp shown on the homepage just above the number of WUs seems to be in the future. By as much as 30min.
                ____________

                Carolyn and Michael Bowen

                Joined: Dec 8 06
                Posts: 1
                ID: 2340
                Credit: 5,717
                RAC: 0
                Message 2594 - Posted 12 Dec 2006 14:04:09 UTC

                  For some reason, while 5.42 was downloading for the first time onto my XP SP2 system, my computer froze up for about 60 seconds. I was given back control once the download finished. The message log taken from around this time (posted below) looks normal, and there seem to be no long-lasting side effects; just a slight annoyance at not being able to work for a bit. The communication failure at the beginning of the listing was because my DSL connection was still in the process of setting itself up during Ralph\'s initial request. Times are U.S. PST.



                  2006/12/12 06:46:46|ralph@home|Sending scheduler request to http://ralph.bakerlab.org/ralph_cgi/cgi
                  2006/12/12 06:46:46|ralph@home|Reason: To fetch work
                  2006/12/12 06:46:46|ralph@home|Requesting 34560 seconds of new work
                  2006/12/12 06:46:47||Project communication failed: attempting access to reference site
                  2006/12/12 06:46:50||Access to reference site failed - check network connection or proxy configuration.
                  2006/12/12 06:46:52|ralph@home|Scheduler request failed: couldn\'t resolve host name
                  2006/12/12 06:46:52|ralph@home|Deferring scheduler requests for 1 minutes and 0 seconds
                  2006/12/12 06:47:53|ralph@home|Fetching scheduler list
                  2006/12/12 06:47:58|ralph@home|Scheduler list download succeeded
                  2006/12/12 06:48:05|ralph@home|Sending scheduler request to http://ralph.bakerlab.org/ralph_cgi/cgi
                  2006/12/12 06:48:05|ralph@home|Reason: To fetch work
                  2006/12/12 06:48:05|ralph@home|Requesting 34560 seconds of new work
                  2006/12/12 06:48:09|ralph@home|Scheduler request succeeded
                  2006/12/12 06:48:11|ralph@home|Started download of file rosetta_beta_5.42_windows_intelx86.exe
                  2006/12/12 06:48:11|ralph@home|Started download of file frags83_2chf_.fasta.gz
                  2006/12/12 06:48:12|ralph@home|Finished download of file frags83_2chf_.fasta.gz
                  2006/12/12 06:48:12|ralph@home|Throughput 1300 bytes/sec
                  2006/12/12 06:48:12|ralph@home|Started download of file frags83_2chf_.psipred_ss2.gz
                  2006/12/12 06:48:13|ralph@home|Finished download of file frags83_2chf_.psipred_ss2.gz
                  2006/12/12 06:48:13|ralph@home|Throughput 5046 bytes/sec
                  2006/12/12 06:48:13|ralph@home|Started download of file boinc_frags83_aa2chf_03_05.200_v1_3.gz
                  2006/12/12 06:48:26|ralph@home|Finished download of file boinc_frags83_aa2chf_03_05.200_v1_3.gz
                  2006/12/12 06:48:26|ralph@home|Throughput 86148 bytes/sec
                  2006/12/12 06:48:26|ralph@home|Started download of file boinc_frags83_aa2chf_09_05.200_v1_3.gz
                  2006/12/12 06:48:31|ralph@home|Finished download of file boinc_frags83_aa2chf_09_05.200_v1_3.gz
                  2006/12/12 06:48:31|ralph@home|Throughput 73067 bytes/sec
                  2006/12/12 06:48:31|ralph@home|Started download of file frags83_2chf.pdb.gz
                  2006/12/12 06:48:33|ralph@home|Finished download of file frags83_2chf.pdb.gz
                  2006/12/12 06:48:33|ralph@home|Throughput 27867 bytes/sec
                  2006/12/12 06:48:33|ralph@home|Started download of file casp7.description.shorter.txt
                  2006/12/12 06:48:35|ralph@home|Finished download of file casp7.description.shorter.txt
                  2006/12/12 06:48:35|ralph@home|Throughput 398 bytes/sec
                  2006/12/12 06:49:24|ralph@home|Finished download of file rosetta_beta_5.42_windows_intelx86.exe
                  2006/12/12 06:49:24|ralph@home|Throughput 133687 bytes/sec
                  2006/12/12 06:49:25||Rescheduling CPU: files downloaded
                  2006/12/12 06:49:26|ralph@home|Starting task 2chf__BOINC_ABINITIO_SAVE_ALL_OUT_frags83__1552_21_0 using rosetta_beta version 542

                  Profile sslickerson

                  Joined: Feb 15 06
                  Posts: 17
                  ID: 37
                  Credit: 4,006
                  RAC: 0
                  Message 2595 - Posted 12 Dec 2006 15:25:45 UTC

                    I\'ve put the new app through the paces regarding the screensaver and graphics window. I\'ve tried all fps speeds and all seems well.

                    --Tim

                    Profile sslickerson

                    Joined: Feb 15 06
                    Posts: 17
                    ID: 37
                    Credit: 4,006
                    RAC: 0
                    Message 2596 - Posted 12 Dec 2006 15:50:22 UTC

                      I tried the same with Rosetta and had the *graphics window* open when the WU froze completely after about a minute of moving the graphics around. This has been typical of my computer. The WU progressed per usual with the screensaver on and no problems this time (although there has been in the past). I should note that with Rosetta WU I see more of this happen at 30 fps than any other number higher or lower than 30. Also, I didn\'t need to force the WU to error out as BOINC was visible on the task bar and so I just exited out. It later continued at its last checkpoint.

                      genes
                      Avatar

                      Joined: Feb 16 06
                      Posts: 45
                      ID: 57
                      Credit: 43,300
                      RAC: 0
                      Message 2597 - Posted 12 Dec 2006 16:15:55 UTC

                        I got a few WU\'s on my home machine this morning, and it didn\'t have any Rosetta, and the Ralph WU\'s appear to be trouble-free so far, screensaver-wise. When I get home later, I\'ll see if it crashed.

                        Here\'s the machine: hostid=2016
                        ____________

                        Rhiju
                        Forum moderator
                        Project developer
                        Project scientist

                        Joined: Feb 14 06
                        Posts: 161
                        ID: 4
                        Credit: 3,725
                        RAC: 0
                        Message 2598 - Posted 12 Dec 2006 17:25:40 UTC - in response to Message 2596.

                          Things sound good so far -- I\'ll wait for a few more replies (e.g. fro Feet1st)
                          before doing the update on rosetta@home. Thanks to all who have checked in!

                          I tried the same with Rosetta and had the *graphics window* open when the WU froze completely after about a minute of moving the graphics around. This has been typical of my computer. The WU progressed per usual with the screensaver on and no problems this time (although there has been in the past). I should note that with Rosetta WU I see more of this happen at 30 fps than any other number higher or lower than 30. Also, I didn\'t need to force the WU to error out as BOINC was visible on the task bar and so I just exited out. It later continued at its last checkpoint.


                          ____________

                          Bjarke

                          Joined: Feb 25 06
                          Posts: 5
                          ID: 796
                          Credit: 5,523
                          RAC: 0
                          Message 2599 - Posted 12 Dec 2006 19:30:11 UTC

                            I crashed a WU during graphics. It either happened because I showed two RALPH-graphics at the same time (i have two cores), or because I tried to zoom in/out on the left-picture.
                            result


                            ____________

                            Profile feet1st

                            Joined: Mar 7 06
                            Posts: 312
                            ID: 1028
                            Credit: 110,522
                            RAC: 0
                            Message 2600 - Posted 12 Dec 2006 22:27:16 UTC

                              I haven\'t been at the proper PC all day, but the WUs still show as not reported yet, so that\'s a good indication that they are still happily crunching (24hr time preference). Previously I wasn\'t ever able to crunch ANY WU for the full 24hrs without a hang or failure or watchdog kicking in.

                              ...I suppose it COULD ALSO mean that a squirrel has found a creative new way to combine the electic transformer outside my home with a latent desire to pursue the afterlife... as was the cause of some WUs not reporting YESTERDAY :)
                              ____________

                              Profile feet1st

                              Joined: Mar 7 06
                              Posts: 312
                              ID: 1028
                              Credit: 110,522
                              RAC: 0
                              Message 2601 - Posted 13 Dec 2006 1:16:27 UTC

                                Last modified: 13 Dec 2006 2:10:46 UTC

                                Well, I\'m now back to my problem PC, and was about to blow the train whistle --Whoo hooo!-- when I saw my screensaver MOVING, and it was on model 94. But then I noticed it only crunched the first WU for almost exactly 3hrs. Now that I\'ve updated to project it shows watchdog ended it. And I commonly saw that same symptom on Rosetta once I activated the screensaver on the same host.

                                Because the watchdog ended it, the messages tab just shows that the very long WU name \"finished\".

                                The second WU I received is crunching happily now in to it\'s 10th hour. I could NEVER have lasted that long on v5.41. So, I can definately say \"more stable\". It is supposed to run for 24hrs, and that means if all goes well it will complete just after I leave for work in the AM.

                                I should note that my hyperthreaded CPU is set in BOINC to only use 1 at the moment. That measure did not seem to make Rosetta any more stable. Now I should set it back to two, but the only other work I have is for Rosetta :) and if I have both running, I don\'t know how BOINC chooses the screensaver, but I think I\'ll botch my test.

                                The \"docking\" WUs seemed to be the most graphically intense. Have some of those been queued up for test here?
                                ____________

                                Rhiju
                                Forum moderator
                                Project developer
                                Project scientist

                                Joined: Feb 14 06
                                Posts: 161
                                ID: 4
                                Credit: 3,725
                                RAC: 0
                                Message 2602 - Posted 13 Dec 2006 2:05:24 UTC - in response to Message 2601.

                                  By a weird coincidence, we also had a ralph WU stuck on our test PC that was shut down by the watchdog. I\'ve checked the results returned so far, though; most of the jobs are coming back fine. Your and my WUs are a little scary; we\'re going to work hard in the new year to trap these WUs and to track down where the jobs are getting stuck. But they look like the exception, rather than the rule.


                                  Thanks to Bjarke as well. I\'m guessing that Bjarke\'s hypothesis about opening two ralph windows simultaneously is correct.


                                  Well, I\'m now back to my problem PC, and was about to blow the train whistle --Whoo hooo!-- when I saw my screensaver MOVING, and it was on model 94. But then I noticed it only crunched the first WU for almost exactly 3hrs. Now that I\'ve updated to project it shows watchdog ended it. And I commonly saw that same symptom on Rosetta once I activated the screensaver on the same host.

                                  Because the watchdog ended it, the messages tab just shows that the very long WU name \"finished\".

                                  The second WU I received is crunching happily now in to it\'s 10th hour. I could NEVER have lasted that long on v5.41. So, I can definately say \"more stable\". It is supposed to run for 24hrs, and that means if all goes well it will complete just after I leave for work in the AM.

                                  I should note that my hyperthreaded CPU is set in BOINC to only use 1 at the moment. That measure did not seem to make Rosetta any more stable. Now I should set it back to two, but the only other work I have is for Rosetta :) and if I have both running, I don\'t know how BOINC chooses the screensaver, but I think I\'ll botch my test.

                                  The \"docking\" WUs seemed to be the most graphically intense. Have some of those been queued up for test here?


                                  ____________

                                  Profile feet1st

                                  Joined: Mar 7 06
                                  Posts: 312
                                  ID: 1028
                                  Credit: 110,522
                                  RAC: 0
                                  Message 2603 - Posted 13 Dec 2006 3:04:30 UTC

                                    Last modified: 13 Dec 2006 3:50:23 UTC

                                    Rhiju, I know you have some means of running the WUs with the same starting seed value, or heck if you know it died on model X, then start it on that model\'s starting value ...but if you run the WUs again, do they again require the watchdog on the same model? If so, then there\'s some better \"sniffing\" possible for your Rosetta blood hounds. If not, then there is something about the rest of the environment that causes the failure (like the screensaver, or memory issues, or what have you).

                                    I\'m still finishing the last Ralph WU to see if it will go 24hrs. And, after all of an hour apparently, it looks like that v5.43 WUs are all gone already, and I didn\'t get any, so I won\'t have any feedback on v5.43 for you.

                                    I\'ve also been puzzled why the watchdog shutting down the thread always seems to lose the completed models of that task? I thought it would preserve them.
                                    ____________

                                    Rhiju
                                    Forum moderator
                                    Project developer
                                    Project scientist

                                    Joined: Feb 14 06
                                    Posts: 161
                                    ID: 4
                                    Credit: 3,725
                                    RAC: 0
                                    Message 2604 - Posted 13 Dec 2006 3:49:08 UTC - in response to Message 2603.

                                      Yea, the good news is that we can rerun the jobs with the seed that was sent out, and thus reproduce the error (and see where the job gets stuck). It will probably take a day or two of debugging, and we\'ll get that going in the new year.

                                      There are a lot of WU\'s queued for 5.43; but I think I may need to increase the size of the buffer. Thanks for the suggestions!

                                      Rhiju, I know you have some means of running the WUs with the same starting seed value, or heck if you know it died on model X, then start it on that model\'s starting value ...but if you run the WUs again, do they again require the watchdog on the same model? If so, then there\'s some better \"sniffing\" possible for your Rosetta blood hounds. If not, then there is something about the rest of the environment that causes the failure (like the screensaver, or memory issues, or what have you).

                                      I\'m still finishing the last Ralph WU to see if it will go 24hrs. And, after all of an hour apparently, it looks like that v5.43 WUs are all gone already, and I didn\'t get any, so I won\'t have any feedback on v5.43 for you.

                                      I\'ve also been puzzled why the watchdog shutting down the thread always seems to lose the completed models of that task? I thought it would preserve them.


                                      ____________

                                      Profile feet1st

                                      Joined: Mar 7 06
                                      Posts: 312
                                      ID: 1028
                                      Credit: 110,522
                                      RAC: 0
                                      Message 2605 - Posted 13 Dec 2006 4:22:39 UTC

                                        Last modified: 13 Dec 2006 5:15:18 UTC

                                        Thanks. I just nabbed two more WUs! I\'ll suspend the 5.42 task so I can see how 5.43 does overnight... on both threads of my HT CPU.

                                        I guess the reason I brought up running on the same seed etc. was that it should be pretty easy to set up a machine to run that one model and see if it needs the watchdog to kill it. ...and if it does not, then it implies there are still some problems with the screensaver (or were at 5.42 anyway). If you find indeed it does invoke the watchdog, then you\'ve got something for your 2007 \"to do\" list. But it would help assure that it is indeed the random event you mention... that just happened to occur with both our systems, or not.
                                        ____________

                                        Rhiju
                                        Forum moderator
                                        Project developer
                                        Project scientist

                                        Joined: Feb 14 06
                                        Posts: 161
                                        ID: 4
                                        Credit: 3,725
                                        RAC: 0
                                        Message 2606 - Posted 13 Dec 2006 5:23:53 UTC - in response to Message 2605.

                                          Great, let me know how those 5.43 WUs go. So far no errors.

                                          Regarding the stuck WUs, my hypothesis is that they\'re not graphics related; we\'ll be able to check this, of course, by running them locally with executables compiled without graphics. I like your idea of checking whether the problems on your machine correlate with the ones on our machines ... I\'ll see if I can get the seed and workunit for the latest result you posted.

                                          Thanks. I just nabbed two more WUs! I\'ll suspend the 5.42 task so I can see how 5.43 does overnight... on both threads of my HT CPU.

                                          I guess the reason I brought up running on the same seed etc. was that it should be pretty easy to set up a machine to run that one model and see if it needs the watchdog to kill it. ...and if it does not, then it implies there are still some problems with the screensaver (or were at 5.42 anyway). If you find indeed it does invoke the watchdog, then you\'ve got something for your 2007 \"to do\" list. But it would help assure that it is indeed the random event you mention... that just happened to occur with both our systems, or not.


                                          ____________

                                          Chu
                                          Forum moderator
                                          Project developer
                                          Project scientist

                                          Joined: Sep 26 06
                                          Posts: 61
                                          ID: 1900
                                          Credit: 12,545
                                          RAC: 0
                                          Message 2607 - Posted 13 Dec 2006 6:07:48 UTC - in response to Message 2601.

                                            I just put some docking WUs on ralph for graphic stability test and let\'s what will come out from that.

                                            Well, I\'m now back to my problem PC, and was about to blow the train whistle --Whoo hooo!-- when I saw my screensaver MOVING, and it was on model 94. But then I noticed it only crunched the first WU for almost exactly 3hrs. Now that I\'ve updated to project it shows watchdog ended it. And I commonly saw that same symptom on Rosetta once I activated the screensaver on the same host.

                                            Because the watchdog ended it, the messages tab just shows that the very long WU name \"finished\".

                                            The second WU I received is crunching happily now in to it\'s 10th hour. I could NEVER have lasted that long on v5.41. So, I can definately say \"more stable\". It is supposed to run for 24hrs, and that means if all goes well it will complete just after I leave for work in the AM.

                                            I should note that my hyperthreaded CPU is set in BOINC to only use 1 at the moment. That measure did not seem to make Rosetta any more stable. Now I should set it back to two, but the only other work I have is for Rosetta :) and if I have both running, I don\'t know how BOINC chooses the screensaver, but I think I\'ll botch my test.

                                            The \"docking\" WUs seemed to be the most graphically intense. Have some of those been queued up for test here?

                                            Profile feet1st

                                            Joined: Mar 7 06
                                            Posts: 312
                                            ID: 1028
                                            Credit: 110,522
                                            RAC: 0
                                            Message 2608 - Posted 13 Dec 2006 13:31:01 UTC

                                              Whoo - Hooo!!!

                                              Both WUs are still crunching away, screensaver ran all night. I didn\'t have Docking WUs, but with 5.41 I couldn\'t seem to get ANY WU to run that long with screensaver active.

                                              So, they\'ve both run on my HT Windows PC for about 9hours, the screensaver was active and cut out normally when I awoke this morning. This looks very promising. Unless they fail, I won\'t have any other feedback until late tonight Central time when they should have completed their 24hr runtime preference.
                                              ____________

                                              Profile UBT - Mikeejones

                                              Joined: Mar 22 06
                                              Posts: 2
                                              ID: 1169
                                              Credit: 3,174
                                              RAC: 0
                                              Message 2609 - Posted 13 Dec 2006 15:12:57 UTC

                                                Not really a problem but I noticed during this WU...

                                                13/12/2006 15:24:08|ralph@home|Starting task eps1__BOINC_JUMPABINITIO_SAVE_ALL_OUT_BARCODE_CRYOEM_truncate_hom001__1555_46_1 using rosetta_beta version 543

                                                That when the screensaver is in full screen the bottom of the graphics disappears off the bottom edge - can\'t see the line which includes \'accepted energy etc. It\'s only just there when set to a window.

                                                Just thought I\'d mention it.

                                                Mike

                                                ____________

                                                Profile anders n

                                                Joined: Feb 16 06
                                                Posts: 166
                                                ID: 91
                                                Credit: 131,419
                                                RAC: 0
                                                Message 2610 - Posted 13 Dec 2006 16:54:28 UTC

                                                  I have tried to fore a grafics error... without any luck.

                                                  Not that my compurets has a history of grafic errors.

                                                  Anders n
                                                  ____________

                                                  Rhiju
                                                  Forum moderator
                                                  Project developer
                                                  Project scientist

                                                  Joined: Feb 14 06
                                                  Posts: 161
                                                  ID: 4
                                                  Credit: 3,725
                                                  RAC: 0
                                                  Message 2611 - Posted 13 Dec 2006 18:10:03 UTC - in response to Message 2610.

                                                    Great this is all looking good! From our end, error rates are extremely low, and we have at least a couple reports of formerly problematic machines being \"cured\".
                                                    I\'ll do a rosetta@home update later today, and we\'ll monitor rosetta@home stuff very carefully over the next few days.

                                                    I have tried to fore a grafics error... without any luck.

                                                    Not that my compurets has a history of grafic errors.

                                                    Anders n


                                                    ____________

                                                    Profile feet1st

                                                    Joined: Mar 7 06
                                                    Posts: 312
                                                    ID: 1028
                                                    Credit: 110,522
                                                    RAC: 0
                                                    Message 2612 - Posted 14 Dec 2006 1:50:21 UTC

                                                      My two WUs are both still crunching strong 21hrs in to a 24hr run. This on the machine that couldn\'t run for more then about 3hrs without failing in some way or presenting a screensaver which was, well, cumbersome to regain control of your PC from. So, I think you\'re good to go!
                                                      ____________

                                                      Jon Bahen

                                                      Joined: Dec 6 06
                                                      Posts: 3
                                                      ID: 2337
                                                      Credit: 1,007
                                                      RAC: 0
                                                      Message 2615 - Posted 14 Dec 2006 3:50:33 UTC - in response to Message 2586.

                                                        We are trying to increase stability in this release... We have turned off mouse rotation and sidechains temporarily. Please let us know if you can force a crash by playing with the \"show graphics\" option from the boinc manager, or with your screensaver!


                                                        If I open the show graphics window then minimize and re-open the window is just black and then crashes

                                                        Profile feet1st

                                                        Joined: Mar 7 06
                                                        Posts: 312
                                                        ID: 1028
                                                        Credit: 110,522
                                                        RAC: 0
                                                        Message 2616 - Posted 14 Dec 2006 4:09:55 UTC

                                                          First 24hr WU reported through just fine. The second has another hour to go (that shows you how much CPU that graphics consumed during the day with the screensaver!)

                                                          ...I just got more Ralph 5.43 WUs downloaded?
                                                          ____________

                                                          Chu
                                                          Forum moderator
                                                          Project developer
                                                          Project scientist

                                                          Joined: Sep 26 06
                                                          Posts: 61
                                                          ID: 1900
                                                          Credit: 12,545
                                                          RAC: 0
                                                          Message 2617 - Posted 14 Dec 2006 6:14:00 UTC - in response to Message 2615.

                                                            Did it freeze (and you had to maually kill it) or just crash itself? Thanks.

                                                            We are trying to increase stability in this release... We have turned off mouse rotation and sidechains temporarily. Please let us know if you can force a crash by playing with the \"show graphics\" option from the boinc manager, or with your screensaver!


                                                            If I open the show graphics window then minimize and re-open the window is just black and then crashes

                                                            Jon Bahen

                                                            Joined: Dec 6 06
                                                            Posts: 3
                                                            ID: 2337
                                                            Credit: 1,007
                                                            RAC: 0
                                                            Message 2618 - Posted 14 Dec 2006 10:47:23 UTC - in response to Message 2617.

                                                              Did it freeze (and you had to maually kill it) or just crash itself? Thanks.

                                                              We are trying to increase stability in this release... We have turned off mouse rotation and sidechains temporarily. Please let us know if you can force a crash by playing with the \"show graphics\" option from the boinc manager, or with your screensaver!


                                                              If I open the show graphics window then minimize and re-open the window is just black and then crashes



                                                              It froze

                                                              Profile feet1st

                                                              Joined: Mar 7 06
                                                              Posts: 312
                                                              ID: 1028
                                                              Credit: 110,522
                                                              RAC: 0
                                                              Message 2620 - Posted 14 Dec 2006 13:47:41 UTC

                                                                I got another hang too. Similar feel to what occured before, only this time there was zero CPU being used, and TWO rosetta.exe\'s shown in the applications tab of Windows.

                                                                I had 2 run fine for 24hrs, then got two more that were crunching when I went to bed, these apparently ended prematurely and two more came down, these apparently both were invoked at some point as screensaver and the 1gup task was displayed (according to processing time shown) at the time it got hung up. Awoke this AM, screensaver non-responsive. It contacted the project directly from Rosetta.exe uploaded some diagnostics for you, and was still unresponsive so I right click the BOINC icon in the task bar and SNOOZE. This then caused both WUs to fail.

                                                                Here\'s what my messages since last night look like:

                                                                12/13/2006 11:40:37 PM|ralph@home|Computation for task eps1__BOINC_JUMPABINITIO_SAVE_ALL_OUT_BARCODE_CRYOEM_truncate_hom001__1555_2_0 finished
                                                                12/13/2006 11:40:37 PM|ralph@home|Starting task 2chf__BOINC_POSE_ABRELAX_VARY_SC_BOND_ANGLES_NEWRELAXFLAGS_frags83__1560_3_0 using rosetta_beta version 543
                                                                12/13/2006 11:40:39 PM|ralph@home|Started upload of file eps1__BOINC_JUMPABINITIO_SAVE_ALL_OUT_BARCODE_CRYOEM_truncate_hom001__1555_2_0_0
                                                                12/13/2006 11:41:07 PM|ralph@home|Finished upload of file eps1__BOINC_JUMPABINITIO_SAVE_ALL_OUT_BARCODE_CRYOEM_truncate_hom001__1555_2_0_0
                                                                12/13/2006 11:41:07 PM|ralph@home|Throughput 5822 bytes/sec
                                                                12/13/2006 11:41:23 PM|ralph@home|Sending scheduler request to http://ralph.bakerlab.org/ralph_cgi/cgi
                                                                12/13/2006 11:41:23 PM|ralph@home|Reason: Requested by user
                                                                12/13/2006 11:41:23 PM|ralph@home|Reporting 1 tasks
                                                                12/13/2006 11:41:28 PM|ralph@home|Scheduler request succeeded
                                                                12/14/2006 2:01:58 AM||Rescheduling CPU: application exited
                                                                12/14/2006 2:01:58 AM|ralph@home|Computation for task 1shfA_BOINC_POSE_ABRELAX_NEWRELAXFLAGS_frags83__1559_3_0 finished
                                                                12/14/2006 2:01:58 AM|ralph@home|Sending scheduler request to http://ralph.bakerlab.org/ralph_cgi/cgi
                                                                12/14/2006 2:01:58 AM|ralph@home|Reason: To fetch work
                                                                12/14/2006 2:01:58 AM|ralph@home|Requesting 306440 seconds of new work
                                                                12/14/2006 2:02:00 AM|ralph@home|Started upload of file 1shfA_BOINC_POSE_ABRELAX_NEWRELAXFLAGS_frags83__1559_3_0_0
                                                                12/14/2006 2:02:07 AM|ralph@home|Finished upload of file 1shfA_BOINC_POSE_ABRELAX_NEWRELAXFLAGS_frags83__1559_3_0_0
                                                                12/14/2006 2:02:07 AM|ralph@home|Throughput 1934 bytes/sec
                                                                12/14/2006 2:02:08 AM|ralph@home|Scheduler request succeeded
                                                                12/14/2006 2:02:10 AM|ralph@home|Started download of file frags83_1fna_.fasta.gz
                                                                12/14/2006 2:02:10 AM|ralph@home|Started download of file frags83_1fna_.psipred_ss2.gz
                                                                12/14/2006 2:02:15 AM|ralph@home|Finished download of file frags83_1fna_.fasta.gz
                                                                12/14/2006 2:02:15 AM|ralph@home|Throughput 38 bytes/sec
                                                                12/14/2006 2:02:15 AM|ralph@home|Finished download of file frags83_1fna_.psipred_ss2.gz
                                                                12/14/2006 2:02:15 AM|ralph@home|Throughput 310 bytes/sec
                                                                12/14/2006 2:02:15 AM|ralph@home|Started download of file boinc_frags83_aa1fna_03_05.200_v1_3.gz
                                                                12/14/2006 2:02:15 AM|ralph@home|Started download of file boinc_frags83_aa1fna_09_05.200_v1_3.gz
                                                                12/14/2006 2:02:24 AM|ralph@home|Finished download of file boinc_frags83_aa1fna_09_05.200_v1_3.gz
                                                                12/14/2006 2:02:24 AM|ralph@home|Throughput 24131 bytes/sec
                                                                12/14/2006 2:02:24 AM|ralph@home|Started download of file frags83_1fna.pdb.gz
                                                                12/14/2006 2:02:27 AM|ralph@home|Finished download of file frags83_1fna.pdb.gz
                                                                12/14/2006 2:02:27 AM|ralph@home|Throughput 6361 bytes/sec
                                                                12/14/2006 2:02:27 AM|ralph@home|Started download of file frags83_1gvp_.fasta.gz
                                                                12/14/2006 2:02:29 AM|ralph@home|Finished download of file frags83_1gvp_.fasta.gz
                                                                12/14/2006 2:02:29 AM|ralph@home|Throughput 113 bytes/sec
                                                                12/14/2006 2:02:29 AM|ralph@home|Started download of file frags83_1gvp_.psipred_ss2.gz
                                                                12/14/2006 2:02:31 AM|ralph@home|Finished download of file frags83_1gvp_.psipred_ss2.gz
                                                                12/14/2006 2:02:31 AM|ralph@home|Throughput 882 bytes/sec
                                                                12/14/2006 2:02:31 AM|ralph@home|Started download of file boinc_frags83_aa1gvp_03_05.200_v1_3.gz
                                                                12/14/2006 2:02:32 AM|ralph@home|Finished download of file boinc_frags83_aa1fna_03_05.200_v1_3.gz
                                                                12/14/2006 2:02:32 AM|ralph@home|Throughput 49290 bytes/sec
                                                                12/14/2006 2:02:32 AM|ralph@home|Started download of file boinc_frags83_aa1gvp_09_05.200_v1_3.gz
                                                                12/14/2006 2:02:33 AM||Rescheduling CPU: files downloaded
                                                                12/14/2006 2:02:33 AM|ralph@home|Starting task 1fna__BOINC_POSE_ABRELAX_NEWRELAXFLAGS_frags83__1559_7_0 using rosetta_beta version 543
                                                                12/14/2006 2:02:40 AM|ralph@home|Finished download of file boinc_frags83_aa1gvp_09_05.200_v1_3.gz
                                                                12/14/2006 2:02:40 AM|ralph@home|Throughput 26148 bytes/sec
                                                                12/14/2006 2:02:40 AM|ralph@home|Started download of file frags83_1gvp.pdb.gz
                                                                12/14/2006 2:02:43 AM|ralph@home|Finished download of file frags83_1gvp.pdb.gz
                                                                12/14/2006 2:02:43 AM|ralph@home|Throughput 7169 bytes/sec
                                                                12/14/2006 2:02:48 AM|ralph@home|Finished download of file boinc_frags83_aa1gvp_03_05.200_v1_3.gz
                                                                12/14/2006 2:02:48 AM|ralph@home|Throughput 47573 bytes/sec
                                                                12/14/2006 2:02:49 AM||Rescheduling CPU: files downloaded
                                                                12/14/2006 3:41:12 AM||Rescheduling CPU: application exited
                                                                12/14/2006 3:41:12 AM|ralph@home|Computation for task 2chf__BOINC_POSE_ABRELAX_VARY_SC_BOND_ANGLES_NEWRELAXFLAGS_frags83__1560_3_0 finished
                                                                12/14/2006 3:41:12 AM|ralph@home|Starting task 1gvp__BOINC_POSE_ABRELAX_NEWRELAXFLAGS_frags83__1559_7_0 using rosetta_beta version 543
                                                                12/14/2006 3:41:14 AM|ralph@home|Started upload of file 2chf__BOINC_POSE_ABRELAX_VARY_SC_BOND_ANGLES_NEWRELAXFLAGS_frags83__1560_3_0_0
                                                                12/14/2006 3:41:21 AM|ralph@home|Finished upload of file 2chf__BOINC_POSE_ABRELAX_VARY_SC_BOND_ANGLES_NEWRELAXFLAGS_frags83__1560_3_0_0
                                                                12/14/2006 3:41:21 AM|ralph@home|Throughput 586 bytes/sec
                                                                12/14/2006 8:37:49 AM||Suspending computation - user request
                                                                12/14/2006 8:37:49 AM|ralph@home|Pausing task 1fna__BOINC_POSE_ABRELAX_NEWRELAXFLAGS_frags83__1559_7_0 (left in memory)
                                                                12/14/2006 8:37:49 AM|ralph@home|Pausing task 1gvp__BOINC_POSE_ABRELAX_NEWRELAXFLAGS_frags83__1559_7_0 (left in memory)
                                                                12/14/2006 8:37:49 AM||Suspending network activity - user request
                                                                12/14/2006 8:37:51 AM|ralph@home|rosetta_beta not responding to screensaver, exiting
                                                                12/14/2006 8:38:00 AM|ralph@home|Unrecoverable error for result 1gvp__BOINC_POSE_ABRELAX_NEWRELAXFLAGS_frags83__1559_7_0 ( - exit code -1 (0xffffffff))
                                                                12/14/2006 8:38:00 AM|ralph@home|Deferring scheduler requests for 1 minutes and 0 seconds
                                                                12/14/2006 8:38:00 AM||Rescheduling CPU: application exited
                                                                12/14/2006 8:38:00 AM|ralph@home|Computation for task 1gvp__BOINC_POSE_ABRELAX_NEWRELAXFLAGS_frags83__1559_7_0 finished
                                                                12/14/2006 8:40:03 AM|ralph@home|Unrecoverable error for result 1fna__BOINC_POSE_ABRELAX_NEWRELAXFLAGS_frags83__1559_7_0 ( - exit code -1073741819 (0xc0000005))
                                                                12/14/2006 8:40:03 AM|ralph@home|Deferring scheduler requests for 1 minutes and 0 seconds
                                                                12/14/2006 8:40:03 AM||Rescheduling CPU: application exited
                                                                12/14/2006 8:40:03 AM|ralph@home|Computation for task 1fna__BOINC_POSE_ABRELAX_NEWRELAXFLAGS_frags83__1559_7_0 finished
                                                                12/14/2006 8:40:34 AM||Resuming computation
                                                                12/14/2006 8:40:34 AM||Rescheduling CPU: Resuming computation
                                                                12/14/2006 8:40:34 AM||Resuming round-robin CPU scheduling.
                                                                12/14/2006 8:41:12 AM||Resuming network activity
                                                                12/14/2006 8:41:12 AM|ralph@home|Sending scheduler request to http://ralph.bakerlab.org/ralph_cgi/cgi
                                                                12/14/2006 8:41:12 AM|ralph@home|Reason: Requested by user
                                                                12/14/2006 8:41:12 AM|ralph@home|Requesting 345600 seconds of new work, and reporting 4 completed tasks

                                                                ____________

                                                                Profile feet1st

                                                                Joined: Mar 7 06
                                                                Posts: 312
                                                                ID: 1028
                                                                Credit: 110,522
                                                                RAC: 0
                                                                Message 2621 - Posted 14 Dec 2006 14:48:47 UTC

                                                                  Oh, forgot, that host isn\'t the only one of mine with Ralph WUs anymore. So that\'s the link to the host where I\'ve been running the screensaver tests. Perhaps bringing back the hyperthreading impacted it?
                                                                  ____________

                                                                  Rhiju
                                                                  Forum moderator
                                                                  Project developer
                                                                  Project scientist

                                                                  Joined: Feb 14 06
                                                                  Posts: 161
                                                                  ID: 4
                                                                  Credit: 3,725
                                                                  RAC: 0
                                                                  Message 2622 - Posted 14 Dec 2006 19:52:25 UTC - in response to Message 2621.

                                                                    Feet1st, I\'m a little confused. Did you do something (like activate hyperthreading) that might have caused that host to start crashing? It crunched one workunit fine, but the rest appear to be watchdog problems or outright crashes.

                                                                    Oh, forgot, that host isn\'t the only one of mine with Ralph WUs anymore. So that\'s the link to the host where I\'ve been running the screensaver tests. Perhaps bringing back the hyperthreading impacted it?


                                                                    ____________

                                                                    Profile feet1st

                                                                    Joined: Mar 7 06
                                                                    Posts: 312
                                                                    ID: 1028
                                                                    Credit: 110,522
                                                                    RAC: 0
                                                                    Message 2623 - Posted 14 Dec 2006 20:02:11 UTC

                                                                      Correct, I used to have BOINC set to use only one CPU. I had done this back on Rosetta 5.41 to see if it effected the screensaver issue. It didn\'t seem to. I retained that setting on 5.42 by default and later got more adventurous and changed BOINC to allow \"both\" to run.

                                                                      So, the PC enabled hypterthread all along. But now BOINC is set to run on 2 CPUs.
                                                                      ____________

                                                                      Rhiju
                                                                      Forum moderator
                                                                      Project developer
                                                                      Project scientist

                                                                      Joined: Feb 14 06
                                                                      Posts: 161
                                                                      ID: 4
                                                                      Credit: 3,725
                                                                      RAC: 0
                                                                      Message 2624 - Posted 14 Dec 2006 23:15:47 UTC - in response to Message 2623.

                                                                        OK, thanks for the clarification.

                                                                        Some followup news -- we can\'t seem to reproduce the hanging of particular workunits even if we use the same random seed. However, we do have a PC that hangs at least once a day, kind of like one of your hosts! And Chu has been able to put in a debugger to see where it gets stuck... very useful. Anyway, Chu is now testing to see if it behaves better when we disallow any display of graphics...

                                                                        Correct, I used to have BOINC set to use only one CPU. I had done this back on Rosetta 5.41 to see if it effected the screensaver issue. It didn\'t seem to. I retained that setting on 5.42 by default and later got more adventurous and changed BOINC to allow \"both\" to run.

                                                                        So, the PC enabled hypterthread all along. But now BOINC is set to run on 2 CPUs.


                                                                        ____________

                                                                        genes
                                                                        Avatar

                                                                        Joined: Feb 16 06
                                                                        Posts: 45
                                                                        ID: 57
                                                                        Credit: 43,300
                                                                        RAC: 0
                                                                        Message 2625 - Posted 15 Dec 2006 1:21:24 UTC

                                                                          I had a WU fail today, this message was in the log:

                                                                          12/14/2006 9:14:07 PM|ralph@home|Unrecoverable error for result 1ten__BOINC_POSE_ABRELAX_VARY_ALL_BOND_ANGLES_VARY_ALL_BOND_DISTANCES_NEWRELAXFLAGS_frags83__1561_15_0 ( - exit code -1073741819 (0xc0000005))

                                                                          This result: resultid=362757

                                                                          I came back to the computer and had a Windows error message on the screen \"Please tell Microsoft about this problem...\" . I don\'t know if graphics were involved, since I was out, however I do have graphics enabled on this machine, and it is a multiprocessor machine. hostid=2016

                                                                          ____________

                                                                          fastdude

                                                                          Joined: Dec 13 06
                                                                          Posts: 4
                                                                          ID: 2367
                                                                          Credit: 113
                                                                          RAC: 0
                                                                          Message 2627 - Posted 15 Dec 2006 9:45:42 UTC

                                                                            Last modified: 15 Dec 2006 10:30:32 UTC

                                                                            I\'ve got a live one!

                                                                            I can\'t do anyting with the graphic, no rotate, zoom etc. therefore no crash, as per all these \"beta / test\" WU\'s.
                                                                            but
                                                                            when viewing the graphic, very slow to show the image, black screen for a few sec then very slow to update the graphic.

                                                                            WU at 1% but has taken 50min of cpu.

                                                                            then go back to boinc when typing this message, and the WU shows 1.5% complete and 4 min of cpu time.

                                                                            it didn\'t crash though.

                                                                            boinc 5.75
                                                                            rosetta 5.43
                                                                            ralph rosetta_beta 5.43

                                                                            fastdude

                                                                            Joined: Dec 13 06
                                                                            Posts: 4
                                                                            ID: 2367
                                                                            Credit: 113
                                                                            RAC: 0
                                                                            Message 2628 - Posted 15 Dec 2006 11:43:29 UTC - in response to Message 2627.

                                                                              I\'ve got a live one!

                                                                              I can\'t do anyting with the graphic, no rotate, zoom etc. therefore no crash, as per all these \"beta / test\" WU\'s.
                                                                              but
                                                                              when viewing the graphic, very slow to show the image, black screen for a few sec then very slow to update the graphic.

                                                                              WU at 1% but has taken 50min of cpu.

                                                                              then go back to boinc when typing this message, and the WU shows 1.5% complete and 4 min of cpu time.

                                                                              it didn\'t crash though.

                                                                              boinc 5.75
                                                                              rosetta 5.43
                                                                              ralph rosetta_beta 5.43


                                                                              another of the same type of POS WU.

                                                                              Done 1 1/2 h but still showing 1% and the graphics perform like a stunned mullet.

                                                                              Chu
                                                                              Forum moderator
                                                                              Project developer
                                                                              Project scientist

                                                                              Joined: Sep 26 06
                                                                              Posts: 61
                                                                              ID: 1900
                                                                              Credit: 12,545
                                                                              RAC: 0
                                                                              Message 2629 - Posted 15 Dec 2006 17:03:34 UTC - in response to Message 2627.

                                                                                Sidechains, zooming and rotating has been disabled in the current application to help us narrow down the cause of graphic crash. So it is normal that you can not do anything on the screen and since Rosetta spends most of its time in high-resolution refinement ( moving backbone a little and refine sidechains ), it is also normal to hardly observe changes on the screen. However, the step number, cpu time should change frequently to reflect that the WU is still alive . If the WU is still working on generating its first model, it shows the progress at 1% for a while.

                                                                                Not sure about the slow graphic updating. Do you have other windows application running at the same time which also share cpu, memory and other resouces as well?

                                                                                I\'ve got a live one!

                                                                                I can\'t do anyting with the graphic, no rotate, zoom etc. therefore no crash, as per all these \"beta / test\" WU\'s.
                                                                                but
                                                                                when viewing the graphic, very slow to show the image, black screen for a few sec then very slow to update the graphic.

                                                                                WU at 1% but has taken 50min of cpu.

                                                                                then go back to boinc when typing this message, and the WU shows 1.5% complete and 4 min of cpu time.

                                                                                it didn\'t crash though.

                                                                                boinc 5.75
                                                                                rosetta 5.43
                                                                                ralph rosetta_beta 5.43

                                                                                Chu
                                                                                Forum moderator
                                                                                Project developer
                                                                                Project scientist

                                                                                Joined: Sep 26 06
                                                                                Posts: 61
                                                                                ID: 1900
                                                                                Credit: 12,545
                                                                                RAC: 0
                                                                                Message 2630 - Posted 15 Dec 2006 17:16:26 UTC - in response to Message 2625.

                                                                                  Hi gene, that job just crashed and did not freeze your computer, right? From users\' report and my local test, it looks like that if a frozen WU is forced to be terminated, it reports error code as - exit code 1073807364 (0x40010004). If a WU just crashes itself without freezing the host computer, it will reports error code as -1073741819 (0xc0000005).

                                                                                  I had a WU fail today, this message was in the log:

                                                                                  12/14/2006 9:14:07 PM|ralph@home|Unrecoverable error for result 1ten__BOINC_POSE_ABRELAX_VARY_ALL_BOND_ANGLES_VARY_ALL_BOND_DISTANCES_NEWRELAXFLAGS_frags83__1561_15_0 ( - exit code -1073741819 (0xc0000005))

                                                                                  This result: resultid=362757

                                                                                  I came back to the computer and had a Windows error message on the screen \"Please tell Microsoft about this problem...\" . I don\'t know if graphics were involved, since I was out, however I do have graphics enabled on this machine, and it is a multiprocessor machine. hostid=2016

                                                                                  Profile feet1st

                                                                                  Joined: Mar 7 06
                                                                                  Posts: 312
                                                                                  ID: 1028
                                                                                  Credit: 110,522
                                                                                  RAC: 0
                                                                                  Message 2631 - Posted 15 Dec 2006 23:32:16 UTC

                                                                                    I\'m having good success with 5.43 on my host which previously had problems when screensaver was active. Hyperthreaded, two WUs ran all night with no ss (I forgot! had turned it off when Ralph ran out of work so R@H wouldn\'t hang), then all day with ss active, one just reported in with over 22hrs crunch time (on 24hr pref), and the avg time per model seems like that was a reasonable time to end.

                                                                                    I\'ve received some work on my other two hosts as well, but they do not use ss. All seems well there as well.
                                                                                    ____________

                                                                                    genes
                                                                                    Avatar

                                                                                    Joined: Feb 16 06
                                                                                    Posts: 45
                                                                                    ID: 57
                                                                                    Credit: 43,300
                                                                                    RAC: 0
                                                                                    Message 2632 - Posted 16 Dec 2006 3:54:43 UTC - in response to Message 2630.

                                                                                      Last modified: 16 Dec 2006 4:47:34 UTC

                                                                                      Yes, that is correct, it did not freeze the computer.

                                                                                      Edit: I can usually prevent frozen WU\'s from being terminated by pressing ctrl-alt-del to get the task manager to be displayed. I will also usually get the taskbar, so I can then close the BOINC manager on the taskbar, then the BOINC CC in the systray. If I do that (in that order), BOINC shuts down all the projects in an orderly manner, and restarting BOINC will then restart all the current WU\'s from their last checkpoint, even the frozen Rosetta. It will then usually pass the point at which it froze and complete normally. This kind of freezing is usually due to the graphics (running in screensaver mode).

                                                                                      Edit: I\'ve had this one fail recently. resultid=363974 It produced a lot of nice debug output.

                                                                                      Hi gene, that job just crashed and did not freeze your computer, right? From users\' report and my local test, it looks like that if a frozen WU is forced to be terminated, it reports error code as - exit code 1073807364 (0x40010004). If a WU just crashes itself without freezing the host computer, it will reports error code as -1073741819 (0xc0000005).
                                                                                      I had a WU fail today, this message was in the log:

                                                                                      12/14/2006 9:14:07 PM|ralph@home|Unrecoverable error for result 1ten__BOINC_POSE_ABRELAX_VARY_ALL_BOND_ANGLES_VARY_ALL_BOND_DISTANCES_NEWRELAXFLAGS_frags83__1561_15_0 ( - exit code -1073741819 (0xc0000005))

                                                                                      This result: resultid=362757

                                                                                      I came back to the computer and had a Windows error message on the screen \"Please tell Microsoft about this problem...\" . I don\'t know if graphics were involved, since I was out, however I do have graphics enabled on this machine, and it is a multiprocessor machine. hostid=2016



                                                                                      ____________

                                                                                      fastdude

                                                                                      Joined: Dec 13 06
                                                                                      Posts: 4
                                                                                      ID: 2367
                                                                                      Credit: 113
                                                                                      RAC: 0
                                                                                      Message 2633 - Posted 16 Dec 2006 7:52:51 UTC

                                                                                        No further graphic issues noticed with rosetta 5.43, ralph is suspended & not getting tasks.

                                                                                        the previous issue happened when alternating between ralph & rosetta.

                                                                                        I will allow ralph to get new tasks again and check it out.

                                                                                        btw the box is using a GF4 mx 64MB agp 4x video card.

                                                                                        Profile feet1st

                                                                                        Joined: Mar 7 06
                                                                                        Posts: 312
                                                                                        ID: 1028
                                                                                        Credit: 110,522
                                                                                        RAC: 0
                                                                                        Message 2634 - Posted 16 Dec 2006 15:18:32 UTC

                                                                                          Well... I WAS having better success. Last night I had two ended by the watchdog. The ss this morning was just a BOINC logo, since I had suspended Rosetta, and the Ralph WUs completed.

                                                                                          I have another end with Exit status -1073741819.

                                                                                          Suggest you place a timestamp in stderr. That would help to see if the watchdog kicked both WUs out at about the same time (i.e. that they both had hung at the same time an hour before).
                                                                                          ____________

                                                                                          Leffe

                                                                                          Joined: Feb 19 06
                                                                                          Posts: 10
                                                                                          ID: 596
                                                                                          Credit: 3,683
                                                                                          RAC: 0
                                                                                          Message 2635 - Posted 16 Dec 2006 17:59:03 UTC

                                                                                            hereĀ“s 1 comp.error:
                                                                                            16/12/2006 03:27:17|ralph@home|Starting task 2reb__TREEJUMP_ABRELAX__NEWRELAXFLAGS_TOP1__1565_17_0 using rosetta_beta version 543
                                                                                            16/12/2006 03:27:19||Suspending work fetch because computer is overcommitted.
                                                                                            16/12/2006 04:19:16|ralph@home|Unrecoverable error for result 2reb__TREEJUMP_ABRELAX__NEWRELAXFLAGS_TOP1__1565_17_0 ( - exit code -1073741819 (0xc0000005))
                                                                                            16/12/2006 04:19:16|ralph@home|Deferring scheduler requests for 1 minutes and 0 seconds
                                                                                            16/12/2006 04:19:16||Rescheduling CPU: application exited
                                                                                            16/12/2006 04:19:16|ralph@home|Computation for task 2reb__TREEJUMP_ABRELAX__NEWRELAXFLAGS_TOP1__1565_17_0 finished

                                                                                            Leffe
                                                                                            ____________

                                                                                            FluffyChicken

                                                                                            Joined: Feb 17 06
                                                                                            Posts: 54
                                                                                            ID: 414
                                                                                            Credit: 710
                                                                                            RAC: 0
                                                                                            Message 2636 - Posted 17 Dec 2006 8:42:04 UTC

                                                                                              Have you given Jack Schonbrun a ring ? If I remember correctly he did the initial graphics setup, he may know an error or two ...
                                                                                              (though David Kim did the rotation if memory serves me right)
                                                                                              ____________

                                                                                              FluffyChicken

                                                                                              Joined: Feb 17 06
                                                                                              Posts: 54
                                                                                              ID: 414
                                                                                              Credit: 710
                                                                                              RAC: 0
                                                                                              Message 2637 - Posted 17 Dec 2006 8:54:04 UTC - in response to Message 2630.

                                                                                                it reports error code as - exit code 1073807364 (0x40010004).



                                                                                                I believe 0x40010004 means \'task is/was running\' which would correspond to it being forced to close (via task manager or the hung program do you wish to kill it question.)



                                                                                                Maybe you should give out the graphics code (and calling messages) see if anyone in that field can debug and help you.



                                                                                                ____________

                                                                                                Profile feet1st

                                                                                                Joined: Mar 7 06
                                                                                                Posts: 312
                                                                                                ID: 1028
                                                                                                Credit: 110,522
                                                                                                RAC: 0
                                                                                                Message 2638 - Posted 17 Dec 2006 16:56:44 UTC

                                                                                                  Last modified: 17 Dec 2006 17:44:35 UTC

                                                                                                  My two docking WUs both ended prematurely (watchdog...20 credits). When I awoke this morning the screen saver was not hung, then I moved the mouse to take the ss down and it froze. Task manager shows one task getting 65% of my hyperthreaded CPU, and the other getting 35%. Odd thing is the WU that was getting the 35% was the one the graphic was being displayed for (you could tell by the elapsed time on it as shown in the graphic and in the CPU time shown in task manager).

                                                                                                  Now that I ended the application that was not responding, I crashed a WU (- exit code 1073807364 ...a positive number?), but it was the one that was getting the 65% of CPU. The threads seem confused about who is doing what.
                                                                                                  ____________

                                                                                                  Profile feet1st

                                                                                                  Joined: Mar 7 06
                                                                                                  Posts: 312
                                                                                                  ID: 1028
                                                                                                  Credit: 110,522
                                                                                                  RAC: 0
                                                                                                  Message 2639 - Posted 17 Dec 2006 19:16:48 UTC

                                                                                                    I just had that happen again. 65/35 split, with the one being displayed being the one that\'s getting 35%, and it didn\'t hang until I tried to use the computer.
                                                                                                    ____________

                                                                                                    FluffyChicken

                                                                                                    Joined: Feb 17 06
                                                                                                    Posts: 54
                                                                                                    ID: 414
                                                                                                    Credit: 710
                                                                                                    RAC: 0
                                                                                                    Message 2640 - Posted 17 Dec 2006 20:22:32 UTC - in response to Message 2638.

                                                                                                      My two docking WUs both ended prematurely (watchdog...20 credits). When I awoke this morning the screen saver was not hung, then I moved the mouse to take the ss down and it froze. Task manager shows one task getting 65% of my hyperthreaded CPU, and the other getting 35%. Odd thing is the WU that was getting the 35% was the one the graphic was being displayed for (you could tell by the elapsed time on it as shown in the graphic and in the CPU time shown in task manager).

                                                                                                      Now that I ended the application that was not responding, I crashed a WU (- exit code 1073807364 ...a positive number?), but it was the one that was getting the 65% of CPU. The threads seem confused about who is doing what.


                                                                                                      Have you tested it with 5.8.0 yet to see if there is a difference ?


                                                                                                      ____________

                                                                                                      Profile feet1st

                                                                                                      Joined: Mar 7 06
                                                                                                      Posts: 312
                                                                                                      ID: 1028
                                                                                                      Credit: 110,522
                                                                                                      RAC: 0
                                                                                                      Message 2642 - Posted 18 Dec 2006 13:31:57 UTC

                                                                                                        Have you tested it with 5.8.0 yet to see if there is a difference ?

                                                                                                        I\'m trying to change as few variables at a time as I can. Right now I feel I\'m a pretty reliable litmas test for the straight, common, non-beta (\'cepting Ralph) Windows installation. So, even if updating BOINC would help, that isn\'t the answer we need until it becomes the stable and \"recommended\" release.

                                                                                                        With my 24hr work units, and not knowing how much longer they plan to test, I\'m never sure if I\'ve got another 2 or 3 days to feedback or not. And I\'d prefer to run a new Rosetta verison in that time if they release one, rather then a new BOINC version.

                                                                                                        But I\'d be glad to do it if Chu or Rhiju feel it would be helpful to their study of the problems.
                                                                                                        ____________

                                                                                                        Andrew Leaver-Fay

                                                                                                        Joined: Apr 14 06
                                                                                                        Posts: 2
                                                                                                        ID: 1285
                                                                                                        Credit: 5,272
                                                                                                        RAC: 0
                                                                                                        Message 2643 - Posted 18 Dec 2006 14:08:36 UTC

                                                                                                          Hi Rhiju

                                                                                                          I\'ve noticed that ralph is running on my mac during the day even when I set my BOINC preferences so that it should only run at night (on a single processor between 6pm and 8am).

                                                                                                          This is a lot like the problems I\'ve had with boinc before which seemed to disappear when I updated my boinc client following DK\'s advice. It almost seems it\'s ralph and not the boinc manager that is the source of the problem; when there were no jobs to run from ralph for a while (and I was running Rosetta@home jobs only) things went smoothly.

                                                                                                          Ralph is now taking 163% of my processor time (from top -- I have two processors). Top thinks the executables name is \"rosetta_be\" but clips the name at the \"be\" -- probably rosetta_beta. The job has been running for 157 hours. I haven\'t earned any credits for the past week, though.

                                                                                                          I ran ps -aux and the job looks like:

                                                                                                          rosetta_beta_5.43_powerpc-apple-darwin xx 1rnb A -output_silent_gz -silent -increase_cycles 10 -new_centroid_packing -pose_abinitio -pose_relax -pose_relax_fragment_moves -out

                                                                                                          ps clips the command line, so it could have gone on for another page or two.

                                                                                                          Seems to be a worrying bug: boinc is burning clock cycles, but not dispensing credit.

                                                                                                          Best,
                                                                                                          Andrew
                                                                                                          ____________

                                                                                                          Profile feet1st

                                                                                                          Joined: Mar 7 06
                                                                                                          Posts: 312
                                                                                                          ID: 1028
                                                                                                          Credit: 110,522
                                                                                                          RAC: 0
                                                                                                          Message 2644 - Posted 18 Dec 2006 15:09:10 UTC

                                                                                                            Andrew, it sounds like you have a task that has slipped away from BOINC\'s control. We see this happen once and a while on Windows. It is almost like BOINC doesn\'t realize it is still running. And this may be why it\'s running during non-scheduled hours, cuz it doesn\'t respond when BOINC says \"whoa!\".

                                                                                                            But you say it has been running for 157hrs. And THAT sounds like a task that has slipped away from the Rosetta (or Ralph) watchdog. Because it appears your run time preference is only an hour. The watchdog should have kicked in some time ago.

                                                                                                            Ralph shows you\'ve only got one task that is not yet completed and it was sent on Thursday. And by my math, 157hrs have not elapsed since Thursday, so now I\'m really confused. In fact, with a Ralph 4 day deadline, times 24hrs... the deadline is 96hrs. I did see a case on Windows where a task managed to tally up more then one second per second of wall-clock time. This was due to the screensaver problems, and a hyperthreaded CPU.

                                                                                                            Suggest you take note of the process ID at a minimum and that way you can tell for certain if it does end and another starts up. If the task really has run for that long, I\'d abort it. Or at least suspend it and resume it again. But, since BOINC doesn\'t seem in contol of that task... I guess I\'d end BOINC for 5 minutes, the task should then end. If it doesn\'t then reboot or end that process. Then restart BOINC.
                                                                                                            ____________

                                                                                                            FluffyChicken

                                                                                                            Joined: Feb 17 06
                                                                                                            Posts: 54
                                                                                                            ID: 414
                                                                                                            Credit: 710
                                                                                                            RAC: 0
                                                                                                            Message 2645 - Posted 19 Dec 2006 8:51:06 UTC - in response to Message 2644.

                                                                                                              Andrew, it sounds like you have a task that has slipped away from BOINC\'s control. We see this happen once and a while on Windows. It is almost like BOINC doesn\'t realize it is still running. And this may be why it\'s running during non-scheduled hours, cuz it doesn\'t respond when BOINC says \"whoa!\".

                                                                                                              But you say it has been running for 157hrs. And THAT sounds like a task that has slipped away from the Rosetta (or Ralph) watchdog. Because it appears your run time preference is only an hour. The watchdog should have kicked in some time ago.

                                                                                                              Ralph shows you\'ve only got one task that is not yet completed and it was sent on Thursday. And by my math, 157hrs have not elapsed since Thursday, so now I\'m really confused. In fact, with a Ralph 4 day deadline, times 24hrs... the deadline is 96hrs. I did see a case on Windows where a task managed to tally up more then one second per second of wall-clock time. This was due to the screensaver problems, and a hyperthreaded CPU.

                                                                                                              Suggest you take note of the process ID at a minimum and that way you can tell for certain if it does end and another starts up. If the task really has run for that long, I\'d abort it. Or at least suspend it and resume it again. But, since BOINC doesn\'t seem in contol of that task... I guess I\'d end BOINC for 5 minutes, the task should then end. If it doesn\'t then reboot or end that process. Then restart BOINC.


                                                                                                              Since 5.8.x is planned to be released shortly (shouldn\'t be to far away, but then they have said that before ;) The code is not going to change much, probably just sime simpleGUI fixes.

                                                                                                              side / anyways, why are you running 24hr tasks on Ralph I thought they wanted them short over here, would create more application swapping as well.#
                                                                                                              /side

                                                                                                              anyways, it was the communication problem (0x40010004) still running and boinc getting confused problem I was wondering about, the swithcing between the screensaver graphics. There where quite a few fixes going on with screensaver/graphics and starting/stopping/stalling of tasks.
                                                                                                              I thought a quick days testing on 3hr tasks should see if it is more stable. Seeing the rate you report them and seem to be able to cause it to happen ;-)

                                                                                                              The other error (0xc0000005) has been in many projects, Rosetta before, Einstien about a year ago, CPDN as well and was certainly always related to the graphics.


                                                                                                              Maybe a true test would be to replace the graphics with the default boinc one, see if it still happens .



                                                                                                              ____________

                                                                                                              Chu
                                                                                                              Forum moderator
                                                                                                              Project developer
                                                                                                              Project scientist

                                                                                                              Joined: Sep 26 06
                                                                                                              Posts: 61
                                                                                                              ID: 1900
                                                                                                              Credit: 12,545
                                                                                                              RAC: 0
                                                                                                              Message 2646 - Posted 19 Dec 2006 22:26:33 UTC

                                                                                                                We suspect it is a problem of thread synchronization. Basically Rosetta working thread does the simulation which changes all the atom coordinates ( which are saved in shared memory) while the graphic thread tries to read data from that place to draw the graphic or screensaver. Currently there is no locking mechanism to ensure the shared memory is accessed by one thread at a time and this could generate some conflicts or memory corruption and then trigger an error. On one of our local computers, when screensaver or graphic is turned on, it caught errors at a rate of at least one per day on average and without any graphics, it ran flawlessly. The errors which have been observed include crashing(0xc0000005), hung-up (0x40010004) and being stuck( watchdog ending). All the errors were not reproducable with same random number seeds and we think that is due to the radomness in graphic process. Another side proof was that showing sidechains requires accessing shared memory more often and intensively, and after turning off sidechains and rotating, the graphic error rates drop but the problem is not solved completely. There seems to be an correlation between two.

                                                                                                                Anyway, our plan is to add a thread locking mechanism in the next release to see if this helps. This will probably happen after the holiday season. I believe the new boinc 5.8.x should also help to reduce the error rate. Thank everyone for helping test on this issue.

                                                                                                                genes
                                                                                                                Avatar

                                                                                                                Joined: Feb 16 06
                                                                                                                Posts: 45
                                                                                                                ID: 57
                                                                                                                Credit: 43,300
                                                                                                                RAC: 0
                                                                                                                Message 2647 - Posted 20 Dec 2006 2:17:57 UTC

                                                                                                                  Got an error on this WU today: resultid=375110

                                                                                                                  It was an 0xC0000005.

                                                                                                                  Running with a Quad Xeon (Sossaman), XPSP2 and Boinc core 5.8.0. Graphics were enabled. No biggie, just sayin\'.

                                                                                                                  Errors seem to be somewhat less with 5.8.0 and 5.43, but they still happen. Not shown, of course, are the ones that I save by stopping and restarting Boinc instead of aborting them. Locking/mutexes are the way to go.

                                                                                                                  ____________

                                                                                                                  Profile feet1st

                                                                                                                  Joined: Mar 7 06
                                                                                                                  Posts: 312
                                                                                                                  ID: 1028
                                                                                                                  Credit: 110,522
                                                                                                                  RAC: 0
                                                                                                                  Message 2648 - Posted 20 Dec 2006 3:53:51 UTC

                                                                                                                    Sounds like a lot of the \"random failures\" will be eliminated!! YIPPIE!!!

                                                                                                                    I take it you can confirm that hyperthreaded CPUs saw more crashes as well?

                                                                                                                    ...but now you\'re in for the next problem. PERFORMANCE! The crunching thread will basically be on hold until the graphic thread does it\'s thing. If you weren\'t already doing that, it will be quite a difference.

                                                                                                                    I have no idea if this is feasible or not, but if you can segment memory in to classes or front to back of the protein or place some sort of order to it, perhaps you can have a semaphore that is more granular then just \"1=crunch\", \"0=Graphic\" access to shared memory. If you can get more granular, then perhaps the graphic can get what it needs to rendure the first 5 AAs of the protein and sidechains, and then release that memory so the crunch thread can start considering the next position for those specific atoms or AAs. Actually... the crunch thread COULD \"consider\" the next move, just don\'t write it back to shared memory yet.

                                                                                                                    Picture it like a segmented worm, looped into a circle. Where the threads would each process in order from head to tail, one segment at a time. Since ya gotta crunch something to draw it, I guess the graphic thread would always trail the crunch thread. Since the graphic is read-only, perhaps that helps too, perhaps the work in the crunch thread that actually updates the shared memory could be delayed until late in the compute cycle, thus allowing read-only access to both threads for a higher % of the time and avoiding lock waits.

                                                                                                                    Or, perhaps you could lock the different views with seperate semaphores. So we might be crunching a new... I don\'t know what ya call the leftmost box, but crunch on that while drawing the \"accepted\" graphic or the \"low energy\" graphic. Actually, the only real contention problem is going to be on that left-most box. Perhaps the graphic thread is already smart enough not to redraw the low energy shape unless it actually has changed? That would minimize contention there.

                                                                                                                    The other idea is to keep two copies of the info. needed for the graphic. The crunch thread pushes a copy out to the graphic thread, but only often enough to keep up with the frame rate desired. That way it always has access to the crunching memory it has now, and it is only delayed every 1/10th of a second when we have to synch up to do the push. Combine this with the first idea of having a more granular lock and you won\'t be waiting for the semaphore often at all.
                                                                                                                    ____________

                                                                                                                    Profile feet1st

                                                                                                                    Joined: Mar 7 06
                                                                                                                    Posts: 312
                                                                                                                    ID: 1028
                                                                                                                    Credit: 110,522
                                                                                                                    RAC: 0
                                                                                                                    Message 2649 - Posted 20 Dec 2006 4:07:47 UTC

                                                                                                                      Last modified: 20 Dec 2006 4:52:09 UTC

                                                                                                                      By the way, is there a prize for an \"accurate prediction\" of another sort? ...December 7, can anyone beat that?

                                                                                                                      ____________

                                                                                                                      Profile feet1st

                                                                                                                      Joined: Mar 7 06
                                                                                                                      Posts: 312
                                                                                                                      ID: 1028
                                                                                                                      Credit: 110,522
                                                                                                                      RAC: 0
                                                                                                                      Message 2650 - Posted 20 Dec 2006 4:33:56 UTC

                                                                                                                        Ok ok, enough of my glout-of-the-year. ...back to business

                                                                                                                        ...a better idea, just double buffer what is pushed out to the graphic thread. And come up with some way of making a reasonable guess at the refresh rate. I mean take the CPU benchmarks and the time to make one iteration through this loop and rough out how many iterations to make before pushing the next frame out to the open buffer. If I\'m running 30 frames per second maybe I have to push every other time through a given loop. If I\'m only running 10fps then we can crunch through the loop 6 times before taking the time to push out the info. for the next frame. Let\'s you crunch all the time, no lock wait on the crunching thread. Just adds the time required to do the push and this is minimized by only pushing memory out when it will actually be needed by the graphic thread.

                                                                                                                        I gotta think that compared to the 110MB typical size of a running WU, that two more copies of the info needed for the graphic thread is pretty minimal.
                                                                                                                        ____________

                                                                                                                        FluffyChicken

                                                                                                                        Joined: Feb 17 06
                                                                                                                        Posts: 54
                                                                                                                        ID: 414
                                                                                                                        Credit: 710
                                                                                                                        RAC: 0
                                                                                                                        Message 2651 - Posted 20 Dec 2006 14:02:33 UTC

                                                                                                                          Chu,


                                                                                                                          Could you put that problem summary in the \'technical news\' at the Rosetta@home site.

                                                                                                                          It would give people a definate place of what the problem is, it would also mean forum helpers could post a link to the news when the errors are happening.

                                                                                                                          ____________

                                                                                                                          darknightcl

                                                                                                                          Joined: Dec 21 06
                                                                                                                          Posts: 3
                                                                                                                          ID: 2414
                                                                                                                          Credit: 36
                                                                                                                          RAC: 0
                                                                                                                          Message 2652 - Posted 21 Dec 2006 4:00:24 UTC

                                                                                                                            K, so I\'ve been running Rosetta for over a year now, and only recently started having problems with the project. For instance, roughly 75% of the work units I download crash out with the funny graphics errors. One thing that I\'ve noticed that I haven\'t seen anyone else mention is this. Sometimes the model displayed seems to \"disconnect\" at some point along the backbone. For example, it will be a nice continuous C-alpha trace (I assume this is a C-alpha trace of the protein), then it will suddenly have a break in it, like the protein has been cleaved with a peptidase. The two ends will sometimes wave around in a manner that is just not consistent with them still being connected, so it almost looks like there is a problem with the science code, like it is taking liberties (such as introducing breaks where it is convenient to have them) with the sequence, which doesn\'t seem probable, as that would undermine the science being done. The other possibility might be that there is a problem with the communication between the graphics code and the science code, but I\'m not a programmer and cannot make any suggestions with regards to this. I\'m running version 5.43 of the rosetta application, with CC 5.4.11 (the official release version). I have an AMD64 X2 4200+ with 1GB DDR 400 RAM, GeForce 6150 integrated graphics, latest directx (9c), Windows XP Pro SP2 (32-bit) with all the necessary patches.

                                                                                                                            I\'m not sure if the helps but I figured I\'d mention it.

                                                                                                                            darknightcl

                                                                                                                            Joined: Dec 21 06
                                                                                                                            Posts: 3
                                                                                                                            ID: 2414
                                                                                                                            Credit: 36
                                                                                                                            RAC: 0
                                                                                                                            Message 2653 - Posted 21 Dec 2006 4:34:42 UTC

                                                                                                                              Another thought... One of the posts by Chu was mentioning that there is no locking mechanism in place to prevent the science thread and the graphics thread from trying to access the same memory at the same time, which can cause a problem if it occurs (or at least that is what I understand the post to mean).

                                                                                                                              If it is the case that the current problem is caused by the graphics and science threads conflicting in this manner, wouldn\'t we have started seeing this problem a long time ago, like when graphics were first introduced? Why has the problem only started cropping up now?

                                                                                                                              Just a thought...

                                                                                                                              FluffyChicken

                                                                                                                              Joined: Feb 17 06
                                                                                                                              Posts: 54
                                                                                                                              ID: 414
                                                                                                                              Credit: 710
                                                                                                                              RAC: 0
                                                                                                                              Message 2654 - Posted 21 Dec 2006 14:20:12 UTC - in response to Message 2653.

                                                                                                                                Another thought... One of the posts by Chu was mentioning that there is no locking mechanism in place to prevent the science thread and the graphics thread from trying to access the same memory at the same time, which can cause a problem if it occurs (or at least that is what I understand the post to mean).

                                                                                                                                If it is the case that the current problem is caused by the graphics and science threads conflicting in this manner, wouldn\'t we have started seeing this problem a long time ago, like when graphics were first introduced? Why has the problem only started cropping up now?

                                                                                                                                Just a thought...


                                                                                                                                - The more common usage of dual core processors today
                                                                                                                                - The increased level of comlexity in the graphics (the sidechains), this part of the \'theory of graphics crashes\' coincides quite happily with the release of the docking program and the increased graphics deisplay.
                                                                                                                                - More active people reporting things on the forum (due to more R@H members overall)
                                                                                                                                ____________

                                                                                                                                Profile feet1st

                                                                                                                                Joined: Mar 7 06
                                                                                                                                Posts: 312
                                                                                                                                ID: 1028
                                                                                                                                Credit: 110,522
                                                                                                                                RAC: 0
                                                                                                                                Message 2655 - Posted 21 Dec 2006 16:27:40 UTC

                                                                                                                                  ...so it almost looks like there is a problem with the science code, like it is taking liberties (such as introducing breaks where it is convenient to have them) with the sequence, which doesn\'t seem probable, as that would undermine the science being done.


                                                                                                                                  Yes! They have a \"jumping\" algorythm you will see used on some tasks. It does just that, break at what are believed may prove to be pivitol points in the chain and then search around for what the correct reconnection of the two points might be.

                                                                                                                                  See discussion of Dr. Baker\'s journal and his original journal entry which started the discussion.

                                                                                                                                  I don\'t understand it all. But basically it is not a symptom of a graphic problem, it is a visual queue that Rosetta\'s efficiency in finding the structure is improving.
                                                                                                                                  ____________

                                                                                                                                  darknightcl

                                                                                                                                  Joined: Dec 21 06
                                                                                                                                  Posts: 3
                                                                                                                                  ID: 2414
                                                                                                                                  Credit: 36
                                                                                                                                  RAC: 0
                                                                                                                                  Message 2656 - Posted 21 Dec 2006 16:50:41 UTC - in response to Message 2654.


                                                                                                                                    - The more common usage of dual core processors today
                                                                                                                                    - The increased level of comlexity in the graphics (the sidechains), this part of the \'theory of graphics crashes\' coincides quite happily with the release of the docking program and the increased graphics deisplay.
                                                                                                                                    - More active people reporting things on the forum (due to more R@H members overall)


                                                                                                                                    K, but is it only dual core processors that are having this problem? I have noticed a low rate of work unit failure on another computer, which is a single cored 64-bit AMD processor, but I\'m never watching closely enough to see if it is this same graphics failure, or some other problem (though I\'ll admit the problems did seem to stop with 5.43, but I also don\'t have any work unit history for this computer, it has been concentrating almost exclusively on CPDN for the last week or so, I\'ll go see if I can crash work units on it later).

                                                                                                                                    Also, previous versions of rosetta were stable on my Athlon X2, or at least the crash rate was low enough that I didn\'t notice it. I believe the last stable release was 5.37 or something like that. If memory serves you could rotate and zoom on a molecule in 5.37, with no problems. Essentially, you\'ve now reduced the level of the graphics complexity to below that of 5.37, and my computer is still crashing almost all of its work units. Right now I have one which hasn\'t done anything for about 40 minutes, but the time counter continues to increment, it is like the science code has stalled. I\'ll leave it to see if the watchdog kicks in. The important point is that I don\'t think (I\'m not certain on this point) the graphics had been displayed at all. I\'d noticed that the time remaining estimate was going up, not down, and decided to check on it.

                                                                                                                                    My point is that I didn\'t start noticing work unit failures until release 5.41, and these failure occur on computers other than my dual cored X2.

                                                                                                                                    In case anyone is curious I have stress tested my computer using Prime95, both cores (separately) with no problems. I\'ve also tested my RAM using Memtestx86, the most recent version, again, no problems.

                                                                                                                                    FluffyChicken

                                                                                                                                    Joined: Feb 17 06
                                                                                                                                    Posts: 54
                                                                                                                                    ID: 414
                                                                                                                                    Credit: 710
                                                                                                                                    RAC: 0
                                                                                                                                    Message 2657 - Posted 22 Dec 2006 10:52:43 UTC - in response to Message 2656.

                                                                                                                                      Last modified: 22 Dec 2006 11:38:35 UTC


                                                                                                                                      - The more common usage of dual core processors today
                                                                                                                                      - The increased level of comlexity in the graphics (the sidechains), this part of the \'theory of graphics crashes\' coincides quite happily with the release of the docking program and the increased graphics deisplay.
                                                                                                                                      - More active people reporting things on the forum (due to more R@H members overall)


                                                                                                                                      K, but is it only dual core processors that are having this problem? I have noticed a low rate of work unit failure on another computer, which is a single cored 64-bit AMD processor, but I\'m never watching closely enough to see if it is this same graphics failure, or some other problem (though I\'ll admit the problems did seem to stop with 5.43, but I also don\'t have any work unit history for this computer, it has been concentrating almost exclusively on CPDN for the last week or so, I\'ll go see if I can crash work units on it later).

                                                                                                                                      Also, previous versions of rosetta were stable on my Athlon X2, or at least the crash rate was low enough that I didn\'t notice it. I believe the last stable release was 5.37 or something like that. If memory serves you could rotate and zoom on a molecule in 5.37, with no problems. Essentially, you\'ve now reduced the level of the graphics complexity to below that of 5.37, and my computer is still crashing almost all of its work units. Right now I have one which hasn\'t done anything for about 40 minutes, but the time counter continues to increment, it is like the science code has stalled. I\'ll leave it to see if the watchdog kicks in. The important point is that I don\'t think (I\'m not certain on this point) the graphics had been displayed at all. I\'d noticed that the time remaining estimate was going up, not down, and decided to check on it.

                                                                                                                                      My point is that I didn\'t start noticing work unit failures until release 5.41, and these failure occur on computers other than my dual cored X2.

                                                                                                                                      In case anyone is curious I have stress tested my computer using Prime95, both cores (separately) with no problems. I\'ve also tested my RAM using Memtestx86, the most recent version, again, no problems.



                                                                                                                                      See option number 2,
                                                                                                                                      It started (or was noticed a lot more) when the docking code came into it.

                                                                                                                                      The part about duat/ht is that it is just more susceptible to the desyncronisation happening. I have also had a rare few fail on my P-M and Athlon64 and without graphics open. but it is nothing like what HT/dual people that play with graphics are reporting.

                                                                                                                                      If they where really smart about it (they being Rosetta@home) they would put a tick box inthe proeferences to say \'I do not want graphics\' and then they can sen the person a version with all the graphis ripped out of it, this often speeds up processing a touch (it does slighctly at seti) and decrease the size of the program along with the running memory requirements.

                                                                                                                                      Personaly I would love that option.
                                                                                                                                      ____________

                                                                                                                                      Profile Trog Dog
                                                                                                                                      Avatar

                                                                                                                                      Joined: Aug 8 06
                                                                                                                                      Posts: 38
                                                                                                                                      ID: 1670
                                                                                                                                      Credit: 41,996
                                                                                                                                      RAC: 0
                                                                                                                                      Message 2660 - Posted 3 Jan 2007 7:09:16 UTC

                                                                                                                                        Problem wu here
                                                                                                                                        ____________

                                                                                                                                        Profile Silver Streak

                                                                                                                                        Joined: Dec 11 06
                                                                                                                                        Posts: 5
                                                                                                                                        ID: 2357
                                                                                                                                        Credit: 216,369
                                                                                                                                        RAC: 0
                                                                                                                                        Message 2665 - Posted 11 Jan 2007 3:36:05 UTC

                                                                                                                                          I had over 40 WU\'s err out in the last 2 hrs!

                                                                                                                                          Pieface

                                                                                                                                          Joined: Feb 16 06
                                                                                                                                          Posts: 64
                                                                                                                                          ID: 234
                                                                                                                                          Credit: 203,513
                                                                                                                                          RAC: 0
                                                                                                                                          Message 2666 - Posted 11 Jan 2007 4:05:18 UTC

                                                                                                                                            Likewise, tons of them, but at least they are going quickly, like in 90 secs or so.

                                                                                                                                            Profile anders n

                                                                                                                                            Joined: Feb 16 06
                                                                                                                                            Posts: 166
                                                                                                                                            ID: 91
                                                                                                                                            Credit: 131,419
                                                                                                                                            RAC: 0
                                                                                                                                            Message 2667 - Posted 11 Jan 2007 4:09:37 UTC

                                                                                                                                              All seem to fail with this error message

                                                                                                                                              \" - exit code -1073741819 (0xc0000005) \"

                                                                                                                                              Anders n

                                                                                                                                              ____________

                                                                                                                                              Papagiorgio

                                                                                                                                              Joined: Nov 2 06
                                                                                                                                              Posts: 3
                                                                                                                                              ID: 2159
                                                                                                                                              Credit: 26,100
                                                                                                                                              RAC: 0
                                                                                                                                              Message 2668 - Posted 11 Jan 2007 7:34:24 UTC

                                                                                                                                                This result errored out the same way.

                                                                                                                                                Matthias

                                                                                                                                                Pepo
                                                                                                                                                Avatar

                                                                                                                                                Joined: Sep 8 06
                                                                                                                                                Posts: 104
                                                                                                                                                ID: 1812
                                                                                                                                                Credit: 36,890
                                                                                                                                                RAC: 0
                                                                                                                                                Message 2669 - Posted 11 Jan 2007 17:48:28 UTC

                                                                                                                                                  The same on my hosts, like
                                                                                                                                                  Windows here: exit code -1073741819 (0xc0000005), Reason: Access Violation (0xc0000005) at address 0x0066C28D read attempt to address 0x0405FF98 (with full BOINC Windows Runtime Debugger symbolic output),
                                                                                                                                                  or Linux here: Maximum disk usage exceeded, segmentation violation, with numeric Stack trace (12 frames).

                                                                                                                                                  Peter

                                                                                                                                                  Chu
                                                                                                                                                  Forum moderator
                                                                                                                                                  Project developer
                                                                                                                                                  Project scientist

                                                                                                                                                  Joined: Sep 26 06
                                                                                                                                                  Posts: 61
                                                                                                                                                  ID: 1900
                                                                                                                                                  Credit: 12,545
                                                                                                                                                  RAC: 0
                                                                                                                                                  Message 2670 - Posted 11 Jan 2007 19:36:19 UTC - in response to Message 2651.

                                                                                                                                                    I just posted it here. Sorry for the delay.

                                                                                                                                                    Chu,


                                                                                                                                                    Could you put that problem summary in the \'technical news\' at the Rosetta@home site.

                                                                                                                                                    It would give people a definate place of what the problem is, it would also mean forum helpers could post a link to the news when the errors are happening.

                                                                                                                                                    Chu
                                                                                                                                                    Forum moderator
                                                                                                                                                    Project developer
                                                                                                                                                    Project scientist

                                                                                                                                                    Joined: Sep 26 06
                                                                                                                                                    Posts: 61
                                                                                                                                                    ID: 1900
                                                                                                                                                    Credit: 12,545
                                                                                                                                                    RAC: 0
                                                                                                                                                    Message 2671 - Posted 11 Jan 2007 19:40:03 UTC - in response to Message 2669.

                                                                                                                                                      A bad batch, I think, maybe with bad memory management...

                                                                                                                                                      The same on my hosts, like
                                                                                                                                                      Windows here: exit code -1073741819 (0xc0000005), Reason: Access Violation (0xc0000005) at address 0x0066C28D read attempt to address 0x0405FF98 (with full BOINC Windows Runtime Debugger symbolic output),
                                                                                                                                                      or Linux here: Maximum disk usage exceeded, segmentation violation, with numeric Stack trace (12 frames).

                                                                                                                                                      Peter

                                                                                                                                                      darkpella

                                                                                                                                                      Joined: Mar 30 06
                                                                                                                                                      Posts: 4
                                                                                                                                                      ID: 1212
                                                                                                                                                      Credit: 15,691
                                                                                                                                                      RAC: 0
                                                                                                                                                      Message 2672 - Posted 12 Jan 2007 8:37:22 UTC

                                                                                                                                                        Hi,

                                                                                                                                                        when ralph is suspended by Boinc Core to let anothe task run:

                                                                                                                                                        11/01/2007 20.38.31|ralph@home|Pausing task 1mkyA_TREEJUMP_ABRELAX__NEWRELAXFLAGS_LARS_TOP2_BARCODE__1607_25_0 (removed from memory)


                                                                                                                                                        it doesn\'t get preempted nor it stops (i.e. it still runs at full power) hence abosrbing lots of CPU cycles form the task that should be running.

                                                                                                                                                        I tried forcing Boic Core to make ralph run (suspending every other task) and then let it switch again to another task (simply resuming all other tasks, since it switched to EDF) but it didn\'t stop \"rosetta_beta_5.\" form crunching at about 70% of the CPU time.

                                                                                                                                                        I also tried suspending ralph as a project, but it didn\'t work either.

                                                                                                                                                        Will try in a while rebooting to see what happens and let you know.
                                                                                                                                                        Should you need any information before I reboot let me know ASAP.
                                                                                                                                                        I\'m running Win 2000 SP4 on a PIV at 2,53 GHz.
                                                                                                                                                        Boinc version is 5.4.11
                                                                                                                                                        rosetta_beta version running is 5.43.

                                                                                                                                                        bye

                                                                                                                                                        darkpella
                                                                                                                                                        ____________

                                                                                                                                                        Profile Conan
                                                                                                                                                        Avatar

                                                                                                                                                        Joined: Feb 16 06
                                                                                                                                                        Posts: 344
                                                                                                                                                        ID: 145
                                                                                                                                                        Credit: 1,325,487
                                                                                                                                                        RAC: 484
                                                                                                                                                        Message 2673 - Posted 12 Jan 2007 12:47:49 UTC

                                                                                                                                                          > Had 21 WUs fail for various reasons, none should be Screensaver related as I no longer run it.

                                                                                                                                                          Maximum disk usage exceeded, WU stuck Incorrect fragment size requested for Phi alignment
                                                                                                                                                          http://ralph.bakerlab.org/workunit.php?wu=338885

                                                                                                                                                          Maximum disk usage exceeded, WU stuck, SIGSEGV:Segmentation Violation
                                                                                                                                                          http://ralph.bakerlab.org/workunit.php?wu=384950

                                                                                                                                                          Exited with code 1
                                                                                                                                                          ERROR:Exit at:loop_relax.cc line:1798
                                                                                                                                                          http://ralph.bakerlab.org/workunit.php?wu=382149
                                                                                                                                                          http://ralph.bakerlab.org/workunit.php?wu=382150

                                                                                                                                                          Exited with code 1
                                                                                                                                                          Incorrect Function, ERROR:Exit at:.\\read_aa_ss.cc line:559
                                                                                                                                                          http://ralph.bakerlab.org/workunit.php?wu=382209

                                                                                                                                                          Exit Code -1073741819 Access Violation
                                                                                                                                                          http://ralph.bakerlab.org/workunit.php?wu=382799, 382800, 382872, 382875, 383015, 383016, 383146, 383148, 383298, 383352, 383405, 383406, 383459, 383460, 384728.

                                                                                                                                                          Computers are Opteron 275 (Linux), Opteron 285 (Linux) and 4800+ (Windows)
                                                                                                                                                          Only the windows machine has a screensaver running but it is not Boinc screensaver so does not appear to be related to graphics problem, a faulty batch? Testing what exactly?
                                                                                                                                                          ____________

                                                                                                                                                          Profile [B^S] Dr. Bill Skiba
                                                                                                                                                          Avatar

                                                                                                                                                          Joined: Feb 15 06
                                                                                                                                                          Posts: 4
                                                                                                                                                          ID: 34
                                                                                                                                                          Credit: 6,496
                                                                                                                                                          RAC: 0
                                                                                                                                                          Message 2674 - Posted 12 Jan 2007 17:28:09 UTC

                                                                                                                                                            Last modified: 12 Jan 2007 18:10:14 UTC

                                                                                                                                                            http://ralph.bakerlab.org/result.php?resultid=382713

                                                                                                                                                            A new one, at least for me.

                                                                                                                                                            Work unit would not preempt. Kept on running even though it said it was preempted and boinc manager said another wu from another project was running.

                                                                                                                                                            Ralph unit kept counting up both on cpu time and time to finish while saying it was preempted. BM said a uFluids wu was running, but cpu time stayed at zero and task manager showed Ralph using the cycles.

                                                                                                                                                            I aborted it.
                                                                                                                                                            ____________

                                                                                                                                                            Profile anders n

                                                                                                                                                            Joined: Feb 16 06
                                                                                                                                                            Posts: 166
                                                                                                                                                            ID: 91
                                                                                                                                                            Credit: 131,419
                                                                                                                                                            RAC: 0
                                                                                                                                                            Message 2675 - Posted 15 Jan 2007 15:09:39 UTC

                                                                                                                                                              Errors to report

                                                                                                                                                              http://ralph.bakerlab.org/result.php?resultid=387680
                                                                                                                                                              http://ralph.bakerlab.org/result.php?resultid=387679

                                                                                                                                                              ERROR:: Exit at: .\\rotamer_functions.cc line:1441

                                                                                                                                                              And

                                                                                                                                                              http://ralph.bakerlab.org/result.php?resultid=387637

                                                                                                                                                              file_name>H4H6_1lis_PAIRWISE_DOCK_MCM_1619_4_0_0</file_name>
                                                                                                                                                              <error_code>-161</error_code>

                                                                                                                                                              Anders n



                                                                                                                                                              ____________

                                                                                                                                                              Profile Silver Streak

                                                                                                                                                              Joined: Dec 11 06
                                                                                                                                                              Posts: 5
                                                                                                                                                              ID: 2357
                                                                                                                                                              Credit: 216,369
                                                                                                                                                              RAC: 0
                                                                                                                                                              Message 2676 - Posted 15 Jan 2007 16:32:07 UTC

                                                                                                                                                                Last modified: 15 Jan 2007 17:22:29 UTC

                                                                                                                                                                I had a few of these also, they occured over night. They seem to have ran a normal length of time before the error occured.

                                                                                                                                                                </stderr_txt>
                                                                                                                                                                <message>
                                                                                                                                                                <file_xfer_error>
                                                                                                                                                                <file_name>H1H7_1lis_PAIRWISE_DOCK_MCM_1619_1_0_0</file_name>
                                                                                                                                                                <error_code>-161</error_code>
                                                                                                                                                                </file_xfer_error>

                                                                                                                                                                </message>

                                                                                                                                                                Rhiju
                                                                                                                                                                Forum moderator
                                                                                                                                                                Project developer
                                                                                                                                                                Project scientist

                                                                                                                                                                Joined: Feb 14 06
                                                                                                                                                                Posts: 161
                                                                                                                                                                ID: 4
                                                                                                                                                                Credit: 3,725
                                                                                                                                                                RAC: 0
                                                                                                                                                                Message 2677 - Posted 15 Jan 2007 18:16:47 UTC - in response to Message 2676.

                                                                                                                                                                  Great, I think I know what the problem is, and I\'m sending them back out with the potential fix.

                                                                                                                                                                  I had a few of these also, they occured over night. They seem to have ran a normal length of time before the error occured.

                                                                                                                                                                  </stderr_txt>
                                                                                                                                                                  <message>
                                                                                                                                                                  <file_xfer_error>
                                                                                                                                                                  <file_name>H1H7_1lis_PAIRWISE_DOCK_MCM_1619_1_0_0</file_name>
                                                                                                                                                                  <error_code>-161</error_code>
                                                                                                                                                                  </file_xfer_error>

                                                                                                                                                                  </message>


                                                                                                                                                                  ____________

                                                                                                                                                                  Profile KSMarksPsych
                                                                                                                                                                  Avatar

                                                                                                                                                                  Joined: Feb 16 06
                                                                                                                                                                  Posts: 40
                                                                                                                                                                  ID: 72
                                                                                                                                                                  Credit: 8,226
                                                                                                                                                                  RAC: 0
                                                                                                                                                                  Message 2678 - Posted 16 Jan 2007 0:05:19 UTC - in response to Message 2677.

                                                                                                                                                                    Great, I think I know what the problem is, and I\'m sending them back out with the potential fix.

                                                                                                                                                                    I had a few of these also, they occured over night. They seem to have ran a normal length of time before the error occured.

                                                                                                                                                                    </stderr_txt>
                                                                                                                                                                    <message>
                                                                                                                                                                    <file_xfer_error>
                                                                                                                                                                    <file_name>H1H7_1lis_PAIRWISE_DOCK_MCM_1619_1_0_0</file_name>
                                                                                                                                                                    <error_code>-161</error_code>
                                                                                                                                                                    </file_xfer_error>

                                                                                                                                                                    </message>





                                                                                                                                                                    Hmmm.... I got one of these but not sure when it actually downloaded and ran.

                                                                                                                                                                    Profile anders n

                                                                                                                                                                    Joined: Feb 16 06
                                                                                                                                                                    Posts: 166
                                                                                                                                                                    ID: 91
                                                                                                                                                                    Credit: 131,419
                                                                                                                                                                    RAC: 0
                                                                                                                                                                    Message 2679 - Posted 16 Jan 2007 4:14:30 UTC - in response to Message 2678.


                                                                                                                                                                      Hmmm.... I got one of these but not sure when it actually downloaded and ran.


                                                                                                                                                                      If you look at this page you can see that it is an old one.

                                                                                                                                                                      Anders n

                                                                                                                                                                      ____________

                                                                                                                                                                      Hans Sveen

                                                                                                                                                                      Joined: Feb 17 06
                                                                                                                                                                      Posts: 8
                                                                                                                                                                      ID: 390
                                                                                                                                                                      Credit: 279,559
                                                                                                                                                                      RAC: 340
                                                                                                                                                                      Message 2680 - Posted 16 Jan 2007 6:02:55 UTC

                                                                                                                                                                        Hello!
                                                                                                                                                                        The ones that I have received late Yesterday (and some allready returned successfully!) seems to have solved(reduced?) the errors that showed up, just take a look at hostid 474 (http://ralph.bakerlab.org/results.php?hostid=474)!

                                                                                                                                                                        @ The Staff:Thank You for Your efforts to things out?

                                                                                                                                                                        With regards,
                                                                                                                                                                        ____________
                                                                                                                                                                        Hans Sveen
                                                                                                                                                                        Oslo, Norway

                                                                                                                                                                        Profile Conan
                                                                                                                                                                        Avatar

                                                                                                                                                                        Joined: Feb 16 06
                                                                                                                                                                        Posts: 344
                                                                                                                                                                        ID: 145
                                                                                                                                                                        Credit: 1,325,487
                                                                                                                                                                        RAC: 484
                                                                                                                                                                        Message 2681 - Posted 16 Jan 2007 9:35:06 UTC - in response to Message 2673.

                                                                                                                                                                          > Had 21 WUs fail for various reasons, none should be Screensaver related as I no longer run it.

                                                                                                                                                                          Maximum disk usage exceeded, WU stuck Incorrect fragment size requested for Phi alignment
                                                                                                                                                                          http://ralph.bakerlab.org/workunit.php?wu=338885

                                                                                                                                                                          Maximum disk usage exceeded, WU stuck, SIGSEGV:Segmentation Violation
                                                                                                                                                                          http://ralph.bakerlab.org/workunit.php?wu=384950

                                                                                                                                                                          Exited with code 1
                                                                                                                                                                          ERROR:Exit at:loop_relax.cc line:1798
                                                                                                                                                                          http://ralph.bakerlab.org/workunit.php?wu=382149
                                                                                                                                                                          http://ralph.bakerlab.org/workunit.php?wu=382150

                                                                                                                                                                          Exited with code 1
                                                                                                                                                                          Incorrect Function, ERROR:Exit at:.\\read_aa_ss.cc line:559
                                                                                                                                                                          http://ralph.bakerlab.org/workunit.php?wu=382209

                                                                                                                                                                          Exit Code -1073741819 Access Violation
                                                                                                                                                                          http://ralph.bakerlab.org/workunit.php?wu=382799, 382800, 382872, 382875, 383015, 383016, 383146, 383148, 383298, 383352, 383405, 383406, 383459, 383460, 384728.

                                                                                                                                                                          Computers are Opteron 275 (Linux), Opteron 285 (Linux) and 4800+ (Windows)
                                                                                                                                                                          Only the windows machine has a screensaver running but it is not Boinc screensaver so does not appear to be related to graphics problem, a faulty batch? Testing what exactly?


                                                                                                                                                                          EDIT> Sorry these are results not work units so should be /result.php?result=, all except the first one which is a wu. I misread the columns.



                                                                                                                                                                          Have found a few more
                                                                                                                                                                          Incorrect fragment size requested for Phi alignment
                                                                                                                                                                          http://ralph.bakerlab.org/result.php?result=384574
                                                                                                                                                                          http://ralph.bakerlab.org/result.php?result=389196

                                                                                                                                                                          Unhandled Exception Errors
                                                                                                                                                                          http://ralph.bakerlab.org/result.php?result=387523
                                                                                                                                                                          http://ralph.bakerlab.org/result.php?result=387524

                                                                                                                                                                          file_xfer error
                                                                                                                                                                          <file_name>H4H6_1lis_PAIRWISE_DOCK_MCM_1619_4_2_0</file_name
                                                                                                                                                                          error code 161
                                                                                                                                                                          http://ralph.bakerlab.org/result.php?result=387843

                                                                                                                                                                          (New record on result 387843 in 6 hours did 819 decoys, yet still got no credit)


                                                                                                                                                                          ____________

                                                                                                                                                                          Profile Trog Dog
                                                                                                                                                                          Avatar

                                                                                                                                                                          Joined: Aug 8 06
                                                                                                                                                                          Posts: 38
                                                                                                                                                                          ID: 1670
                                                                                                                                                                          Credit: 41,996
                                                                                                                                                                          RAC: 0
                                                                                                                                                                          Message 2682 - Posted 18 Jan 2007 7:21:07 UTC

                                                                                                                                                                            Just got a 161 error myself - http://ralph.bakerlab.org/result.php?resultid=390970
                                                                                                                                                                            ____________

                                                                                                                                                                            Profile Jack Shaftoe

                                                                                                                                                                            Joined: Aug 8 06
                                                                                                                                                                            Posts: 13
                                                                                                                                                                            ID: 1673
                                                                                                                                                                            Credit: 105,163
                                                                                                                                                                            RAC: 0
                                                                                                                                                                            Message 2684 - Posted 19 Jan 2007 2:42:48 UTC - in response to Message 2682.

                                                                                                                                                                              http://ralph.bakerlab.org/result.php?resultid=390604

                                                                                                                                                                              # random seed: 2772809
                                                                                                                                                                              sin_cos_range ERROR: -1.#IND000 is outside of [-1,+1] range
                                                                                                                                                                              sin_cos_range ERROR: -1.#IND000 is outside of [-1,+1] range
                                                                                                                                                                              sin_cos_range ERROR: -1.#IND000 is outside of [-1,+1] range
                                                                                                                                                                              sin_cos_range ERROR: -1.#IND000 is outside of [-1,+1] range
                                                                                                                                                                              sin_cos_range ERROR: -1.#IND000 is outside of [-1,+1] range
                                                                                                                                                                              sin_cos_range ERROR: -1.#IND000 is outside of [-1,+1] range
                                                                                                                                                                              sin_cos_range ERROR: -1.#IND000 is outside of [-1,+1] range

                                                                                                                                                                              etc...

                                                                                                                                                                              Profile Jack Shaftoe

                                                                                                                                                                              Joined: Aug 8 06
                                                                                                                                                                              Posts: 13
                                                                                                                                                                              ID: 1673
                                                                                                                                                                              Credit: 105,163
                                                                                                                                                                              RAC: 0
                                                                                                                                                                              Message 2685 - Posted 19 Jan 2007 2:44:11 UTC - in response to Message 2684.

                                                                                                                                                                                161 error.

                                                                                                                                                                                http://ralph.bakerlab.org/result.php?resultid=390967

                                                                                                                                                                                Message boards : RALPH@home bug list : Bug reports for Ralph 5.42 and 5.43


                                                                                                                                                                                Home | Join | About | Participants | Community | Statistics

                                                                                                                                                                                Copyright © 2017 University of Washington

                                                                                                                                                                                Last Modified: 20 Nov 2008 19:41:56 UTC
                                                                                                                                                                                Back to top ^