RALPH@home

Bug reports for 5.65

  UW Seal
 
[ Home ] [ Join ] [ About ] [ Participants ] [ Community ] [ Statistics ]
  [ login/out ]


Advanced search

Message boards : RALPH@home bug list : Bug reports for 5.65

AuthorMessage
Rhiju
Forum moderator
Project developer
Project scientist

Joined: Feb 14 06
Posts: 161
ID: 4
Credit: 3,725
RAC: 0
Message 3119 - Posted 22 May 2007 6:57:32 UTC

    So far things have been pretty stable with 5.64; thanks to everyone for posting about crashes on ralph, its helped us fine-tune our workunits. This update just has a small addition to give us more control over the energy function assumed in RNA workunits.
    ____________

    k6

    Joined: May 16 07
    Posts: 3
    ID: 3098
    Credit: 3,025
    RAC: 0
    Message 3120 - Posted 22 May 2007 8:32:20 UTC

      Last modified: 22 May 2007 8:36:50 UTC

      For this time I´ve computed 2 units using 5.65Beta, but both ends with compute error. Here it is:

      521641
      521561

      Now, my computer is working on next units, i´ll edit this post and insert an additional links to failed WUs, if they occurs.

      Sorry for bad english.

      k6

      Joined: May 16 07
      Posts: 3
      ID: 3098
      Credit: 3,025
      RAC: 0
      Message 3121 - Posted 22 May 2007 9:46:34 UTC

        Last modified: 22 May 2007 10:17:21 UTC

        Next bad WU:
        521745

        Good WUs:
        521746
        521770

        Profile anders n

        Joined: Feb 16 06
        Posts: 166
        ID: 91
        Credit: 131,419
        RAC: 0
        Message 3122 - Posted 22 May 2007 11:06:22 UTC

          http://ralph.bakerlab.org/result.php?resultid=521786

          - exit code -1073741819 (0xc0000005)

          Anders n
          ____________

          Profile feet1st

          Joined: Mar 7 06
          Posts: 312
          ID: 1028
          Credit: 110,522
          RAC: 0
          Message 3123 - Posted 22 May 2007 14:36:07 UTC

            Mine failed too after just 17:27.

            Unrecoverable error for result 1urnA_BOINC_ABRELAX_BARCODE-1urnA-frags83__2061_1_1 ( - exit code -1073741819 (0xc0000005))

            ____________

            mdettweiler
            Avatar

            Joined: Apr 4 07
            Posts: 11
            ID: 2886
            Credit: 1,010
            RAC: 0
            Message 3124 - Posted 22 May 2007 22:03:15 UTC

              I got an error for this workunit. Here\'s what my BOINC client logged about the error:


              5/22/2007 5:56:27 PM|ralph@home|Deferring communication for 1 min 0 sec
              5/22/2007 5:56:27 PM|ralph@home|Reason: Unrecoverable error for result CNTRL_01RELAXNATIVE_SAVE_ALL_OUT_-1n0u_-_2064_9_0 ( - exit code -1073741819 (0xc0000005))
              5/22/2007 5:56:28 PM|ralph@home|Computation for task CNTRL_01RELAXNATIVE_SAVE_ALL_OUT_-1n0u_-_2064_9_0 finished
              5/22/2007 5:56:28 PM|ralph@home|Output file CNTRL_01RELAXNATIVE_SAVE_ALL_OUT_-1n0u_-_2064_9_0_0 for task CNTRL_01RELAXNATIVE_SAVE_ALL_OUT_-1n0u_-_2064_9_0 absent


              The odd thing is, after it was done, my firewall told me that the Ralph application needed to access the internet. According to my firewall\'s logs, it sent back a couple of megabytes worth of information to the Ralph server after I clicked to allow internet access for the Ralph application. I\'ve noticed that sometimes Ralph (and Rosetta, for that matter) workunits will oddly need to send back tons of data to the server if there is an error and the workunit has to stop. Is this because BOINC otherwise won\'t send back any data if the workunit errors out, and the Rosetta/Ralph admins want to see more error data than BOINC sends back?

              Snagletooth

              Joined: May 4 07
              Posts: 65
              ID: 3020
              Credit: 112,601
              RAC: 0
              Message 3125 - Posted 22 May 2007 22:04:57 UTC

                unrecoverable error

                522925

                Profile feet1st

                Joined: Mar 7 06
                Posts: 312
                ID: 1028
                Credit: 110,522
                RAC: 0
                Message 3126 - Posted 22 May 2007 22:43:20 UTC - in response to Message 3124.



                  The odd thing is, after it was done, my firewall told me that the Ralph application needed to access the internet... Is this because BOINC otherwise won\'t send back any data if the workunit errors out, and the Rosetta/Ralph admins want to see more error data than BOINC sends back?


                  When a failure occurs, additional details about the failure are collected and reported directly to the project by the application rather then via BOINC Manager. I always end up with the firewall msg and it\'s been sitting there long enough I assume it times out and doesn\'t send the goods. So, when I remember, and see a new Ralph application, I always download it from here (if it hasn\'t come down already), then identify it to my firewall to allow internet access.
                  ____________

                  Profile feet1st

                  Joined: Mar 7 06
                  Posts: 312
                  ID: 1028
                  Credit: 110,522
                  RAC: 0
                  Message 3127 - Posted 22 May 2007 22:49:35 UTC

                    We have a pulse!


                    ____________

                    Profile EvoDude
                    Avatar

                    Joined: Feb 18 06
                    Posts: 28
                    ID: 527
                    Credit: 639,833
                    RAC: 0
                    Message 3128 - Posted 23 May 2007 0:59:41 UTC - in response to Message 3124.

                      I\'ve had 7 \'Computation Errors\' in the last couple of days too. They report a client error and grant 0 credit.

                      The affected results ID\'s are:- 524262 524263 524220 524165 524163 524121 522749

                      Any chance someone could look into this problem and get back to us.

                      I got an error for this workunit. Here\'s what my BOINC client logged about the error:


                      5/22/2007 5:56:27 PM|ralph@home|Deferring communication for 1 min 0 sec
                      5/22/2007 5:56:27 PM|ralph@home|Reason: Unrecoverable error for result CNTRL_01RELAXNATIVE_SAVE_ALL_OUT_-1n0u_-_2064_9_0 ( - exit code -1073741819 (0xc0000005))
                      5/22/2007 5:56:28 PM|ralph@home|Computation for task CNTRL_01RELAXNATIVE_SAVE_ALL_OUT_-1n0u_-_2064_9_0 finished
                      5/22/2007 5:56:28 PM|ralph@home|Output file CNTRL_01RELAXNATIVE_SAVE_ALL_OUT_-1n0u_-_2064_9_0_0 for task CNTRL_01RELAXNATIVE_SAVE_ALL_OUT_-1n0u_-_2064_9_0 absent


                      The odd thing is, after it was done, my firewall told me that the Ralph application needed to access the internet. According to my firewall\'s logs, it sent back a couple of megabytes worth of information to the Ralph server after I clicked to allow internet access for the Ralph application. I\'ve noticed that sometimes Ralph (and Rosetta, for that matter) workunits will oddly need to send back tons of data to the server if there is an error and the workunit has to stop. Is this because BOINC otherwise won\'t send back any data if the workunit errors out, and the Rosetta/Ralph admins want to see more error data than BOINC sends back?


                      ____________

                      Dr Who Fan
                      Avatar

                      Joined: Sep 2 06
                      Posts: 63
                      ID: 1787
                      Credit: 46,809
                      RAC: 0
                      Message 3129 - Posted 23 May 2007 1:07:59 UTC

                        Last modified: 23 May 2007 1:08:36 UTC

                        Error:
                        http://ralph.bakerlab.org/result.php?resultid=522994

                        <core_client_version>5.8.16</core_client_version>
                        <![CDATA[
                        <message>
                        - exit code -1073741819 (0xc0000005)
                        </message>
                        <stderr_txt>
                        # cpu_run_time_pref: 7200
                        # random seed: 2664719


                        Unhandled Exception Detected...

                        - Unhandled Exception Record -
                        Reason: Access Violation (0xc0000005) at address 0x009B9479 read attempt to address 0x000C0010

                        Engaging BOINC Windows Runtime Debugger...



                        ********************

                        Dr Who Fan
                        Avatar

                        Joined: Sep 2 06
                        Posts: 63
                        ID: 1787
                        Credit: 46,809
                        RAC: 0
                        Message 3130 - Posted 23 May 2007 1:09:57 UTC

                          Error:
                          http://ralph.bakerlab.org/result.php?resultid=523659

                          <core_client_version>5.8.16</core_client_version>
                          <![CDATA[
                          <message>
                          - exit code -1073741819 (0xc0000005)
                          </message>
                          <stderr_txt>
                          # cpu_run_time_pref: 7200
                          # random seed: 2662174


                          Unhandled Exception Detected...

                          - Unhandled Exception Record -
                          Reason: Access Violation (0xc0000005) at address 0x009B93DB read attempt to address 0x1133FE5C

                          Engaging BOINC Windows Runtime Debugger...



                          ********************

                          Rhiju
                          Forum moderator
                          Project developer
                          Project scientist

                          Joined: Feb 14 06
                          Posts: 161
                          ID: 4
                          Credit: 3,725
                          RAC: 0
                          Message 3131 - Posted 23 May 2007 1:24:31 UTC - in response to Message 3130.

                            Hi everybody:

                            Looks like there are a lot of problems with this version, actually -- a very high error rate. I\'ll track it down! Thanks for posting.


                            Error:
                            http://ralph.bakerlab.org/result.php?resultid=523659

                            <core_client_version>5.8.16</core_client_version>
                            <![CDATA[
                            <message>
                            - exit code -1073741819 (0xc0000005)
                            </message>
                            <stderr_txt>
                            # cpu_run_time_pref: 7200
                            # random seed: 2662174


                            Unhandled Exception Detected...

                            - Unhandled Exception Record -
                            Reason: Access Violation (0xc0000005) at address 0x009B93DB read attempt to address 0x1133FE5C

                            Engaging BOINC Windows Runtime Debugger...



                            ********************


                            ____________

                            Admin

                            Joined: Apr 20 07
                            Posts: 1
                            ID: 2950
                            Credit: 218
                            RAC: 0
                            Message 3132 - Posted 23 May 2007 5:40:36 UTC

                              This WU Errored out after 35 secs

                              http://ralph.bakerlab.org/result.php?resultid=525097

                              Deborah Goldsmith

                              Joined: Feb 16 06
                              Posts: 3
                              ID: 297
                              Credit: 209,690
                              RAC: 0
                              Message 3133 - Posted 23 May 2007 6:13:35 UTC

                                Lots of crashes on Mac OS X Intel -- here\'s a representative:

                                Exception: EXC_BAD_ACCESS (0x0001)
                                Codes: KERN_INVALID_ADDRESS (0x0001) at 0x0895fe3c

                                Thread 0:
                                0 libSystem.B.dylib 0x90038297 mach_wait_until + 7
                                1 libSystem.B.dylib 0x90037f19 sleep + 121
                                2 ...beta_5.65_i686-apple-darwin 0x00ea6402 0x1000 + 15356930
                                3 ...beta_5.65_i686-apple-darwin 0x00e97baa 0x1000 + 15297450
                                4 ...beta_5.65_i686-apple-darwin 0x00e97c3a 0x1000 + 15297594
                                5 ...beta_5.65_i686-apple-darwin 0x00e97210 0x1000 + 15294992
                                6 ...beta_5.65_i686-apple-darwin 0x007685fb 0x1000 + 7763451
                                7 ...beta_5.65_i686-apple-darwin 0x0000260e 0x1000 + 5646
                                8 ...beta_5.65_i686-apple-darwin 0x00002535 0x1000 + 5429

                                Thread 1 Crashed:
                                0 ...beta_5.65_i686-apple-darwin 0x00abf8cb 0x1000 + 11266251
                                1 ...beta_5.65_i686-apple-darwin 0x004e9ad2 0x1000 + 5147346
                                2 ...beta_5.65_i686-apple-darwin 0x008874fd 0x1000 + 8938749
                                3 ...beta_5.65_i686-apple-darwin 0x0088a88c 0x1000 + 8951948
                                4 ...beta_5.65_i686-apple-darwin 0x00555758 0x1000 + 5588824
                                5 ...beta_5.65_i686-apple-darwin 0x00556c59 0x1000 + 5594201
                                6 ...beta_5.65_i686-apple-darwin 0x00bd587a 0x1000 + 12404858
                                7 ...beta_5.65_i686-apple-darwin 0x00bd8444 0x1000 + 12416068
                                8 ...beta_5.65_i686-apple-darwin 0x00084547 0x1000 + 537927
                                9 ...beta_5.65_i686-apple-darwin 0x006064d7 0x1000 + 6313175
                                10 ...beta_5.65_i686-apple-darwin 0x00768548 0x1000 + 7763272
                                11 ...beta_5.65_i686-apple-darwin 0x00e97a25 0x1000 + 15297061
                                12 libSystem.B.dylib 0x90024987 _pthread_body + 84

                                Thread 2:
                                0 libSystem.B.dylib 0x90038297 mach_wait_until + 7
                                1 libSystem.B.dylib 0x90037f19 sleep + 121
                                2 ...beta_5.65_i686-apple-darwin 0x00e9a1cc 0x1000 + 15307212
                                3 ...beta_5.65_i686-apple-darwin 0x00e8e606 0x1000 + 15259142
                                4 libSystem.B.dylib 0x90024987 _pthread_body + 84

                                Thread 3:
                                0 libSystem.B.dylib 0x90038297 mach_wait_until + 7
                                1 libSystem.B.dylib 0x90037f19 sleep + 121
                                2 ...beta_5.65_i686-apple-darwin 0x00dfb342 0x1000 + 14656322
                                3 libSystem.B.dylib 0x90024987 _pthread_body + 84

                                Thread 1 crashed with X86 Thread State (32-bit):
                                eax: 0x0895fe38 ebx: 0x00abf80e ecx: 0x00000004 edx: 0xb3fff190
                                edi: 0x01490c00 esi: 0x0c2848bc ebp: 0xb3ffe098 esp: 0xb3ffe040
                                ss: 0x0000001f efl: 0x00010203 eip: 0x00abf8cb cs: 0x00000017
                                ds: 0x0000001f es: 0x0000001f fs: 0x0000001f gs: 0x00000037

                                I have the full report if you want it.

                                Odysseus

                                Joined: May 4 07
                                Posts: 23
                                ID: 3023
                                Credit: 16,331
                                RAC: 0
                                Message 3134 - Posted 23 May 2007 7:06:11 UTC

                                  My dual-G5 Mac (OS 10.4.9) had an error with exit status 193 (0xc1) after about five minutes of crunching on CNTRL_01ABRELAX_SAVE_ALL_OUT_-1cc8A-_filters_2065_17_2, having successfully completed two other v5.65 tasks. My G4/733 (OS 10.3.9) also has returned two v5.65 results without errors.

                                  Profile Conan
                                  Avatar

                                  Joined: Feb 16 06
                                  Posts: 344
                                  ID: 145
                                  Credit: 1,309,534
                                  RAC: 0
                                  Message 3135 - Posted 23 May 2007 8:55:31 UTC

                                    Work Units starting with TST1 are not Validating, after completing 6 hours of cruching they generate huge amouts of bug reports then say invalid and don\'t validate

                                    http://ralph.bakerlabs.org/result.php?resultid=523246
                                    http://ralph.bakerlabs.org/result.php?resultid=523511
                                    http://ralph.bakerlabs.org/result.php?resultid=523862
                                    http://ralph.bakerlabs.org/result.php?resultid=523863

                                    Also http://ralph.bakerlabs.org/result.php?resultid=523529
                                    gave the following

                                    <core_client_version>5.8.16</core_client_version>
                                    <![CDATA[
                                    <message>
                                    process exited with code 1 (0x1)
                                    </message>
                                    <stderr_txt>
                                    Graphics are disabled due to configuration...
                                    # cpu_run_time_pref: 21600
                                    # random seed: 2663137
                                    ERROR:: Unable to determine sequence length from pdb file
                                    ERROR:: Exit from: pose.cc line: 1929

                                    Hope this helps.
                                    ____________

                                    Profile anders n

                                    Joined: Feb 16 06
                                    Posts: 166
                                    ID: 91
                                    Credit: 131,419
                                    RAC: 0
                                    Message 3136 - Posted 23 May 2007 9:45:36 UTC

                                      Intresting WU. I have a 4 H setting and it errord out after 1H 23min. Next chrucher has a 1 H setting and it came out ok after 62 min.

                                      http://ralph.bakerlab.org/workunit.php?wuid=462868

                                      Anders n
                                      ____________

                                      HTH

                                      Joined: Mar 6 06
                                      Posts: 9
                                      ID: 1005
                                      Credit: 10,226
                                      RAC: 0
                                      Message 3137 - Posted 23 May 2007 10:20:06 UTC

                                        A compute error: 521610.
                                        ____________

                                        Billy

                                        Joined: Jan 29 07
                                        Posts: 13
                                        ID: 2592
                                        Credit: 5,855
                                        RAC: 0
                                        Message 3138 - Posted 23 May 2007 13:10:44 UTC

                                          Last modified: 23 May 2007 13:12:00 UTC

                                          Result 522762

                                          Intel Mac on OSX

                                          k6

                                          Joined: May 16 07
                                          Posts: 3
                                          ID: 3098
                                          Credit: 3,025
                                          RAC: 0
                                          Message 3139 - Posted 23 May 2007 13:16:52 UTC

                                            Another WU that ends with compute error:

                                            522587

                                            Profile anders n

                                            Joined: Feb 16 06
                                            Posts: 166
                                            ID: 91
                                            Credit: 131,419
                                            RAC: 0
                                            Message 3140 - Posted 23 May 2007 14:53:16 UTC

                                              Last modified: 23 May 2007 15:11:11 UTC

                                              Intell MAC
                                              Validate error
                                              Silent_out::setup: silent output with symmetry info not compatible with non-ideal bonds yet.
                                              http://ralph.bakerlab.org/result.php?resultid=523041
                                              http://ralph.bakerlab.org/result.php?resultid=523042

                                              Anders n

                                              EDIT
                                              And
                                              http://ralph.bakerlab.org/result.php?resultid=523042
                                              ____________

                                              Profile Bober [B@P]

                                              Joined: Jun 18 06
                                              Posts: 6
                                              ID: 1538
                                              Credit: 15,427
                                              RAC: 0
                                              Message 3141 - Posted 23 May 2007 15:22:29 UTC

                                                Last modified: 23 May 2007 15:23:14 UTC

                                                Computation error:
                                                result 523948
                                                ____________

                                                Odysseus

                                                Joined: May 4 07
                                                Posts: 23
                                                ID: 3023
                                                Credit: 16,331
                                                RAC: 0
                                                Message 3142 - Posted 23 May 2007 20:34:32 UTC

                                                  Another crash on my G4/733; sorry, I can’t provide a link or exit-status code because the result seems to have been deleted from the servers already. BOINC Manager said:

                                                  Wed May 23 13:30:29 2007|ralph@home|[error] rosetta_beta not responding to screensaver, requesting exit
                                                  Wed May 23 13:30:30 2007|ralph@home|Task 1bq9A_BOINC_ABINITIO-1bq9A-frags83__2067_7_0 exited with zero status but no \'finished\' file
                                                  Wed May 23 13:30:30 2007|ralph@home|If this happens repeatedly you may need to reset the project.
                                                  Wed May 23 13:30:30 2007|ralph@home|Restarting task 1bq9A_BOINC_ABINITIO-1bq9A-frags83__2067_7_0 using rosetta_beta version 565
                                                  […]
                                                  Wed May 23 13:41:30 2007|ralph@home|Deferring communication for 1 min 0 sec
                                                  Wed May 23 13:41:30 2007|ralph@home|Reason: Unrecoverable error for result 1bq9A_BOINC_ABINITIO-1bq9A-frags83__2067_7_0 (process got signal 6)
                                                  Wed May 23 13:41:30 2007|ralph@home|Computation for task 1bq9A_BOINC_ABINITIO-1bq9A-frags83__2067_7_0 finished
                                                  Wed May 23 13:41:30 2007|ralph@home|Output file 1bq9A_BOINC_ABINITIO-1bq9A-frags83__2067_7_0_0 for task 1bq9A_BOINC_ABINITIO-1bq9A-frags83__2067_7_0 absent

                                                  Odysseus

                                                  Joined: May 4 07
                                                  Posts: 23
                                                  ID: 3023
                                                  Credit: 16,331
                                                  RAC: 0
                                                  Message 3143 - Posted 23 May 2007 21:41:43 UTC - in response to Message 3142.

                                                    I can’t provide a link or exit-status code because the result seems to have been deleted from the servers already.

                                                    Sorry, my mistake, looking in the wrong place. Here it is, exit status 6 (0x6): bq9A_BOINC_ABINITIO-1bq9A-frags83__2067_7.

                                                    tallguy-13088
                                                    Avatar

                                                    Joined: Feb 17 06
                                                    Posts: 10
                                                    ID: 376
                                                    Credit: 121,701
                                                    RAC: 0
                                                    Message 3144 - Posted 23 May 2007 21:49:47 UTC

                                                      Last modified: 23 May 2007 21:52:15 UTC

                                                      Hi,

                                                      In the last twelve hours, I have aborted three (3) work units that have run over 10 hours apiece. They are as follows:

                                                      1fkb__BOINC_ABRELAX_BARCODE-1fkb_-frags83__2061_4
                                                      (2acy__BOINC_ABRELAX_BARCODE-2acy_-frags83__2061_4)
                                                      and
                                                      (1eyvA_BOINC_ABRELAX_BARCODE-1eyvA-frags83__2061_4)

                                                      The OS is Win2K
                                                      O/S: Win2K: 5.00.2195 SP4
                                                      BOINC Manager: 5.8.15
                                                      and all three workunits were running under RALPH 5.65.

                                                      If you need more debug info, pls contact. Thanks!

                                                      Had to re-edit since I messed up on the URL tags.
                                                      ____________

                                                      Odysseus

                                                      Joined: May 4 07
                                                      Posts: 23
                                                      ID: 3023
                                                      Credit: 16,331
                                                      RAC: 0
                                                      Message 3145 - Posted 24 May 2007 1:50:58 UTC

                                                        Last modified: 24 May 2007 1:51:33 UTC

                                                        Some errors from my dual G5 Mac, all with exit code 193 (0xc1):
                                                        CNTRL_01ABRELAX_SAVE_ALL_OUT_-1pgx_-_filters_2065_23
                                                        CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ogw_-_filters_2065_18
                                                        CNTRL_01ABRELAX_SAVE_ALL_OUT_-1cc8A-_filters_2065_17

                                                        I’m pretty sure these ones are not screensaver-related, as they occurred while the system had been undisturbed (with screen blacked out & monitor sleeping) for several hours.

                                                        Trying to get more work, amidst the “no work from project” messages I see:

                                                        Wed 23 May 19:35:45 2007|ralph@home|Sending scheduler request to http://ralph.bakerlab.org/ralph_cgi/cgi
                                                        Wed 23 May 19:35:45 2007|ralph@home|Reason: To fetch work
                                                        Wed 23 May 19:35:45 2007|ralph@home|Requesting 172800 seconds of new work
                                                        Wed 23 May 19:35:50 2007|ralph@home|Scheduler request succeeded
                                                        Wed 23 May 19:35:50 2007|ralph@home|Message from server: Project encountered internal error: shared memory
                                                        Wed 23 May 19:35:50 2007|ralph@home|Project is down

                                                        mdettweiler
                                                        Avatar

                                                        Joined: Apr 4 07
                                                        Posts: 11
                                                        ID: 2886
                                                        Credit: 1,010
                                                        RAC: 0
                                                        Message 3147 - Posted 24 May 2007 3:52:28 UTC - in response to Message 3145.

                                                          Trying to get more work, amidst the “no work from project” messages I see:
                                                          Wed 23 May 19:35:45 2007|ralph@home|Sending scheduler request to http://ralph.bakerlab.org/ralph_cgi/cgi
                                                          Wed 23 May 19:35:45 2007|ralph@home|Reason: To fetch work
                                                          Wed 23 May 19:35:45 2007|ralph@home|Requesting 172800 seconds of new work
                                                          Wed 23 May 19:35:50 2007|ralph@home|Scheduler request succeeded
                                                          Wed 23 May 19:35:50 2007|ralph@home|Message from server: Project encountered internal error: shared memory
                                                          Wed 23 May 19:35:50 2007|ralph@home|Project is down



                                                          I\'m getting the same \"shared memory\" error. What could be causing this?

                                                          Profile Conan
                                                          Avatar

                                                          Joined: Feb 16 06
                                                          Posts: 344
                                                          ID: 145
                                                          Credit: 1,309,534
                                                          RAC: 0
                                                          Message 3149 - Posted 24 May 2007 9:59:41 UTC - in response to Message 3135.

                                                            Work Units starting with TST1 are not Validating, after completing 6 hours of cruching they generate huge amouts of bug reports then say invalid and don\'t validate

                                                            http://ralph.bakerlabs.org/result.php?resultid=523246
                                                            http://ralph.bakerlabs.org/result.php?resultid=523511
                                                            http://ralph.bakerlabs.org/result.php?resultid=523862
                                                            http://ralph.bakerlabs.org/result.php?resultid=523863

                                                            Also http://ralph.bakerlabs.org/result.php?resultid=523529
                                                            gave the following

                                                            <core_client_version>5.8.16</core_client_version>
                                                            <![CDATA[
                                                            <message>
                                                            process exited with code 1 (0x1)
                                                            </message>
                                                            <stderr_txt>
                                                            Graphics are disabled due to configuration...
                                                            # cpu_run_time_pref: 21600
                                                            # random seed: 2663137
                                                            ERROR:: Unable to determine sequence length from pdb file
                                                            ERROR:: Exit from: pose.cc line: 1929

                                                            Hope this helps.


                                                            >> Another 3 Validate TST1 workunit errors
                                                            All have completed but failed to validate

                                                            http://ralph.bakerlabs.org/result.php?resultid=523534
                                                            http://ralph.bakerlabs.org/result.php?resultid=525016
                                                            http://ralph.bakerlabs.org/result.php?resultid=523245

                                                            Have a good day, it would be nice to get the cobblestones for these failed WUs.
                                                            ____________

                                                            Gary Tegner

                                                            Joined: Aug 9 06
                                                            Posts: 1
                                                            ID: 1678
                                                            Credit: 43,513
                                                            RAC: 0
                                                            Message 3150 - Posted 24 May 2007 10:27:38 UTC

                                                              invalid results (client error/compute error):
                                                              525985
                                                              523747
                                                              523123
                                                              523121
                                                              523015

                                                              valid results:
                                                              526004
                                                              525993

                                                              three other WUs are still running.

                                                              tallguy-13088
                                                              Avatar

                                                              Joined: Feb 17 06
                                                              Posts: 10
                                                              ID: 376
                                                              Credit: 121,701
                                                              RAC: 0
                                                              Message 3151 - Posted 24 May 2007 21:41:01 UTC

                                                                Last modified: 24 May 2007 21:42:03 UTC

                                                                Please add

                                                                1vie__BOINC_ABRELAX_BARCODE-1vie_-frags83__2061_2

                                                                and

                                                                2acy__BOINC_ABRELAX_BARCODE-2acy_-frags83__2061_4

                                                                to the list of aborted workunits running over 10 hours or more
                                                                ____________

                                                                Odysseus

                                                                Joined: May 4 07
                                                                Posts: 23
                                                                ID: 3023
                                                                Credit: 16,331
                                                                RAC: 0
                                                                Message 3152 - Posted 24 May 2007 22:08:30 UTC - in response to Message 3151.

                                                                  Please add […] to the list of aborted workunits running over 10 hours or more

                                                                  Are we supposed to abort tasks that run for more than ten hours? I couldn’t find any announcement to that effect.

                                                                  Profile anders n

                                                                  Joined: Feb 16 06
                                                                  Posts: 166
                                                                  ID: 91
                                                                  Credit: 131,419
                                                                  RAC: 0
                                                                  Message 3156 - Posted 25 May 2007 4:32:43 UTC - in response to Message 3152.

                                                                    Please add […] to the list of aborted workunits running over 10 hours or more

                                                                    Are we supposed to abort tasks that run for more than ten hours? I couldn’t find any announcement to that effect.


                                                                    No we are not supposed to abort any tasks!

                                                                    Unless there are a direct instruction to do so from the team.

                                                                    Anders n
                                                                    ____________

                                                                    Message boards : RALPH@home bug list : Bug reports for 5.65


                                                                    Home | Join | About | Participants | Community | Statistics

                                                                    Copyright © 2017 University of Washington

                                                                    Last Modified: 20 Nov 2008 19:41:56 UTC
                                                                    Back to top ^