RALPH@home

minirosetta 2.05

  UW Seal
 
[ Home ] [ Join ] [ About ] [ Participants ] [ Community ] [ Statistics ]
  [ login/out ]


Advanced search

Message boards : RALPH@home bug list : minirosetta 2.05

AuthorMessage
Profile dekim
Forum moderator
Project administrator
Project developer
Project scientist

Joined: Jan 20 06
Posts: 202
ID: 1
Credit: 367,731
RAC: 512
Message 5039 - Posted 12 Jan 2010 0:53:43 UTC

    Last modified: 12 Jan 2010 0:59:41 UTC

    This version update includes a checkpointing fix and minor code updates.

    Please post bugs/issues here.

    FYI, I skipped over version 2.04 on purpose. I was about to update the 2.04 app but part way through the updating process I had to include some other code updates.

    thanks,

    David K
    ____________

    Profile nenym

    Joined: Jan 16 09
    Posts: 14
    ID: 5145
    Credit: 1,007,003
    RAC: 0
    Message 5041 - Posted 13 Jan 2010 11:15:15 UTC

      2.05 task 9gbnnotyr_3gbn_1e5k_9Jan2010_13807_1_0 finished succesully, validate error occured.
      4CPU Xeon 2.83GHz, 8GB RAM, XP x64.

      Profile nenym

      Joined: Jan 16 09
      Posts: 14
      ID: 5145
      Credit: 1,007,003
      RAC: 0
      Message 5042 - Posted 13 Jan 2010 22:30:36 UTC

        Last modified: 13 Jan 2010 22:32:18 UTC

        2.05 task run2.loopbuild_hb_t286__IGNORE_THE_REST_13813_9_1 errored out after 25s.
        4CPU Xeon 2.83GHz, 8GB RAM, XP x64.

        <core_client_version>6.10.24</core_client_version>
        <![CDATA[
        <message>
        - exit code -1073741819 (0xc0000005)
        </message>
        <stderr_txt>

        Snagletooth

        Joined: May 4 07
        Posts: 65
        ID: 3020
        Credit: 112,601
        RAC: 3
        Message 5052 - Posted 24 Jan 2010 23:52:04 UTC

          validate errors

          tyrsim_3gbn_2qzq_20Jan2010_14017_2

          There are a couple of 2.05 bug reports in the 2.03 thread. You might want to sticky this thread and unsticky the 2.03 thread.

          Snags

          AdeB
          Avatar

          Joined: Dec 22 07
          Posts: 61
          ID: 3888
          Credit: 99,745
          RAC: 34
          Message 5054 - Posted 30 Jan 2010 11:16:18 UTC

            Task 1731436 had a normal runtime, was a success and is valid; however there is an ERROR in stderr out:

            ERROR: unknown atom_name: ILE CG
            ERROR:: Exit from: src/core/chemical/ResidueType.cc line: 1382

            AdeB

            Profile [VENETO] boboviz

            Joined: Apr 9 08
            Posts: 474
            ID: 4205
            Credit: 681,030
            RAC: 132
            Message 5055 - Posted 1 Feb 2010 8:56:15 UTC

              Last modified: 1 Feb 2010 8:58:25 UTC

              After 10 seconds, task 1733723

              ERROR: ERROR: FragmentIO: could not open file aa1mq9A09_05.200_v1_3.gz
              ERROR:: Exit from: ..\..\src\core\fragment\FragmentIO.cc line: 258
              BOINC:: Error reading and gzipping output datafile: default.out
              called boinc_finish

              Profile [VENETO] boboviz

              Joined: Apr 9 08
              Posts: 474
              ID: 4205
              Credit: 681,030
              RAC: 132
              Message 5057 - Posted 3 Feb 2010 8:14:24 UTC

                A lot of validate error:
                1734687
                1734658
                1734657


                # cpu_run_time_pref: 14400
                ======================================================
                DONE :: 2 starting structures 13935.9 cpu seconds
                This process generated 2 decoys from 2 attempts
                ======================================================

                BOINC :: Watchdog shutting down...
                BOINC :: BOINC support services shutting down cleanly ...
                called boinc_finish

                Profile morse [E.R.] - BOINC.Italy

                Joined: Jan 26 10
                Posts: 7
                ID: 15141
                Credit: 66,816
                RAC: 0
                Message 5059 - Posted 3 Feb 2010 13:26:27 UTC

                  Validate Error:

                  1735917
                  1735916

                  AdeB
                  Avatar

                  Joined: Dec 22 07
                  Posts: 61
                  ID: 3888
                  Credit: 99,745
                  RAC: 34
                  Message 5060 - Posted 3 Feb 2010 17:01:31 UTC

                    Task 1734255

                    <message>
                    Maximum memory exceeded
                    </message>


                    AdeB

                    Profile Conan
                    Avatar

                    Joined: Feb 16 06
                    Posts: 344
                    ID: 145
                    Credit: 1,309,534
                    RAC: 0
                    Message 5061 - Posted 5 Feb 2010 11:25:33 UTC

                      Work units failing with "Validate Error", all show no errors in readout and all have produced successful decoy runs, why have they been marked as faulty?

                      See 1735252
                      1737376
                      1737381
                      1737630
                      1737631

                      Thanks
                      Conan.
                      ____________

                      Snagletooth

                      Joined: May 4 07
                      Posts: 65
                      ID: 3020
                      Credit: 112,601
                      RAC: 3
                      Message 5062 - Posted 5 Feb 2010 11:43:50 UTC

                        more validate errors:

                        dckCFA_1sq2_1xg8_0029_0002_ProteinInterfaceDesign_4Feb2010_14289_3

                        coturnix

                        Joined: Jan 5 10
                        Posts: 9
                        ID: 15073
                        Credit: 196,185
                        RAC: 0
                        Message 5063 - Posted 5 Feb 2010 15:28:42 UTC

                          tyrsim_3gbn_2qsb_Protein_interface_design_01Feb2010_14280_2 (Validate Error for task 1735683, second task succeeded)
                          tyrsim_3gbn_2qsv_Protein_interface_design_01Feb2010_14280_2 (Validate Error for task 1735685, second task succeeded)
                          tyrsim_3gbn_2qvk_Protein_interface_design_01Feb2010_14280_2 (Validate Error for task 1735699, second task succeeded)
                          dckCFA_1sq2_2ODU_ppk_0025_foldit_ProteinInterfaceDesign_3Feb2010_14288_4 (Compute Error for both tasks, file_xfer_error)
                          dckCFA_1sq2_1GS9_ppk_0056_0001_ProteinInterfaceDesign_3Feb2010_14288_7 (Compute Error for both tasks, file_xfer_error)
                          dckCFA_1sq2_1GS9_ppk_0056_0001_ProteinInterfaceDesign_3Feb2010_14288_4 (Compute Error for both tasks, file_xfer_error)
                          dckCFA_1sq2_3EGN_ppk_0002_foldit_ProteinInterfaceDesign_3Feb2010_14286_1 (Validate Error for both tasks)
                          1fxaA_boinc_70pct_loopbuild_threading_cst_relax_tex_IGNORE_THE_REST_14292_7 (Validate Error for task 1737477, second task succeeded)
                          1hz6A_boinc_70pct_loopbuild_threading_cst_relax_tex_IGNORE_THE_REST_14292_7 (Validate Error for task 1737482, second task succeeded)

                          Snagletooth

                          Joined: May 4 07
                          Posts: 65
                          ID: 3020
                          Credit: 112,601
                          RAC: 3
                          Message 5064 - Posted 5 Feb 2010 21:11:12 UTC

                            one more validate error:
                            1l33A_boinc_70pct_loopbuild_threading_cst_relax_tex_IGNORE_THE_REST_14292_3_0

                            svincent

                            Joined: Apr 4 08
                            Posts: 34
                            ID: 4182
                            Credit: 51,768
                            RAC: 0
                            Message 5065 - Posted 6 Feb 2010 1:59:28 UTC

                              I have got compute errors on Mac OS X 10.6 for 4 tasks named

                              1738219 dckCFA_1sq2_3EGN_ppk_0002_foldit_ProteinInterfaceDesign_3Feb2010_14288_1_1
                              1738220 dckCFA_1sq2_2ODU_ppk_0025_foldit_ProteinInterfaceDesign_3Feb2010_14288_2_1
                              1738221 dckCFA_1sq2_2OJ4_ppk_0050_ProteinInterfaceDesign_3Feb2010_14288_2_1
                              1738222 dckCFA_1sq2_3ce7_0044_foldit_ProteinInterfaceDesign_3Feb2010_14288_2_1

                              All seemed to complete OK but gave an error code at the end.

                              </stderr_txt>
                              <message>
                              <file_xfer_error>
                              <file_name>dckCFA_1sq2_2ODU_ppk_0025_foldit_ProteinInterfaceDesign_3Feb2010_14288_2_1_0</file_name>
                              <error_code>-161</error_code>
                              </file_xfer_error>

                              My wingmen seemed to have similar problems

                              Profile nenym

                              Joined: Jan 16 09
                              Posts: 14
                              ID: 5145
                              Credit: 1,007,003
                              RAC: 0
                              Message 5066 - Posted 6 Feb 2010 3:54:21 UTC

                                Last modified: 6 Feb 2010 4:03:35 UTC

                                1533632dckCFA_1sq2_1s3g_0018_foldit_ProteinInterfaceDesign_3Feb2010_14288_2

                                errored out
                                <core_client_version>6.6.28</core_client_version>
                                <![CDATA[
                                <message>
                                Maximum disk usage exceeded
                                </message>
                                ]]>

                                validate errors:
                                1534306 1ifcA_boinc_70pct_loopbuild_threading_cst_relax_tex_IGNORE_THE_REST_14292_7
                                1534258 1l23A_boinc_70pct_loopbuild_threading_cst_relax_tex_IGNORE_THE_REST_14292_6
                                15343321shgA_boinc_70pct_loopbuild_threading_cst_relax_tex_IGNORE_THE_REST_14292_7
                                1534338200lA_boinc_70pct_loopbuild_threading_cst_relax_tex_IGNORE_THE_REST_14292_7

                                Win XP x64, 4CPU Xeon 2.83 GHz, 8GB RAM

                                Profile morse [E.R.] - BOINC.Italy

                                Joined: Jan 26 10
                                Posts: 7
                                ID: 15141
                                Credit: 66,816
                                RAC: 0
                                Message 5067 - Posted 6 Feb 2010 19:12:50 UTC

                                  Validate Error:

                                  1735938

                                  AdeB
                                  Avatar

                                  Joined: Dec 22 07
                                  Posts: 61
                                  ID: 3888
                                  Credit: 99,745
                                  RAC: 34
                                  Message 5068 - Posted 8 Feb 2010 18:35:27 UTC

                                    Task 1738713 was marked Invalid. One of those 578 decoys must have been valid!

                                    AdeB

                                    Profile Conan
                                    Avatar

                                    Joined: Feb 16 06
                                    Posts: 344
                                    ID: 145
                                    Credit: 1,309,534
                                    RAC: 0
                                    Message 5069 - Posted 11 Feb 2010 6:47:38 UTC

                                      This Work unit was running for nearly 8 hours at 0.050% completed and 24 hours to go.
                                      It looks like it has then failed not long after I looked at it.
                                      The CPU recorded time says 22 minutes, so the WU was probably not using any CPU but the time was still ticking over.

                                      This WU
                                      ____________

                                      coturnix

                                      Joined: Jan 5 10
                                      Posts: 9
                                      ID: 15073
                                      Credit: 196,185
                                      RAC: 0
                                      Message 5070 - Posted 11 Feb 2010 11:05:32 UTC

                                        Lots of Compute Errors -- SIGSEGV (Linux) / SIGBUS (Mac) -- for example

                                        test_run1.1ttz.1ttz.IGNORE_THE_REST.c.1.0.pdb.pdb.JOB_14301_1
                                        test_run1.1ttz.1ttz.IGNORE_THE_REST.c.4.1.pdb.pdb.JOB_14301_1
                                        test_run1.1ttz.1ttz.IGNORE_THE_REST.c.3.4.pdb.pdb.JOB_14301_1

                                        and many more.

                                        AdeB
                                        Avatar

                                        Joined: Dec 22 07
                                        Posts: 61
                                        ID: 3888
                                        Credit: 99,745
                                        RAC: 34
                                        Message 5071 - Posted 13 Feb 2010 13:40:22 UTC

                                          File transfer errors!!!

                                          Task 1740222:

                                          <file_xfer_error>
                                          <file_name>igfhum_brub1_2dsrI_1ZZK_ProteinInterfaceDesign_12Feb2010_14321_1_0_0</file_name>
                                          <error_code>-161</error_code>
                                          </file_xfer_error>

                                          Simmilar errors in tasks 1740267 and 1741847.

                                          AdeB

                                          Profile Conan
                                          Avatar

                                          Joined: Feb 16 06
                                          Posts: 344
                                          ID: 145
                                          Credit: 1,309,534
                                          RAC: 0
                                          Message 5073 - Posted 16 Feb 2010 6:31:04 UTC

                                            Got this Error on five Work Units so far

                                            ERROR: did not find topology_file: beta_lowE.top
                                            ERROR:: Exit from: ..\..\src\protocols\topology_broker\TemplateJumpClaimer.cc line: 93
                                            BOINC:: Error reading and gzipping output datafile: default.out
                                            called boinc_finish

                                            Work Units are 1745497
                                            1745711
                                            1745710
                                            1745699
                                            1745697

                                            Also had

                                            "Error Code 161"
                                            "/file_xfer_error/"

                                            on This WU

                                            They all ran for only a short time before failing.
                                            ____________

                                            Profile Krzychu P.

                                            Joined: Feb 16 06
                                            Posts: 19
                                            ID: 114
                                            Credit: 10,236
                                            RAC: 0
                                            Message 5074 - Posted 16 Feb 2010 8:06:48 UTC

                                              Task 1746139

                                              At the "stderr out":

                                              <message>
                                              Niepoprawna funkcja. (0x1) - exit code 1 (0x1)
                                              </message>
                                              (...)
                                              ERROR: did not find topology_file: beta_lowE.top
                                              ERROR:: Exit from: ..\..\src\protocols\topology_broker\TemplateJumpClaimer.cc line: 93
                                              BOINC:: Error reading and gzipping output datafile: default.out
                                              called boinc_finish


                                              In the manager message window:

                                              2010-02-16 08:28:03 ralph@home Starting t290__boinc__sel_core_1.5.broker_corebuild_tex_IGNORE_THE_REST_14338_8_0
                                              2010-02-16 08:28:06 ralph@home [task_debug] task_state=EXECUTING for t290__boinc__sel_core_1.5.broker_corebuild_tex_IGNORE_THE_REST_14338_8_0 from start
                                              2010-02-16 08:28:06 ralph@home Starting task t290__boinc__sel_core_1.5.broker_corebuild_tex_IGNORE_THE_REST_14338_8_0 using minirosetta version 205
                                              2010-02-16 08:28:13 ralph@home update requested by user
                                              2010-02-16 08:28:24 ralph@home [task_debug] Process for t290__boinc__sel_core_1.5.broker_corebuild_tex_IGNORE_THE_REST_14338_8_0 exited
                                              2010-02-16 08:28:24 ralph@home [task_debug] task_state=EXITED for t290__boinc__sel_core_1.5.broker_corebuild_tex_IGNORE_THE_REST_14338_8_0 from handle_exited_app
                                              2010-02-16 08:28:24 ralph@home [task_debug] result state=COMPUTE_ERROR for t290__boinc__sel_core_1.5.broker_corebuild_tex_IGNORE_THE_REST_14338_8_0 from CS::report_result_error
                                              2010-02-16 08:28:24 ralph@home [task_debug] Process for t290__boinc__sel_core_1.5.broker_corebuild_tex_IGNORE_THE_REST_14338_8_0 exited
                                              2010-02-16 08:28:24 ralph@home [task_debug] exit code 1 (0x1): Niepoprawna funkcja. (0x1)
                                              2010-02-16 08:28:24 ralph@home Computation for task t290__boinc__sel_core_1.5.broker_corebuild_tex_IGNORE_THE_REST_14338_8_0 finished
                                              2010-02-16 08:28:24 ralph@home Output file t290__boinc__sel_core_1.5.broker_corebuild_tex_IGNORE_THE_REST_14338_8_0_0 for task t290__boinc__sel_core_1.5.broker_corebuild_tex_IGNORE_THE_REST_14338_8_0 absent
                                              2010-02-16 08:28:24 ralph@home [task_debug] result state=COMPUTE_ERROR for t290__boinc__sel_core_1.5.broker_corebuild_tex_IGNORE_THE_REST_14338_8_0 from CS::app_finished

                                              ____________

                                              Tonno

                                              Joined: Nov 23 06
                                              Posts: 16
                                              ID: 2269
                                              Credit: 49,841
                                              RAC: 0
                                              Message 5075 - Posted 16 Feb 2010 8:34:12 UTC - in response to Message 5074.

                                                ERROR: Option file open failed for: relax_options_lr5_rama09_mix01_it03_run01_A_yfsong

                                                1744059
                                                1744057
                                                1743941

                                                Profile [VENETO] boboviz

                                                Joined: Apr 9 08
                                                Posts: 474
                                                ID: 4205
                                                Credit: 681,030
                                                RAC: 132
                                                Message 5076 - Posted 16 Feb 2010 14:36:52 UTC

                                                  1742734
                                                  1742963

                                                  <message>
                                                  <file_xfer_error> <file_name>igfhum_brub1_2dsrI_3FLG_ProteinInterfaceDesign_12Feb2010_14326_1_1_0
                                                  </file_name>
                                                  <error_code>-161</error_code>
                                                  </file_xfer_error>
                                                  </message>

                                                  Tonno

                                                  Joined: Nov 23 06
                                                  Posts: 16
                                                  ID: 2269
                                                  Credit: 49,841
                                                  RAC: 0
                                                  Message 5077 - Posted 16 Feb 2010 23:34:05 UTC - in response to Message 5076.

                                                    1746875
                                                    <core_client_version>6.10.32</core_client_version>
                                                    <![CDATA[
                                                    <message>
                                                    Input file t286__boinc_corebuild_round2_rerun_sel_core_1.5.broker_corebuild_tex.boinc.flags missing or invalid: -119
                                                    </message>
                                                    ]]>

                                                    svincent

                                                    Joined: Apr 4 08
                                                    Posts: 34
                                                    ID: 4182
                                                    Credit: 51,768
                                                    RAC: 0
                                                    Message 5078 - Posted 19 Feb 2010 1:05:34 UTC

                                                      Some recent failures on Mac OS X

                                                      1749862
                                                      1749890
                                                      1749891

                                                      all failed as follows:

                                                      ERROR: start_res != middle_res
                                                      ERROR:: Exit from: src/protocols/moves/KinematicMover.cc line: 132
                                                      BOINC:: Error reading and gzipping output datafile: default.out
                                                      called boinc_finish

                                                      </stderr_txt>

                                                      1749898

                                                      failed differently

                                                      SIGPIPE: write on a pipe with no reader
                                                      0 0x006e2839 SIGPIPE: write on a pipe with no reader
                                                      1 0x00338ace SIGPIPE: write on a pipe with no reader

                                                      etc.

                                                      Profile [VENETO] boboviz

                                                      Joined: Apr 9 08
                                                      Posts: 474
                                                      ID: 4205
                                                      Credit: 681,030
                                                      RAC: 132
                                                      Message 5079 - Posted 19 Feb 2010 8:19:06 UTC

                                                        1749893


                                                        - Unhandled Exception Record -
                                                        Reason: Access Violation (0xc0000005) at address 0x007E4651 read attempt to address 0x00000008

                                                        Engaging BOINC Windows Runtime Debugger...

                                                        - Unhandled Exception Record -
                                                        Reason: Access Violation (0xc0000005) at address 0x007E4651 read attempt to address 0x00000008

                                                        - Registers -
                                                        eax=00000000 ebx=00000000 ecx=017fe47c edx=07886300 esi=017f9d38 edi=00c08198
                                                        eip=007e4651 esp=017f8ffc ebp=017f9730
                                                        cs=001b ss=0023 ds=0023 es=0023 fs=003b gs=0000 efl=00010246

                                                        - Callstack -
                                                        ChildEBP RetAddr Args to Child
                                                        017f9730 0074785d 017fe47c 40d35fcd 017fe944 017fe47c minirosetta_2.05_windows_intelx!protocols::moves::KinematicMover::apply+0xf (d:\boinc_build\minirosetta_2.04\mini\src\protocols\moves\kinematicmover.cc:343)
                                                        017f9f70 006c9d40 017fe47c 40d3570d 078f28f0 00000010 minirosetta_2.05_windows_intelx!protocols::loops::LoopMover_Refine_KIC::apply+0x0 (d:\boinc_build\minirosetta_2.04\mini\src\protocols\loops\loopmover_kic.cc:780)
                                                        017fe33c 0068b980 017fe47c 40d32bc1 00000000 00000009 minirosetta_2.05_windows_intelx!protocols::loops::LoopRelaxMover::apply+0x0 (d:\boinc_build\minirosetta_2.04\mini\src\protocols\loops\looprelaxmover.cc:740)
                                                        017fed44 00405754 00000001 40d32551 00001db0 00000002 minirosetta_2.05_windows_intelx!protocols::loops::LoopRelax_main+0x0 (d:\boinc_build\minirosetta_2.04\mini\src\protocols\loops\loopbuild.cc:283)
                                                        017feedc 00405bb5 00000021 017feef4 000b2300 017feef4 minirosetta_2.05_windows_intelx!main+0x7 (d:\boinc_build\minirosetta_2.04\mini\src\apps\public\boinc\minirosetta.cc:197)
                                                        017ffef0 00418647 00400000 00000000 000b2342 0000000a minirosetta_2.05_windows_intelx!WinMain+0x0 (d:\boinc_build\minirosetta_2.04\mini\src\apps\public\boinc\minirosetta.cc:264)
                                                        017fff88 768b1174 7ffdf000 017fffd4 7716b3f5 7ffdf000 minirosetta_2.05_windows_intelx!__tmainCRTStartup+0x1c (f:\sp\vctools\crt_bld\self_x86\crt\src\crt0.c:324)
                                                        017fff94 7716b3f5 7ffdf000 47b5e9f0 00000000 00000000 kernel32!@BaseThreadInitThunk@12+0x0 (f:\sp\vctools\crt_bld\self_x86\crt\src\crt0.c:324)
                                                        017fffd4 7716b3c8 004186b0 7ffdf000 00000000 00000000 ntdll!___RtlUserThreadStart@8+0x0 (f:\sp\vctools\crt_bld\self_x86\crt\src\crt0.c:324)
                                                        017fffec 00000000 004186b0 7ffdf000 00000000 00000000 ntdll!__RtlUserThreadStart@8+0x0 (f:\sp\vctools\crt_bld\self_x86\crt\src\crt0.c:324)

                                                        *** Dump of thread ID 2892 (state: Initialized): ***

                                                        - Information -
                                                        Status: Base Priority: Normal, Priority: Normal, , Kernel Time: 0.000000, User Time: 0.000000, Wait Time: 0.000000

                                                        - Registers -
                                                        eax=03c2f880 ebx=00000000 ecx=00000005 edx=0000007c esi=03c2ff48 edi=00000000
                                                        eip=771564f4 esp=03c2ff04 ebp=03c2ff6c
                                                        cs=001b ss=0023 ds=0023 es=0023 fs=003b gs=0000 efl=00000206

                                                        - Callstack -
                                                        ChildEBP RetAddr Args to Child
                                                        03c2ff00 77154c1c 753a1876 00000000 03c2ff48 4267f096 ntdll!_KiFastSystemCallRet@0+0x0 FPO: [0,0,0]
                                                        03c2ff04 753a1876 00000000 03c2ff48 4267f096 00000000 ntdll!_ZwDelayExecution@8+0x0 FPO: [2,0,0]
                                                        03c2ff6c 753a1818 00000064 00000000 03c2ff94 004088ab KERNELBASE!_SleepEx@8+0x0
                                                        03c2ff7c 004088ab 00000064 00000000 768b1174 00000000 KERNELBASE!_Sleep@4+0x0
                                                        03c2ff88 768b1174 00000000 03c2ffd4 7716b3f5 00000000 minirosetta_2.05_windows_intelx!timer_thread+0x0 (d:\boinc_build\minirosetta_2.04\mini\external\boinc\api\boinc_api.cpp:922)
                                                        03c2ff94 7716b3f5 00000000 4508e9f0 00000000 00000000 kernel32!@BaseThreadInitThunk@12+0x0 (d:\boinc_build\minirosetta_2.04\mini\external\boinc\api\boinc_api.cpp:922)
                                                        03c2ffd4 7716b3c8 004088a0 00000000 00000000 00000000 ntdll!___RtlUserThreadStart@8+0x0 (d:\boinc_build\minirosetta_2.04\mini\external\boinc\api\boinc_api.cpp:922)
                                                        03c2ffec 00000000 004088a0 00000000 00000000 fb8de5f8 ntdll!__RtlUserThreadStart@8+0x0 (d:\boinc_build\minirosetta_2.04\mini\external\boinc\api\boinc_api.cpp:922)

                                                        *** Dump of thread ID 3452 (state: Initialized): ***

                                                        - Information -
                                                        Status: Base Priority: Normal, Priority: Normal, , Kernel Time: 0.000000, User Time: 0.000000, Wait Time: 0.000000

                                                        - Registers -
                                                        eax=094dfe28 ebx=075d7900 ecx=094de734 edx=000000b9 esi=094dfdfc edi=00000000
                                                        eip=771564f4 esp=094dfdb8 ebp=094dfe20
                                                        cs=001b ss=0023 ds=0023 es=0023 fs=003b gs=0000 efl=00000206

                                                        - Callstack -
                                                        ChildEBP RetAddr Args to Child
                                                        094dfdb4 77154c1c 753a1876 00000000 094dfdfc 48e8f1da ntdll!_KiFastSystemCallRet@0+0x0 FPO: [0,0,0]
                                                        094dfdb8 753a1876 00000000 094dfdfc 48e8f1da 000000bb ntdll!_ZwDelayExecution@8+0x0 FPO: [2,0,0]
                                                        094dfe20 753a1818 000007d0 00000000 768aef66 006a42d1 KERNELBASE!_SleepEx@8+0x0
                                                        094dfe30 006a42d1 000007d0 48e136cd 00000000 075d7908 KERNELBASE!_Sleep@4+0x0
                                                        094dff40 006a44e7 00000000 00414e2c 00000000 48e1370d minirosetta_2.05_windows_intelx!protocols::boinc::watchdog::main_watchdog+0x0 (d:\boinc_build\minirosetta_2.04\mini\src\protocols\boinc\watchdog.cc:316)
                                                        094dff48 00414e2c 00000000 48e1370d 00000000 075d7908 minirosetta_2.05_windows_intelx!protocols::boinc::watchdog::main_watchdog_windows+0x7 (d:\boinc_build\minirosetta_2.04\mini\src\protocols\boinc\watchdog.cc:94)
                                                        094dff80 00414ed1 00000000 768b1174 075d7908 094dffd4 minirosetta_2.05_windows_intelx!_callthreadstartex+0x6 (f:\sp\vctools\crt_bld\self_x86\crt\src\threadex.c:348)
                                                        094dff88 768b1174 075d7908 094dffd4 7716b3f5 075d7908 minirosetta_2.05_windows_intelx!_threadstartex+0x5 (f:\sp\vctools\crt_bld\self_x86\crt\src\threadex.c:326)
                                                        094dff94 7716b3f5 075d7908 4f87e9f0 00000000 00000000 kernel32!@BaseThreadInitThunk@12+0x0 (f:\sp\vctools\crt_bld\self_x86\crt\src\threadex.c:326)
                                                        094dffd4 7716b3c8 00414e52 075d7908 00000000 00000000 ntdll!___RtlUserThreadStart@8+0x0 (f:\sp\vctools\crt_bld\self_x86\crt\src\threadex.c:326)
                                                        094dffec 00000000 00414e52 075d7908 00000000 09690000 ntdll!__RtlUserThreadStart@8+0x0 (f:\sp\vctools\crt_bld\self_x86\crt\src\threadex.c:326)


                                                        *** Debug Message Dump ****

                                                        Profile Conan
                                                        Avatar

                                                        Joined: Feb 16 06
                                                        Posts: 344
                                                        ID: 145
                                                        Credit: 1,309,534
                                                        RAC: 0
                                                        Message 5080 - Posted 19 Feb 2010 8:52:31 UTC

                                                          Had the following Error message

                                                          ERROR: start_res != middle_res
                                                          ERROR:: Exit from: ..\..\src\protocols\moves\KinematicMover.cc line: 132
                                                          BOINC:: Error reading and gzipping output datafile: default.out

                                                          It was on 1749596
                                                          1749597
                                                          1749598
                                                          1749871

                                                          Happened after only a couple of hundred seconds.
                                                          ____________

                                                          Profile [VENETO] boboviz

                                                          Joined: Apr 9 08
                                                          Posts: 474
                                                          ID: 4205
                                                          Credit: 681,030
                                                          RAC: 132
                                                          Message 5081 - Posted 26 Feb 2010 8:05:31 UTC

                                                            Last modified: 26 Feb 2010 8:10:59 UTC

                                                            1544269
                                                            1544268
                                                            1544270
                                                            1544271

                                                            After different calculation times, all the same error:

                                                            ERROR: [ERROR] Unable to open constraints file: aqp9_.dist_csts
                                                            ERROR:: Exit from: ..\..\src\core\scoring\constraints\ConstraintIO.cc line: 332
                                                            BOINC:: Error reading and gzipping output datafile: default.out
                                                            called boinc_finish

                                                            AdeB
                                                            Avatar

                                                            Joined: Dec 22 07
                                                            Posts: 61
                                                            ID: 3888
                                                            Credit: 99,745
                                                            RAC: 34
                                                            Message 5082 - Posted 26 Feb 2010 23:06:22 UTC - in response to Message 5081.

                                                              1544269
                                                              1544268
                                                              1544270
                                                              1544271

                                                              After different calculation times, all the same error:

                                                              ERROR: [ERROR] Unable to open constraints file: aqp9_.dist_csts
                                                              ERROR:: Exit from: ..\..\src\core\scoring\constraints\ConstraintIO.cc line: 332
                                                              BOINC:: Error reading and gzipping output datafile: default.out
                                                              called boinc_finish


                                                              Same error in task 1750870.

                                                              AdeB

                                                              Profile [VENETO] boboviz

                                                              Joined: Apr 9 08
                                                              Posts: 474
                                                              ID: 4205
                                                              Credit: 681,030
                                                              RAC: 132
                                                              Message 5083 - Posted 3 Mar 2010 6:45:02 UTC

                                                                A lot of validate error

                                                                1755041
                                                                1755043
                                                                1755051
                                                                etc, etc, etc

                                                                Cyph3r
                                                                Avatar

                                                                Joined: Oct 25 08
                                                                Posts: 1
                                                                ID: 4844
                                                                Credit: 286,789
                                                                RAC: 0
                                                                Message 5087 - Posted 11 Mar 2010 14:46:36 UTC

                                                                  Last modified: 11 Mar 2010 15:10:25 UTC

                                                                  I had the following error message:
                                                                  1757476

                                                                  SIGSEGV: segmentation violation
                                                                  Stack trace (17 frames):
                                                                  [0x96c49b3]
                                                                  [0x96ee888]
                                                                  [0xf7f90400]
                                                                  [0x8d2a923]
                                                                  [0x8d2aaf2]
                                                                  [0x8d6efb3]
                                                                  [0x8c49cb5]
                                                                  [0x8f95cf0]
                                                                  [0x88b5050]
                                                                  [0x88b97a6]
                                                                  [0x873763f]
                                                                  [0x812a54a]
                                                                  [0x812b82d]
                                                                  [0x86aa16b]
                                                                  [0x8049a26]
                                                                  [0x974c15c]
                                                                  [0x8048121]

                                                                  Exiting...
                                                                  ------------//------------
                                                                  and:
                                                                  1757331

                                                                  *** glibc detected *** free(): invalid pointer: 0xec562de1 ***
                                                                  SIGABRT: abort called
                                                                  Stack trace (20 frames):
                                                                  [0x96c49b3]
                                                                  [0x96ee888]
                                                                  [0xf7fef400]
                                                                  [0x97532d4]
                                                                  [0x9768fc2]
                                                                  [0x976def3]
                                                                  [0x976e3bb]
                                                                  [0x973e0c1]
                                                                  [0x81bdfb1]
                                                                  [0x902371b]
                                                                  [0x8486f0d]
                                                                  [0x900bc5c]
                                                                  [0x8f9b5d9]
                                                                  [0x8627747]
                                                                  [0x812a167]
                                                                  [0x812b82d]
                                                                  [0x86aa16b]
                                                                  [0x8049a26]
                                                                  [0x974c15c]
                                                                  [0x8048121]

                                                                  Exiting...


                                                                  Other WUs in the same machine (Linux have similar errors:

                                                                  1758427
                                                                  1757675

                                                                  Evan

                                                                  Joined: Dec 23 07
                                                                  Posts: 75
                                                                  ID: 3893
                                                                  Credit: 69,584
                                                                  RAC: 0
                                                                  Message 5088 - Posted 11 Mar 2010 18:02:44 UTC

                                                                    100% validate errors (11 out of 11) using win xp and the hard drive is going continuously as it processes 4 units at a time. The work units are fcDE-W3.....

                                                                    Evan

                                                                    Joined: Dec 23 07
                                                                    Posts: 75
                                                                    ID: 3893
                                                                    Credit: 69,584
                                                                    RAC: 0
                                                                    Message 5089 - Posted 11 Mar 2010 20:32:27 UTC

                                                                      I wouldn't be surprised if there is going to be a chorus of complaints from the Rosetta users about the way these fcDE-W3.. units are hogging the hard drive. I have found that if you want to open files, access the internet etc you have to put boinc on snooze because the hard drive is otherwise occupied. The task manager shows the processors changing from 0 to 100% at very frequent intervals. In the past the indicator show a steady 100%. To top it all, the validation error is still staying at a constant 100%.

                                                                      Profile Conan
                                                                      Avatar

                                                                      Joined: Feb 16 06
                                                                      Posts: 344
                                                                      ID: 145
                                                                      Credit: 1,309,534
                                                                      RAC: 0
                                                                      Message 5090 - Posted 12 Mar 2010 0:04:01 UTC

                                                                        While the validate errors seem to have been corrected (at least the ones I am processing now), they all have quite short run times and ALL of them process 10,000 Decoys then finish.

                                                                        This is a new record, it is the most decoys I have seen since I started this project (I certain I think).

                                                                        Looks like this is an inbuilt maximum number of decoys that can be processed for a work unit.
                                                                        ____________

                                                                        Profile [VENETO] boboviz

                                                                        Joined: Apr 9 08
                                                                        Posts: 474
                                                                        ID: 4205
                                                                        Credit: 681,030
                                                                        RAC: 132
                                                                        Message 5091 - Posted 12 Mar 2010 9:27:07 UTC - in response to Message 5090.

                                                                          While the validate errors seem to have been corrected (at least the ones I am processing now)


                                                                          I hope.....

                                                                          Tonno

                                                                          Joined: Nov 23 06
                                                                          Posts: 16
                                                                          ID: 2269
                                                                          Credit: 49,841
                                                                          RAC: 0
                                                                          Message 5092 - Posted 12 Mar 2010 11:51:21 UTC - in response to Message 5091.

                                                                            The "fcDE-W3" WUs have something wrong.
                                                                            The graphics show only one structure of three and the stage and energy are always at "zero".

                                                                            Profile [VENETO] boboviz

                                                                            Joined: Apr 9 08
                                                                            Posts: 474
                                                                            ID: 4205
                                                                            Credit: 681,030
                                                                            RAC: 132
                                                                            Message 5093 - Posted 13 Mar 2010 6:40:13 UTC

                                                                              A gunn-fragments error:

                                                                              1759776

                                                                              ERROR: ct == final_atoms
                                                                              ERROR:: Exit from: ..\..\src\core\scoring\rms_util.cc line: 397
                                                                              BOINC:: Error reading and gzipping output datafile: default.out
                                                                              called boinc_finish


                                                                              and,usualy, a tons of validate error in fcDE-W3 wu.....

                                                                              strauch

                                                                              Joined: Mar 15 10
                                                                              Posts: 1
                                                                              ID: 15285
                                                                              Credit: 4,730
                                                                              RAC: 0
                                                                              Message 5094 - Posted 15 Mar 2010 19:07:06 UTC - in response to Message 5092.

                                                                                thanks for pointing this one out. We are working on a fix for those jobs.

                                                                                Profile [VENETO] boboviz

                                                                                Joined: Apr 9 08
                                                                                Posts: 474
                                                                                ID: 4205
                                                                                Credit: 681,030
                                                                                RAC: 132
                                                                                Message 5096 - Posted 16 Mar 2010 20:58:35 UTC

                                                                                  As usual, validate errors
                                                                                  1765734
                                                                                  1765727
                                                                                  1765728

                                                                                  Most of errors are on my Phaenom II X4 (and windows 7)....problems with L3 cache? SSE extension?
                                                                                  No problems with Turion X2 (windows 7) or Amd Mobile single core (windows xp).

                                                                                  coturnix

                                                                                  Joined: Jan 5 10
                                                                                  Posts: 9
                                                                                  ID: 15073
                                                                                  Credit: 196,185
                                                                                  RAC: 0
                                                                                  Message 5097 - Posted 21 Mar 2010 8:23:26 UTC

                                                                                    Quite a few placestub_alt_denovo_1zvy_****_ProteinInterfaceDesign_19Mar2010_14558_*** work units report

                                                                                    ERROR: Value of inactive option accessed: -holes:dalphaball
                                                                                    (e.g. placestub_alt_denovo_1zvy_2amh_ProteinInterfaceDesign_19Mar2010_14558_2)

                                                                                    Evan

                                                                                    Joined: Dec 23 07
                                                                                    Posts: 75
                                                                                    ID: 3893
                                                                                    Credit: 69,584
                                                                                    RAC: 0
                                                                                    Message 5099 - Posted 21 Mar 2010 12:16:09 UTC

                                                                                      validate error for
                                                                                      placestub_alt_denovo_1zvy_2r0j_ProteinInterfaceDesign_19Mar2010_14558_1_0

                                                                                      50% (6 out of 12) of these work units have failed with this error

                                                                                      Snagletooth

                                                                                      Joined: May 4 07
                                                                                      Posts: 65
                                                                                      ID: 3020
                                                                                      Credit: 112,601
                                                                                      RAC: 3
                                                                                      Message 5100 - Posted 21 Mar 2010 13:51:21 UTC

                                                                                        compute error on my mac: placestub_alt_denovo_1zvy_2quo_ProteinInterfaceDesign_19Mar2010_14558_2_0
                                                                                        ERROR: Value of inactive option accessed: -holes:dalphaball
                                                                                        SIGSEGV: segmentation violation

                                                                                        Billy

                                                                                        Joined: Jan 29 07
                                                                                        Posts: 13
                                                                                        ID: 2592
                                                                                        Credit: 5,855
                                                                                        RAC: 0
                                                                                        Message 5101 - Posted 22 Mar 2010 3:53:28 UTC

                                                                                          Result 1773378

                                                                                          Got stuck and continued for 4 hours; usually tasks are 1 hour. Error is crash.

                                                                                          Intel Mac OSX 10.4

                                                                                          Profile [VENETO] boboviz

                                                                                          Joined: Apr 9 08
                                                                                          Posts: 474
                                                                                          ID: 4205
                                                                                          Credit: 681,030
                                                                                          RAC: 132
                                                                                          Message 5102 - Posted 22 Mar 2010 7:19:36 UTC

                                                                                            As usual, between 80% and 90% of validate error(after a correct run) on placestub_alt_denovo_*
                                                                                            1767609
                                                                                            1767608
                                                                                            1767608
                                                                                            etc

                                                                                            I hope validate errors will resolve with 2.06 version.....

                                                                                            pwrguru

                                                                                            Joined: Mar 19 10
                                                                                            Posts: 1
                                                                                            ID: 15299
                                                                                            Credit: 301,909
                                                                                            RAC: 0
                                                                                            Message 5103 - Posted 22 Mar 2010 13:05:05 UTC

                                                                                              I am new to this project and I must say that I am far from impressed so far... The bulk of the work units I have run so far have completed fine only to be tagged as VALIDATE ERROR ...How can you justify doing valid work only to have it rejected by the server ??

                                                                                              Evan

                                                                                              Joined: Dec 23 07
                                                                                              Posts: 75
                                                                                              ID: 3893
                                                                                              Credit: 69,584
                                                                                              RAC: 0
                                                                                              Message 5104 - Posted 22 Mar 2010 14:13:48 UTC - in response to Message 5103.

                                                                                                This site is designed to find and rectify any problems before the work units are sent to R@H. Don't expect to have all the work units working properly. If they do then it is a bonus!

                                                                                                Profile [VENETO] boboviz

                                                                                                Joined: Apr 9 08
                                                                                                Posts: 474
                                                                                                ID: 4205
                                                                                                Credit: 681,030
                                                                                                RAC: 132
                                                                                                Message 5105 - Posted 22 Mar 2010 16:28:13 UTC - in response to Message 5103.

                                                                                                  I am new to this project and I must say that I am far from impressed so far... The bulk of the work units I have run so far have completed fine only to be tagged as VALIDATE ERROR ...How can you justify doing valid work only to have it rejected by the server ??


                                                                                                  Well, the "validate error" is a well-known problem.
                                                                                                  This is a beta project, so it's normal to have problems with wu
                                                                                                  But, i'm agree with you, this problems it's annoying....

                                                                                                  Tonno

                                                                                                  Joined: Nov 23 06
                                                                                                  Posts: 16
                                                                                                  ID: 2269
                                                                                                  Credit: 49,841
                                                                                                  RAC: 0
                                                                                                  Message 5106 - Posted 23 Mar 2010 1:03:02 UTC - in response to Message 5105.

                                                                                                    1774657
                                                                                                    1776020
                                                                                                    1776022

                                                                                                    ERROR: in::file::zip minirosetta_database.zip does not exist!
                                                                                                    ERROR:: Exit from: ..\..\src\apps\public\boinc\minirosetta.cc line: 137
                                                                                                    BOINC:: Error reading and gzipping output datafile: default.out
                                                                                                    called boinc_finish

                                                                                                    Tonno

                                                                                                    Joined: Nov 23 06
                                                                                                    Posts: 16
                                                                                                    ID: 2269
                                                                                                    Credit: 49,841
                                                                                                    RAC: 0
                                                                                                    Message 5107 - Posted 23 Mar 2010 1:34:12 UTC - in response to Message 5106.

                                                                                                      I checked some validate errors and I found that happens in WUs that has been suspended by the BOINC manager (6.10.36) to let other WUs start.
                                                                                                      This is a common problem that I see in the recent versions (6.X). The BOINC manager doesn’t let finish the WU, but suspended it to start another of the same project.
                                                                                                      I have 10 RALPH WUs that has been suspended at 60-80% (taking a lot of memory and making the computer to be really slow and the BOINC manager to crash sometimes) and then, when resumed, they finished rapidly and give validate error.
                                                                                                      In the other PC with version 5.10.45 of the BOINC manager, I haven't such a problem and no validate error (on a total of 27 WUs).

                                                                                                      Someone other can confirm that?

                                                                                                      Profile [VENETO] boboviz

                                                                                                      Joined: Apr 9 08
                                                                                                      Posts: 474
                                                                                                      ID: 4205
                                                                                                      Credit: 681,030
                                                                                                      RAC: 132
                                                                                                      Message 5108 - Posted 23 Mar 2010 7:59:51 UTC - in response to Message 5107.

                                                                                                        I have only 6.10.36 clients, but now i download an older version and i try it

                                                                                                        Message boards : RALPH@home bug list : minirosetta 2.05


                                                                                                        Home | Join | About | Participants | Community | Statistics

                                                                                                        Copyright © 2017 University of Washington

                                                                                                        Last Modified: 20 Nov 2008 19:41:56 UTC
                                                                                                        Back to top ^