RALPH@home

Bug reports for Ralph 5.25

  UW Seal
 
[ Home ] [ Join ] [ About ] [ Participants ] [ Community ] [ Statistics ]
  [ login/out ]


Advanced search

Message boards : RALPH@home bug list : Bug reports for Ralph 5.25

AuthorMessage
Rhiju
Forum moderator
Project developer
Project scientist

Joined: Feb 14 06
Posts: 161
ID: 4
Credit: 3,725
RAC: 0
Message 1875 - Posted 29 Jun 2006 22:43:29 UTC

    This version has a very slight change that will fix a bug in checkpointing that occurs a small fraction of the time in ABRELAX workunits.
    ____________

    Pieface

    Joined: Feb 16 06
    Posts: 64
    ID: 234
    Credit: 203,513
    RAC: 0
    Message 1876 - Posted 1 Jul 2006 19:16:08 UTC

      Hmmmm here\'s an odd one...
      WU 204303 .
      it\'s been so long since i had to do this I forgot how to format it!
      Keep up the good work!!!!!!!!

      Profile Carlos_Pfitzner
      Avatar

      Joined: Feb 16 06
      Posts: 182
      ID: 296
      Credit: 22,792
      RAC: 0
      Message 1877 - Posted 3 Jul 2006 11:35:36 UTC

        Last modified: 3 Jul 2006 11:48:21 UTC

        Incorrect function. (0x1) - exit code 1 (0x1)
        ERROR:: Exit at: .\\fragments.cc line:689

        http://ralph.bakerlab.org/result.php?resultid=207053
        SIGSEGV: segmentation violation
        http://ralph.bakerlab.org/result.php?resultid=205541

        ____________
        Click signature for global team stats

        Pieface

        Joined: Feb 16 06
        Posts: 64
        ID: 234
        Credit: 203,513
        RAC: 0
        Message 1878 - Posted 3 Jul 2006 15:58:45 UTC

          Last modified: 3 Jul 2006 16:00:07 UTC

          Just noticed that on the last one I reported it was sent to three folks and all three got bitten by the watch-dog. Here\'s another from today:
          WU 180623

          Profile Carlos_Pfitzner
          Avatar

          Joined: Feb 16 06
          Posts: 182
          ID: 296
          Credit: 22,792
          RAC: 0
          Message 1879 - Posted 3 Jul 2006 23:49:55 UTC - in response to Message 1878.

            Just noticed that on the last one I reported it was sent to three folks and all three got bitten by the watch-dog. Here\'s another from today:
            WU 180623


            The real error was this one
            WARNING! attempt to gzip file .\\aat329.out failed: file does not exist.

            May be someone will \"fix\" the WU generator script ? (server side)

            Thanks
            ____________
            Click signature for global team stats

            Profile Carlos_Pfitzner
            Avatar

            Joined: Feb 16 06
            Posts: 182
            ID: 296
            Credit: 22,792
            RAC: 0
            Message 1880 - Posted 4 Jul 2006 3:46:07 UTC

              stuck at 1.044 %
              Stage Ab initio + relax
              step 341224

              http://ralph.bakerlab.org/result.php?resultid=207703 -:(
              ____________
              Click signature for global team stats

              Wabbit98

              Joined: Feb 17 06
              Posts: 1
              ID: 378
              Credit: 1,934
              RAC: 0
              Message 1881 - Posted 4 Jul 2006 4:42:56 UTC

                I do not know if this is a bug but all my current WU\'s seem to be stuck at 1.623% until about 90 minutes in and then they finished successfully.

                Ni
                ____________

                Profile Conan
                Avatar

                Joined: Feb 16 06
                Posts: 345
                ID: 145
                Credit: 1,328,309
                RAC: 299
                Message 1882 - Posted 4 Jul 2006 9:40:01 UTC

                  Last modified: 4 Jul 2006 9:44:43 UTC

                  Had 3 out of 4 workunits fail in a very short time with the error \"the system failed to find the path specified\".
                  All workunits started with \"FRA_t329_CASP7\".
                  The workunits are :-
                  http://ralph.bakerlab.org/workunit.php?wuid=185176
                  http://ralph.bakerlab.org/workunit.php?wuid=185177
                  http://ralph.bakerlab.org/workunit.php?wuid=185178

                  Other workunit seems to be ok, it is a t347_CASP7.
                  Hope this is of help as the correction from 5.24 needs correcting.


                  ____________

                  Profile Conan
                  Avatar

                  Joined: Feb 16 06
                  Posts: 345
                  ID: 145
                  Credit: 1,328,309
                  RAC: 299
                  Message 1883 - Posted 4 Jul 2006 11:43:13 UTC - in response to Message 1882.

                    Had 3 out of 4 workunits fail in a very short time with the error \"the system failed to find the path specified\".
                    All workunits started with \"FRA_t329_CASP7\".
                    The workunits are :-
                    http://ralph.bakerlab.org/workunit.php?wuid=185176
                    http://ralph.bakerlab.org/workunit.php?wuid=185177
                    http://ralph.bakerlab.org/workunit.php?wuid=185178

                    Other workunit seems to be ok, it is a t347_CASP7.
                    Hope this is of help as the correction from 5.24 needs correcting.

                    >>> Spoke to soon,
                    workunit type t347_CASP7 http://ralph.bakerlab.org/workunit.php?wuid=177477
                    also failed but with error \"unhandled exception detected\" access violation.



                    ____________

                    Profile Astro

                    Joined: Feb 16 06
                    Posts: 141
                    ID: 48
                    Credit: 32,977
                    RAC: 0
                    Message 1884 - Posted 5 Jul 2006 11:22:09 UTC

                      Woke up to a screensaver and a \"runtime\" error box on my AMD64 3700. It\'s the first time I\'ve seen this one. it looked like this



                      So I hit printscreen and pasted it to Photoshop, before I could finish editing the photo it happened again as can be seen below.



                      Here\'s what my Boinc Manager looked like:



                      And the WUs were wuid=185158 and wuid=185157. I noticed one other user had an error with these same WUs before they were issued to me. Here\'s the Result ID\'s:


                      Result ID 209014
                      Name FRA_t329_CASP7_hom001_8_858_3_1
                      Workunit 185157
                      Created 5 Jul 2006 4:25:27 UTC
                      Sent 5 Jul 2006 4:25:38 UTC
                      Received 5 Jul 2006 11:00:31 UTC
                      Server state Over
                      Outcome Client error
                      Client state Computing
                      Exit status 3 (0x3)
                      Computer ID 2172
                      Report deadline 9 Jul 2006 4:25:38 UTC
                      CPU time 99.3125
                      stderr out <core_client_version>5.5.4</core_client_version>
                      <message>
                      The system cannot find the path specified. (0x3) - exit code 3 (0x3)
                      </message>
                      <stderr_txt>
                      # cpu_run_time_pref: 14400
                      # random seed: 2970327

                      </stderr_txt>


                      Validate state Invalid
                      Claimed credit 0.389839542825192
                      Granted credit 0
                      application version 5.25

                      and

                      Result ID 209015
                      Name FRA_t329_CASP7_hom001_8_858_4_1
                      Workunit 185158
                      Created 5 Jul 2006 4:25:27 UTC
                      Sent 5 Jul 2006 4:25:38 UTC
                      Received 5 Jul 2006 11:00:31 UTC
                      Server state Over
                      Outcome Client error
                      Client state Computing
                      Exit status 3 (0x3)
                      Computer ID 2172
                      Report deadline 9 Jul 2006 4:25:38 UTC
                      CPU time 72.921875
                      stderr out <core_client_version>5.5.4</core_client_version>
                      <message>
                      The system cannot find the path specified. (0x3) - exit code 3 (0x3)
                      </message>
                      <stderr_txt>
                      # random seed: 2970326

                      </stderr_txt>


                      Validate state Invalid
                      Claimed credit 0.286246247068151
                      Granted credit 0
                      application version 5.25

                      Profile Fuzzy Hollynoodles
                      Avatar

                      Joined: Feb 19 06
                      Posts: 37
                      ID: 585
                      Credit: 2,089
                      RAC: 0
                      Message 1885 - Posted 5 Jul 2006 18:30:11 UTC

                        Windows Runtime Error.

                        http://ralph.bakerlab.org/workunit.php?wuid=185165

                        Result: http://ralph.bakerlab.org/result.php?resultid=209160


                        ____________

                        "I'm trying to maintain a shred of dignity in this world." - Me

                        Profile Astro

                        Joined: Feb 16 06
                        Posts: 141
                        ID: 48
                        Credit: 32,977
                        RAC: 0
                        Message 1886 - Posted 5 Jul 2006 21:53:01 UTC

                          I just got the third one of these. The three WUs were:

                          FRA_t329_CASP7_hom001_8_858_3
                          FRA_t329_CASP7_hom001_8_858_4
                          FRA_t329_CASP7_hom001_8_858_5

                          I think I see a pattern. lol

                          Each WU I did had previously failed for one other user, prior to them failing for me.

                          tony

                          Profile Carlos_Pfitzner
                          Avatar

                          Joined: Feb 16 06
                          Posts: 182
                          ID: 296
                          Credit: 22,792
                          RAC: 0
                          Message 1887 - Posted 9 Jul 2006 13:53:19 UTC

                            Last modified: 9 Jul 2006 13:55:02 UTC

                            The system cannot find the path specified. (0x3) - exit code 3 (0x3)
                            http://ralph.bakerlab.org/result.php?resultid=213243

                            Along with a windows popup that left my pc IDLE for hours !

                            Hours cause this pc is monitored eventually.

                            *in case of real remote pc trully unmonitored
                            this could left that pc not crunching anything more forever,
                            -or- maybe only until that pc nobreak breaks, and a reboot do occurs.

                            *in case of a crunching pc on a commercial company,
                            that popup may cause the bigboss ask to stop crunching anything
                            on all u own co. pcs, cause this popup disturbs their employee works!

                            So,
                            please, avoid that windows popups triggered by app errors


                            Thanks
                            ____________
                            Click signature for global team stats

                            Profile Carlos_Pfitzner
                            Avatar

                            Joined: Feb 16 06
                            Posts: 182
                            ID: 296
                            Credit: 22,792
                            RAC: 0
                            Message 1888 - Posted 9 Jul 2006 14:08:43 UTC

                              WU download error: couldn\'t get input files:
                              http://ralph.bakerlab.org/result.php?resultid=212229
                              *Any problem with ralph servers ???

                              [B^S] suguruhirahara

                              Joined: Mar 5 06
                              Posts: 40
                              ID: 992
                              Credit: 6,001
                              RAC: 0
                              Message 1889 - Posted 12 Jul 2006 22:54:25 UTC

                                http://ralph.bakerlab.org/result.php?resultid=216027

                                WARNING! attempt to gzip file .\\xxt319.out failed: file does not exist.

                                ____________

                                [B^S] suguruhirahara

                                Joined: Mar 5 06
                                Posts: 40
                                ID: 992
                                Credit: 6,001
                                RAC: 0
                                Message 1890 - Posted 13 Jul 2006 2:45:21 UTC

                                  Last modified: 13 Jul 2006 2:45:38 UTC

                                  same thing happened here:
                                  http://ralph.bakerlab.org/result.php?resultid=216029
                                  http://ralph.bakerlab.org/result.php?resultid=216028

                                  Both contain:

                                  WARNING! attempt to gzip file .\\xxt319.out failed: file does not exist.

                                  ____________

                                  STE\/E

                                  Joined: Feb 16 06
                                  Posts: 27
                                  ID: 166
                                  Credit: 576,975
                                  RAC: 2
                                  Message 1891 - Posted 13 Jul 2006 14:44:53 UTC

                                    WARNING! attempt to gzip file .\\xxt319.out failed: file does not exist.

                                    Same thing here, I\'ve had 11 of them reporting the same message the last few days after they seemingly have run their full course of about 1 hour ... O_o

                                    STE\/E

                                    Joined: Feb 16 06
                                    Posts: 27
                                    ID: 166
                                    Credit: 576,975
                                    RAC: 2
                                    Message 1892 - Posted 14 Jul 2006 22:05:57 UTC

                                      The App seems to be okay now, out of 154 WU\'s run the last 2 days only 2 have Erred out, and both of them were at the beginning of the 154 WU\'s ...

                                      Profile feet1st

                                      Joined: Mar 7 06
                                      Posts: 312
                                      ID: 1028
                                      Credit: 110,522
                                      RAC: 0
                                      Message 1893 - Posted 19 Jul 2006 14:34:14 UTC

                                        WU failed with this pop-up


                                        BOINC ran a single thread on a dual-core until I clicked OK, then this message was displayed in BOINC Manager, and another WU began.

                                        7/19/2006 9:23:25 AM|ralph@home|Unrecoverable error for result t353_LOOPRELAX_hom002_S_00001_0004344_0_1030_4_1 (The system cannot find the path specified. (0x3) - exit code 3 (0x3))

                                        WU report shows:
                                        The system cannot find the path specified. (0x3) - exit code 3 (0x3)

                                        ____________

                                        Profile anders n

                                        Joined: Feb 16 06
                                        Posts: 166
                                        ID: 91
                                        Credit: 131,419
                                        RAC: 0
                                        Message 1894 - Posted 20 Jul 2006 16:30:26 UTC

                                          Got 2 failiurs \"WU download error: couldn\'t get input files:\"

                                          http://ralph.bakerlab.org/results.php?hostid=118

                                          Rosetta works fine.

                                          Anders n
                                          ____________

                                          Profile paul and kirsty yates
                                          Avatar

                                          Joined: Feb 16 06
                                          Posts: 11
                                          ID: 310
                                          Credit: 949
                                          RAC: 0
                                          Message 1895 - Posted 21 Jul 2006 20:55:04 UTC

                                            Last modified: 21 Jul 2006 20:58:31 UTC

                                            just got this one
                                            21/07/2006 21:39:02|ralph@home|Giving up on download of hom011_S_00001_0000033_1.obligate_loopfile.gz: file was not found on server

                                            21/07/2006 21:39:02|ralph@home|Checksum or signature error for hom011_S_00001_0000033_1.obligate_loopfile.gz

                                            21/07/2006 21:39:04|ralph@home|Unrecoverable error for result t353_LOOPRELAX_hom011_S_00001_0000033_1_1030_15_2 (WU download error: couldn\'t get input files:<file_xfer_error> <file_name>hom011_S_00001_0000033_1.obligate_loopfile.gz</file_name> <error_code>-163</error_code> <error_message>file was not found on server</error_message></file_xfer_error>)

                                            ____________

                                            bt1228

                                            Joined: Mar 22 06
                                            Posts: 7
                                            ID: 1167
                                            Credit: 9,385
                                            RAC: 0
                                            Message 1902 - Posted 24 Jul 2006 3:42:13 UTC

                                              What\'s the difference between Rosetta 5.25 and Ralph\'s Rosetta_Beta 5.25 ?

                                              --- bt

                                              [B^S] suguruhirahara

                                              Joined: Mar 5 06
                                              Posts: 40
                                              ID: 992
                                              Credit: 6,001
                                              RAC: 0
                                              Message 1903 - Posted 24 Jul 2006 6:42:12 UTC

                                                Last modified: 24 Jul 2006 6:43:53 UTC

                                                Isn\'t it almost same one with a little configuration change?

                                                Wait for answer until developers finish the work on CASP.
                                                ____________

                                                blackbird

                                                Joined: Feb 19 06
                                                Posts: 2
                                                ID: 575
                                                Credit: 12,029
                                                RAC: 0
                                                Message 1904 - Posted 24 Jul 2006 17:54:46 UTC

                                                  Ralph 5.25 has terminated this WU on 96.64%.
                                                  stderr.log:


                                                  Exiting...
                                                  Graphics are disabled due to configuration...
                                                  # cpu_run_time_pref: 345600
                                                  Graphics are disabled due to configuration...
                                                  # cpu_run_time_pref: 345600
                                                  Graphics are disabled due to configuration...
                                                  # cpu_run_time_pref: 345600
                                                  SIGSEGV: segmentation violation
                                                  Stack trace (20 frames):
                                                  [0x8849d1b]
                                                  [0x8861dcc]
                                                  [0xffffe420]
                                                  [0x88e40a9]
                                                  [0x88b2de7]
                                                  [0x88b51d1]
                                                  [0x809f31c]
                                                  [0x86b4609]
                                                  [0x86bab32]
                                                  [0x84b14f8]
                                                  [0x84b343b]
                                                  [0x84b6573]
                                                  [0x84b8231]
                                                  [0x87e6e77]
                                                  [0x86c3ac7]
                                                  [0x805f7c1]
                                                  [0x846e09d]
                                                  [0x8470594]
                                                  [0x88c12b4]
                                                  [0x8048111]


                                                  stdout.txt:

                                                  [T/F OPT]Default FALSE value for [-minimize_exclude_helix]
                                                  [T/F OPT]Default FALSE value for [-minimize_exclude_strand]
                                                  CYCLES::number is 1 x total_residue: 86
                                                  initializing full atom coordinates
                                                  BOINC :: [2006-07-23 23:31:01] :: checkpoint_decoys() :: saved decoy info :: attempted_decoys: 305 :: num_decoys: 305 :: farlx_stage: 10
                                                  dump_fullatom_pdb: farlxcheck
                                                  starting score 4429.49316 rms 0
                                                  starting full atom minimization
                                                  [T/F OPT]Default FALSE value for [-infinite_loop]
                                                  score_filter: tag= relax_score_filter1 score= -112.872 rank= 91 max_rank= 95 nscores= 190 filter_score= -112.28


                                                  Suse Linux 10.1 on Athlon 2400+
                                                  ____________

                                                  Pieface

                                                  Joined: Feb 16 06
                                                  Posts: 64
                                                  ID: 234
                                                  Credit: 203,513
                                                  RAC: 0
                                                  Message 1905 - Posted 25 Jul 2006 13:02:41 UTC

                                                    Last modified: 25 Jul 2006 13:05:06 UTC

                                                    not exactly sure this is a bug, but I noticed that a couple of the t372 units ran longer than expected on my 1ghz win xp machine:
                                                    resid 231586 ran for 19,675 secs and
                                                    resid 231585 ran for 22,102 secs.
                                                    Both were still on the first structure, so maybe these guys need to be restricted to the faster machines?

                                                    [edit:] I guess I should have mentioned - run-time pref is set to 4 hours (14,400 secs)
                                                    ____________

                                                    Pieface

                                                    Joined: Feb 16 06
                                                    Posts: 64
                                                    ID: 234
                                                    Credit: 203,513
                                                    RAC: 0
                                                    Message 1906 - Posted 25 Jul 2006 16:02:45 UTC

                                                      As an update to my last, I have another of those t372 units running on the same machine. CPU time 4:18, pct complete 1.044, est to completion 10:12 and climbing. i\'m running 50/50 with rosie, but machine is in EDF mode running ralph\'s exclusively as I have four other projects set to no-new-work so the scheduler thinks ralph/rosie are only getting 16-2/3 pct of the cpu each and won\'t be able to finish two days work in the next week or so... oh wellll...

                                                      Mahray

                                                      Joined: Feb 17 06
                                                      Posts: 1
                                                      ID: 417
                                                      Credit: 1,945
                                                      RAC: 0
                                                      Message 1907 - Posted 27 Jul 2006 3:50:52 UTC

                                                        I just had three workunits with screwed up downloads, all the same error. It looks like everyone else downloading had the same problem.

                                                        Units are:

                                                        http://ralph.bakerlab.org/workunit.php?wuid=205131
                                                        http://ralph.bakerlab.org/workunit.php?wuid=205144
                                                        http://ralph.bakerlab.org/workunit.php?wuid=205143

                                                        Messages below

                                                        ************************************************************************

                                                        27/07/2006 1:32:14 PM|ralph@home|Sending scheduler request to http://ralph.bakerlab.org/ralph_cgi/cgi
                                                        27/07/2006 1:32:14 PM|ralph@home|Reason: To fetch work
                                                        27/07/2006 1:32:14 PM|ralph@home|Requesting 43200 seconds of new work
                                                        27/07/2006 1:32:34 PM|ralph@home|Scheduler request succeeded
                                                        27/07/2006 1:32:36 PM|ralph@home|Started download of file bq_cterm_hom001_t386_.fasta.gz
                                                        27/07/2006 1:32:36 PM|ralph@home|Started download of file bq_cterm_hom001_t386_.psipred_ss2.gz
                                                        27/07/2006 1:32:41 PM|ralph@home|Finished download of file bq_cterm_hom001_t386_.fasta.gz
                                                        27/07/2006 1:32:41 PM|ralph@home|Throughput 37 bytes/sec
                                                        27/07/2006 1:32:41 PM|ralph@home|Finished download of file bq_cterm_hom001_t386_.psipred_ss2.gz
                                                        27/07/2006 1:32:41 PM|ralph@home|Throughput 286 bytes/sec
                                                        27/07/2006 1:32:41 PM|ralph@home|Started download of file boinc_bq_cterm_hom001_aat386_03_05.200_v1_3.gz
                                                        27/07/2006 1:32:41 PM|ralph@home|Started download of file boinc_bq_cterm_hom001_aat386_09_05.200_v1_3.gz
                                                        27/07/2006 1:33:09 PM|ralph@home|Finished download of file boinc_bq_cterm_hom001_aat386_09_05.200_v1_3.gz
                                                        27/07/2006 1:33:09 PM|ralph@home|Throughput 7287 bytes/sec
                                                        27/07/2006 1:33:09 PM|ralph@home|Started download of file bq_cterm_hom001_killlocal.bar.gz
                                                        27/07/2006 1:33:15 PM|ralph@home|Incomplete read of less than 5KB for bq_cterm_hom001_killlocal.bar.gz - truncating
                                                        27/07/2006 1:33:15 PM|ralph@home|Finished download of file bq_cterm_hom001_killlocal.bar.gz
                                                        27/07/2006 1:33:15 PM|ralph@home|Throughput 33 bytes/sec
                                                        27/07/2006 1:33:15 PM|ralph@home|Started download of file casp7.description.shorter.txt
                                                        27/07/2006 1:33:15 PM|ralph@home|Checksum or signature error for bq_cterm_hom001_killlocal.bar.gz
                                                        27/07/2006 1:33:16 PM|ralph@home|Unrecoverable error for result t386__CASP7_ABRELAX_SAVE_ALL_OUT_BARCODE_bq_cterm_hom001__1060_5_2 (WU download error: couldn\'t get input files:<file_xfer_error> <file_name>bq_cterm_hom001_killlocal.bar.gz</file_name> <error_code>-200</error_code></file_xfer_error>)
                                                        27/07/2006 1:33:24 PM|ralph@home|Finished download of file casp7.description.shorter.txt
                                                        27/07/2006 1:33:24 PM|ralph@home|Throughput 10 bytes/sec
                                                        27/07/2006 1:33:24 PM|ralph@home|Started download of file bq_cterm_hom002_t386_.fasta.gz
                                                        27/07/2006 1:33:32 PM|ralph@home|Finished download of file bq_cterm_hom002_t386_.fasta.gz
                                                        27/07/2006 1:33:32 PM|ralph@home|Throughput 20 bytes/sec
                                                        27/07/2006 1:33:32 PM|ralph@home|Started download of file bq_cterm_hom002_t386_.psipred_ss2.gz
                                                        27/07/2006 1:33:39 PM|ralph@home|Finished download of file bq_cterm_hom002_t386_.psipred_ss2.gz
                                                        27/07/2006 1:33:39 PM|ralph@home|Throughput 155 bytes/sec
                                                        27/07/2006 1:33:39 PM|ralph@home|Started download of file boinc_bq_cterm_hom002_aat386_03_05.200_v1_3.gz
                                                        27/07/2006 1:33:41 PM|ralph@home|Finished download of file boinc_bq_cterm_hom001_aat386_03_05.200_v1_3.gz
                                                        27/07/2006 1:33:41 PM|ralph@home|Throughput 13205 bytes/sec
                                                        27/07/2006 1:33:41 PM|ralph@home|Started download of file boinc_bq_cterm_hom002_aat386_09_05.200_v1_3.gz
                                                        27/07/2006 1:34:09 PM|ralph@home|Finished download of file boinc_bq_cterm_hom002_aat386_09_05.200_v1_3.gz
                                                        27/07/2006 1:34:09 PM|ralph@home|Throughput 7793 bytes/sec
                                                        27/07/2006 1:34:09 PM|ralph@home|Started download of file bq_cterm_hom002_killlocal.bar.gz
                                                        27/07/2006 1:34:17 PM|ralph@home|Incomplete read of less than 5KB for bq_cterm_hom002_killlocal.bar.gz - truncating
                                                        27/07/2006 1:34:17 PM|ralph@home|Finished download of file bq_cterm_hom002_killlocal.bar.gz
                                                        27/07/2006 1:34:17 PM|ralph@home|Throughput 27 bytes/sec
                                                        27/07/2006 1:34:17 PM|ralph@home|Checksum or signature error for bq_cterm_hom002_killlocal.bar.gz
                                                        27/07/2006 1:34:18 PM|ralph@home|Unrecoverable error for result t386__CASP7_ABRELAX_SAVE_ALL_OUT_BARCODE_bq_cterm_hom002__1060_5_2 (WU download error: couldn\'t get input files:<file_xfer_error> <file_name>bq_cterm_hom002_killlocal.bar.gz</file_name> <error_code>-200</error_code></file_xfer_error>)
                                                        27/07/2006 1:34:38 PM|ralph@home|Finished download of file boinc_bq_cterm_hom002_aat386_03_05.200_v1_3.gz
                                                        27/07/2006 1:34:38 PM|ralph@home|Throughput 13514 bytes/sec
                                                        27/07/2006 1:35:03 PM|rosetta@home|Sending scheduler request to http://boinc.bakerlab.org/rosetta_cgi/cgi
                                                        27/07/2006 1:35:03 PM|rosetta@home|Reason: Requested by user
                                                        27/07/2006 1:35:03 PM|rosetta@home|Reporting 1 tasks
                                                        27/07/2006 1:35:13 PM|rosetta@home|Scheduler request succeeded
                                                        27/07/2006 1:35:18 PM|ralph@home|Sending scheduler request to http://ralph.bakerlab.org/ralph_cgi/cgi
                                                        27/07/2006 1:35:18 PM|ralph@home|Reason: Requested by user
                                                        27/07/2006 1:35:18 PM|ralph@home|Requesting 43200 seconds of new work, and reporting 2 completed tasks
                                                        27/07/2006 1:35:28 PM|ralph@home|Scheduler request succeeded
                                                        27/07/2006 1:35:28 PM|ralph@home|Message from server: Not sending work - last request too recent: 181 sec
                                                        27/07/2006 1:39:34 PM|ralph@home|Sending scheduler request to http://ralph.bakerlab.org/ralph_cgi/cgi
                                                        27/07/2006 1:39:34 PM|ralph@home|Reason: To fetch work
                                                        27/07/2006 1:39:34 PM|ralph@home|Requesting 43200 seconds of new work
                                                        27/07/2006 1:39:59 PM|ralph@home|Scheduler request succeeded
                                                        27/07/2006 1:40:02 PM|ralph@home|Started download of file bq_cterm_hom001_t386_.fasta.gz
                                                        27/07/2006 1:40:03 PM|ralph@home|Started download of file bq_cterm_hom001_t386_.psipred_ss2.gz
                                                        27/07/2006 1:40:09 PM|ralph@home|Finished download of file bq_cterm_hom001_t386_.fasta.gz
                                                        27/07/2006 1:40:09 PM|ralph@home|Throughput 29 bytes/sec
                                                        27/07/2006 1:40:09 PM|ralph@home|Finished download of file bq_cterm_hom001_t386_.psipred_ss2.gz
                                                        27/07/2006 1:40:09 PM|ralph@home|Throughput 222 bytes/sec
                                                        27/07/2006 1:40:09 PM|ralph@home|Started download of file boinc_bq_cterm_hom001_aat386_03_05.200_v1_3.gz
                                                        27/07/2006 1:40:09 PM|ralph@home|Started download of file boinc_bq_cterm_hom001_aat386_09_05.200_v1_3.gz
                                                        27/07/2006 1:40:37 PM|ralph@home|Finished download of file boinc_bq_cterm_hom001_aat386_09_05.200_v1_3.gz
                                                        27/07/2006 1:40:37 PM|ralph@home|Throughput 7517 bytes/sec
                                                        27/07/2006 1:40:37 PM|ralph@home|Started download of file bq_cterm_hom001_killlocal.bar.gz
                                                        27/07/2006 1:40:44 PM|ralph@home|Incomplete read of less than 5KB for bq_cterm_hom001_killlocal.bar.gz - truncating
                                                        27/07/2006 1:40:44 PM|ralph@home|Finished download of file bq_cterm_hom001_killlocal.bar.gz
                                                        27/07/2006 1:40:44 PM|ralph@home|Throughput 32 bytes/sec
                                                        27/07/2006 1:40:44 PM|ralph@home|Started download of file casp7.description.shorter.txt
                                                        27/07/2006 1:40:44 PM|ralph@home|Checksum or signature error for bq_cterm_hom001_killlocal.bar.gz
                                                        27/07/2006 1:40:45 PM|ralph@home|Unrecoverable error for result t386__CASP7_ABRELAX_SAVE_ALL_OUT_BARCODE_bq_cterm_hom001__1060_3_3 (WU download error: couldn\'t get input files:<file_xfer_error> <file_name>bq_cterm_hom001_killlocal.bar.gz</file_name> <error_code>-200</error_code></file_xfer_error>)
                                                        27/07/2006 1:40:50 PM|ralph@home|Finished download of file casp7.description.shorter.txt
                                                        27/07/2006 1:40:50 PM|ralph@home|Throughput 16 bytes/sec
                                                        27/07/2006 1:40:50 PM|ralph@home|Started download of file nohistag_hom001_t363_.fasta.gz
                                                        27/07/2006 1:40:58 PM|ralph@home|Finished download of file nohistag_hom001_t363_.fasta.gz
                                                        27/07/2006 1:40:58 PM|ralph@home|Throughput 19 bytes/sec
                                                        27/07/2006 1:40:58 PM|ralph@home|Started download of file nohistag_hom001_t363_.psipred_ss2.gz
                                                        27/07/2006 1:41:06 PM|ralph@home|Finished download of file nohistag_hom001_t363_.psipred_ss2.gz
                                                        27/07/2006 1:41:06 PM|ralph@home|Throughput 136 bytes/sec
                                                        27/07/2006 1:41:06 PM|ralph@home|Started download of file boinc_nohistag_hom001_aat363_03_05.200_v1_3.gz
                                                        27/07/2006 1:41:10 PM|ralph@home|Finished download of file boinc_bq_cterm_hom001_aat386_03_05.200_v1_3.gz
                                                        27/07/2006 1:41:10 PM|ralph@home|Throughput 13346 bytes/sec
                                                        27/07/2006 1:41:10 PM|ralph@home|Started download of file boinc_nohistag_hom001_aat363_09_05.200_v1_3.gz
                                                        27/07/2006 1:41:12 PM|ralph@home|Sending scheduler request to http://ralph.bakerlab.org/ralph_cgi/cgi
                                                        27/07/2006 1:41:12 PM|ralph@home|Reason: Requested by user
                                                        27/07/2006 1:41:12 PM|ralph@home|Reporting 1 tasks
                                                        27/07/2006 1:41:27 PM|ralph@home|Scheduler request succeeded
                                                        27/07/2006 1:41:41 PM|ralph@home|Finished download of file boinc_nohistag_hom001_aat363_09_05.200_v1_3.gz
                                                        27/07/2006 1:41:41 PM|ralph@home|Throughput 6360 bytes/sec
                                                        27/07/2006 1:41:41 PM|ralph@home|Started download of file hom020_S_00004_0000864_0.pdb.gz
                                                        27/07/2006 1:41:53 PM|ralph@home|Finished download of file hom020_S_00004_0000864_0.pdb.gz
                                                        27/07/2006 1:41:53 PM|ralph@home|Throughput 972 bytes/sec
                                                        27/07/2006 1:41:53 PM|ralph@home|Started download of file hom020_S_00004_0000864_0.loopfile.gz
                                                        27/07/2006 1:42:00 PM|ralph@home|Finished download of file hom020_S_00004_0000864_0.loopfile.gz
                                                        27/07/2006 1:42:00 PM|ralph@home|Throughput 13 bytes/sec
                                                        27/07/2006 1:42:00 PM|ralph@home|Started download of file hom020_S_00004_0000864_0.obligate_loopfile.gz
                                                        27/07/2006 1:42:07 PM|ralph@home|Finished download of file boinc_nohistag_hom001_aat363_03_05.200_v1_3.gz
                                                        27/07/2006 1:42:07 PM|ralph@home|Throughput 13009 bytes/sec
                                                        27/07/2006 1:42:07 PM|ralph@home|Finished download of file hom020_S_00004_0000864_0.obligate_loopfile.gz
                                                        27/07/2006 1:42:07 PM|ralph@home|Throughput 9 bytes/sec
                                                        27/07/2006 1:42:08 PM||Rescheduling CPU: files downloaded
                                                        27/07/2006 1:42:08 PM|QMC@HOME|Pausing task 03B_stdna_nodelete.1749_0 (left in memory)
                                                        27/07/2006 1:42:09 PM|ralph@home|Starting task t363_LOOPRELAX_hom020_S_00004_0000864_0_1077_1_0 using rosetta_beta version 525

                                                        ____________

                                                        Profile feet1st

                                                        Joined: Mar 7 06
                                                        Posts: 312
                                                        ID: 1028
                                                        Credit: 110,522
                                                        RAC: 0
                                                        Message 1908 - Posted 28 Jul 2006 21:47:41 UTC

                                                          There\'s a Rosetta user getting this:
                                                          Incomplete read of less than 5KB for...
                                                          error as well. If a resolution is found, please inform them as well.
                                                          ____________

                                                          Profile paul and kirsty yates
                                                          Avatar

                                                          Joined: Feb 16 06
                                                          Posts: 11
                                                          ID: 310
                                                          Credit: 949
                                                          RAC: 0
                                                          Message 1910 - Posted 30 Jul 2006 11:05:39 UTC

                                                            Last modified: 30 Jul 2006 11:07:48 UTC

                                                            looks like all 3 of us that got this w/u have had the same error
                                                            bad w/u?

                                                            <core_client_version>5.4.9</core_client_version>
                                                            <message>
                                                            Incorrect function. (0x1) - exit code 1 (0x1)
                                                            </message>
                                                            <stderr_txt>
                                                            ERROR:: Exit at: .\\pack.cc line:7839

                                                            ____________

                                                            dainenyu

                                                            Joined: Feb 19 06
                                                            Posts: 6
                                                            ID: 565
                                                            Credit: 7,772
                                                            RAC: 0
                                                            Message 1911 - Posted 31 Jul 2006 21:12:39 UTC

                                                              Well, at least WatchDog is working.

                                                              Two people that got this WU got the Watchdog error. The third got the Incorrect function. (0x1) - exit code 1 (0x1) before a minute had passed.
                                                              ____________

                                                              Profile Conan
                                                              Avatar

                                                              Joined: Feb 16 06
                                                              Posts: 345
                                                              ID: 145
                                                              Credit: 1,328,309
                                                              RAC: 299
                                                              Message 1912 - Posted 1 Aug 2006 15:33:49 UTC

                                                                I am running Linux on an AMD Opteron machine that had 2 lots of file downloads fail within seconds of the work units starting.
                                                                All had the Error \" process exited with code 1 (0x1)
                                                                ERROR:: Exit at: pack.cc line.7839 \"

                                                                Had 24 fail on the 29/7/06 and 14 fail on the 1/8/06. There were no successful units processed, all failed.
                                                                All started with \"t347_CASP7_ABRELAX_SAVE_ALL_OUT_hom001_1087_\"
                                                                then the last parts are :-- 14_2, 15_2, 16_2, 17_2, 20_2, 132_1, 133_1, 21_2, 31_2, 32_2, 134_1, 22_2, 23_2, 24_2, 136_1, 137_1, 138_1, 37_2, 139_1, 141_1, 143_1, 144_1, 145_1, 146_1.
                                                                Second lot started the same but ended with :-- 163_2, 164_2, 165_2, 166_2, 167_2, 168_2, 169_2, 183_2, 184_2, 185_2, 186_2, 187_2, 188_2, 189_2.


                                                                ____________

                                                                Profile Conan
                                                                Avatar

                                                                Joined: Feb 16 06
                                                                Posts: 345
                                                                ID: 145
                                                                Credit: 1,328,309
                                                                RAC: 299
                                                                Message 1999 - Posted 12 Aug 2006 7:32:35 UTC

                                                                  Got another 2 Segmentation Violation errors
                                                                  http://ralph.bakerlab.org/workunit.php?wuid=211463
                                                                  http://ralph.bakerlab.org/workunit.php?wuid=211462

                                                                  Also had problem with another WU as well
                                                                  http://ralph.bakerlab.org/workunit.php?wuid=212403
                                                                  it had Process exit code 1
                                                                  ERROR:Exit at dock_structure.cc line:401

                                                                  ____________

                                                                  Profile Conan
                                                                  Avatar

                                                                  Joined: Feb 16 06
                                                                  Posts: 345
                                                                  ID: 145
                                                                  Credit: 1,328,309
                                                                  RAC: 299
                                                                  Message 2011 - Posted 12 Aug 2006 15:03:28 UTC

                                                                    Last modified: 12 Aug 2006 15:04:44 UTC

                                                                    Project people, when you updated your Ralph@home project to show the new Credit System totals, myself and a number of other testers have been unable to upload WU results. The WU uploads but the results do not. Our time is running out to have these returned on time and I have about 46 to return.
                                                                    Please see thread about \"Internal Server Error\".
                                                                    I, myself am getting a \"No Schedulers Responded\" error, as are a few others, but some are getting an \"Internal Server\" error.

                                                                    Please be prompt in repairing this as the deadline is only a day or so away.
                                                                    It has been 2 days now since the fault started.
                                                                    ____________

                                                                    STE\/E

                                                                    Joined: Feb 16 06
                                                                    Posts: 27
                                                                    ID: 166
                                                                    Credit: 576,975
                                                                    RAC: 2
                                                                    Message 2012 - Posted 12 Aug 2006 15:31:06 UTC

                                                                      I\'ve Uploaded @ Reported 10 WU\'s Just this morning ... are you sure it\'s on Ralph\'s end & not yours ... ????

                                                                      Profile Conan
                                                                      Avatar

                                                                      Joined: Feb 16 06
                                                                      Posts: 345
                                                                      ID: 145
                                                                      Credit: 1,328,309
                                                                      RAC: 299
                                                                      Message 2026 - Posted 13 Aug 2006 0:50:22 UTC

                                                                        Thanks PoorBoy, the problem was at the Ralph end. It has taken 2 days to fix. I was not the only one having the problem and it has now been fixed between my last post and this one. All my WU\'s have now uploaded.
                                                                        ____________

                                                                        Profile Conan
                                                                        Avatar

                                                                        Joined: Feb 16 06
                                                                        Posts: 345
                                                                        ID: 145
                                                                        Credit: 1,328,309
                                                                        RAC: 299
                                                                        Message 2030 - Posted 13 Aug 2006 2:19:43 UTC

                                                                          Have another 3 work units with Error SIGSEGV : Segmentation Violation
                                                                          http://ralph.bakerlab.org/workunit.php?wuid=212912
                                                                          http://ralph.bakerlab.org/workunit.php?wuid=212913
                                                                          http://ralph.bakerlab.org/workunit.php?wuid=212914

                                                                          Also another 1 with \"Process exit code 1\"
                                                                          \"ERROR:Exit at:dock_structure.cc:line:401\"
                                                                          http://ralph.bakerlab.org/workunit.php?wuid=212903

                                                                          ____________

                                                                          Profile [B^S] thierry@home
                                                                          Avatar

                                                                          Joined: Feb 15 06
                                                                          Posts: 20
                                                                          ID: 12
                                                                          Credit: 17,624
                                                                          RAC: 0
                                                                          Message 2258 - Posted 29 Aug 2006 18:56:15 UTC

                                                                            Last modified: 29 Aug 2006 18:44:52 UTC

                                                                            I just get a WU that crashes immediately:

                                                                            8/29/2006 8:44:32 PM|ralph@home|Sending scheduler request to http://ralph.bakerlab.org/ralph_cgi/cgi
                                                                            8/29/2006 8:44:32 PM|ralph@home|Reason: To fetch work
                                                                            8/29/2006 8:44:32 PM|ralph@home|Requesting 19008 seconds of new work
                                                                            8/29/2006 8:44:37 PM|ralph@home|Scheduler request succeeded
                                                                            8/29/2006 8:44:39 PM|ralph@home|Started download of file aa2int_03_05.200_v1_3.gz
                                                                            8/29/2006 8:44:39 PM|ralph@home|Started download of file aa2int_09_05.200_v1_3.gz
                                                                            8/29/2006 8:45:10 PM|ralph@home|Finished download of file aa2int_03_05.200_v1_3.gz
                                                                            8/29/2006 8:45:10 PM|ralph@home|Throughput 54665 bytes/sec
                                                                            8/29/2006 8:45:10 PM|ralph@home|Started download of file 2int_.fasta.gz
                                                                            8/29/2006 8:45:11 PM|ralph@home|Finished download of file 2int_.fasta.gz
                                                                            8/29/2006 8:45:11 PM|ralph@home|Throughput 475 bytes/sec
                                                                            8/29/2006 8:45:11 PM|ralph@home|Started download of file 2int.loop_file.gz
                                                                            8/29/2006 8:45:12 PM|ralph@home|Finished download of file 2int.loop_file.gz
                                                                            8/29/2006 8:45:12 PM|ralph@home|Throughput 240 bytes/sec
                                                                            8/29/2006 8:45:12 PM|ralph@home|Started download of file 2int_1_model_12_idl.pdb.gz
                                                                            8/29/2006 8:45:15 PM|ralph@home|Finished download of file 2int_1_model_12_idl.pdb.gz
                                                                            8/29/2006 8:45:15 PM|ralph@home|Throughput 32415 bytes/sec
                                                                            8/29/2006 8:45:15 PM|ralph@home|Started download of file paths_200_2int.txt
                                                                            8/29/2006 8:45:16 PM|ralph@home|Finished download of file paths_200_2int.txt
                                                                            8/29/2006 8:45:16 PM|ralph@home|Throughput 3689 bytes/sec
                                                                            8/29/2006 8:45:20 PM|ralph@home|Finished download of file aa2int_09_05.200_v1_3.gz
                                                                            8/29/2006 8:45:20 PM|ralph@home|Throughput 96325 bytes/sec
                                                                            8/29/2006 8:45:21 PM||Rescheduling CPU: files downloaded
                                                                            8/29/2006 8:45:21 PM|rosetta@home|Pausing task BENCH_ABRELAX_SAVE_ALL_OUT_1iibA_BARCODE_R55_filters_1214_6701_0 (left in memory)
                                                                            8/29/2006 8:45:22 PM|ralph@home|Starting task NMR_2int_CASPR_1_2int_1_model_12IGNORE_THE_REST_idl_1266_6_0 using rosetta_beta version 525
                                                                            8/29/2006 8:45:59 PM||Rescheduling CPU: application exited
                                                                            8/29/2006 8:45:59 PM|ralph@home|Computation for task NMR_2int_CASPR_1_2int_1_model_12IGNORE_THE_REST_idl_1266_6_0 finished
                                                                            8/29/2006 8:45:59 PM|rosetta@home|Resuming task BENCH_ABRELAX_SAVE_ALL_OUT_1iibA_BARCODE_R55_filters_1214_6701_0 using rosetta version 525
                                                                            8/29/2006 8:46:00 PM|ralph@home|Unrecoverable error for result NMR_2int_CASPR_1_2int_1_model_12IGNORE_THE_REST_idl_1266_6_0 (<file_xfer_error> <file_name>NMR_2int_CASPR_1_2int_1_model_12IGNORE_THE_REST_idl_1266_6_0_0</file_name> <error_code>-161</error_code></file_xfer_error>)
                                                                            8/29/2006 8:48:41 PM|ralph@home|Sending scheduler request to http://ralph.bakerlab.org/ralph_cgi/cgi
                                                                            8/29/2006 8:48:41 PM|ralph@home|Reason: To fetch work
                                                                            8/29/2006 8:48:41 PM|ralph@home|Requesting 19008 seconds of new work, and reporting 1 completed tasks
                                                                            8/29/2006 8:48:46 PM|ralph@home|Scheduler request succeeded
                                                                            8/29/2006 8:48:48 PM|ralph@home|Started download of file 2int_1_model_11_idl.pdb.gz
                                                                            8/29/2006 8:48:51 PM|ralph@home|Finished download of file 2int_1_model_11_idl.pdb.gz
                                                                            8/29/2006 8:48:51 PM|ralph@home|Throughput 18677 bytes/sec
                                                                            8/29/2006 8:48:52 PM||Rescheduling CPU: files downloaded
                                                                            8/29/2006 8:48:52 PM|rosetta@home|Pausing task BENCH_ABRELAX_SAVE_ALL_OUT_1iibA_BARCODE_R55_filters_1214_6701_0 (left in memory)
                                                                            8/29/2006 8:48:53 PM|ralph@home|Starting task NMR_2int_CASPR_1_2int_1_model_11IGNORE_THE_REST_idl_1266_7_1 using rosetta_beta version 525
                                                                            8/29/2006 8:49:30 PM||Rescheduling CPU: application exited
                                                                            8/29/2006 8:49:30 PM|ralph@home|Computation for task NMR_2int_CASPR_1_2int_1_model_11IGNORE_THE_REST_idl_1266_7_1 finished

                                                                            Windows XP pro sp2
                                                                            P4 3.0 HT (on)
                                                                            1 Gb RAM
                                                                            BOINC 5.4.11

                                                                            Message boards : RALPH@home bug list : Bug reports for Ralph 5.25


                                                                            Home | Join | About | Participants | Community | Statistics

                                                                            Copyright © 2017 University of Washington

                                                                            Last Modified: 20 Nov 2008 19:41:56 UTC
                                                                            Back to top ^