Message boards : RALPH@home bug list : Bug reports for 5.49-5.51
Previous · 1 · 2 · 3 · Next
Author | Message |
---|---|
Rhiju Volunteer moderator Project developer Project scientist Send message Joined: 14 Feb 06 Posts: 161 Credit: 3,725 RAC: 0 |
Yea, that's a big one -- I better not send it out again! I'm tracking down a few other RNA workunits that have crashed. On the whole, things are looking good, though I'll probably need to run more tests, and do another app update this week. This Wu was atleast at step 1.000.000 when it timed out! |
anders n Send message Joined: 16 Feb 06 Posts: 166 Credit: 131,419 RAC: 0 |
process exited with code 1 (0x1) On all my new Wu-s like this one. https://ralph.bakerlab.org/result.php?resultid=446166 Anders n |
Rhiju Volunteer moderator Project developer Project scientist Send message Joined: 14 Feb 06 Posts: 161 Credit: 3,725 RAC: 0 |
Thanks for continuing to post. It took a little effort to find this last bug in this set of new WUs -- I had to trap it on my laptop and read the stdout.txt. I think I fixed it, so I'm sending out a few new jobs. process exited with code 1 (0x1) |
genes Send message Joined: 16 Feb 06 Posts: 45 Credit: 43,706 RAC: 20 |
Wow! The graphics are awesome! (not a bug) :-) |
feet1st Send message Joined: 7 Mar 06 Posts: 313 Credit: 116,623 RAC: 0 |
Something is strange with these Wu-s. Me too... v5.50, 0 decoys and yet exactly 30 nstructs, just as "anders n", and it appears to have run for the standard 10,000 seconds rather then my 24hr preference. https://ralph.bakerlab.org/result.php?resultid=445489 |
anders n Send message Joined: 16 Feb 06 Posts: 166 Credit: 131,419 RAC: 0 |
Yea, that's a big one -- I better not send it out again! No problem with me to send it again. :) If possibel the big ones should be sent to the "faster" computers. Anders n |
anders n Send message Joined: 16 Feb 06 Posts: 166 Credit: 131,419 RAC: 0 |
- Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x008F2D4C read attempt to address 0x00000E51 On a number fo Wu-s Like this one. https://ralph.bakerlab.org/result.php?resultid=450894 Anders n |
Viromancy Send message Joined: 20 Jan 07 Posts: 7 Credit: 1,425 RAC: 0 |
|
j2satx Send message Joined: 17 Feb 06 Posts: 42 Credit: 168,797 RAC: 0 |
Computer Project Date ID Message 6100M902 ralph@home 3/7/2007 4:52:11 AM 518 Reason: Unrecoverable error for result DOCKING_1rhj_SYMM_11rhj_1_d.s036_bigrun.out.85_1826_4_1 (process exited with code 131 (0x83)) |
Michael Stoeter Send message Joined: 20 Feb 06 Posts: 1 Credit: 1,097,989 RAC: 0 |
From my last 47 WU ends 41 WU with Error Exit status -1073741819 (0xffffffffc0000005) https://ralph.bakerlab.org/workunit.php?wuid=398542 |
Viromancy Send message Joined: 20 Jan 07 Posts: 7 Credit: 1,425 RAC: 0 |
Another very rapid access violation in 5.50, this time without any attempt to view the graphics when the WU started: 451729 Half the WUs my machine has processed under 5.50 have now failed within seconds of starting. The "incorrect function" errors with the first two ab-initio RNA folding WUs seem to have stopped, but both of the access violation errors today have been with DOCKING_1rhj_SYMM_11rhj_1_d.s036_bigrun.out. units. |
Ingemar Volunteer moderator Project developer Project scientist Send message Joined: 7 Mar 07 Posts: 9 Credit: 76 RAC: 0 |
Hi, the jobs named DOCK_SYMM unraveled a bug in the 5.50 release and all fail shortly after they appear. This problem will be fixed in the next release and no more jobs causing this problem will be submitted. Sorry for the inconvenience! |
feet1st Send message Joined: 7 Mar 06 Posts: 313 Credit: 116,623 RAC: 0 |
One comment on the graphics. As a new model begins, the graphic shows the strand and gradually scales down during the first few steps... as that happens, the text in the box scales down as well. With the DOC and HINGE WUs, those text labels (Searching, Low Energy, Native...) can get PRETTY small. |
Michael.L Send message Joined: 26 Nov 06 Posts: 5 Credit: 1,173 RAC: 0 |
08/03/2007 22:46:10|ralph@home|Unrecoverable error for result 1ywz_1_NMRREF_1_1ywz_1_idid_model_12IGNORE_THE_REST_idl_1831_8_0 ( - exit code -1073741819 (0xc0000005)) WU ran for only 48 seconds before failing. i think the new graphics are great, no problems. |
Michael.L Send message Joined: 26 Nov 06 Posts: 5 Credit: 1,173 RAC: 0 |
08/03/2007 23:47:48|ralph@home|Unrecoverable error for result 1ywz_1_NMRREF_1_1ywz_1_idid_model_11IGNORE_THE_REST_idl_1831_8_0 ( - exit code -1073741819 (0xc0000005)) 08/03/2007 23:58:39|ralph@home|Unrecoverable error for result 1ywz_1_NMRREF_1_1ywz_1_idid_model_01IGNORE_THE_REST_idl_1831_2_2 ( - exit code -1073741819 (0xc0000005)) The above two ran for about 46 and 47 seconds each |
Michael.L Send message Joined: 26 Nov 06 Posts: 5 Credit: 1,173 RAC: 0 |
Sends feet1st my spare reading glasses. |
[B^S] sTrey Send message Joined: 15 Feb 06 Posts: 58 Credit: 15,430 RAC: 0 |
|
genes Send message Joined: 16 Feb 06 Posts: 45 Credit: 43,706 RAC: 20 |
|
Thomas Leibold Send message Joined: 25 Feb 07 Posts: 27 Credit: 77,464 RAC: 0 |
400678 400776 400748 On the same workunits that windows users are getting "exit code -1073741819 (0xc0000005)" and "- Unhandled Exception Record - Reason: Access Violation (0xc0000005)" I'm getting on Linux "process exited with code 131 (0x83)" and a segmentation violation: ERROR:: Unable to determine sequence length from pdb file # random seed: 2719130 SIGSEGV: segmentation violation Stack trace (13 frames): [0x8b95623] [0x8bb146c] [0xffffe420] [0x8857337] [0x861bad6] [0x8620366] [0x86222e3] [0x8973172] [0x8529c73] [0x8641c32] [0x8641cdc] [0x8c10a94] [0x8048111] Exiting... |
RickH Send message Joined: 10 Aug 06 Posts: 5 Credit: 7,260 RAC: 0 |
Just noticed WU ID 400842, DOC_1DFJ_R070309_pose_b_pert_fixbb_score12_1832_6 has been stuck doing no work for over an hour, while the RALPH science app repeatedly tries to directly connect to 207.46.212.122:80 for some reason. I don't know why a science app would be directly using the internet connection instead of letting BOINC handle the file transfers like usual, but it's not a good idea. My software firewall (Comodo) is set to require individual app-by-app approval for internet access (to prevent trojans from phoning home), and since rosetta_beta_5.50_windows_intelx86.exe is not on the approved list, it keeps denying the program's access and the app is apparently stuck spinning its wheels. Currently at 20+ access errors logged and counting, repeated every 4 minutes or so. I could of course approve the app for internet access, but that won't work long term, since soon enough it'll be upgraded to rosetta_beta_5.51... and the game will start all over again. Even if I wanted to do this, each app upgrade will result in hours of wasted CPU time waiting for me to notice each new version's approval request popups, along with filling up the firewall's access list with dozens of sequential app names. Blech. I guess I'll have to approve this one, since it's either that or abort it, but 5.51 needs to either not require direct internet access, or at least fail gracefully if it's denied. |
Message boards :
RALPH@home bug list :
Bug reports for 5.49-5.51
©2024 University of Washington
http://www.bakerlab.org