Message boards : RALPH@home bug list : Report - Previously Unclassified Work Unit Errors
Previous · 1 · 2 · 3 · 4
Author | Message |
---|---|
Carlos_Pfitzner Send message Joined: 16 Feb 06 Posts: 182 Credit: 22,792 RAC: 0 |
SIGSEGV: segmentation violationStack trace (11 frames): https://ralph.bakerlab.org/result.php?resultid=18472 SIGSEGV: segmentation violationStack trace (11 frames): https://ralph.bakerlab.org/result.php?resultid=18966 Exit status 0 (0x0) https://ralph.bakerlab.org/result.php?resultid=19471 Rosetta_beta 4.84 Linux for all results above Click signature for global team stats |
hugothehermit Send message Joined: 17 Feb 06 Posts: 17 Credit: 2,170 RAC: 0 |
This WU hasn't been doing anything (it's not stuck on 1% it's stuck on 0%) for, I would guess about 9 hours, I can't find when it started in the messages as I had a power outage. stderr.txt # random seed: 3985987 No heartbeat from core client for 31 sec - exiting I would guess it never got around to properly exiting, as the other (HT) CPU is working away no worries. It's probably just an error my end. |
hugothehermit Send message Joined: 17 Feb 06 Posts: 17 Credit: 2,170 RAC: 0 |
I just did a reboot and the WU is now working. |
Carlos_Pfitzner Send message Joined: 16 Feb 06 Posts: 182 Credit: 22,792 RAC: 0 |
*** glibc detected *** corrupted double-linked list: 0x0894a300 *** https://ralph.bakerlab.org/result.php?resultid=21152 Rosetta_beta 4.84 Linux Click signature for global team stats |
Carlos_Pfitzner Send message Joined: 16 Feb 06 Posts: 182 Credit: 22,792 RAC: 0 |
SIGSEGV: segmentation violationStack trace (11 frames):[b] https://ralph.bakerlab.org/result.php?resultid=19920 https://ralph.bakerlab.org/result.php?resultid=20503 [b]Rosetta_beta 4.84 Linux Click signature for global team stats |
Carlos_Pfitzner Send message Joined: 16 Feb 06 Posts: 182 Credit: 22,792 RAC: 0 |
SIGSEGV: segmentation violationStack trace (11 frames): https://ralph.bakerlab.org/result.php?resultid=19920 https://ralph.bakerlab.org/result.php?resultid=20503 Rosetta_beta 4.84 Linux Click signature for global team stats |
Carlos_Pfitzner Send message Joined: 16 Feb 06 Posts: 182 Credit: 22,792 RAC: 0 |
|
Carlos_Pfitzner Send message Joined: 16 Feb 06 Posts: 182 Credit: 22,792 RAC: 0 |
Exit status 2 (0x2) https://ralph.bakerlab.org/result.php?resultid=49678 Rosetta_beta 4.85 Linux Click signature for global team stats |
Carlos_Pfitzner Send message Joined: 16 Feb 06 Posts: 182 Credit: 22,792 RAC: 0 |
stuck at 78.47% https://ralph.bakerlab.org/result.php?resultid=49653 Rosetta_beta 4.85 Linux load average: 0.00, 0.00, 0.17 *re-starting boinc, following message apears on Linux console *** glibc detected *** double free or corruption (fasttop): 0x0914b110 *** Click signature for global team stats |
Carlos_Pfitzner Send message Joined: 16 Feb 06 Posts: 182 Credit: 22,792 RAC: 0 |
Exit status 131 (0x83) *** glibc detected *** corrupted double-linked list: 0x0986c7e0 *** SIGSEGV: segmentation violationStack trace (12 frames): https://ralph.bakerlab.org/result.php?resultid=49639 Rosetta_beta 4.85 Linux Click signature for global team stats |
Carlos_Pfitzner Send message Joined: 16 Feb 06 Posts: 182 Credit: 22,792 RAC: 0 |
Exit status 139 (0x8b) process got signal 11 SIGSEGV: segmentation violationStack trace (10 frames): https://ralph.bakerlab.org/result.php?resultid=49708 Rosetta_beta 4.85 Linux Click signature for global team stats |
Snake Doctor Send message Joined: 16 Feb 06 Posts: 37 Credit: 998,880 RAC: 0 |
Every ralph WU that has hit my system since the release of Mac version 4.86 has crashed. Up till now I had only seen one WU fail. The errors say that BOINC libray 5.2.27 was used to compile the application. Like this one here I am running noinc 5.1.13 which is the current release version. Also some of the errors are for a missing file. Such as this one here What ever is changed for version 4.86 on the Mac is clearly not working. Regards Phil |
Carlos_Pfitzner Send message Joined: 16 Feb 06 Posts: 182 Credit: 22,792 RAC: 0 |
Rosetta_beta 4.87 Linux Exit status 131 (0x83) SIGSEGV https://ralph.bakerlab.org/result.php?resultid=68531 https://ralph.bakerlab.org/result.php?resultid=68618 Click signature for global team stats |
casio7131 Send message Joined: 20 Mar 06 Posts: 15 Credit: 12,660 RAC: 0 |
8/04/2006 2:22:05 PM|ralph@home|Unrecoverable error for result HBLR_1.0_2tif_375_18_0 ( - exit code -1073741819 (0xc0000005)) resultid=79533 died after about 40 min 8/04/2006 2:33:01 PM|ralph@home|Unrecoverable error for result HBLR_1.0_1b72_375_87_0 ( - exit code -1073741819 (0xc0000005)) resultid=80087 died after about 11 min note, i've had similar problems with these HB work units in rosetta too: (see https://boinc.bakerlab.org/rosetta/forum_thread.php?id=1106#13206)
|
casio7131 Send message Joined: 20 Mar 06 Posts: 15 Credit: 12,660 RAC: 0 |
and now another HB failure. 8/04/2006 3:38:24 PM|ralph@home|Unrecoverable error for result HBLR_1.0_1di2_375_67_0 ( - exit code -1073741819 (0xc0000005)) https://ralph.bakerlab.org/result.php?resultid=79922 |
Nuadormrac Send message Joined: 22 Feb 06 Posts: 68 Credit: 11,362 RAC: 0 |
OK, this was a WU that was stuck at about 19%, so does not classify as a 1% hang. It was also the newer type (not the HB...) 7449_largescale* Now, all other units of this type that I've thus far received, have completed successfully and without incident. So seems to be a lone one. What it did, was got up to model 2, step 0, and then it just sat there/hung without any progress at all. It ran for longer then some of the longest running RALPH units I got thus far, but exhibited one other oddity. On the accepted energy graph on the left hand side of the screen, when I looked at it, the thing was a complete blur for the most part, with no line or data points in the least bit visable. I've never seen the graph become a blured out/washed out mess like that when I've looked, and didn't seem normal. I let it run a bit longer (to have it's run time go beyond about the longest run times other units that went successfully. The thing was just searching wildly, and on the graph portion bluring up to an indistinguishable mess... I aborted then... Here's the result: https://ralph.bakerlab.org/result.php?resultid=82276 |
Carlos_Pfitzner Send message Joined: 16 Feb 06 Posts: 182 Credit: 22,792 RAC: 0 |
|
KB7RZF Send message Joined: 16 Feb 06 Posts: 7 Credit: 1,426 RAC: 0 |
Posted this in the Bug reporting thread for 4.97 and above. But I'll post it here too. This WU shows this error below: Result ID 80333 Name HBLR_1.0_2tif_375_118_0 Workunit 75272 Created 7 Apr 2006 22:20:57 UTC Sent 8 Apr 2006 7:05:46 UTC Received 9 Apr 2006 23:58:50 UTC Server state Over Outcome Client error Client state Computing Exit status -1073741819 (0xffffffffc0000005) Computer ID 65 Report deadline 22 Apr 2006 7:05:46 UTC CPU time 2920.796875 stderr out <core_client_version>5.3.12.tx36</core_client_version> <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> # random seed: 3893951 # cpu_run_time_pref: 7200 ***UNHANDLED EXCEPTION**** Reason: Access Violation (0xc0000005) at address 0x00599FF4 read attempt to address 0x0EAFFDA4 Dump of the Worker(offending) thread: 1: 04/09/06 16:58:34 1: SymGetLineFromAddr(): GetLastError = 126 Dump of the Timer thread: 2: 04/09/06 16:58:34 Dump of the Graphics thread: 3: 04/09/06 16:58:34 Exiting... </stderr_txt> Validate state Invalid Claimed credit 13.8767261204283 Granted credit 0 application version 4.97 |
Message boards :
RALPH@home bug list :
Report - Previously Unclassified Work Unit Errors
©2024 University of Washington
http://www.bakerlab.org