Report - Previously Unclassified Work Unit Errors

Message boards : RALPH@home bug list : Report - Previously Unclassified Work Unit Errors

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4

AuthorMessage
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 897 - Posted: 17 Mar 2006, 19:27:18 UTC
Last modified: 17 Mar 2006, 19:32:22 UTC

SIGSEGV: segmentation violationStack trace (11 frames):
https://ralph.bakerlab.org/result.php?resultid=18472
SIGSEGV: segmentation violationStack trace (11 frames):
https://ralph.bakerlab.org/result.php?resultid=18966
Exit status 0 (0x0)
https://ralph.bakerlab.org/result.php?resultid=19471
Rosetta_beta 4.84 Linux for all results above
Click signature for global team stats
ID: 897 · Report as offensive    Reply Quote
hugothehermit

Send message
Joined: 17 Feb 06
Posts: 17
Credit: 2,170
RAC: 0
Message 902 - Posted: 18 Mar 2006, 6:25:48 UTC

This WU hasn't been doing anything (it's not stuck on 1% it's stuck on 0%) for, I would guess about 9 hours, I can't find when it started in the messages as I had a power outage.

stderr.txt
# random seed: 3985987
No heartbeat from core client for 31 sec - exiting


I would guess it never got around to properly exiting, as the other (HT) CPU is working away no worries. It's probably just an error my end.







ID: 902 · Report as offensive    Reply Quote
hugothehermit

Send message
Joined: 17 Feb 06
Posts: 17
Credit: 2,170
RAC: 0
Message 903 - Posted: 18 Mar 2006, 6:33:18 UTC

I just did a reboot and the WU is now working.
ID: 903 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 910 - Posted: 19 Mar 2006, 1:35:57 UTC

*** glibc detected *** corrupted double-linked list: 0x0894a300 ***
https://ralph.bakerlab.org/result.php?resultid=21152
Rosetta_beta 4.84 Linux

Click signature for global team stats
ID: 910 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 911 - Posted: 19 Mar 2006, 6:17:31 UTC

SIGSEGV: segmentation violationStack trace (11 frames):[b]
https://ralph.bakerlab.org/result.php?resultid=19920
https://ralph.bakerlab.org/result.php?resultid=20503
[b]Rosetta_beta 4.84 Linux


Click signature for global team stats
ID: 911 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 912 - Posted: 19 Mar 2006, 6:17:49 UTC

SIGSEGV: segmentation violationStack trace (11 frames):
https://ralph.bakerlab.org/result.php?resultid=19920
https://ralph.bakerlab.org/result.php?resultid=20503
Rosetta_beta 4.84 Linux

Click signature for global team stats
ID: 912 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 940 - Posted: 22 Mar 2006, 2:34:34 UTC
Last modified: 22 Mar 2006, 2:37:53 UTC


Click signature for global team stats
ID: 940 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 941 - Posted: 22 Mar 2006, 2:35:03 UTC

Exit status 2 (0x2)
https://ralph.bakerlab.org/result.php?resultid=49678
Rosetta_beta 4.85 Linux
Click signature for global team stats
ID: 941 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 948 - Posted: 22 Mar 2006, 7:34:19 UTC
Last modified: 22 Mar 2006, 7:38:40 UTC

stuck at 78.47%
https://ralph.bakerlab.org/result.php?resultid=49653
Rosetta_beta 4.85 Linux

load average: 0.00, 0.00, 0.17

*re-starting boinc, following message apears on Linux console
*** glibc detected *** double free or corruption (fasttop): 0x0914b110 ***

Click signature for global team stats
ID: 948 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 950 - Posted: 22 Mar 2006, 12:40:05 UTC
Last modified: 22 Mar 2006, 12:42:27 UTC

Exit status 131 (0x83)
*** glibc detected *** corrupted double-linked list: 0x0986c7e0 ***
SIGSEGV: segmentation violationStack trace (12 frames):

https://ralph.bakerlab.org/result.php?resultid=49639
Rosetta_beta 4.85 Linux

Click signature for global team stats
ID: 950 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 952 - Posted: 22 Mar 2006, 15:30:10 UTC
Last modified: 22 Mar 2006, 15:32:14 UTC

Exit status 139 (0x8b)
process got signal 11
SIGSEGV: segmentation violationStack trace (10 frames):

https://ralph.bakerlab.org/result.php?resultid=49708
Rosetta_beta 4.85 Linux


Click signature for global team stats
ID: 952 · Report as offensive    Reply Quote
Snake Doctor

Send message
Joined: 16 Feb 06
Posts: 37
Credit: 998,880
RAC: 0
Message 957 - Posted: 22 Mar 2006, 22:54:28 UTC

Every ralph WU that has hit my system since the release of Mac version 4.86 has crashed. Up till now I had only seen one WU fail. The errors say that BOINC libray 5.2.27 was used to compile the application. Like this one here


I am running noinc 5.1.13 which is the current release version. Also some of the errors are for a missing file. Such as this one here

What ever is changed for version 4.86 on the Mac is clearly not working.

Regards
Phil

ID: 957 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 1006 - Posted: 28 Mar 2006, 11:08:38 UTC

Rosetta_beta 4.87 Linux
Exit status 131 (0x83) SIGSEGV
https://ralph.bakerlab.org/result.php?resultid=68531
https://ralph.bakerlab.org/result.php?resultid=68618


Click signature for global team stats
ID: 1006 · Report as offensive    Reply Quote
casio7131

Send message
Joined: 20 Mar 06
Posts: 15
Credit: 12,660
RAC: 0
Message 1044 - Posted: 8 Apr 2006, 4:44:50 UTC
Last modified: 8 Apr 2006, 4:48:32 UTC

8/04/2006 2:22:05 PM|ralph@home|Unrecoverable error for result HBLR_1.0_2tif_375_18_0 ( - exit code -1073741819 (0xc0000005))
resultid=79533
died after about 40 min

8/04/2006 2:33:01 PM|ralph@home|Unrecoverable error for result HBLR_1.0_1b72_375_87_0 ( - exit code -1073741819 (0xc0000005))
resultid=80087
died after about 11 min

note, i've had similar problems with these HB work units in rosetta too:
(see https://boinc.bakerlab.org/rosetta/forum_thread.php?id=1106#13206)

8/04/2006 1:44:51 PM|rosetta@home|Unrecoverable error for result HBLR_1.0_1hz6_420_4766_0 ( - exit code -1073741811 (0xc000000d))
resultid=16362541
3/04/2006 11:38:45 PM|rosetta@home|Unrecoverable error for result HB_BARCODE_30_4ubpA_351_49332_0 ( - exit code -1073741811 (0xc000000d))
resultid=15780509

ID: 1044 · Report as offensive    Reply Quote
casio7131

Send message
Joined: 20 Mar 06
Posts: 15
Credit: 12,660
RAC: 0
Message 1045 - Posted: 8 Apr 2006, 5:25:59 UTC
Last modified: 8 Apr 2006, 5:26:33 UTC

and now another HB failure.

8/04/2006 3:38:24 PM|ralph@home|Unrecoverable error for result HBLR_1.0_1di2_375_67_0 ( - exit code -1073741819 (0xc0000005))

https://ralph.bakerlab.org/result.php?resultid=79922
ID: 1045 · Report as offensive    Reply Quote
Nuadormrac
Avatar

Send message
Joined: 22 Feb 06
Posts: 68
Credit: 11,362
RAC: 0
Message 1055 - Posted: 9 Apr 2006, 7:23:34 UTC
Last modified: 9 Apr 2006, 7:24:05 UTC

OK, this was a WU that was stuck at about 19%, so does not classify as a 1% hang. It was also the newer type (not the HB...) 7449_largescale* Now, all other units of this type that I've thus far received, have completed successfully and without incident. So seems to be a lone one.

What it did, was got up to model 2, step 0, and then it just sat there/hung without any progress at all. It ran for longer then some of the longest running RALPH units I got thus far, but exhibited one other oddity. On the accepted energy graph on the left hand side of the screen, when I looked at it, the thing was a complete blur for the most part, with no line or data points in the least bit visable. I've never seen the graph become a blured out/washed out mess like that when I've looked, and didn't seem normal.

I let it run a bit longer (to have it's run time go beyond about the longest run times other units that went successfully. The thing was just searching wildly, and on the graph portion bluring up to an indistinguishable mess... I aborted then... Here's the result:

https://ralph.bakerlab.org/result.php?resultid=82276
ID: 1055 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 1064 - Posted: 9 Apr 2006, 20:04:30 UTC

Son Goku

What u think about this url for forum signature ?
http://www.boincstats.com/signature/user_42702_banner.gif

Thanks
Click signature for global team stats
ID: 1064 · Report as offensive    Reply Quote
KB7RZF

Send message
Joined: 16 Feb 06
Posts: 7
Credit: 1,426
RAC: 0
Message 1069 - Posted: 10 Apr 2006, 3:00:50 UTC

Posted this in the Bug reporting thread for 4.97 and above. But I'll post it here too.

This WU shows this error below:

Result ID 80333
Name HBLR_1.0_2tif_375_118_0
Workunit 75272
Created 7 Apr 2006 22:20:57 UTC
Sent 8 Apr 2006 7:05:46 UTC
Received 9 Apr 2006 23:58:50 UTC
Server state Over
Outcome Client error
Client state Computing
Exit status -1073741819 (0xffffffffc0000005)
Computer ID 65
Report deadline 22 Apr 2006 7:05:46 UTC
CPU time 2920.796875
stderr out <core_client_version>5.3.12.tx36</core_client_version>
<message> - exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>
# random seed: 3893951
# cpu_run_time_pref: 7200

***UNHANDLED EXCEPTION****
Reason: Access Violation (0xc0000005) at address 0x00599FF4 read attempt to address 0x0EAFFDA4


Dump of the Worker(offending) thread:
1: 04/09/06 16:58:34
1: SymGetLineFromAddr(): GetLastError = 126


Dump of the Timer thread:
2: 04/09/06 16:58:34


Dump of the Graphics thread:
3: 04/09/06 16:58:34


Exiting...

</stderr_txt>


Validate state Invalid
Claimed credit 13.8767261204283
Granted credit 0
application version 4.97

ID: 1069 · Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4

Message boards : RALPH@home bug list : Report - Previously Unclassified Work Unit Errors



©2024 University of Washington
http://www.bakerlab.org