Report - Previously Unclassified Work Unit Errors

Message boards : RALPH@home bug list : Report - Previously Unclassified Work Unit Errors

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · Next

AuthorMessage
STE\/E

Send message
Joined: 16 Feb 06
Posts: 27
Credit: 2,226,442
RAC: 783
Message 623 - Posted: 25 Feb 2006, 14:11:16 UTC

Those WU's should have been aborted Tony, I see you still have 1 WU on host 103 that will error out too (BARCODE_30_1iibA_219_2) if you run it. It already says Canceled in the WU ID Page ...
ID: 623 · Report as offensive    Reply Quote
Profile Astro

Send message
Joined: 16 Feb 06
Posts: 141
Credit: 32,977
RAC: 0
Message 624 - Posted: 25 Feb 2006, 14:33:52 UTC - in response to Message 623.  
Last modified: 25 Feb 2006, 14:35:34 UTC

Those WU's should have been aborted Tony, I see you still have 1 WU on host 103 that will error out too (BARCODE_30_1iibA_219_2) if you run it. It already says Canceled in the WU ID Page ...

I thought it was abort earlier than 4.87 not earlier than or equal to? OK off I go to abort them.

Thanks Poorboy with many machines.

I just looked and it says "February 24, 2006
Correction Please abort work units on Windows machines that are currently running versions earlier than 4.87. "

ID: 624 · Report as offensive    Reply Quote
STE\/E

Send message
Joined: 16 Feb 06
Posts: 27
Credit: 2,226,442
RAC: 783
Message 625 - Posted: 25 Feb 2006, 14:37:37 UTC - in response to Message 624.  

Those WU's should have been aborted Tony, I see you still have 1 WU on host 103 that will error out too (BARCODE_30_1iibA_219_2) if you run it. It already says Canceled in the WU ID Page ...

I thought it was abort earlier than 4.87 not earlier than or equal to? OK off I go to abort them.

Thanks Poorboy with many machines.


No, the ones they wanted aborted were the v4.87 that showed they were Canceled already in the WU ID & any v4.86 or earlier that you still had ... :)
ID: 625 · Report as offensive    Reply Quote
Profile Astro

Send message
Joined: 16 Feb 06
Posts: 141
Credit: 32,977
RAC: 0
Message 627 - Posted: 25 Feb 2006, 14:40:14 UTC
Last modified: 25 Feb 2006, 14:40:55 UTC

I went to my results and see several that "comp errored" out. Hmmm, Must have happened while I slept (strange, I don't remember sleeping that much, short nap here, short nap there, that's about it for me). LOL
ID: 627 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 655 - Posted: 25 Feb 2006, 21:39:26 UTC
Last modified: 25 Feb 2006, 22:00:03 UTC

Access Violation - Rosetta_beta_4.89
https://ralph.bakerlab.org/result.php?resultid=10919
*pc undisturbed when this occured - I was out for about 1 hour ...
*no one else has access to this pc
Click signature for global team stats
ID: 655 · Report as offensive    Reply Quote
Nuadormrac
Avatar

Send message
Joined: 22 Feb 06
Posts: 68
Credit: 11,362
RAC: 0
Message 662 - Posted: 25 Feb 2006, 22:53:21 UTC
Last modified: 25 Feb 2006, 22:55:34 UTC

Maximum disk usage space exceeded error, in 4.89 result

https://ralph.bakerlab.org/result.php?resultid=10997

stderr out

<core_client_version>5.2.13</core_client_version>
<message>Maximum disk usage exceeded
</message>
<stderr_txt>
# random seed: 3989828
# cpu_run_time_pref: 28800

Comp was running when I was in bed last night, and when I got up to do some things. Errored out when I was away from it. Computer behind locked door (in my apartment) and no one else has access...

Only other thing going on, is I'm downloading a torrent via Azureus, albeit the download is being written to a seperate (external) hard drive, on a seperate controller (USB for the external drive, vs. a SCSI adapter for the system/BOINC drive), which neither BOINC, nor the operating system itself are installed on, or run from...

Any files that are needed for this one?
ID: 662 · Report as offensive    Reply Quote
Marky-UK

Send message
Joined: 16 Feb 06
Posts: 5
Credit: 1,530
RAC: 0
Message 725 - Posted: 28 Feb 2006, 8:07:34 UTC
Last modified: 28 Feb 2006, 8:07:56 UTC

WU 11498 failed after 44 seconds for me. It looks like it also failed after 83 seconds for someone else.

<message>Incorrect function. (0x1) - exit code 1 (0x1)
</message>
ID: 725 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 733 - Posted: 28 Feb 2006, 12:37:46 UTC
Last modified: 28 Feb 2006, 12:48:06 UTC

Access Violation Rosetta_beta_4.89
https://ralph.bakerlab.org/result.php?resultid=12011

Previous to occurring that access violation ...

Date Host Project ID Message
2/28/2006 9:01:40 AM carlos.cp3 ralph@home 21 Aborting result HOMSb7_homDB030_1b72__226_4_0: exceeded disk limit: 200022677.000000 > 200000000.000000
2/28/2006 9:01:40 AM carlos.cp3 ralph@home 22 Unrecoverable error for result HOMSb7_homDB030_1b72__226_4_0 (Maximum disk usage exceeded)
2/28/2006 9:01:41 AM carlos.cp3 --- 23 request_reschedule_cpus: process exited
2/28/2006 9:01:41 AM carlos.cp3 ralph@home 24 Computation for result HOMSb7_homDB030_1b72__226_4_0 finished
2/28/2006 9:01:41 AM carlos.cp3 ralph@home 25 Starting result HOMSb7_homDB001_1b72__226_5_0 using rosetta_beta version 489
2/28/2006 9:02:41 AM carlos.cp3 ralph@home 26 Sending scheduler request to https://ralph.bakerlab.org/ralph_cgi/cgi
2/28/2006 9:02:41 AM carlos.cp3 ralph@home 27 Reason: To report results
2/28/2006 9:02:41 AM carlos.cp3 ralph@home 28 Reporting 1 results
2/28/2006 9:03:02 AM carlos.cp3 ralph@home 29 Scheduler request to https://ralph.bakerlab.org/ralph_cgi/cgi succeeded

I have more than 2 GB of free disk space
and my preferences for disk usage are set high enough to support a GB result file Oh! ???
Why not finish the WU when it top on u size limits, instead of aborting it?


Click signature for global team stats
ID: 733 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 735 - Posted: 28 Feb 2006, 13:09:12 UTC
Last modified: 28 Feb 2006, 13:11:42 UTC

Exit status -1073741819 (0xffffffffc0000005) Rosetta_Beta_4.89
https://ralph.bakerlab.org/result.php?resultid=12012

I see in other threads that we are testing 4.90 !?

Why I am not ???

This WU was sent me today
and when I try refreshing the project I get

Date Host Project ID Message
2/28/2006 10:12:29 AM carlos.cp3 ralph@home 81 No work from project

Thus, I am assuming that there are only 4.89 WU s -- at least for me -:(
Now Ill play a game - NO WUs to test -:(

Click signature for global team stats
ID: 735 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 737 - Posted: 28 Feb 2006, 13:33:48 UTC

stuck at 80.41% Rosetta_betta_4.84 Linux
https://ralph.bakerlab.org/result.php?resultid=13007

not using CPU -> load average: 0.00, 0.00, 0.07

crobertp [/home/boinc/BOINC] > ps xu
USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND
boinc 27682 0.0 0.4 2616 1036 ? SN Feb17 0:00 /bin/bash ./yasuc.sh
boinc 23482 0.0 1.5 5868 3740 ? S 05:58 0:02 ./boinc -redirectio -allow_remote_gui_rpc -return_results_imme
boinc 23621 44.8 27.4 172164 68060 ? SN 06:35 108:08 rosetta_beta_4.84_i686-pc-linux-gnu xx 1dcj _ -abrelax -string
boinc 23622 0.0 27.4 172164 68060 ? SN 06:35 0:00 rosetta_beta_4.84_i686-pc-linux-gnu xx 1dcj _ -abrelax -string
boinc 23623 0.0 27.4 172164 68060 ? SN 06:35 0:00 rosetta_beta_4.84_i686-pc-linux-gnu xx 1dcj _ -abrelax -string
boinc 24075 0.0 0.9 7180 2356 ? S 08:42 0:00 /usr/sbin/sshd
boinc 24076 0.0 0.9 3476 2336 pts/4 S 08:42 0:00 -bash
boinc 24567 0.0 0.2 2084 624 ? SN 10:27 0:00 sleep 600
boinc 24603 0.0 0.2 2548 672 pts/4 R 10:36 0:00 ps xu
crobertp [/home/boinc/BOINC] >

Restarting boinc ...
Click signature for global team stats
ID: 737 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 740 - Posted: 28 Feb 2006, 15:19:58 UTC - in response to Message 737.  

stuck at 80.41% Rosetta_betta_4.84 Linux
https://ralph.bakerlab.org/result.php?resultid=13007

not using CPU -> load average: 0.00, 0.00, 0.07

crobertp [/home/boinc/BOINC] > ps xu
USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND
boinc 27682 0.0 0.4 2616 1036 ? SN Feb17 0:00 /bin/bash ./yasuc.sh
boinc 23482 0.0 1.5 5868 3740 ? S 05:58 0:02 ./boinc -redirectio -allow_remote_gui_rpc -return_results_imme
boinc 23621 44.8 27.4 172164 68060 ? SN 06:35 108:08 rosetta_beta_4.84_i686-pc-linux-gnu xx 1dcj _ -abrelax -string
boinc 23622 0.0 27.4 172164 68060 ? SN 06:35 0:00 rosetta_beta_4.84_i686-pc-linux-gnu xx 1dcj _ -abrelax -string
boinc 23623 0.0 27.4 172164 68060 ? SN 06:35 0:00 rosetta_beta_4.84_i686-pc-linux-gnu xx 1dcj _ -abrelax -string
boinc 24075 0.0 0.9 7180 2356 ? S 08:42 0:00 /usr/sbin/sshd
boinc 24076 0.0 0.9 3476 2336 pts/4 S 08:42 0:00 -bash
boinc 24567 0.0 0.2 2084 624 ? SN 10:27 0:00 sleep 600
boinc 24603 0.0 0.2 2548 672 pts/4 R 10:36 0:00 ps xu
crobertp [/home/boinc/BOINC] >

Restarting boinc ...


Update - after boinc restart WU run by some time, and now stuck at 85.48%
this message appeared on my remote ssh Linux console´
crobertp [/home/boinc/BOINC] > *** glibc detected *** malloc(): memory corruption: 0x40237a45 ***

crobertp [/home/boinc/BOINC] > w
12:23pm up 14 days, 19:04, 2 users, load average: 0.00, 0.00, 0.09
USER TTY FROM LOGIN@ IDLE JCPU PCPU WHAT
saigam pts/0 matrix.cp3 Mon10pm 2:24m 0.13s 0.13s -bash
boinc pts/4 200.149.209.246 8:42am 0.00s 8:29 0.01s w
crobertp [/home/boinc/BOINC] > ps xu
USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND
boinc 27682 0.0 0.4 2616 1036 ? SN Feb17 0:00 /bin/bash ./yasuc.sh
boinc 24075 0.0 0.9 7220 2380 ? S 08:42 0:00 /usr/sbin/sshd
boinc 24076 0.0 0.9 3476 2340 pts/4 S 08:42 0:00 -bash
boinc 24631 0.0 1.4 5732 3544 pts/4 S 10:45 0:00 ./boinc -redirectio -allow_remote_gui_rpc -return_results_imme
boinc 24875 20.8 24.5 118684 60992 pts/4 SN 11:43 8:28 rosetta_beta_4.84_i686-pc-linux-gnu xx 1dcj _ -abrelax -string
boinc 24876 0.0 24.5 118684 60992 pts/4 SN 11:43 0:00 rosetta_beta_4.84_i686-pc-linux-gnu xx 1dcj _ -abrelax -string
boinc 24877 0.0 24.5 118684 60992 pts/4 SN 11:43 0:00 rosetta_beta_4.84_i686-pc-linux-gnu xx 1dcj _ -abrelax -string
boinc 24990 0.0 0.2 2084 624 ? SN 12:17 0:00 sleep 600
boinc 25012 0.0 0.2 2548 672 pts/4 R 12:24 0:00 ps xu
crobertp [/home/boinc/BOINC] >

Restarting boinc again ...

BTW: When will the Linux app be staticaly linked ?
-or-
may be the purpose is disallowing who owns a Linux Pc w/ Kernel 2.4.x from
crunching rosetta ?


Click signature for global team stats
ID: 740 · Report as offensive    Reply Quote
Spare_Cycles

Send message
Joined: 16 Feb 06
Posts: 17
Credit: 12,942
RAC: 0
Message 747 - Posted: 28 Feb 2006, 17:33:15 UTC - in response to Message 740.  

this message appeared on my remote ssh Linux console´
crobertp [/home/boinc/BOINC] > *** glibc detected *** malloc(): memory corruption: 0x40237a45 ***


Have you checked this computer with programs like memtest86 and Super Pi?

may be the purpose is disallowing who owns a Linux Pc w/ Kernel 2.4.x from
crunching rosetta ?


I have five Linux PC Kernel 2.4.x computers running RALPH. So far they have completed all WUs successfully except for a few that nobody seems able to crunch.
ID: 747 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 749 - Posted: 28 Feb 2006, 21:47:57 UTC
Last modified: 28 Feb 2006, 21:49:23 UTC

SIGSEGV11 Rosetta_Beta_4.84 Linux
https://ralph.bakerlab.org/result.php?resultid=13093
Click signature for global team stats
ID: 749 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 750 - Posted: 28 Feb 2006, 21:54:21 UTC

process got signal 11 Rosetta_beta_4.84 Linux
https://ralph.bakerlab.org/result.php?resultid=13267
Click signature for global team stats
ID: 750 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 751 - Posted: 28 Feb 2006, 21:59:38 UTC

Exit status 139 (0x8b) Rosetta_beta_4.84 Linux
https://ralph.bakerlab.org/result.php?resultid=12969
Click signature for global team stats
ID: 751 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 752 - Posted: 28 Feb 2006, 22:06:34 UTC

pure virtual method called Rosetta_beta_4.84 Linux
https://ralph.bakerlab.org/result.php?resultid=9280

Click signature for global team stats
ID: 752 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 754 - Posted: 28 Feb 2006, 22:47:15 UTC - in response to Message 747.  
Last modified: 28 Feb 2006, 22:49:09 UTC

this message appeared on my remote ssh Linux console´
crobertp [/home/boinc/BOINC] > *** glibc detected *** malloc(): memory corruption: 0x40237a45 ***


Have you checked this computer with programs like memtest86 and Super Pi?

may be the purpose is disallowing who owns a Linux Pc w/ Kernel 2.4.x from
crunching rosetta ?


I have five Linux PC Kernel 2.4.x computers running RALPH. So far they have completed all WUs successfully except for a few that nobody seems able to crunch.


Yes, I looked at a few of u results, and seems OK (aleatory choosen results)

May be I used wrong wording ... not english my native idiom.
What I really mean is that is possible to to have the same Kernel 2.4.x
build with original Libc.6.so -or- newer libc.6.so -:)

*All dependes on the date u buy u Linux CD or magazine containing it, for free.

My system is commercial, and was built some years ago ...
These are the libc I am using ...
crobertp [/home/boinc/BOINC] > ls /lib/libc* -lha
-rw-r--r-- 1 root root 1.2M Oct 13 2004 /lib/libc-2.3.2.so
lrwxrwxrwx 1 root root 13 Oct 18 2004 /lib/libc.so.6 -> libc-2.3.2.so
lrwxrwxrwx 1 root root 14 May 3 2003 /lib/libcap.so.1 -> libcap.so.1.10
-rw-r--r-- 1 root root 9.2k Jan 31 2003 /lib/libcap.so.1.10
lrwxrwxrwx 1 root root 17 May 3 2003 /lib/libcom_err.so.2 -> libcom_err.so.2.0
-rw-r--r-- 1 root root 5.3k Jan 6 2003 /lib/libcom_err.so.2.0
-rw-r--r-- 1 root root 18k Oct 13 2004 /lib/libcrypt-2.3.2.so
lrwxrwxrwx 1 root root 17 Oct 18 2004 /lib/libcrypt.so.1 -> libcrypt-2.3.2.so
crobertp [/home/boinc/BOINC] >
note the year of 2003 - in almost all of them ...

*Very hard to update libc.so.6 - better building a new system ,
to replace the actual . what I cannot do for now .
*lack of spare hardware resources and time.

My conclusion: IF the executable cannot be linked static to avoid these
libc.so.6 incompatibilities, I will have to find another project to crunch.

*In fact rosetta worked well into this pc, for some time ...
Then a rosetta for Linux version Update ... (compiled with newer libc)
and the problems started ...

Oh well!, at least simap is working on this pc without any errors,
so, u see is not difficult to find another project ... perhaps others too,
*only I does not have yet tested them all - what works vs what does not works
Thanks
Click signature for global team stats
ID: 754 · Report as offensive    Reply Quote
Spare_Cycles

Send message
Joined: 16 Feb 06
Posts: 17
Credit: 12,942
RAC: 0
Message 759 - Posted: 1 Mar 2006, 1:36:16 UTC
Last modified: 1 Mar 2006, 1:37:26 UTC

build with original Libc.6.so -or- newer libc.6.so -:)

My gentoo installs would be the newer libc.6.so. The "2.4.20-4GB-athlon" install is SuSE 8.2 and most of the libs are dated march 13 2003 and seem to be version 2.3.2.

I don't know why these would work but not yours.
ID: 759 · Report as offensive    Reply Quote
Mistral

Send message
Joined: 16 Feb 06
Posts: 1
Credit: 352
RAC: 0
Message 772 - Posted: 1 Mar 2006, 17:49:40 UTC - in response to Message 725.  
Last modified: 1 Mar 2006, 17:51:01 UTC

Same (0x1) - exit code 1 (0x1) error in WU 11509

WU's name: cap_24_fullatom_cap4_dec00_10_231_4_1

stderr out:

<core_client_version>5.2.13</core_client_version>
<message>Fonction incorrecte. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>

</stderr_txt>

Win 2k SP4, application version = 4.90, all presets = default
ID: 772 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 784 - Posted: 2 Mar 2006, 13:06:42 UTC
Last modified: 2 Mar 2006, 13:11:38 UTC

MacOS Error -43 occured in Mac_Lib.c line 64 Rosetta 4.82
https://boinc.bakerlab.org/rosetta/result.php?resultid=12137116

While looking at the results of the "user of day" on rosetta web site,
I found this error (result was considered valid) though -:(

These incompatibilities between C lib(s) versions are occuring not only on Linux
... as above we see, on Mac/Darwin too.

imho: Is need to make all applications static
If we want app running on *all* pcs ...

Otherwise we can start donwnloading the source code and building the app
into each pc , before running it.
-or- alternatively start writting app into another language that does not
deppend on "shared libray(s)" eg: Assembly

Click signature for global team stats
ID: 784 · Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · Next

Message boards : RALPH@home bug list : Report - Previously Unclassified Work Unit Errors



©2024 University of Washington
http://www.bakerlab.org