Old - Bug reports for Windows Ver - 5.00 (and higher)

Message boards : RALPH@home bug list : Old - Bug reports for Windows Ver - 5.00 (and higher)

To post messages, you must log in.

AuthorMessage
Moderator9
Volunteer moderator

Send message
Joined: 16 Feb 06
Posts: 251
Credit: 0
RAC: 0
Message 1189 - Posted: 16 Apr 2006, 2:51:12 UTC
Last modified: 19 Apr 2006, 13:37:26 UTC

This thread is for reporting Bugs in windows Version 5.0 (or higher). Please include the type of error in your report, and a link to the RESULT in your stats page.

Please do not place long listing of you error logs in your posts especially if they repeat. A lot of people are using modem connections or paying packet rates. Long repeating text posts will be removed.

Moderator9
RALPH@home FAQs
RALPH@home Guidelines
Moderator Contact
ID: 1189 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 1206 - Posted: 17 Apr 2006, 19:27:09 UTC
Last modified: 17 Apr 2006, 19:32:00 UTC

ID: 1206 · Report as offensive    Reply Quote
rbpeake

Send message
Joined: 16 Feb 06
Posts: 19
Credit: 3,370
RAC: 0
Message 1207 - Posted: 17 Apr 2006, 19:32:21 UTC - in response to Message 1206.  

My question is: When 5.00 windows will go to replace the buggy Rosetta 4.98 ?

I have the same question. My experience with 5.00 shows it to be much better than 4.98, which shows an occasional bug now and then.

ID: 1207 · Report as offensive    Reply Quote
Profile feet1st

Send message
Joined: 7 Mar 06
Posts: 313
Credit: 116,623
RAC: 0
Message 1217 - Posted: 18 Apr 2006, 15:21:47 UTC

I attempted to display the graphics for a WU and it blinked the graphic window and killed it, and seems to have crashed the WU. I tried to display graphic both at 9:02 and at 10:34, I had THOUGH I was displaying the Ralph graphic in both cases.

4/18/2006 7:00:00 AM||Suspending network activity - time of day
4/18/2006 9:02:38 AM|rosetta@home|Unrecoverable error for result 7565_largescale_large_fullatom_relax_dec7565_1_06_6.pdb_440_88_0 (There are no child processes to wait for. (0x80) - exit code 128 (0x80))
4/18/2006 9:02:42 AM||request_reschedule_cpus: process exited
4/18/2006 9:02:42 AM|rosetta@home|Computation for result 7565_largescale_large_fullatom_relax_dec7565_1_06_6.pdb_440_88_0 finished
4/18/2006 9:02:47 AM|rosetta@home|Starting result 7565_largescale_large_fullatom_relax_dec7565_1_01_9.pdb_440_88_0 using rosetta version 498
4/18/2006 10:34:07 AM|ralph@home|Unrecoverable error for result FACONTACTS_NOFILTERS_1bm8__383_7_0 (There are no child processes to wait for. (0x80) - exit code 128 (0x80))
4/18/2006 10:34:09 AM||request_reschedule_cpus: process exited
4/18/2006 10:34:09 AM|ralph@home|Computation for result FACONTACTS_NOFILTERS_1bm8__383_7_0 finished

Link to the WU
ID: 1217 · Report as offensive    Reply Quote
Profile feet1st

Send message
Joined: 7 Mar 06
Posts: 313
Credit: 116,623
RAC: 0
Message 1219 - Posted: 18 Apr 2006, 15:30:30 UTC

So I then enabled my network access to report the WU, and (as has often happened when I enable my network access) I get errors in all active projects: climate and in Rosetta & Ralph:
4/18/2006 10:42:47 AM||Resuming network activity
4/18/2006 10:43:37 AM|rosetta@home|Sending scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi
4/18/2006 10:43:37 AM|rosetta@home|Reason: To fetch work
4/18/2006 10:43:37 AM|rosetta@home|Requesting 132831 seconds of new work, and reporting 3 results
4/18/2006 10:43:40 AM|climateprediction.net|Result sulphur_imrd_100869305_0 exited with zero status but no 'finished' file
4/18/2006 10:43:40 AM|climateprediction.net|If this happens repeatedly you may need to reset the project.
4/18/2006 10:43:40 AM|rosetta@home|Result 7565_largescale_large_fullatom_relax_dec7565_1_01_9.pdb_440_88_0 exited with zero status but no 'finished' file
4/18/2006 10:43:40 AM|rosetta@home|If this happens repeatedly you may need to reset the project.
4/18/2006 10:43:40 AM|ralph@home|Result FACONTACTS_NOFILTERS_1bq9A_383_7_0 exited with zero status but no 'finished' file
4/18/2006 10:43:40 AM|ralph@home|If this happens repeatedly you may need to reset the project.
4/18/2006 10:43:40 AM||request_reschedule_cpus: process exited
4/18/2006 10:43:41 AM|rosetta@home|Restarting result 7565_largescale_large_fullatom_relax_dec7565_1_01_9.pdb_440_88_0 using rosetta version 498
4/18/2006 10:43:42 AM|ralph@home|Restarting result FACONTACTS_NOFILTERS_1bq9A_383_7_0 using rosetta_beta version 500
4/18/2006 10:43:46 AM|rosetta@home|Scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi succeeded

While this happens BOINC seems to hang (i.e. unresponsive to user clicks). I've also seen it sometimes loop through many of these zero status but no finished file messages. Eventually a BOINC reschedule came through and seemed to halt the loop.


This is WIN XP Pro, SP1
This is the box where I moved BOINC to a network drive, so perhaps the delay in accessing disk causes such problems?
ID: 1219 · Report as offensive    Reply Quote
Profile Astro

Send message
Joined: 16 Feb 06
Posts: 141
Credit: 32,977
RAC: 0
Message 1220 - Posted: 18 Apr 2006, 15:47:53 UTC

Feet1st, The "no finished file" message means Boinc.exe (the daemon), lost contact with Boincmgr.exe (the manager). The move to a network drive may have something to do with it. I assume the manager in on a local puter and the daemon is on the network drive.

tony
ID: 1220 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 1223 - Posted: 18 Apr 2006, 19:06:50 UTC - in response to Message 1220.  

Feet1st, The "no finished file" message means Boinc.exe (the daemon), lost contact with Boincmgr.exe (the manager). The move to a network drive may have something to do with it. I assume the manager in on a local puter and the daemon is on the network drive.

tony


Sure ? I see this occurs on *all* projects I crunch when my 2gbit Internet link
is overloaded by excess of network traffic.

However is the boinc.exe (the daemon) that lost contact with
the science crunching application, cause the O/S was unable to
service the loopback link. Normally IP 127.0.0.1

Boincmgr.exe is irrelevant to this occurrence.
In fact I run boincview.exe instead of boincmgr.exe

And If this is a bug , is a bug of boinc , not of rosetta.

Also all those jobs that exited, with zero status, but no 'finished' file,
later are automatically re-started by boinc. No loss !

I did STOPed crunching Rosetta 4.98 on my PC's cause that 4.98 errors !

So now, on a thread to report 5.00 bugs, 4.98 bugs are mixed.

While furter delays on upgrading Rosetta are achieved,
I crunch other projects, no problems. lol

btw: Where is the required link for that rosetta 4.98 result that erroed ?
Click signature for global team stats
ID: 1223 · Report as offensive    Reply Quote
Nuadormrac
Avatar

Send message
Joined: 22 Feb 06
Posts: 68
Credit: 11,362
RAC: 0
Message 1239 - Posted: 19 Apr 2006, 11:05:58 UTC - in response to Message 1206.  

ps: I have not yet tested it on Linux, cause lack of enough WUs,
but at least the windows 5.00 I consider GOOD !


Not sure about 5.0 on Linux, but 4.98 did seem to go without incident on Linux, the problems we were seeing, being specific to the Windows app...

5.0 has been running without error for me as well...
ID: 1239 · Report as offensive    Reply Quote
[B^S] sTrey
Avatar

Send message
Joined: 15 Feb 06
Posts: 58
Credit: 15,430
RAC: 0
Message 1251 - Posted: 19 Apr 2006, 17:26:31 UTC
Last modified: 19 Apr 2006, 18:04:07 UTC

Old, yes, but since I didn't see problems reported (and was not able to get any 5.01 wus), I'll add my crashed 5.00 results:

90042
90134
90295
90278

The first two exited with incorrect function (tether.cc line 411; the last two with the familiar exit code -1073741819
ID: 1251 · Report as offensive    Reply Quote
Profile Carlos_Pfitzner
Avatar

Send message
Joined: 16 Feb 06
Posts: 182
Credit: 22,792
RAC: 0
Message 1257 - Posted: 19 Apr 2006, 20:00:32 UTC

By now, I have too 2 results that erred out using 5.00
The familiar Exit status 1 (0x1)
Less than 3% of error, and not stuck. Much better than 4.98 -:)
https://ralph.bakerlab.org/result.php?resultid=90016
https://ralph.bakerlab.org/result.php?resultid=90137
ps: The latest debugger from microsoft *is* installed on this PC
Click signature for global team stats
ID: 1257 · Report as offensive    Reply Quote
casio7131

Send message
Joined: 20 Mar 06
Posts: 15
Credit: 12,660
RAC: 0
Message 1266 - Posted: 20 Apr 2006, 0:58:29 UTC

20/04/2006 4:18:44 AM|ralph@home|Unrecoverable error for result LOOPFARLX_NO_FARLX_dec7553_1_00_10.pdb_394_1_1 ( - exit code -1073741819 (0xc0000005))
https://ralph.bakerlab.org/result.php?resultid=90355
ID: 1266 · Report as offensive    Reply Quote
rbpeake

Send message
Joined: 16 Feb 06
Posts: 19
Credit: 3,370
RAC: 0
Message 1278 - Posted: 20 Apr 2006, 15:09:45 UTC

I know the application has been upgraded to 5.01, but I got several failures on 5.00 of which the below are typical, in case there are of some use:


90320

90318

90185
ID: 1278 · Report as offensive    Reply Quote

Message boards : RALPH@home bug list : Old - Bug reports for Windows Ver - 5.00 (and higher)



©2024 University of Washington
http://www.bakerlab.org