Bug Reports for Rosetta Mini Versions 1.+

Message boards : RALPH@home bug list : Bug Reports for Rosetta Mini Versions 1.+

To post messages, you must log in.

1 · 2 · 3 · 4 . . . 5 · Next

AuthorMessage
Profile dekim
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 20 Jan 06
Posts: 250
Credit: 543,579
RAC: 0
Message 3620 - Posted: 15 Jan 2008, 18:05:30 UTC

As many are aware, we are starting to test a new application version called Rosetta Mini. This application is a complete restructuring of the rosetta application that is currently running on Rosetta@home.

Please post any bugs regarding the first alpha test versions (versions 1.+) of this application here. We'll provide more information about this app in the near future.

ID: 3620 · Report as offensive    Reply Quote
Luuklag

Send message
Joined: 5 Jan 08
Posts: 15
Credit: 80
RAC: 0
Message 3622 - Posted: 15 Jan 2008, 19:12:28 UTC

is this a kind of clean code of rosetta? removed all the things that have become unusefull and rewritten parts into clearer code??
ID: 3622 · Report as offensive    Reply Quote
Profile dekim
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 20 Jan 06
Posts: 250
Credit: 543,579
RAC: 0
Message 3626 - Posted: 16 Jan 2008, 0:10:51 UTC - in response to Message 3622.  

is this a kind of clean code of rosetta? removed all the things that have become unusefull and rewritten parts into clearer code??


Yes. exactly correct. The old code was converted from fortran to C++ resulting in fortran-like C++ code that is difficult to develop and contains a lot of duplicate code. Mini is object oriented and designed to simplify development.
ID: 3626 · Report as offensive    Reply Quote
Pepo
Avatar

Send message
Joined: 8 Sep 06
Posts: 104
Credit: 36,890
RAC: 0
Message 3630 - Posted: 16 Jan 2008, 8:07:09 UTC - in response to Message 3626.  

The old code was converted from fortran to C++ resulting in fortran-like C++ code that is difficult to develop and contains a lot of duplicate code. Mini is object oriented and designed to simplify development.

Well, good luck, this is indeed a good news. Especially concerming mainainablilty and extensibility of the code.

Do you expect this transition to influence the memory footprint of large Rosetta simulations? Or will it be possible to split the large simulations into smaller ones?

Peter
ID: 3630 · Report as offensive    Reply Quote
Luuklag

Send message
Joined: 5 Jan 08
Posts: 15
Credit: 80
RAC: 0
Message 3637 - Posted: 16 Jan 2008, 15:35:09 UTC

Peter, you mean like it can be ported over to I.e. the ps3 in the future?
ID: 3637 · Report as offensive    Reply Quote
Pepo
Avatar

Send message
Joined: 8 Sep 06
Posts: 104
Credit: 36,890
RAC: 0
Message 3639 - Posted: 16 Jan 2008, 22:51:34 UTC - in response to Message 3637.  

Peter, you mean like it can be ported over to I.e. the ps3 in the future?

How the heck should I know this? Ported - probably yes, for sure. Let it effectively use some/all of the SPEs - would be nice. Get an SPE SDK, plenty of dedicated time, some SPE-experienced programmer(s), cook it long enough and...

Peter
ID: 3639 · Report as offensive    Reply Quote
Profile Conan
Avatar

Send message
Joined: 16 Feb 06
Posts: 364
Credit: 1,368,421
RAC: 0
Message 3654 - Posted: 20 Jan 2008, 14:48:06 UTC

Here are another 3 'mini' WU problems, all on Windows machines.
Running under Boinc version 5.10.38, after the first WU's froze computer, now seems to be ok as latest 'mini' failures did not stop computer.

WU 736284
WU 736500
WU 735381

<core_client_version>5.8.15</core_client_version>
<![CDATA[
<message>
- exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>
Failed to load GDI32.DLL...
Failed to load OPENGL32.DLL...
Failed to load GLU32.DLL...
Graphics are disabled due to configuration...
Invalid parameter detected in function (null). File: (null) Line: 0
Expression: (null)


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Breakpoint Encountered (0x80000003) at address 0x7C901230

Engaging BOINC Windows Runtime Debugger...
ID: 3654 · Report as offensive    Reply Quote
Profile Conan
Avatar

Send message
Joined: 16 Feb 06
Posts: 364
Credit: 1,368,421
RAC: 0
Message 3676 - Posted: 30 Jan 2008, 13:46:52 UTC

>> Problems with "mini" WU's version 1.04.
Have just had 16 of them fail in a row and none of them getting past a few seconds. Missing a file or something like that, error code -1.

Two examples are WU 739359 and WU 740086.

Hope this helps.
ID: 3676 · Report as offensive    Reply Quote
Profile dekim
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 20 Jan 06
Posts: 250
Credit: 543,579
RAC: 0
Message 3677 - Posted: 30 Jan 2008, 18:15:56 UTC

There was a small batch sent out that was configured incorrectly and thus all the corresponding tasks failed with the missing file message.
ID: 3677 · Report as offensive    Reply Quote
Evan

Send message
Joined: 23 Dec 07
Posts: 75
Credit: 69,584
RAC: 0
Message 3678 - Posted: 30 Jan 2008, 19:41:48 UTC

This one also seems to be one of them. No graphics and so it is hard to tell if it is working or not. The only indication is from task manager.

mini_abinitio-1ten_-test_james_3074_6
ID: 3678 · Report as offensive    Reply Quote
Profile Conan
Avatar

Send message
Joined: 16 Feb 06
Posts: 364
Credit: 1,368,421
RAC: 0
Message 3679 - Posted: 31 Jan 2008, 21:34:11 UTC
Last modified: 31 Jan 2008, 21:36:34 UTC

G'Day dekim,
all the "score" 1.05 WU's are failing after 5 seconds with this error
I have had about 4 so far and 4 more to go which I expect will fail as well.

Example this WU

<core_client_version>5.10.21</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)
</message>
<stderr_txt>
Graphics are disabled due to configuration...
ERROR:: Exit from: src/core/fragment/ConstantLengthFragSet.cc line: 60
called boinc_finish
ID: 3679 · Report as offensive    Reply Quote
Pepo
Avatar

Send message
Joined: 8 Sep 06
Posts: 104
Credit: 36,890
RAC: 0
Message 3680 - Posted: 31 Jan 2008, 22:45:58 UTC - in response to Message 3679.  
Last modified: 31 Jan 2008, 22:53:41 UTC

all the "score" 1.05 WU's are failing after 5 seconds with this error
I have had about 4 so far and 4 more to go which I expect will fail as well.

Example this WU

Yours is on a Linux host.

My score_13_hb_env_test62_A_1louA_3083_1_0 WU on Windows is already running 45 minutes and does not complain. 1 more to go.

And I see 2 more are waiting on my Linux host. To check for such fast death - I can do it immediately now...

...the score_13_hb_env_test62_A_1bkrA_3084_5_0 WU on Linux is running 5 minutes without crash. Hope to see it continuing this way. 1 more to go.

Peter
ID: 3680 · Report as offensive    Reply Quote
Profile dekim
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 20 Jan 06
Posts: 250
Credit: 543,579
RAC: 0
Message 3682 - Posted: 31 Jan 2008, 23:21:18 UTC

Another bad batch of work units went out. Please just ignore these failures.
ID: 3682 · Report as offensive    Reply Quote
Thomas Leibold

Send message
Joined: 25 Feb 07
Posts: 27
Credit: 77,464
RAC: 0
Message 3686 - Posted: 5 Feb 2008, 6:57:35 UTC - in response to Message 3682.  

Another bad batch of work units went out. Please just ignore these failures.


I'm assuming the bad batch were the Rosetta Mini 1.04 workunits ? All of the ones I got failed immediately. I only got a single 1.05 workunit and it was successful.

Currently my system is working on a 1.06 workunit and so far it seems to be running ok, with just a little oddity in the task display of Boinc Manager.
I have Ralph configured for 4 hour preferred runtime and the task display shows 2:37:51 cpu time, 63.244%, 7:09:26 to completion.
ID: 3686 · Report as offensive    Reply Quote
Profile Conan
Avatar

Send message
Joined: 16 Feb 06
Posts: 364
Credit: 1,368,421
RAC: 0
Message 3691 - Posted: 7 Feb 2008, 8:22:32 UTC

> Version "mini 1.06" seemed to have completed OK but received a "Validation Error" and then said invalid with no credit.
This WU

ID: 3691 · Report as offensive    Reply Quote
Profile KSMarksPsych
Avatar

Send message
Joined: 16 Feb 06
Posts: 40
Credit: 8,226
RAC: 0
Message 3693 - Posted: 7 Feb 2008, 10:17:27 UTC - in response to Message 3691.  

> Version "mini 1.06" seemed to have completed OK but received a "Validation Error" and then said invalid with no credit.
This WU



See here. Looks like we found a bug :-)
Kathryn :o)
The BOINC FAQ Service
The Unofficial BOINC Wiki
The Trac System
More BOINC information than you can shake a stick of RAM at.
ID: 3693 · Report as offensive    Reply Quote
Faux

Send message
Joined: 22 Jul 07
Posts: 1
Credit: 16,165
RAC: 0
Message 3763 - Posted: 20 Feb 2008, 3:05:04 UTC - in response to Message 3626.  

Mini is object oriented and designed to simplify development.



This should make it reeeal easy to make us a nice, shiny x86_64 build, right? :)
ID: 3763 · Report as offensive    Reply Quote
Sadir

Send message
Joined: 21 Feb 06
Posts: 6
Credit: 1,419
RAC: 0
Message 3765 - Posted: 20 Feb 2008, 8:29:50 UTC

WU 690472 give me this messages:
20/02/2008 09:29:02|ralph@home|Restarting task score13_hb_envtest62_A_5croA_3299_3594_1 using minirosetta version 108
20/02/2008 09:29:03|ralph@home|Task score13_hb_envtest62_A_5croA_3299_3594_1 exited with zero status but no 'finished' file
20/02/2008 09:29:03|ralph@home|If this happens repeatedly you may need to reset the project.

and CPU was oscilating betveen 19 and 32 % with 0 Progress and ---- CPU time.
ID: 3765 · Report as offensive    Reply Quote
LEONARI

Send message
Joined: 12 Mar 06
Posts: 5
Credit: 108,342
RAC: 0
Message 3770 - Posted: 20 Feb 2008, 12:47:09 UTC

I have been getting this kind of report continuously over the last few days!

20/02/2008 12:37:45|ralph@home|Restarting task score13_hb_envtest62_A_1tig__3299_3942_0 using minirosetta version 108
20/02/2008 12:37:46|ralph@home|Task score13_hb_envtest62_A_1tig__3299_3942_0 exited with zero status but no 'finished' file
20/02/2008 12:37:46|ralph@home|If this happens repeatedly you may need to reset the project.
20/02/2008 12:37:47|ralph@home|Restarting task score13_hb_envtest62_A_1tig__3299_3942_0 using minirosetta version 108
20/02/2008 12:37:48|ralph@home|Computation for task score13_hb_envtest62_A_1tig__3299_3942_0 finished
20/02/2008 12:39:22|ralph@home|Sending scheduler request: To fetch work
20/02/2008 12:39:22|ralph@home|Requesting 9 seconds of new work, and reporting 1 completed tasks
20/02/2008 12:39:27|ralph@home|Scheduler RPC succeeded [server version 509]
20/02/2008 12:39:27|ralph@home|Deferring communication for 4 min 2 sec
20/02/2008 12:39:27|ralph@home|Reason: requested by project
20/02/2008 12:39:29|ralph@home|[file_xfer] Started download of file frags83_1a19.pdb.gz
20/02/2008 12:39:29|ralph@home|[file_xfer] Started download of file boinc_frags83_aa1a19A03_05.200_v1_3.gz
20/02/2008 12:39:32|ralph@home|[file_xfer] Finished download of file frags83_1a19.pdb.gz
20/02/2008 12:39:32|ralph@home|[file_xfer] Throughput 6209 bytes/sec
20/02/2008 12:39:32|ralph@home|[file_xfer] Started download of file boinc_frags83_aa1a19A09_05.200_v1_3.gz
20/02/2008 12:39:44|ralph@home|[file_xfer] Finished download of file boinc_frags83_aa1a19A09_05.200_v1_3.gz
20/02/2008 12:39:44|ralph@home|[file_xfer] Throughput 17080 bytes/sec
20/02/2008 12:40:15|ralph@home|[file_xfer] Finished download of file boinc_frags83_aa1a19A03_05.200_v1_3.gz
20/02/2008 12:40:15|ralph@home|[file_xfer] Throughput 17258 bytes/sec
20/02/2008 12:40:16|ralph@home|Starting score13_hb_envtest62_A_1a19A_3299_3939_0
20/02/2008 12:40:16|ralph@home|Starting task score13_hb_envtest62_A_1a19A_3299_3939_0 using minirosetta version 108
20/02/2008 12:40:18|ralph@home|Task score13_hb_envtest62_A_1a19A_3299_3939_0 exited with zero status but no 'finished' file
20/02/2008 12:40:18|ralph@home|If this happens repeatedly you may need to reset the project.
20/02/2008 12:40:18|ralph@home|Restarting task score13_hb_envtest62_A_1a19A_3299_3939_0 using minirosetta version 108
20/02/2008 12:40:19|ralph@home|Task score13_hb_envtest62_A_1a19A_3299_3939_0 exited with zero status but no 'finished' file
20/02/2008 12:40:19|ralph@home|If this happens repeatedly you may need to reset the project.

ID: 3770 · Report as offensive    Reply Quote
Pepo
Avatar

Send message
Joined: 8 Sep 06
Posts: 104
Credit: 36,890
RAC: 0
Message 3772 - Posted: 20 Feb 2008, 13:00:03 UTC - in response to Message 3770.  

I have been getting this kind of report continuously over the last few days!

Could you point to any of your failed results? (Hidden computers.)

Peter
ID: 3772 · Report as offensive    Reply Quote
1 · 2 · 3 · 4 . . . 5 · Next

Message boards : RALPH@home bug list : Bug Reports for Rosetta Mini Versions 1.+



©2024 University of Washington
http://www.bakerlab.org