Rosetta_beta 4.0+

Message boards : RALPH@home bug list : Rosetta_beta 4.0+

To post messages, you must log in.

Previous · 1 · 2 · 3

AuthorMessage
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 557
Credit: 923,594
RAC: 3,986
Message 6508 - Posted: 25 Mar 2018, 17:55:11 UTC - in response to Message 6506.  

All my 4.07 work units have failed, eith immediately (Windows XP 32 bit) or after 1 minute (Linux 64 bit)


No problems here with Win10 64 bit
I hope the rosetta admins is in the way to abandon Xp (and, in the future, 32 bit) support.
ID: 6508 · Report as offensive    Reply Quote
Mumps [MM]

Send message
Joined: 10 Mar 15
Posts: 2
Credit: 10,763,841
RAC: 78,731
Message 6509 - Posted: 25 Mar 2018, 19:32:07 UTC

On my predominantly AMD based fleet, almost all 64 bit 4.07 tasks are ending in error after about 1 minute. All hosts seem to be fine running the 32 bit tasks. The logs report an exit signal of 11, which I believe translates to SIGSEGV, normally an invalid memory reference.

It looks like it may be related to a missing/corrupted download, because a rare host does seem to be able to complete tasks run with the 64 bit app, while all seem to be able to do so with the 32 bit app. Any suggestions?

Failed:
https://ralph.bakerlab.org/result.php?resultid=4264728

Good host:
https://ralph.bakerlab.org/result.php?resultid=42645268
ID: 6509 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 557
Credit: 923,594
RAC: 3,986
Message 6511 - Posted: 27 Mar 2018, 18:26:28 UTC

4281026

File: C:\cygwin\home\boinc\Rosetta\main\source\src\core/pack/dunbrack/SingleResidueDunbrackLibrary.hh:306
chi angle must be between -180 and 180: -nan(ind)
ID: 6511 · Report as offensive    Reply Quote
Trotador

Send message
Joined: 7 May 10
Posts: 26
Credit: 12,991,156
RAC: 102,384
Message 6512 - Posted: 27 Mar 2018, 19:37:48 UTC - in response to Message 6511.  

4281026

File: C:\cygwin\home\boinc\Rosetta\main\source\src\core/pack/dunbrack/SingleResidueDunbrackLibrary.hh:306
chi angle must be between -180 and 180: -nan(ind)


Several of those ones as well.
ID: 6512 · Report as offensive    Reply Quote
Trotador

Send message
Joined: 7 May 10
Posts: 26
Credit: 12,991,156
RAC: 102,384
Message 6513 - Posted: 28 Mar 2018, 8:01:45 UTC - in response to Message 6512.  

4281026

File: C:\cygwin\home\boinc\Rosetta\main\source\src\core/pack/dunbrack/SingleResidueDunbrackLibrary.hh:306
chi angle must be between -180 and 180: -nan(ind)


Several of those ones as well.


A lot of them actually
ID: 6513 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 557
Credit: 923,594
RAC: 3,986
Message 6517 - Posted: 28 Mar 2018, 15:26:21 UTC - in response to Message 6513.  

A lot of them actually

+1

4281114
4281023
etc....

But I don't know if admins read this thread.
ID: 6517 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 557
Credit: 923,594
RAC: 3,986
Message 6520 - Posted: 6 Apr 2018, 9:28:50 UTC

Some wus are using over 1gb of ram each
ID: 6520 · Report as offensive    Reply Quote
Profile Conan
Avatar

Send message
Joined: 16 Feb 06
Posts: 350
Credit: 1,354,536
RAC: 455
Message 6522 - Posted: 9 Apr 2018, 4:05:36 UTC - in response to Message 6507.  
Last modified: 9 Apr 2018, 4:06:14 UTC

Same here.

Win XP 32 bit.

Application version	Rosetta v4.07 
windows_intelx86

Stderr output
<core_client_version>7.9.2</core_client_version>
<![CDATA[
<message>
couldn't start app: CreateProcess() failed - (unknown error)</message>
]]>


Still getting this error on ALL 32 bit Windows XP tasks, NONE of them work.

Same thing happened on the main project, obviously released too early before being fixed here on RALPH.

Conan
ID: 6522 · Report as offensive    Reply Quote
Profile Conan
Avatar

Send message
Joined: 16 Feb 06
Posts: 350
Credit: 1,354,536
RAC: 455
Message 6529 - Posted: 13 Apr 2018, 7:05:17 UTC - in response to Message 6522.  

Same here.

Win XP 32 bit.

Application version	Rosetta v4.07 
windows_intelx86

Stderr output
<core_client_version>7.9.2</core_client_version>
<![CDATA[
<message>
couldn't start app: CreateProcess() failed - (unknown error)</message>
]]>


Still getting this error on ALL 32 bit Windows XP tasks, NONE of them work.

Same thing happened on the main project, obviously released too early before being fixed here on RALPH.

Conan


I am getting the same error still. On both Ralph and Rosetta, for these XP 32 bit tasks, ALL fail.
Linux works OK.

Conan
ID: 6529 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 557
Credit: 923,594
RAC: 3,986
Message 6530 - Posted: 14 Apr 2018, 9:27:08 UTC - in response to Message 6529.  

I am getting the same error still. On both Ralph and Rosetta, for these XP 32 bit tasks, ALL fail.
Linux works OK.

Conan


All big sw houses (MS, Apple, a lot of linux distro, etc) are abandoning 32 bits.
All big hw producers (Nvidia, AMD, etc) are abandoning 32 bits in their drivers.
I think it's time, for boinc projects, to start the death of 32 bits (no debug, no new app, etc)
ID: 6530 · Report as offensive    Reply Quote
Profile Conan
Avatar

Send message
Joined: 16 Feb 06
Posts: 350
Credit: 1,354,536
RAC: 455
Message 6531 - Posted: 14 Apr 2018, 10:45:05 UTC - in response to Message 6530.  
Last modified: 14 Apr 2018, 10:47:00 UTC

I am getting the same error still. On both Ralph and Rosetta, for these XP 32 bit tasks, ALL fail.
Linux works OK.

Conan


All big sw houses (MS, Apple, a lot of linux distro, etc) are abandoning 32 bits.
All big hw producers (Nvidia, AMD, etc) are abandoning 32 bits in their drivers.
I think it's time, for boinc projects, to start the death of 32 bits (no debug, no new app, etc)


That may or may not be the case, however it suits my needs and I don't need 64 bits to do what I do with this computer.

Check the application listing, 32 Bit is supposed to work, it does NOT.

My question is the "Rosetta Mini" work units run without an issue, on both 32 bit and 64 bit, but the new "Rosetta" work units fail on the 32 bit. So I am asking what has changed to cause the problem.

Conan
ID: 6531 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 557
Credit: 923,594
RAC: 3,986
Message 6532 - Posted: 14 Apr 2018, 17:28:40 UTC - in response to Message 6531.  

That may or may not be the case, however it suits my needs and I don't need 64 bits to do what I do with this computer.

Check the application listing, 32 Bit is supposed to work, it does NOT.

My question is the "Rosetta Mini" work units run without an issue, on both 32 bit and 64 bit, but the new "Rosetta" work units fail on the 32 bit. So I am asking what has changed to cause the problem.

Conan


Don't misunderstand me, Conan.
I think you're right: you have 32bit system, project supports 32 bit system.
But I do not know if they have the strength (or the will) to support XP 32 bit.
ID: 6532 · Report as offensive    Reply Quote
Trotador

Send message
Joined: 7 May 10
Posts: 26
Credit: 12,991,156
RAC: 102,384
Message 6533 - Posted: 16 Apr 2018, 18:28:11 UTC

Many units are taking over 1GB RAM (again), I've seen up to 1.6GB
ID: 6533 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 557
Credit: 923,594
RAC: 3,986
Message 6534 - Posted: 16 Apr 2018, 18:47:22 UTC

4597437

<message>
Funzione non corretta.
(0x1) - exit code 1 (0x1)</message>
<stderr_txt>
command: projects/ralph.bakerlab.org/rosetta_4.07_windows_intelx86.exe -run:protocol jd2_scripting @flags_rb_04_15_74_125__t000__0_C1_robetta -silent_gz -mute all -out:file:silent default.out -in:file:boinc_wu_zip input_rb_04_15_74_125__t000__0_C1_robetta.zip -nstruct 10000 -cpu_run_time 3600 -watchdog -boinc:max_nstruct 600 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 2163117
Starting watchdog...
Watchdog active.
ID: 6534 · Report as offensive    Reply Quote
Trotador

Send message
Joined: 7 May 10
Posts: 26
Credit: 12,991,156
RAC: 102,384
Message 6535 - Posted: 16 Apr 2018, 20:17:58 UTC - in response to Message 6533.  

Many units are taking over 1GB RAM (again), I've seen up to 1.6GB

some examples:

https://ralph.bakerlab.org/result.php?resultid=4610974
https://ralph.bakerlab.org/result.php?resultid=4611547
https://ralph.bakerlab.org/result.php?resultid=4610940
https://ralph.bakerlab.org/result.php?resultid=4611488
https://ralph.bakerlab.org/result.php?resultid=4613637
https://ralph.bakerlab.org/result.php?resultid=4613672
https://ralph.bakerlab.org/result.php?resultid=4613678
https://ralph.bakerlab.org/result.php?resultid=4613261
https://ralph.bakerlab.org/result.php?resultid=4616237
https://ralph.bakerlab.org/result.php?resultid=4615718
https://ralph.bakerlab.org/result.php?resultid=4614760
https://ralph.bakerlab.org/result.php?resultid=4614886
https://ralph.bakerlab.org/result.php?resultid=4614837
https://ralph.bakerlab.org/result.php?resultid=4616251
https://ralph.bakerlab.org/result.php?resultid=4616952
https://ralph.bakerlab.org/result.php?resultid=4614243
https://ralph.bakerlab.org/result.php?resultid=4614244
ID: 6535 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 557
Credit: 923,594
RAC: 3,986
Message 6536 - Posted: 17 Apr 2018, 5:45:12 UTC - in response to Message 6533.  

Many units are taking over 1GB RAM (again), I've seen up to 1.6GB


Same here. 2 of my computers are "waiting for memory".
ID: 6536 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 557
Credit: 923,594
RAC: 3,986
Message 6537 - Posted: 17 Apr 2018, 15:18:06 UTC - in response to Message 6534.  

4597437

<message>
Funzione non corretta.
(0x1) - exit code 1 (0x1)</message>
<stderr_txt>
command: projects/ralph.bakerlab.org/rosetta_4.07_windows_intelx86.exe -run:protocol jd2_scripting @flags_rb_04_15_74_125__t000__0_C1_robetta -silent_gz -mute all -out:file:silent default.out -in:file:boinc_wu_zip input_rb_04_15_74_125__t000__0_C1_robetta.zip -nstruct 10000 -cpu_run_time 3600 -watchdog -boinc:max_nstruct 600 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 2163117
Starting watchdog...
Watchdog active.


A lot of these errors (some after 4hs)

https://ralph.bakerlab.org/result.php?resultid=4623903
https://ralph.bakerlab.org/result.php?resultid=4622701
https://ralph.bakerlab.org/result.php?resultid=4623848

etc, etc...
ID: 6537 · Report as offensive    Reply Quote
Trotador

Send message
Joined: 7 May 10
Posts: 26
Credit: 12,991,156
RAC: 102,384
Message 6538 - Posted: 18 Apr 2018, 5:28:44 UTC

More errors

<core_client_version>7.6.31</core_client_version>
<![CDATA[
<message>
process got signal 11
</message>
<stderr_txt>
command: ../../projects/ralph.bakerlab.org/rosetta_4.07_i686-pc-linux-gnu -run:protocol jd2_scripting @flags_rb_04_15_85_134__t000__0_C4_robetta -silent_gz -mute all -out:file:silent default.out -in:file:boinc_wu_zip input_rb_04_15_85_134__t000__0_C4_robetta.zip -nstruct 10000 -cpu_run_time 3600 -watchdog -boinc:max_nstruct 600 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 2126819
Starting watchdog...
Watchdog active.
BOINC:: CPU time: 18506.1s, 14400s + 3600s[2018- 4-18 2:40:41:] :: BOINC
WARNING! cannot get file size for default.out.gz: could not open file.
Output exists: default.out.gz Size: -1
InternalDecoyCount: 0 (GZ)
-----
0
-----
Stream information inconsistent.
Writing W_0000001
======================================================
DONE :: 1 starting structures 18506.1 cpu seconds
This process generated 1 decoys from 1 attempts
======================================================
02:40:41 (20103): called boinc_finish(0)

</stderr_txt>
]]>
ID: 6538 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 557
Credit: 923,594
RAC: 3,986
Message 6539 - Posted: 18 Apr 2018, 14:05:34 UTC - in response to Message 6538.  

More errors


I'm quite skeptical that administrators read the forum....
ID: 6539 · Report as offensive    Reply Quote
Previous · 1 · 2 · 3

Message boards : RALPH@home bug list : Rosetta_beta 4.0+



©2018 University of Washington
http://www.bakerlab.org