Task 5462440

Name RF_SAVE_ALL_OUT_NOJRAN_IGNORE_THE_REST_validation_env_f_pred_236_16902_5_1
Workunit 4848030
Created 14 Jun 2024, 5:06:02 UTC
Sent 14 Jun 2024, 8:09:05 UTC
Report deadline 15 Jun 2024, 8:09:05 UTC
Received 14 Jun 2024, 9:01:16 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 12 (0x0000000C) Unknown error code
Computer ID 50047
Run time 3 min 29 sec
CPU time 3 sec
Validate state Invalid
Credit 0.00
Device peak FLOPS 3.65 GFLOPS
Application version Generalized biomolecular modeling and design with RoseTTAFold All-Atom v0.02 (nvidia_alpha)
windows_x86_64
Peak working set size 279.30 MB
Peak swap size 7,849.83 MB
Peak disk usage 2.09 MB

Stderr output

<core_client_version>7.16.20</core_client_version>
<![CDATA[
<message>
Code d - exit code 12 (0xc)</message>
<stderr_txt>
Traceback (most recent call last):
  File "E:\BOINC\DATA\projects\ralph.bakerlab.org\cv2\rf2aa\predict.py", line 692, in <module>
    pred = Predictor(args)
  File "E:\BOINC\DATA\projects\ralph.bakerlab.org\cv2\rf2aa\predict.py", line 282, in __init__
    checkpoint = torch.load(args.checkpoint, map_location=self.device)
  File "E:\BOINC\DATA\projects\ralph.bakerlab.org\ev0\lib\site-packages\torch\serialization.py", line 607, in load
    return _load(opened_zipfile, map_location, pickle_module, **pickle_load_args)
  File "E:\BOINC\DATA\projects\ralph.bakerlab.org\ev0\lib\site-packages\torch\serialization.py", line 882, in _load
    result = unpickler.load()
  File "E:\BOINC\DATA\projects\ralph.bakerlab.org\ev0\lib\site-packages\torch\serialization.py", line 857, in persistent_load
    load_tensor(data_type, size, key, _maybe_decode_ascii(location))
  File "E:\BOINC\DATA\projects\ralph.bakerlab.org\ev0\lib\site-packages\torch\serialization.py", line 846, in load_tensor
    loaded_storages[key] = restore_location(storage, location)
  File "E:\BOINC\DATA\projects\ralph.bakerlab.org\ev0\lib\site-packages\torch\serialization.py", line 824, in restore_location
    return default_restore_location(storage, map_location)
  File "E:\BOINC\DATA\projects\ralph.bakerlab.org\ev0\lib\site-packages\torch\serialization.py", line 175, in default_restore_location
    result = fn(storage, location)
  File "E:\BOINC\DATA\projects\ralph.bakerlab.org\ev0\lib\site-packages\torch\serialization.py", line 157, in _cuda_deserialize
    return obj.cuda(device)
  File "E:\BOINC\DATA\projects\ralph.bakerlab.org\ev0\lib\site-packages\torch\_utils.py", line 79, in _cuda
    return new_type(self.size()).copy_(self, non_blocking)
RuntimeError: CUDA error: out of memory
CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.

</stderr_txt>
]]>




©2024 University of Washington
http://www.bakerlab.org