Ralph support OpenCL ?

Message boards : Number crunching : Ralph support OpenCL ?

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 . . . 8 · Next

AuthorMessage
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 912
Credit: 1,892,541
RAC: 294
Message 5787 - Posted: 1 Sep 2014, 7:38:45 UTC
Last modified: 1 Sep 2014, 7:39:13 UTC

HSA Foundation releases specification, compiler and driver HSA Spec

AMD Announces Heterogeneous C++ AMP Language for Developers here

PyOpenCL 2014.1 here

Fedora 21 supports OpenCl "out of the box" here
ID: 5787 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 912
Credit: 1,892,541
RAC: 294
Message 5805 - Posted: 27 Dec 2014, 17:57:23 UTC
Last modified: 27 Dec 2014, 17:58:13 UTC

OpenCl 2.0 SDK AMD (beta) with Bolt 1.3: SDK 3.0
OpenCl 2.0 SDK Intel: SDK 2014
New version of CodeXL
ID: 5805 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 912
Credit: 1,892,541
RAC: 294
Message 5809 - Posted: 4 Feb 2015, 14:18:30 UTC

Some AVX docs
Introduction to AVX
Matrix transpose with AVX
Overview of AVX
Visual Studio 2013 supports AVX2
ID: 5809 · Report as offensive    Reply Quote
sgaboinc

Send message
Joined: 8 Jul 14
Posts: 20
Credit: 4,159
RAC: 0
Message 5931 - Posted: 18 Nov 2015, 18:03:51 UTC
Last modified: 18 Nov 2015, 18:07:47 UTC

a muse on vectorized computing SSE/AVX/AVX2/OpenCL e.g. GPU

there has been quite a bit of talk about vectorized computing i.e. the use os GPU and AVX2 for highly vectorized computing etc

i actually did a little bit of experiment, i'm running on haswell i7 4771 (non-k)
asus h87-pro motherboard & 16G 1600ghz ram

i tried openblas
http://www.openblas.net/
https://github.com/xianyi/OpenBLAS

./dlinpack.goto
these are the benchmarks
   SIZE       Residual     Decompose            Solve           Total
   100 :   4.529710e-14      503.14 MFlops    2000.00 MFlops     514.36 MFlops
   500 :   1.103118e-12     8171.54 MFlops    3676.47 MFlops    8112.38 MFlops
  1000 :   5.629275e-12    45060.27 MFlops    2747.25 MFlops   43075.87 MFlops
  5000 :   1.195055e-11   104392.81 MFlops    3275.04 MFlops  102495.20 MFlops
 10000 :   1.529443e-11   129210.71 MFlops    3465.54 MFlops  127819.77 MFlops


ok quite impressive ~128 Gflops on a haswell i7 desktop PC running at only 3.7ghz!
that almost compare to an 'old' supercomputer Numerical wind tunnel in Tokyo
https://en.wikipedia.org/wiki/Numerical_Wind_Tunnel_%28Japan%29

but what become immediately very apparent is also that only very large matrices 10,000 x 10,000 benefits from the vectorized codes (i.e. AVX2) in the *decompose* part. if you have tiny matrices say 100x100 in size that gives a paltry 514.36 Mflops, less than 100 times (or could say 1/200) of that speed of 10,000 x 10,000.

The other apparent thing is the *solve* part of the computation, you could see that while the decompose part which involves a matrix multiplication (e.g. DGEMM) can reach speeds of 128 Ghz, *but* the *solve* part *did not benefit* from all that AVX2 vectorized codes showing little improvements for different matrices sizes!

this has major implications, it means that whether you have a good cpu with AVX2 etc or that you have a large GPU that can process say vectorized / parallel 1000s floating point calcs per clock cycle.

But if your problems are small (e.g. 100x100) or that it cannot benefit from such vectorized codes much of that GPU capacity and even for this instance AVX2 may simply be *unused* and will *not benefit* from all that expensive vectorized hardware (e.g. AVX2 and expensive GPU cards capable of processing possibly thousands of vector computation per cycle, e.g. thousands of gpu simd cores)

i'd guess this reflect in a way Amdahl's law
https://en.wikipedia.org/wiki/Amdahl%27s_law

Gene Amdahl passed away recently & perhaps this could be a little tribute to him for having 'seen so far ahead' from back then.
http://www.latimes.com/local/obituaries/la-me-gene-amdahl-20151116-story.html
ID: 5931 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 912
Credit: 1,892,541
RAC: 294
Message 5932 - Posted: 21 Nov 2015, 14:58:05 UTC

This week, during SC15, Kronos Group released OpenCl 2.1 specifications, tools and compilers (with SPIR 1.0). Soon, the release of OpenCl C++ kernel.
ID: 5932 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 912
Credit: 1,892,541
RAC: 294
Message 5933 - Posted: 21 Nov 2015, 15:01:16 UTC

CodeXL 1.8 has a lot of new features:

Support for Microsoft Windows® 10
HSA Profiler
Power Profiling discrete GPUs
Power Profiling API
Offline build and analysis of OpenGL shaders
New Static Analyzer enhanced ISA view
GPU Debugger support up to OpenGL 4.5
64-bit offline build for OpenCLâ„¢ kernels
CPU Profiling for the “Carrizo” APU
CPU Profiling on VMWare
ID: 5933 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 912
Credit: 1,892,541
RAC: 294
Message 5935 - Posted: 27 Nov 2015, 9:59:19 UTC

New CodeXl is out.
CodeXL 1.9 includes:
Build and Static Analysis of these types of OpenGL® shaders on Microsoft Windows® and Linux®:
Vertex
Tessellation Evaluation
Geometric
Fragment
Compute
Faster processing of CPU profile data;
Support for longer CPU profiling sessions;
API for controlling CPU Profiling data collection from app code;
HSAIL Debugger (beta)
Support for Ubuntu 15.04
Many user-experience enhancements plus more.
ID: 5935 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 912
Credit: 1,892,541
RAC: 294
Message 6052 - Posted: 19 Feb 2016, 18:40:18 UTC

ID: 6052 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 912
Credit: 1,892,541
RAC: 294
Message 6078 - Posted: 21 Apr 2016, 16:26:22 UTC
Last modified: 21 Apr 2016, 16:28:12 UTC

Code XL 2.0 is here (and now is open source!!).
ID: 6078 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 912
Credit: 1,892,541
RAC: 294
Message 6079 - Posted: 8 Jun 2016, 15:49:22 UTC

Now CodeXl is 2.1 and is on GitHub
ID: 6079 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 912
Credit: 1,892,541
RAC: 294
Message 6083 - Posted: 5 Sep 2016, 7:46:10 UTC

CodeXl 2.2 here
Intel CodeBuilder OpenCl for Visual Studio here
Visual Studio 15 preview 4 here
Gcc 6.2 here
ID: 6083 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 912
Credit: 1,892,541
RAC: 294
Message 6094 - Posted: 8 Nov 2016, 15:59:55 UTC

Hip is now 1.0, with a lot of new features
PyOpenCl 2016.2 supports Spir-V
ID: 6094 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 912
Credit: 1,892,541
RAC: 294
Message 6095 - Posted: 22 Nov 2016, 10:23:12 UTC - in response to Message 6083.  

Visual Studio 15 preview 4 here


Visual Studio 2017 is now in RC phase
ID: 6095 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 912
Credit: 1,892,541
RAC: 294
Message 6139 - Posted: 10 Mar 2017, 17:04:31 UTC - in response to Message 6095.  

Visual Studio 2017 is here.
And Codeplay releases the first open beta of ComputeCpp, their implementation of SYCL to bring C++ to OpenCl
ID: 6139 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 912
Credit: 1,892,541
RAC: 294
Message 6149 - Posted: 22 Mar 2017, 7:53:19 UTC
Last modified: 22 Mar 2017, 7:53:45 UTC

New KDevelop 5.1 supports OpenCl (and c/c++/c#,java,bash,fortran,pascal,python,etc...) and runs also on Windows and not only on Linux.
ID: 6149 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 912
Credit: 1,892,541
RAC: 294
Message 6161 - Posted: 5 Apr 2017, 16:28:42 UTC

Arm opens up Compute Library with OpenCl
Arm Compute Library
ID: 6161 · Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 13 Jan 09
Posts: 103
Credit: 331,865
RAC: 0
Message 6176 - Posted: 15 Apr 2017, 3:02:18 UTC

I'll consider starting to make some BOINC applications for OpenCL AFTER I find and take an online class in OpenCL that is intended for programming GPUs, not FPGAs. I cannot travel enough to use in-person classes instead.

BOINC now supports use of CUDA and OpenCL; I've seen no information on whether it can also handle the various other ways of programming GPUs that have already been mentioned in this thread. I'd expect library compatibility problems to block use of some of them at least until future versions of BOINC build in replacements for the incompatible sections of the libraries.
ID: 6176 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 912
Credit: 1,892,541
RAC: 294
Message 6191 - Posted: 16 May 2017, 15:03:10 UTC

OpenCl 2.2 is official.
- Support to OpencCl C++
- Spir 1.2
- SyCl 2.2
and other intersting things
ID: 6191 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 912
Credit: 1,892,541
RAC: 294
Message 6193 - Posted: 10 Jun 2017, 10:35:09 UTC

During IWOCL Kronos Group has annunced that they wants to unify OpenCl and Vulkan Api. That's very interesting!
ID: 6193 · Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 9 Apr 08
Posts: 912
Credit: 1,892,541
RAC: 294
Message 6194 - Posted: 1 Jul 2017, 15:55:02 UTC

ROCm is now at 1.6 version
ID: 6194 · Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 . . . 8 · Next

Message boards : Number crunching : Ralph support OpenCL ?



©2024 University of Washington
http://www.bakerlab.org