#OpenCL - kbin.social

toxi, 6 months ago to opensource

A week ago was the 1st anniversary of this solo instance & more generally of my fulltime move to Mastodon. A good time for a more detailed intro, partially intended as CV thread (pinned to my profile) which I will add to over time (also to compensate the ongoing lack of a proper website)... Always open to consulting offers, commissions and/or suitable remote positions...

Hi, I'm Karsten 👋 — indy software engineer, researcher, #OpenSource author of hundreds of projects (since ~1999), computational/generative artist/designer, landscape photographer, lecturer, outdoor enthusiast, on the ND spectrum. Main interest in transdisplinary research, tool making, exploring techniques, projects & roles amplifying the creative, educational, expressive and inspirational potential of (personal) computation, code as material, combining this with generative techniques of all forms (quite different to what is now called and implied by "generative AI").

Much of my own practice & philosophy is about #BottomUpDesign, interconnectedness, simplicity and composability as key enablers of emergent effects (also in terms of workflow & tool/system design). Been adopting a round-robin approach to cross-pollinate my work & learning, spending periods going deep into various fields to build up and combine experience in (A-Z order): API design, audio/DSP, baremetal (mainly STM32), computer vision/image processing, compiler/DSL/VM impl, databases/linked data/query engines, data structures impl, dataviz, fabrication (3DP, CNC, knit, lasercut), file formats & protocols (as connective tissue), "fullstack" webdev (front/back/AWS), generative & evolutionary algorithms/art/design/aesthetics/music, geometry/graphics, parsers, renderers, simulation (agents/CFD/particles/physics), shaders, typography, UI/UX/IxD...

Since 2018 my main endeavor has been https://thi.ng/umbrella, a "jurassic" (as it's been called) monorepo of ~185 code libraries, addressing many of the above topics (plus ~150 examples to illustrate usage). More generally, for the past decade my OSS work has been focused on #TypeScript, #C, #Zig, #WebAssembly, #Clojure, #ClojureScript, #GLSL, #OpenCL, #Forth, #Houdini/#VEX. Earlier on, mainly Java (~15 years, since 1996).

Formative years in the deep end of the #Atari 8bit demoscene (Chip Special Software) & game dev (eg. The Brundles, 1993), B&W dark room lab (since age 10), music production/studio (from 1993-2003), studied media informatics, moved to London initially as web dev, game dev (Shockwave 3D, ActionScript), interaction designer, information architect. Branched out, more varied clients/roles/community for my growing collection of computational design tools, which I've been continously expanding/updating for the past 20+ years, and which have been the backbone of 99% of my work since ~2006 (and which helped countless artists/designers/students/studios/startups). Creator of thi.ng (since 2011), toxiclibs (2006-2013), both large-scale, multi-faceted library collections. Early contributor to Processing (2003-2005, pieces of core graphics API).

Worked on dozens of interactive installations/exhibitions, public spaces & mediafacades (own projects and many collabs, several award winning), large-scale print on-demand projects (>250k unique outputs), was instrumental in creating some of the first generative brand identity systems (incl. cloud infrastructure & asset management pipelines), collaborated with architects, artists, agencies, hardware engineers, had my work shown at major galleries/museums worldwide, taught 60+ workshops at universities, institutions and companies (mainly in EMEA). Was algorithm design lead at Nike's research group for 5 years, working on novel internal design tools, workflows, methods of make, product design (footwear & apparel) and team training. After 23 years in London, my family decided on a lifestyle change and so currently based in the beautiful Allgäu region in Southern Germany.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ villares, hamoid, guidoschmidt, robertoranon +1 more

karolherbst, 7 months ago to random

Davinci resolve (and other #OpenCL applications using gl_sharing) should work on rusticl on AMD and Intel hardware as of today.

Other drivers need to implement a few optional features, but that should be rather trivial.

Thanks a lot for Antonio for working on all of this!

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ villares, oblomov

giuseppebilotta, 7 months ago to random

OK so I'm ready for today's #GPGPU lesson with the new laptop. My only gripe for the lesson will be that #Rusticl in #Mesa 23.2 doesn't support #profiling information. Apparently the feature was merged at a later commit
https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24101
and I even tried upgrading to my distro's experimental 23.3-rc1 packages, but trying to use rusticl on those packages segfaults. So either I've messed up something with this mixed upgrade, or I've hit an actual bug.

reply

expand (14)

collapse (14)

report

activity

copy /kbin url

copy original url

open original url

Loading...

giuseppebilotta, 7 months ago

I'm still moderately annoyed by the fact that there's no single #OpenCL platform to drive all computer devices on this machine. #PoCL comes close because it supports both the CPU and the #NVIDIA dGPU through #CUDA, but the not the #AMD iGPU (there's an #HSA device, but). #Rusticl supports the iGP (radeonsi) and the CPU (llvmpipe), but not the dGPU (partly because I'm running that on proprietary drivers for CUDA). Everything else has at best one supported device out of three available.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

giuseppebilotta, 7 months ago to random

Whenever I can, I try to end each lesson with a provocation.

When we finished our first trivial #OpenCL program, I showed them how the kernel runtime plus data transfer runtime actually made GPUs “not convenient”, as a prelude to illustrating the usefulness of memory pinning and buffer (un)mapping to improve data transfer efficiency and avoiding them when possible.

We're still working on that trivial program, so today I showed them how number of elements affects performance.

1/2

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

giuseppebilotta, 7 months ago to random

This year I managed to squeeze the introduction to #OpenCL at the end of the lesson before we started writing code, so we managed to make the first complete OpenCL example in one lesson (it usually takes us two lessons). The code compiled and ran correctly in the first go. Everybody was surprised (including me!) —students are third years, yet they are already familiar with the principle that if it seems to work, there's a subtle bug that will rear its head at the worst of times.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Brett_E_Carlock, 7 months ago to random

Do I have anyone in my wider network with skills in programming CUDA, SYCL, and OpenCL?

We want to determine feasibility of migrating CUDA-only code to SYCL (via SYCLomatic?): OpenCV feature detection/extraction modules (SIFT, HAGOG, ORB, AKAZE).

The intent is to upstream all feasible work.

This, hopefully, should stand to benefit everyone instead of being limited to NVIDIA.

Currently in info gathering/people connecting phase, not yet funded & ready to go.

#CUDA #SYCL #OpenCL #OpenCV

reply

expand (11)

collapse (11)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ mwfc, giuseppebilotta, oblomov

karolherbst, 7 months ago to random

Does anybody know of any #opencl application making use of the provisional cl_khr_external_memory extension(s)?

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ oblomov

igb, 8 months ago to Java

Got #TornadoVM installed and running on my local Linux laptop, a #Lenovo 14s Thinkpad with an 10th generation Intel® Core™ CPU and an integrated Intel® UHD graphics card.

Took a bit of futzing around with runtime dependencies, but the required packages (for Ubuntu Jammy) were:

intel-basekit
intel-oneapi-runtime-opencl
intel-opencl-icd

I also had to explicitly set

LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/opt/intel/oneapi/compiler/2023.2.1/linux/compiler/lib/intel64_lin

#Java #Graal #JIT #ML #OpenCL

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ rewarp

tk, 9 months ago to photography

Fun times with #Darktable and #OpenCL. :blobfoxsad: #photography #photog

Darktable is unable to use rocm, crashes X with rusticl

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ tk

pekka, 10 months ago to random

chipStar 1.0 released! It's a tool for compiling and running CUDA/HIP applications on SPIR-V-supported OpenCL or LevelZero platforms. v1.0 can already run various HPC applications correctly. See: https://github.com/CHIP-SPV/chipStar/releases/tag/v1.0
#opencl #levelzero #spirv #cuda #hip

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ giuseppebilotta, oblomov

toxi, 10 months ago (edited 10 months ago) to ZigLang

As I've been updating the build files for my various #ziglang projects & templates, also learned that quite a few of them have to be overhauled/refactored due to syntax changes and a more strict compiler. One example is this #WASM #voxel #renderer from 1.5 years ago which doesn't build anymore without major code updates, but the old build still works:

https://demo.thi.ng/zig/voxel-trace/

Reload for random views. Press x to export current frame. The renderer is incremental (never finishes) and slowly reduces pixel size from 8 down to 1. It would be much faster, but I had some ideas for creating a more stylistic output and in this current state it only renders a fixed area per frame...

The 2-bit 512^3 voxel model was generated with a custom fork of @R4_Unit's voxel automata... 🥰

Other renders & process on my old Twitter:

https://twitter.com/search?q=from%3A%40toxi+voxel+ziglang

Ps. This renderer is heavily using this #SIMD vector library:

https://github.com/thi-ng/zig-thing/tree/main/vectors

...and is a rewrite of my 2013 hybrid #OpenCL #Clojure voxel renderer:

https://github.com/thi-ng/raymarchcl

#GenerativeArt #ThingUmbrella

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ guidoschmidt

kandid, 11 months ago to random

Even more particle traces.

#openFrameworks #OpenCL

Colorful traces appear on a dark background and fade away again.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

kandid, 11 months ago to random

Light trails.

#openFrameworks #OpenCL

Colored stripes moving slowly, red tones predominate. The colors originate from brightly colored film material.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

sri, 11 months ago to rust

For those who have waited eagerly for the recording of our #oneapi meetup with @karolherbst - here it is! https://youtu.be/KUze0JbPSy8
#sycl #opencl #rustlang - if you are interested in joining our oneAPI meetup - feel free to subscribe here - https://www.meetup.com/oneapi-community-us/

Next time we will be meeting with Stephano Cetola who will be talking about RISC-V, onAPI, and other things.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ adamjcook, oblomov

kandid, 11 months ago to random

Losing a somewhat stable state.

#openFrameworks #OpenCL

White figures move with steady speed. Then the uniform speed gets lost.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ProjectPhysX, 11 months ago to hpc

My ChEESE CoE webinar talk on #OpenCL #GPU programming for #HPC applications is now uploaded to my YouTube channel and provided with timestamps. Enjoy! 🖖😎🧀
👉 https://youtu.be/w4HEwdpdTns

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ giuseppebilotta

kandid, 11 months ago to random

The trace of thousands of simulated particles. Flowing. Deflected by a movie of a rotating thing.

#openFrameworks #OpenCL

Short moving traces in gray scale. The thing rotating in the background appears vaguely.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

kandid, 11 months ago to random

The stream of particles is deflected by the color of the background.

#openFrameworks #openCL

White particles flowing mostly horizontally. Except the area where the movie is visible. The colors in the background deflect the path of this particles.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

karolherbst, 11 months ago to random

was bored today and the heat is wearing me down and everything, but at least I was able to work on optional image format support for #Rusticl

I've also added sRGB support, which is an #OpenCL 2.0 feature

The only formats not handled yet are packed formats (like CL_UNORM_SHORT_565), CL_DEPTH and CL_RGBx

CL_Rx and CL_RGx will probably stay unsupported forever unless there is a strong reason to add them

https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23714

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ rewarp, kainoa, XOrgFoundation

jbzfn, 1 year ago to linux

「 Mesa’s own OpenCL implementation Rusticl is now officially supported for AMD Radeon graphics cards. A bunch of Asahi fixes are present as well in Mesa 23.1, which also brings various updates to the PanVK, LLVMpipe, RadeonSI, and Zink drivers 」
— @9to5linux

#Linux #Mesa #OpenCL #Vulkan
https://9to5linux.com/mesa-23-1-linux-graphics-stack-arrives-with-many-radv-improvements-more

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

penguin42, 1 year ago to random

However one #opencl weirdness is that for Radeon there are two OpenCL sets; the one from #Mesa and the RocM set from AMD - and they're giving me wildly different behaviours on different data sets. For some data sets the mesa one is much faster, but for others the RocM set is much faster. That's going to be 'interesting'

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

penguin42, 1 year ago to random

My #OpenCL stuff is getting somewhere; 'profiling' is fairly nice; you set events and can read the times at the events - so at least I can tell which of the 3 kernels I'm running is slow. With some vectorisation I'm up from ~200lps in the slow case to about 1100lps - which means it's no longer painful.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

penguin42, 1 year ago to random

my reason for looking in #mesa was that I was trying to use profiling in some #opencl code and while it worked with AMDs ROCm user code, it failed on Mesa with a 'PROFILING_INFO_NOT_AVAILABLE' and it looks like there are a whole bunch of reasons that can happen - in the end it turned out to be because the event was still in the queue, but I only figured that out by adding debug prints into mesa.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ProjectPhysX, 1 year ago to random German

5 years ago I had this wild idea to write my own #CFD software from scratch in #OpenCL. I wanted to know how fluid simulations work, and to make them ridiculously fast on any #GPU. Today, #FluidX3D has ⭐1.4k stars on #GitHub: https://github.com/ProjectPhysX/FluidX3D

how it started: how it's going:

image/png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ blacktraffic, giuseppebilotta

penguin42, 1 year ago to random

Hmm my #opencl code is running about 4x slower on #fedora 38 than f37 (on Radeon) - not figured out which component though; opencl is hard enough to profile at the best of times; and there's a lot of components. Ideas welcome.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...