Urbanists.Social Admins @admins

0 posts0 participants0 posts today

**Benjamin Carr, Ph.D.** @BenjaminHCCarr@hachyderm.io · Apr 16

Benjamin Carr, Ph.D. @BenjaminHCCarr@hachyderm.io

#AMD splits #ROCm toolkit into two parts – ROCm #AMDGPU drivers get their own branch under Instinct #datacenter #GPU moniker
The new #datacenter Instinct driver is a renamed version of the #Linux AMDGPU driver packages that are already distributed and documented with ROCm. Previously, everything related to ROCm (including the amdgpu driver) existed as part of the ROCm software stack.
https://www.tomshardware.com/pc-components/gpus/amd-splits-rocm-toolkit-into-two-parts-rocm-amdgpu-drivers-get-their-own-branch-under-instinct-datacenter-gpu-moniker

Tom's Hardware · Apr 14AMD splits ROCm toolkit into two parts – ROCm AMDGPU drivers get their own branch under Instinct datacenter GPU monikerBy Aaron Klotz

Continued thread

**Denzil Ferreira** @denzilferreira@techhub.social · Apr 14

Apr 14

Denzil Ferreira @denzilferreira@techhub.social

Then your Docker compose container should have:

image-name:
build:
context: .
devices:
- /dev/dri
- /dev/kfd
group_add:
- video
shm_size: 4G
environment:
- PYTORCH_HIP_ALLOC_CONF=garbage_collection_threshold:0.9,max_split_size_mb:512
- PYTORCH_CUDA_ALLOC_CONF=garbage_collection_threshold:0.9,max_split_size_mb:512
- HSA_OVERRIDE_GFX_VERSION=11.0.0

#rocm #gfx1103 #780M

**Denzil Ferreira** @denzilferreira@techhub.social · Apr 14 *

Apr 14 *

Denzil Ferreira @denzilferreira@techhub.social

So, good news. ROCm 6.3.4 and PyTorch 2.4.0 seems stable enough with gfx1103 if I use HSA override for 11.0.0, using latest firmware blobs and kernel 6.13.10 on Fedora 41.

In your Dockerfile, build your AI app from:
```
FROM rocm/pytorch:rocm6.3.4_ubuntu24.04_py3.12_pytorch_release_2.4.0
```

#rocm #pytorch #amdgpu

Continued thread

**Denzil Ferreira** @denzilferreira@techhub.social · Apr 12

Apr 12

Denzil Ferreira @denzilferreira@techhub.social

Ok… so added a few env variables on docker and now it runs… for a few minutes and then it freezes the laptop. Unable to see dmesg or anything as the whole laptop goes dark and image is frozen… #amd #rocm

**Denzil Ferreira** @denzilferreira@techhub.social · Apr 11

Apr 11

Denzil Ferreira @denzilferreira@techhub.social

Been fighting the whole day trying to get ROCm to play nice with 780M and PyTorch. Using latest #rocm and my laptop just freezes with gfx1103 and using HSA override to 11.0.0 and with 10.3.0

#amd really needs to fix this crap for their GPUs. Using Docker and their provided ROCm images. I know, 780M is not supported. But c’mon, ALL Nvidia cards can run #CUDA just fine. #rant

**RenézuCode** @ReneRebe@chaos.social · Apr 1

Apr 1

RenézuCode @ReneRebe@chaos.social

100 CPU threads & 240GB RAM to make @risc_v #AI @amd #ROCm and #t2linux https://www.twitch.tv/videos/2421181919

Twitch100 CPU threads & 240GB RAM to make RISCV AI ROCm! - t2sde on Twitcht2sde went live on Twitch. Catch up on their Software and Game Development VOD now.

**pafurijaz** @pafurijaz@mastodon.social · Mar 31

Mar 31

pafurijaz @pafurijaz@mastodon.social

It seems that #Vulkan could be the real alternative for using #AI on GPUs or CPUs of any brand, without necessarily having to rely on #CUDA or #AMD's #ROCm. I thought #SYCL was the alternative. This might finally free us from of monopoly #Nvidia.
#Khronos

**Natasha Nox** @Natanox@chaos.social · Mar 29

Mar 29

Natasha Nox @Natanox@chaos.social

ffs, why does their docker only support Navi 31 and not Navi 32?
https://hub.docker.com/r/rocm/pytorch

I just wish both #Nvidia and #AMD would stop with that whole licensing bullshit around #CUDA and #ROCm and just include that damn stuff in the default driver.
I just want to run #Codestral on my local machine so I can use it with non-public code. Will be troublesome enough to cram it into 16gb VRAM.
#computer #Linux #AI

**ℒӱḏɩę** @Lydie@tech.lgbt · Mar 25

Mar 25

ℒӱḏɩę @Lydie@tech.lgbt

The B-17 Bomber was amazing and helped win WWII. I flew on one in 2002 as a tourist - I have family members that were ball turret gunners - bad place to be.

This video was shot on Hi-8, and thankfully I digitized it (at 720x480) way back in that day. Now, I've up-scaled it with local AI (1408x954) and the improvement is astounding.

Sadly, this actual B17 crashed in 2019: https://en.wikipedia.org/wiki/2019_Boeing_B-17_Flying_Fortress_crash

#localai
#stablediffusion
#rocm
#amd
#b17
#flyingfortress

Continued thread

**Giuseppe Bilotta** @giuseppebilotta@fediscience.org · Mar 10

Mar 10

Giuseppe Bilotta @giuseppebilotta@fediscience.org

Even now, Thrust as a dependency is one of the main reason why we have a #CUDA backend, a #HIP / #ROCm backend and a pure #CPU backend in #GPUSPH, but not a #SYCL or #OneAPI backend (which would allow us to extend hardware support to #Intel GPUs). <https://doi.org/10.1002/cpe.8313>

This is also one of the reason why we implemented our own #BLAS routines when we introduced the semi-implicit integrator. A side-effect of this choice is that it allowed us to develop the improved #BiCGSTAB that I've had the opportunity to mention before <https://doi.org/10.1016/j.jcp.2022.111413>. Sometimes I do wonder if it would be appropriate to “excorporate” it into its own library for general use, since it's something that would benefit others. OTOH, this one was developed specifically for GPUSPH and it's tightly integrated with the rest of it (including its support for multi-GPU), and refactoring to turn it into a library like cuBLAS is

a. too much effort
b. probably not worth it.

Again, following @eniko's original thread, it's really not that hard to roll your own, and probably less time consuming than trying to wrangle your way through an API that may or may not fit your needs.

Recent searches

Search options

Administered by:

Server stats:

#rocm