AMD ROCm does not support the AMD Ryzen AI 300 Series GPUs

I have been experimenting to ROCm on a FW16 7840HS.

The most recent experiments have been with TheRock. It used ROCm version 7.x

I was able to compile ROCm (The TheROCK version) for my particular GPU.

I had to make sure all other ROCm was removed from my system, otherwise the compile fails.

My tests have been with complex number matrix multiply using ROCBLAS.

I get similar failures that you get.

I am still trying to debug the GPU code. I have not found any good tools yet.

I did find something out that might help you.

If I hipMalloc a 50000 x 50000 matrix

and then use cgemm to multiple the 50000 x 50000 matrix, it crashes, similar to what you see.

If I hipMalloc a 50512 x 50512 matrix

and then use cgemm to multiple the 50000 x 50000 matrix, it works.

So, I think there is a buffer overflow bug in ROCM.

I mention this, because if you also over hipMalloc like me, at least your application might finish and work. Seems an OK workaround for now.

1 Like