Oss-gpt 120b large context stalls during llama.cpp checkpoints

I’m telling you the kernel package you have installed doesn’t have the fix for long running compute jobs (the reason you raised this issue afaict).

If you’re using Ubuntu you need OEM 6.14 kernel. Or 6.17.2 or later.