An NPU is nothing more than a section of the CPU dedicated to being very fast at multiplying matrices, which is a mathematical operation used in a lot of AI models.
Notably the iGPU is actually faster than the NPU at performing matrix multiplication (at least in 7040 series). The NPU is purely about doing it without consuming a ton of battery.