NexaML: NPU- Aware Inference Engine for Fast, Multimodal On-Device AI