NexaQuant: Llama.cpp-Compatible Multimodal Model Compression with 100%+ Accuracy Recovery