Nexa AI Blog

Nexa Blog

On-Device AI Benchmarks: Gen AI Performance Across Devices

On-Device Inference Benchmark: Gen AI tasks' performance across devices from Laptops to Mobile to Edge IoT

Nexa Quantized DeepSeek R1 Distill Model With Full Quality Recovery

Model

Developer

A Quarter of the Size But Full Quality Recovery

Nexa AI 2024 Year Review

News

Nexa AI's 2024 Milestones and Highlights at a Glance

NexaQuant

Developer

Model

Works with both text and multimodal models and can be deployed on any devices

Visit AMD and Nexa AI at CES 2025

News

Transforming On-Device AI with Multimodal Capabilities

OmniAudio-2.6B

Model

Research

Compact text-audio-in model with optimized performance and size for edge devices

For The First Time, You Can Run Qwen2-Audio On Your Device

Run Qwen2-Audio Locally

Developer

News

Run Qwen2-Audio on edge devices with Nexa SDK

AMD: Efficient Local RAG for Document Intelligence

E2E Local RAG System

Success Story

End-to-end Local RAG System Powered by AMD Hardware

OmniVision-968M

Model

Research

Pocket-size multimodal model with 9x token reduction for on-device deployment

Squid

Model

Revolutionizing On-Device Language Models for Long Contexts

Octopus v2

Model

On-Device 0.5B LLMs, Voice/Text in, action out, outperform GPT-4 in function-calling

Local AI Voice Agent

Success Story

Voice-Enabled Personal AI Assistant Run Entirely on Lenovo AI PC

PIN AI: Local-Cloud Hybrid Mobile LLM OS

Local-Cloud LLM OS

Success Story

LLM OS Empowering Seamless, Private, and Lightning-fast Interaction Across Mobile Apps

Nexa AI x PIN AI

News

Nexa AI Partners with PIN AI to Bring Secure, On-Device AI to Mobile

What can you do with tiny (1B/3B) LLMs in a local RAG system?

Local RAG w. Tiny LLMs

Developer

A practical exploration of on-device AI for chatting with document: from basic Q&A to specialized tasks with LoRA

Products

Multimodal Model Support

Model Compression

Local On-Device Inference

For Business

Success Stories

Use Cases

Developers

Nexa-SDK GitHub

Tiny Model Hub

Docs

SLM Leaderboard

Download SDK

Blog

Nexa Blog

Company

About

Career

Contactoctopus@nexa.ai

Social