Hacker News new | past | comments | ask | show | jobs | submit | from login
On-Device Gen AI Multimodal Benchmarks Across Devices (nexa.ai)
1 point by jinqueeny 1 day ago | past | discuss
NexaQuant: Llama.cpp-Compatible Model Compression with 100%+ Accuracy Recovery (nexa.ai)
3 points by BUFU 41 days ago | past | 1 comment
How to unify Gemma and Whisper to build a super fast local voice LLM (nexa.ai)
2 points by alanzhuly 58 days ago | past
OmniAudio-2.6B: Fastest Audio Language Model for Edge Deployment (nexa.ai)
2 points by BUFU 62 days ago | past | 1 comment
Run Qwen Audio Language Model on Local Devices for Voice Chat and Audio Analysis (nexa.ai)
4 points by BUFU 80 days ago | past
Omnivision-968M: Vision Language Model with 9x Tokens Reduction for Edge Devices (nexa.ai)
69 points by BUFU 3 months ago | past | 12 comments
Tiny (1B/3B) LLMs in a local RAG system (nexa.ai)
2 points by jinqueeny 3 months ago | past
What can you do with tiny (1B/3B) LLMs in a local RAG system? (nexa.ai)
3 points by jinqueeny 3 months ago | past
What you can do with tiny (1B/3B) LLMs in a local RAG system? (nexa.ai)
1 point by alanzhuly 3 months ago | past

Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: