Launches
Q
QuantaLLM
Intelligence. Powered by Your Handhelds. Anywhere
1 votes·1 comment·May 15, 2026
About
Run large language models entirely on your Android phone. Powered by llama.cpp with Hexagon NPU acceleration and ONNX Runtime — 100% offline, fully private inference on ARM64.