QuantaLLM

Intelligence. Powered by Your Handhelds. Anywhere

1 votes·1 comment·May 15, 2026

About

Run large language models entirely on your Android phone. Powered by llama.cpp with Hexagon NPU acceleration and ONNX Runtime — 100% offline, fully private inference on ARM64.