Run large language models
on your phone.
No cloud, no internet, no tracking.
Private by construction. Fast by design.
The model weights live on your phone. Prompts, drafts and answers never touch a server — there isn't one to touch.
Fully offline
Airplane mode? Subway? Flight? No difference. Inference happens on your SoC, never on someone else's server.
Your roles, saved
Save reusable system prompts once and pick the right one for any model. Keep tone, role and output format consistent across every session.
Tune every knob
Temperature, Top-P, Top-K, Min-P, repetition penalty, seed, context size — each model remembers its own settings.
Save anywhere
Models are multi-gigabyte. Download them to any folder — internal storage, SD card, or an external drive. Move them between locations any time.
Pick a brain. Tap download. Chat.
Browse community-made models optimized for mobile — compact enough to fit on your phone, powerful enough to be useful.
Three taps from app open to first token.
Pick a model
Browse the curated list or paste your own model URL. Sizes range from 267 MB to 5.4 GB.
Download and resume
Reliable background downloads with notifications, speed, and ETA — with automatic resume when the connection drops.
Chat locally
Your conversation history stays on your device. Reasoning is shown inline. No accounts, no API keys.
Speaks your language.
Right out of the box.
The whole app is translated into 28 languages. On first launch, we'll pick a model that speaks yours.
What you actually
wanted to know.
Still stuck? Open an issue on GitHub or reach out directly.
Any model in your pocket.
Seconds to install. Minutes to download a model. Then you're done with the cloud.