v1.5.0 · System prompts + 28 languages

Run large language models
on your phone.

No cloud, no internet, no tracking.

MIT — free & open source 25+ supported models 0 network calls at inference
12:19
Gemma 4 E2B
3.41 GB
What is machine learning?
Thinking...
120 tokens · 1.5s · 80 tok/s
Message Send
Runs open-weights from
Qwen Gemma Meta Llama Nemotron Phi-4 Granite Mistral DeepSeek Liquid AI
Why on-device

Private by construction. Fast by design.

The model weights live on your phone. Prompts, drafts and answers never touch a server — there isn't one to touch.

01

Fully offline

Airplane mode? Subway? Flight? No difference. Inference happens on your SoC, never on someone else's server.

02

Your roles, saved

Save reusable system prompts once and pick the right one for any model. Keep tone, role and output format consistent across every session.

03

Tune every knob

Temperature, Top-P, Top-K, Min-P, repetition penalty, seed, context size — each model remembers its own settings.

04

Save anywhere

Models are multi-gigabyte. Download them to any folder — internal storage, SD card, or an external drive. Move them between locations any time.

Model library · 25+

Pick a brain. Tap download. Chat.

Browse community-made models optimized for mobile — compact enough to fit on your phone, powerful enough to be useful.

0pkts/s
Zero network traffic during inference. After the model is downloaded, you can pull the SIM, turn off Wi-Fi, and keep chatting. There is no telemetry, no analytics, no "helpful" background sync.
10k 1k 100 10 0 100 1k 10k 100k 1M 10M Bytes sent to server Tokens generated →
How it works

Three taps from app open to first token.

STEP 01

Pick a model

Browse the curated list or paste your own model URL. Sizes range from 267 MB to 5.4 GB.

Qwen 3 1.7B
Gemma 3 1B806 MB
Llama 3.2 3B2.02 GB
Phi-4 mini2.49 GB
DeepSeek R1 1.5B1.12 GB
STEP 02

Download and resume

Reliable background downloads with notifications, speed, and ETA — with automatic resume when the connection drops.

Qwen 3 1.7B73%
12.4 MB/sETA 0:28
Wi-Fi · HomeResuming...
→ /storage/emulated/0/LMPlayground/
STEP 03

Chat locally

Your conversation history stays on your device. Reasoning is shown inline. No accounts, no API keys.

Summarize this email.
The sender is moving Thursday's sync to Friday 10am PT — confirm or propose another time.
● on-device38 tok/s
Languages · 28

Speaks your language.
Right out of the box.

The whole app is translated into 28 languages. On first launch, we'll pick a model that speaks yours.

28 locales
Spotted a wrong translation? Please open a PR on GitHub.
FAQ

What you actually
wanted to know.

Still stuck? Open an issue on GitHub or reach out directly.

Any model in your pocket.

Seconds to install. Minutes to download a model. Then you're done with the cloud.

Tweaks