autonomous ops layer

The agent runs
the release suite.

For mobile teams that ship faster than their competitors. Mini owns test authoring, maintenance, triage, and the green light. No QA bench required.

100% AndroidWorld pass rate

6wk → days feature cycle reduction

0 scripts to maintain

web

2 days per feature

Cursor, Claude, and the whole AI stack got here. Web dev ships fast.

mobile

6 weeks per feature

The same AI tools can't see the screen. Can't test a gesture. Can't verify a build. Until now.

what mini does

Authors tests

Give mini a Jira ticket and a Figma link. It writes, runs, and maintains the full suite — zero scripts, zero YAML. Just a criterion.

Self-heals on changes

Vision-based understanding survives any UI refactor. No brittle selectors to update. Coverage adapts as your product evolves.

Real-device triage

Every failure includes a session replay, device logs, and a Cursor-ready fix prompt. 'Can't reproduce' stops being an answer.

Decides to ship

Mini runs your critical flows against real iOS and Android targets before every release. Green light or the bug report — that's the output.

how it works

From criterion to green light

Attach a spec

Paste a Jira ticket, a Figma link, or just a one-liner: "a user can sign in and complete a $10 payment." Mini understands the intent.

Mini authors the suite

The agent explores the app, defines critical flows, and builds coverage — no scripting, no YAML, no engineers in the loop for authoring.

Runs every release

Coverage auto-runs on every PR. As the app changes, mini adapts — re-exploring, re-covering, self-healing as it goes.

Failure arrives with the fix

When something breaks, you get a session replay, device logs, repro steps, and a Cursor prompt. You ship the fix, not the investigation.

androidworld benchmark

100 %

Mini is the only agent to saturate the AndroidWorld leaderboard — the public standard for evaluating AI-controlled mobile devices.

Agent Pass@1

mini Minitap 100.0%

agi The AGI Company 97.4%

askui askui 94.8%

surfer 2 H Company 87.1%

gbox gbox.ai 86.2%

z.ai Z.AI 80.2%

The companies running 10x more mobile experiments will dominate their categories. Mini is the infrastructure that makes that pace possible.

The agent runsthe release suite.