Hacker News Viewer

Show HN: Finalrun – Spec-driven testing using English and vision for mobile apps

by ashish004 on 4/7/2026, 2:33:48 PM

I wanted to test mobile apps in plain English instead of relying on brittle selectors like XPath or accessibility IDs.<p>With a vision-based agent, that part actually works well. It can look at the screen, understand intent, and perform actions across Android and iOS.<p>The bigger problem showed up around how tests are defined and maintained.<p>When test flows are kept outside the codebase (written manually or generated from PRDs), they quickly go out of sync with the app. Keeping them updated becomes a lot of effort, and they lose reliability over time.<p>I then tried generating tests directly from the codebase (via MCP). That improved sync, but introduced high token usage and slower generation.<p>The shift for me was realizing test generation shouldn’t be a one-off step. Tests need to live alongside the codebase so they stay in sync and have more context.<p>I kept the execution vision-based (no brittle selectors), but moved test generation closer to the repo.<p>I’ve open sourced the core pieces:<p>1. generate tests from codebase context 2. YAML-based test flows 3. Vision-based execution across Android and iOS<p>Repo: <a href="https:&#x2F;&#x2F;github.com&#x2F;final-run&#x2F;finalrun-agent" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;final-run&#x2F;finalrun-agent</a> Demo: <a href="https:&#x2F;&#x2F;youtu.be&#x2F;rJCw3p0PHr4" rel="nofollow">https:&#x2F;&#x2F;youtu.be&#x2F;rJCw3p0PHr4</a><p>In the Demo video, you’ll see the &quot;post-development hand-off.&quot; An AI builds a feature in an IDE, and Finalrun immediately generates and executes a vision-based test for it verifying the feature developed by AI.

https://github.com/final-run/finalrun-agent

Comments

by: avikaa

This solves a massive headache. The drift between externally generated tests and an active codebase is a brutal problem to maintain.<p>Using vision-based execution instead of brittle XPaths is a great baseline, but moving the test definitions to live directly alongside the repo context is definitely the real win here.<p>Did you find that generating the YAML from the codebase context entirely eliminated the &quot;stale test&quot; issue, or do developers still need to manually tweak the generated YAML when mobile UI layouts change drastically? Great project!

4/7/2026, 5:45:30 PM


by: sahilahuja

Agentic testing. Kudos to your decision to open-source it!

4/7/2026, 5:01:26 PM


by: arnold_laishram

Looks pretty cool. How does your agent understand plain english?

4/7/2026, 2:57:17 PM


by: rootally7

[dead]

4/7/2026, 5:59:54 PM


by:

4/7/2026, 2:34:21 PM


by: arbaaz

[dead]

4/7/2026, 3:19:45 PM