What was your motivation to use AI for this instead of simple image analysis?
Test should be passing now
Instead of AI looking at your code and browser and writing Playwright scripts, AI is directly controlling browser and asserting over tests. Do we have to wait for on-prem multimodal low latency AI for this to be viable?
And nice "smoke test" and making me curious about your product.
How much did you end up spending on API credits for Flash 2.0?
[dead]
Alternative version: check for dings on my phone from every news outlet sending a notification about it.