Hacker News

We created the first open source implementation of Meta's TestGen–LLM

by gronky_on 5/21/2024, 11:37:27 AM with 13 comments

by data-ottawaon 5/21/2024, 12:12:14 PM
How do people feel about LLM generated tests?
I tried creating some on a personal project just using ChatGPT and it saved me a lot of toil on tests I probably wouldn’t have written. I did find I had low trust in refactoring my code, but higher than if I’d had no tests.
It seemed like a net positive for low risk cases.
by throwanemon 5/21/2024, 5:12:08 PM
Per the cited real world figures, that's about 1 in 40 tests that pass human review, or a success rate of about 2.5%.
It's hard to see value in spending resources this way right now - most notably, engineer time to review the generated tests. Improve the hit rate by an order of magnitude, and I suspect I'd feel differently.
by rohitpaulkon 5/21/2024, 3:44:06 PM
Tried this out on a Ruby codebase and it generated Python tests: https://github.com/Codium-ai/cover-agent/issues/17. Is there any data available on whether this actually works?
by darknoonon 5/21/2024, 12:15:06 PM
Why does this webpage have auto-playing audio?
by ryoshuon 5/21/2024, 12:29:05 PM
The audio track on load that has no obvious way to stop playing prevents me from reading this content. Please don't do that.
by _pdp_on 5/21/2024, 1:11:26 PM
Using ChatGPT to generate unit tests works great almost out of the box, but I guess this system solves the remaining 5% to make it fully automated end-to-end. I believe this will work and help us write better software, given that I have experienced numerous cases where the generated tests (even with inferior models) catch no-so-obvious bugs.
by joeberg8on 5/21/2024, 1:49:49 PM
Seems decent enough for boilerplate. But if my code is incorrect, won’t an LLM generated a test for incorrect code?
by Havocon 5/21/2024, 2:19:33 PM
Interesting idea. I generally don’t run tests at all (hobbyist) so even mediocre llm tests may actually be a win
by muglugon 5/21/2024, 2:26:27 PM
Don't see any actual output measurement in the conclusion — it seems like the effort may not have really borne fruit.
by EGregon 5/21/2024, 1:57:02 PM
To the OP:
Is your name a reference to Gronky Scripples? https://www.youtube.com/watch?v=4KG3v365mq4
by yuvalkarmion 5/21/2024, 6:41:48 PM
Love that you took something that meta wrote about but didn't actually release and then... did it for them haha :)
by wockaon 5/21/2024, 2:37:11 PM
I get redirected to an oops 404 page when I try to create an account using Github.
by jrawlingson 5/21/2024, 6:50:20 PM
Any chance of supporting integrations with AWS, Azure, GCP APIs?