Hacker News

Show HN: Anything To JSON – a language model for structured extraction

by eduntemanon 3/20/2024, 6:11:29 PM with 3 comments

by yeldarbon 3/20/2024, 7:16:37 PM
Nice, makes sense to chain models so you don't waste the attention of the big smart model on the grunt work of JSON structure.
I bet there are some good post-processing heuristics you could also apply for hallucination with a flag for "this should be in the text verbatim" & then string matching whether the answer it outputted was a string from the text or not.
by Orason 3/20/2024, 6:59:39 PM
How does it differentiate from using json output with OpenAI?
I think it can be useful for some cases, but you’ll always have the limitation of the model size to compete against large closed source models.
by edwinweeon 3/20/2024, 6:41:10 PM
can't speak to the actual service, but can confirm they have excellent bananas (not kidding)