Interesting/unfortunate/expected that GPT-5 isn't touted as AGI or some other outlandish claim. It's just improved reasoning etc. I know it's not the actual announcement and it's just a single page accidentally released, but it at least seems more grounded...? Have to wait and see what the actual announcement entails.
The actual announcement (now deleted on GitHub's blog): https://archive.is/IoMEg
Did they photoshop the screenshot from https://github.blog/changelog/2025-05-19-github-models-built... ? Other than the model id, it’s identical.
> GPT-5 will have "enhanced agentic capabilities” and can handle “complex coding tasks with minimal prompting.”
this seems to be directly targeted at anthropic/claude, wonder if it leads anywhere or if claude keeps it's mystical advantage (especially with new claude models coming out this week as well).
> GPT-5 will have four model variants, according to GitHub...
i also find it interesting that the primary model is the logic-focused one (likely very long and deep reasoning), whereas the conversational mainstream model is now a variant. seems like a fundamental shift in how they want these tools to be used, as opposed to today's primary 4o and the more logical GPT-4.1, o4-mini, and o3.
Damn interns!
they are comparing it to llama 4 and cohere v2 in the image…
Is the announcement implying that "mainline" GPT-5 is now a reasoning model?
> gpt-5: Designed for logic and multi-step tasks.
sama posted a picture of the death star yesterday
[dead]
[flagged]
> It handles complex coding tasks with minimal prompting...
I find it interesting how marketers are trying to make minimal prompting a good thing, a direction to optimize. Even if i talk to a senior engineer, i'm trying to be specific as possible to avoid ambiguities etc. Pushing the models to just do what they think its best is a weird direction. There are so many subtle things/understandings of the architecture that are just in my head or a colleagues head. Meanwhile, i found that a very good workflow is asking claude code to come back with clarifying questions and then a plan, before just starting to execute.