Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
archeantus
83 days ago
|
parent
|
context
|
favorite
| on:
GPT-4.1 in the API
“GPT‑4.1 scores 54.6% on SWE-bench Verified, improving by 21.4%abs over GPT‑4o and 26.6%abs over GPT‑4.5—making it a leading model for coding.”
4.1 is 26.6% better at coding than 4.5. Got it. Also…see the em dash
pdabbadabba
83 days ago
|
next
[–]
What's wrong with the em-dash? That's just...the typographically correct dash AFAIK.
clbrmbr
82 days ago
|
parent
|
next
[–]
Maybe a reference to the OpenAI models loving to output em-dashes?
drexlspivey
83 days ago
|
prev
[–]
Should have named it 4.10
clbrmbr
82 days ago
|
parent
[–]
But it’s so much weaker than 4.5 in broader tasks… maybe more optimized against benchmarks but it’s just no replacement for a huge model.
Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
4.1 is 26.6% better at coding than 4.5. Got it. Also…see the em dash