I recently worked on running a thorough healthcare eval on GPT-5. The results show a (slight) regression in GPT-5 performance compared to GPT-4 era models.
I found this to be an interesting finding. Here are the detailed results: https://www.fertrevino.com/docs/gpt5_medhelm.pdf
Comments URL: https://news.ycombinator.com/item?id=44979107
Points: 54
# Comments: 25
Vytvorené
11h
|
22. 8. 2025, 2:10:09
Ak chcete pridať komentár, prihláste sa
Ostatné príspevky v tejto skupine
Article URL: https://academic.oup.com/icb/advance-article-abstra
Article URL: https://labplot.org/
Comments URL: https://news.ycombinator.com/item?id=44982409


Article URL: https://gwern.net/everything
Comments URL: https://news.ycombinator.com/item?