Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It's a test on which (apparently until now) the vast majority of humans have far outperformed all machine systems.


But it’s not a test that directly shows general intelligence.

I am excited no less! This is huge improvement.

How does this do on SWE Bench?


>How does this do on SWE Bench?

71.7%


I've seen this figure on a few tech news websites and reddit but can't find an official source. If it was in the video I must have missed it, where is this coming from?


It was in the video. I don't know if Open ai have a page up yet




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: