Well, no, we have the HumanEval results for the June release. | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		refulgentis on Aug 26, 2023 \| parent \| context \| favorite \| on: Beating GPT-4 on HumanEval with a fine-tuned CodeL... Well, no, we have the HumanEval results for the June release.

somenameforme on Aug 27, 2023 [–]

Which is both (1) a subjective selection to measure the effectiveness of various chatbots and (2) now subject to gaming from companies using opaque/closed/inaccessible/unverifiable systems, like OpenAI.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact