Reflexion Agent - Search News

GPT4 With Reflexion Has a Superior Coding Score

A slightly improved Reflexion-based GPT-4 agent achieves state-of-the-art pass@1 results (88%) on HumanEval, outperforming GPT-4 (67.0%) and CodeT: Code Generation with Generated Tests (65.8%), which ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

GPT4 With Reflexion Has a Superior Coding Score

Trending now