News
A new benchmark can test how much LLMs become sycophants, and found that GPT-4o was the most sycophantic of the models tested.
Some results have been hidden because they may be inaccessible to you
Show inaccessible resultsSome results have been hidden because they may be inaccessible to you
Show inaccessible results