Lmarena, Artificial Analysis

Do you trust benchmarks like lmarena, Artificial Analysis, and how should one use them as a guide? How do you determine which AI to use for a specific task, and how do you choose AI for different tasks? Do you read posts or watch expert videos? I would appreciate any recommendations or resources.

1 Like

For example, do you trust these results?

https://lmarena.ai/

https://beta.lmarena.ai/leaderboard