Chatbot Arena – if you’ve felt that Claude 3 Opus still holds a slight edge over the new GPT-4 Turbo, we now understand why
Chatbot Arena – if you’ve felt that Claude 3 Opus still holds a slight edge over the new GPT-4 Turbo, we now understand why

Chatbot Arena – if you’ve felt that Claude 3 Opus still holds a slight edge over the new GPT-4 Turbo, we now understand why

Chatbot Arena - if you’ve felt that Claude 3 Opus still holds a slight edge over the new GPT-4 Turbo, we now understand why

If we exclude the refusals (e.g., "I cannot answer") ,and only tally votes for actual responses, Claude 3 Opus continues to be marginally superior to the new GPT-4 Turbo.

Yes, you might think it’s pure bias on my part, but if you’re looking to compare the chatbots based on the quality of their responses when they do provide an answer, then excluding refusals might be a reasonable approach. This could give you a clearer picture of how well each chatbot performs when it is able to engage in a conversation.

https://preview.redd.it/868h3r3p5quc1.png?width=1801&format=png&auto=webp&s=35b52face87ff90d405b85db74fc92e048f0e657

submitted by /u/ok373737
[link] [comments]