GLM-5 has 744B parameters and scores worse on MMLU-Pro than a 9B model
Tier lists make S-tier and D-tier feel like different categories of thing entirely, red box at the top, blue box at the bottom. Actually plotted named models by parameter count against MMLU-Pro score instead of trusting the tier labels, and the p…