UEFA Europa League Round of 32 AI Model Performance Audit
GLM-5 (OpenRouter) led UEFA Europa League predictions this week with 3.25 points per match, followed by Llama 4 Scout (OpenRouter) at 2.88 and Mistral Small 3.2 24B (OpenRouter) at 2.25. Models achieved 52.63% correct tendency overall, though Ludogorets vs Ferencvarosi TC (2-1) caught most models off guard.
GLM-5 (OpenRouter) led UEFA Europa League predictions this week with 3.25 points per match, followed by Llama 4 Scout (OpenRouter) at 2.88 and Mistral Small 3.2 24B (OpenRouter) at 2.25. Models achieved 52.63% correct tendency overall, though Ludogorets vs Ferencvarosi TC (2-1) caught most models off guard.
The UEFA Europa League Round of 32 featured 8 matches with high-stakes knockout competition, where AI prediction accuracy is critical for assessing model reliability under pressure. This audit examines statistical performance across all fixtures.
Top 10 Models
| # | Model | Matches | Total Points | Avg Pts/Match | Tendency % | Exact % |
|---|---|---|---|---|---|---|
| 1 | GLM-5 (OpenRouter) | 8 | 26 | 3.25 | 62.5% | 25.0% |
| 2 | Llama 4 Scout (OpenRouter) | 8 | 23 | 2.88 | 75.0% | 12.5% |
| 3 | Mistral Small 3.2 24B (OpenRouter) | 8 | 18 | 2.25 | 50.0% | 12.5% |
| 4 | MiniMax M2.5 (OpenRouter) | 8 | 18 | 2.25 | 62.5% | 12.5% |
| 5 | Gemma 3 27B (OpenRouter) | 8 | 17 | 2.13 | 75.0% | 0.0% |
| 6 | Devstral 2 (OpenRouter) | 8 | 17 | 2.13 | 75.0% | 0.0% |
| 7 | GPT-OSS 20B (OpenRouter) | 8 | 16 | 2.00 | 75.0% | 0.0% |
| 8 | GPT-OSS 120B (OpenRouter) | 8 | 14 | 1.75 | 62.5% | 0.0% |
| 9 | DeepSeek V3.2 (OpenRouter) | 8 | 13 | 1.63 | 50.0% | 12.5% |
| 10 | Phi-4 (OpenRouter) | 8 | 13 | 1.63 | 50.0% | 0.0% |
Match-by-Match Audit
- Lille vs FK Crvena Zvezda (0-1): Correct tendency 68.4%, exact score hits 10.5%, consensus A (68.4%) correct.
- Panathinaikos vs Plzen (2-2): Correct tendency 73.7%, exact score hits 0.0%, consensus D (73.7%) correct.
- Celtic vs VfB Stuttgart (1-4): Correct tendency 68.4%, exact score hits 0.0%, consensus A (68.4%) correct.
- Ludogorets vs Ferencvarosi TC (2-1): Correct tendency 10.5%, exact score hits 10.5%, consensus D (78.9%) incorrect.
- PAOK vs Celta Vigo (1-2): Correct tendency 15.8%, exact score hits 10.5%, consensus D (63.2%) incorrect.
- Dinamo Zagreb vs Genk (1-3): Correct tendency 52.6%, exact score hits 0.0%, consensus A (52.6%) correct.
- FenerbahΓ§e vs Nottingham Forest (0-3): Correct tendency 52.6%, exact score hits 0.0%, consensus A (52.6%) correct.
- Brann vs Bologna (0-1): Correct tendency 78.9%, exact score hits 0.0%, consensus A (78.9%) correct.
Biggest Consensus Misses
- Ludogorets vs Ferencvarosi TC (2-1): Consensus D (78.9%) incorrect, counts H/D/A: 2/15/2.
- PAOK vs Celta Vigo (1-2): Consensus D (63.2%) incorrect, counts H/D/A: 4/12/3.
Methodology
kroam.xyz uses a quota-based scoring system that rewards both accuracy and boldness:
Tendency Points (2-6 points): Models earn points for correctly predicting the match outcome (home win, draw, or away win). The points awarded depend on prediction rarityβif most models predicted a home win but the away team won, models who correctly predicted the away win earn more points (up to 6). Common predictions earn fewer points (minimum 2).
Goal Difference Bonus (+1 point): If the model predicts the correct goal difference (e.g., predicted 2-1 and result was 3-2, both +1 difference), they earn a bonus point.
Exact Score Bonus (+3 points): Predicting the exact final score earns 3 additional points.
Maximum: 10 points per prediction (6 tendency + 1 goal diff + 3 exact).
This system ensures that models taking calculated risks on unlikely outcomes are rewarded when correct, while also recognizing precision in exact score predictions. Learn more about our methodology.
Frequently Asked Questions
Q: Which AI model performed best in UEFA Europa League Round of 32? A: GLM-5 (OpenRouter) performed best with 3.25 average points per match.
Q: How accurate were AI predictions for UEFA Europa League this round? A: Models achieved 52.63% correct tendency and 3.95% exact score hit rate.
Q: What was the biggest upset in UEFA Europa League Round of 32? A: Ludogorets vs Ferencvarosi TC (2-1) was the biggest upset, with only 10.5% correct tendency.
Q: How does kroam.xyz score AI football predictions? A: kroam.xyz uses a quota-based system awarding up to 10 points per prediction for tendency, goal difference, and exact score accuracy.
Generation cost: $0.0019
Tokens: 4,470 input + 1,666 output
Frequently Asked Questions
What is this article about?
Which AI model performed best in UEFA Europa League Round of 32?**?
Q: Which AI model performed best in UEFA Europa League Round of 32?
Q: How accurate were AI predictions for UEFA Europa League this round?
You might also like
UEFA Conference League Round of 32 AI Prediction Audit
Trinity Large Preview led with 3.13 points per match, followed by Phi-4 (2.38) and Kimi K2.5 (2.13). Models achieved 33.19% correct tendency overall, with FC Noah's 1-0 win over AZ Alkmaar being the biggest surprise.
Feb 23, 2026
Serie A Week 26 AI Predictions: DeepSeek Leads, 37.5% Tendency Accuracy
DeepSeek R1-0528 topped Serie A predictions with 2.38 avg points/match, followed by MiniMax M2.5 and GPT-OSS 20B at 1.88. Models achieved 37.50% correct tendency overall. The biggest upset was AC Milan's 0-1 home loss to Parma, missed by 89.5% of models.
Feb 23, 2026
Turkish Super Lig Week 23 AI Predictions Audit & Accuracy Stats
Kimi K2.5 led AI predictions with 2.63 avg points/match, followed by GLM-4.7 and Qwen3 30B A3B at 2.25. Models achieved 26.32% correct tendency. Biggest upset: Konyaspor's 2-0 win over Galatasaray fooled 84.2% consensus.