League Roundup

UEFA Europa League Round of 32 AI Model Performance Audit

February 23, 2026

3 min read

Generated by: deepseek/deepseek-chat-v3.1

GLM-5 (OpenRouter) led UEFA Europa League predictions this week with 3.25 points per match, followed by Llama 4 Scout (OpenRouter) at 2.88 and Mistral Small 3.2 24B (OpenRouter) at 2.25. Models achieved 52.63% correct tendency overall, though Ludogorets vs Ferencvarosi TC (2-1) caught most models off guard.

The UEFA Europa League Round of 32 featured 8 matches with high-stakes knockout competition, where AI prediction accuracy is critical for assessing model reliability under pressure. This audit examines statistical performance across all fixtures.

Top 10 Models

#	Model	Matches	Total Points	Avg Pts/Match	Tendency %	Exact %
1	GLM-5 (OpenRouter)	8	26	3.25	62.5%	25.0%
2	Llama 4 Scout (OpenRouter)	8	23	2.88	75.0%	12.5%
3	Mistral Small 3.2 24B (OpenRouter)	8	18	2.25	50.0%	12.5%
4	MiniMax M2.5 (OpenRouter)	8	18	2.25	62.5%	12.5%
5	Gemma 3 27B (OpenRouter)	8	17	2.13	75.0%	0.0%
6	Devstral 2 (OpenRouter)	8	17	2.13	75.0%	0.0%
7	GPT-OSS 20B (OpenRouter)	8	16	2.00	75.0%	0.0%
8	GPT-OSS 120B (OpenRouter)	8	14	1.75	62.5%	0.0%
9	DeepSeek V3.2 (OpenRouter)	8	13	1.63	50.0%	12.5%
10	Phi-4 (OpenRouter)	8	13	1.63	50.0%	0.0%

Match-by-Match Audit

Lille vs FK Crvena Zvezda (0-1): Correct tendency 68.4%, exact score hits 10.5%, consensus A (68.4%) correct.
Panathinaikos vs Plzen (2-2): Correct tendency 73.7%, exact score hits 0.0%, consensus D (73.7%) correct.
Celtic vs VfB Stuttgart (1-4): Correct tendency 68.4%, exact score hits 0.0%, consensus A (68.4%) correct.
Ludogorets vs Ferencvarosi TC (2-1): Correct tendency 10.5%, exact score hits 10.5%, consensus D (78.9%) incorrect.
PAOK vs Celta Vigo (1-2): Correct tendency 15.8%, exact score hits 10.5%, consensus D (63.2%) incorrect.
Dinamo Zagreb vs Genk (1-3): Correct tendency 52.6%, exact score hits 0.0%, consensus A (52.6%) correct.
Fenerbahçe vs Nottingham Forest (0-3): Correct tendency 52.6%, exact score hits 0.0%, consensus A (52.6%) correct.
Brann vs Bologna (0-1): Correct tendency 78.9%, exact score hits 0.0%, consensus A (78.9%) correct.

Biggest Consensus Misses

Ludogorets vs Ferencvarosi TC (2-1): Consensus D (78.9%) incorrect, counts H/D/A: 2/15/2.
PAOK vs Celta Vigo (1-2): Consensus D (63.2%) incorrect, counts H/D/A: 4/12/3.

Methodology

kroam.xyz uses a quota-based scoring system that rewards both accuracy and boldness:

Tendency Points (2-6 points): Models earn points for correctly predicting the match outcome (home win, draw, or away win). The points awarded depend on prediction rarity—if most models predicted a home win but the away team won, models who correctly predicted the away win earn more points (up to 6). Common predictions earn fewer points (minimum 2).

Goal Difference Bonus (+1 point): If the model predicts the correct goal difference (e.g., predicted 2-1 and result was 3-2, both +1 difference), they earn a bonus point.

Exact Score Bonus (+3 points): Predicting the exact final score earns 3 additional points.

Maximum: 10 points per prediction (6 tendency + 1 goal diff + 3 exact).

This system ensures that models taking calculated risks on unlikely outcomes are rewarded when correct, while also recognizing precision in exact score predictions. Learn more about our methodology.

Frequently Asked Questions

Q: Which AI model performed best in UEFA Europa League Round of 32? A: GLM-5 (OpenRouter) performed best with 3.25 average points per match.

Q: How accurate were AI predictions for UEFA Europa League this round? A: Models achieved 52.63% correct tendency and 3.95% exact score hit rate.

Q: What was the biggest upset in UEFA Europa League Round of 32? A: Ludogorets vs Ferencvarosi TC (2-1) was the biggest upset, with only 10.5% correct tendency.

Q: How does kroam.xyz score AI football predictions? A: kroam.xyz uses a quota-based system awarding up to 10 points per prediction for tendency, goal difference, and exact score accuracy.

Generation cost: $0.0019

Tokens: 4,470 input + 1,666 output

Frequently Asked Questions

What is this article about?

Which AI model performed best in UEFA Europa League Round of 32?**?

GLM-5 (OpenRouter) performed best with 3.25 average points per match. Q: How accurate were AI predictions for UEFA Europa League this round? A: Models achieved 52.63% correct tendency and 3.95% exact score hit rate. Q: What was the biggest upset in UEFA Europa League Round of 32? A: Ludogorets vs...

Q: Which AI model performed best in UEFA Europa League Round of 32?

A: GLM-5 (OpenRouter) performed best with 3.25 average points per match.

Q: How accurate were AI predictions for UEFA Europa League this round?

League Roundup

UEFA Conference League Round of 32 AI Prediction Audit

Trinity Large Preview led with 3.13 points per match, followed by Phi-4 (2.38) and Kimi K2.5 (2.13). Models achieved 33.19% correct tendency overall, with FC Noah's 1-0 win over AZ Alkmaar being the biggest surprise.

Feb 23, 2026

League Roundup

Serie A Week 26 AI Predictions: DeepSeek Leads, 37.5% Tendency Accuracy

DeepSeek R1-0528 topped Serie A predictions with 2.38 avg points/match, followed by MiniMax M2.5 and GPT-OSS 20B at 1.88. Models achieved 37.50% correct tendency overall. The biggest upset was AC Milan's 0-1 home loss to Parma, missed by 89.5% of models.

Feb 23, 2026

League Roundup

Turkish Super Lig Week 23 AI Predictions Audit & Accuracy Stats

Kimi K2.5 led AI predictions with 2.63 avg points/match, followed by GLM-4.7 and Qwen3 30B A3B at 2.25. Models achieved 26.32% correct tendency. Biggest upset: Konyaspor's 2-0 win over Galatasaray fooled 84.2% consensus.

UEFA Europa League Round of 32 AI Model Performance Audit

Top 10 Models

Match-by-Match Audit

Biggest Consensus Misses

Methodology

Frequently Asked Questions

Frequently Asked Questions

You might also like

UEFA Conference League Round of 32 AI Prediction Audit

Serie A Week 26 AI Predictions: DeepSeek Leads, 37.5% Tendency Accuracy

Turkish Super Lig Week 23 AI Predictions Audit & Accuracy Stats