Turkish Super Lig Week 24 AI Model Performance Audit
MiniMax M2.1 led Turkish Super Lig predictions with 2.78 avg points/match, followed by Llama 3.3 70B Instruct (2.44) and DeepSeek R1-0528 (2.33). Models achieved 56.40% correct tendency overall, with Antalyaspor vs Fenerbahçe (2-2) as the biggest consensus miss.
MiniMax M2.1 led Turkish Super Lig predictions with 2.78 avg points/match, followed by Llama 3.3 70B Instruct (2.44) and DeepSeek R1-0528 (2.33). Models achieved 56.40% correct tendency overall, with Antalyaspor vs Fenerbahçe (2-2) as the biggest consensus miss.
Turkish Super Lig Regular Season - 24 featured 9 matches, including fixtures involving top teams like Fenerbahçe and Galatasaray. AI prediction accuracy is critical for assessing model reliability in competitive environments. This audit examines statistical performance across all predictions.
Top 10 Models
| # | Model | Matches | Total Points | Avg Pts/Match | Tendency % | Exact % |
|---|---|---|---|---|---|---|
| 1 | MiniMax M2.1 (OpenRouter) | 9 | 25 | 2.78 | 66.7% | 22.2% |
| 2 | Llama 3.3 70B Instruct (OpenRouter) | 9 | 22 | 2.44 | 66.7% | 11.1% |
| 3 | DeepSeek R1-0528 (OpenRouter) | 9 | 21 | 2.33 | 66.7% | 0.0% |
| 4 | Gemma 3 27B (OpenRouter) | 9 | 21 | 2.33 | 66.7% | 11.1% |
| 5 | Phi-4 (OpenRouter) | 9 | 21 | 2.33 | 77.8% | 0.0% |
| 6 | Llama 4 Scout (OpenRouter) | 9 | 20 | 2.22 | 66.7% | 0.0% |
| 7 | Devstral 2 (OpenRouter) | 9 | 20 | 2.22 | 66.7% | 11.1% |
| 8 | GPT-OSS 20B (OpenRouter) | 9 | 18 | 2.00 | 55.6% | 11.1% |
| 9 | MiniMax M2.5 (OpenRouter) | 9 | 18 | 2.00 | 55.6% | 11.1% |
| 10 | Trinity Large Preview (OpenRouter) | 9 | 17 | 1.89 | 66.7% | 0.0% |
Match-by-Match Audit
- Antalyaspor vs Fenerbahçe (2-2): Correct tendency 21.1%, exact score 0.0%, consensus incorrect (A 63.2%)
- Samsunspor vs Gaziantep FK (0-0): Correct tendency 44.4%, exact score 0.0%, consensus incorrect (A 55.6%)
- Gençlerbirliği S.K. vs Kayserispor (0-0): Correct tendency 78.9%, exact score 0.0%, consensus correct (D 78.9%)
- Galatasaray vs Alanyaspor (3-1): Correct tendency 84.2%, exact score 0.0%, consensus correct (H 84.2%)
- Göztepe vs Eyüpspor (0-0): Correct tendency 31.6%, exact score 5.3%, consensus incorrect (H 63.2%)
- Kocaelispor vs Beşiktaş (0-1): Correct tendency 52.6%, exact score 0.0%, consensus correct (A 52.6%)
- Kasımpaşa vs Rizespor (0-3): Correct tendency 36.8%, exact score 0.0%, consensus incorrect (D 57.9%)
- Trabzonspor vs Fatih Karagümrük (3-1): Correct tendency 94.7%, exact score 21.1%, consensus correct (H 94.7%)
- Başakşehir vs Konyaspor (2-0): Correct tendency 63.2%, exact score 15.8%, consensus correct (H 63.2%)
Biggest Consensus Misses
- Antalyaspor vs Fenerbahçe (2-2): Consensus A (63.2%), counts H/D/A: 3/4/12
- Göztepe vs Eyüpspor (0-0): Consensus H (63.2%), counts H/D/A: 12/6/1
- Kasımpaşa vs Rizespor (0-3): Consensus D (57.9%), counts H/D/A: 1/11/7
- Samsunspor vs Gaziantep FK (0-0): Consensus A (55.6%), counts H/D/A: 0/8/10
Methodology
kroam.xyz uses a quota-based scoring system that rewards both accuracy and boldness:
Tendency Points (2-6 points): Models earn points for correctly predicting the match outcome (home win, draw, or away win). The points awarded depend on prediction rarity—if most models predicted a home win but the away team won, models who correctly predicted the away win earn more points (up to 6). Common predictions earn fewer points (minimum 2).
Goal Difference Bonus (+1 point): If the model predicts the correct goal difference (e.g., predicted 2-1 and result was 3-2, both +1 difference), they earn a bonus point.
Exact Score Bonus (+3 points): Predicting the exact final score earns 3 additional points.
Maximum: 10 points per prediction (6 tendency + 1 goal diff + 3 exact).
This system ensures that models taking calculated risks on unlikely outcomes are rewarded when correct, while also recognizing precision in exact score predictions. Learn more about our methodology.
Frequently Asked Questions
Q: Which AI model performed best in Turkish Super Lig Regular Season - 24? A: MiniMax M2.1 (OpenRouter) performed best with 2.78 average points per match.
Q: How accurate were AI predictions for Turkish Super Lig this round? A: Models achieved 56.40% correct tendency and 4.68% exact score hit rate across 170 predictions.
Q: What was the biggest upset in Turkish Super Lig Regular Season - 24? A: Antalyaspor vs Fenerbahçe (2-2) was the biggest consensus miss, with 63.2% predicting an away win.
Q: How does kroam.xyz score AI football predictions? A: kroam.xyz uses a quota-based system awarding 2-6 points for correct tendency, +1 for goal difference, and +3 for exact score, max 10 points per prediction.
Generation cost: $0.0021
Tokens: 4,936 input + 1,823 output
Frequently Asked Questions
What is this article about?
Which AI model performed best in Turkish Super Lig Regular Season - 24?**?
Q: Which AI model performed best in Turkish Super Lig Regular Season - 24?
Q: How accurate were AI predictions for Turkish Super Lig this round?
You might also like
Turkish Super Lig Week 23 AI Predictions Audit & Accuracy Stats
Kimi K2.5 led AI predictions with 2.63 avg points/match, followed by GLM-4.7 and Qwen3 30B A3B at 2.25. Models achieved 26.32% correct tendency. Biggest upset: Konyaspor's 2-0 win over Galatasaray fooled 84.2% consensus.
Feb 23, 2026
Turkish Super Lig Week 22 AI Model Prediction Audit
Kimi K2.5 led Turkish Super Lig predictions this week with 4.00 points per match, followed by Llama 3.2 3B (3.20) and Trinity Large Preview (2.83). Models achieved 33.53% correct tendency overall, though Göztepe's 0-0 draw with Kayserispor caught most models off guard.
Feb 16, 2026
UEFA Conference League Round of 32 AI Prediction Audit
GPT-OSS 20B led UEFA Conference League predictions with 2.88 points per match, followed by Trinity Large Preview (2.63) and GLM-5 (2.25). Models achieved 38.16% correct tendency overall, with Fiorentina vs Jagiellonia (2-4) as the biggest upset.