📊 Full opportunity report: OpenEuroLLM. The third path. on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

OpenEuroLLM is a European consortium developing open-source multilingual large language models. Despite progress, it faces critical compute resource limitations that could impact its success. The first models are expected in July 2026.

OpenEuroLLM, a major European consortium developing open-source multilingual large language models, is facing significant challenges in securing enough computational resources to complete its models, according to project leaders.

The project, funded by €20.6 million from the EU’s Digital Europe Programme and involving 20 organizations across academia, industry, and high-performance computing centers, aims to produce a pan-European sovereign LLM. Coordinated by Jan Hajič at Charles University and co-led by Peter Sarlin of Silo AI, the initiative is part of Europe’s broader strategy to develop independent AI capabilities. Despite achieving initial milestones, the project’s first-year progress report highlights that securing additional compute power remains a critical obstacle. This bottleneck echoes similar issues faced by national projects like Italy’s Minerva and Portugal’s AMÁLIA, which also operate at resource-constrained scales. The first models are scheduled for release by July 31, 2026, but whether the consortium can meet this deadline depends on overcoming current resource limitations. The absence of Mistral, a leading French AI company, further complicates the landscape, as the consortium struggles to bring all major European players into the fold.

OpenEuroLLM · The Third Path.

DISPATCH / MAY 2026 ESSAY · EUROPEAN SOVEREIGN LLMs · OPENEUROLLM · CONSORTIUM

▲ Standalone Essay EU Sovereign AI · Pan-EU · May 2026

Standalone Essay 03 · European Sovereign AI · The Consortium Case Study

OpenEuroLLM.
The third
path.

€37.4M EU budget, 20 organizations, four major EuroHPC supercomputers, 35 target languages. And the project’s coordinator says: “significant challenges in securing more compute still remain.”

Italy bet national. Portugal bet continuation. The EU bet consortium. OpenEuroLLM — coordinated by Jan Hajič at Charles University Prague, co-led by Peter Sarlin at AMD-owned Silo AI — is what the pan-European pooled-resources answer looks like in operational form. And the project lead is publicly stating that even at pan-European pooled scale, compute is the bottleneck. Each of the three sovereign-LLM answers, examined honestly, surfaces a complication the press coverage downplays.

Thorsten Meyer / ThorstenMeyerAI.com / May 2026 · Standalone Essay 03

▲ The structural editorial finding

The European sovereign-LLM movement’s three answers — Minerva from-scratch, AMÁLIA continuation, OpenEuroLLM consortium — are now operating at sufficient scale and duration that their structural limits are visible. None of them is the answer. Each of them is an answer. The strategic discourse benefits from treating all three as complementary data points in the same empirical experiment about what European sovereign-AI development actually requires.

— standalone essay 03 · the OpenEuroLLM case study · may 2026

€37.4M

EU consortium budget · €20.6M from Digital Europe Programme · grant 101195233

“a pittance compared with the $100B US Stargate first tranche” — Fortune · STEP Seal awarded

Partner organizations · 12 universities · 6 companies · 3 HPC centers

Charles University coordinator · AMD Silo AI co-lead · Mistral notably absent

4.5M+

GPU hours secured · Leonardo BOOSTER (3M) + LUMI (1.5M) + strategic across 4 EuroHPC

“significant challenges in securing more compute still remain” — Hajič, March 2026

Jul2026

First models deliverable · the strategic moment · 6 weeks from now

2 of 11 deliverables shipped · final models January 2028

● OPENEUROLLM €37.4M EU BUDGET · 20 ORGANIZATIONS · CHARLES UNIVERSITY + AMD SILO AI LEADS · STARTED FEB 1 2025 ● HAJIČ MARCH 2026 “SIGNIFICANT CHALLENGES IN SECURING MORE COMPUTE FOR FINAL MODELS STILL REMAIN” · STRUCTURAL FINDING ● COMPUTE 3M GPU HOURS LEONARDO BOOSTER + 1.5M LUMI + STRATEGIC 4 EUROHPC SYSTEMS · $7B EUROHPC CONTEXT ● THREE-WAY MINERVA FROM-SCRATCH · AMÁLIA CONTINUATION · OPENEUROLLM CONSORTIUM · ALL THREE OPERATIONAL SUMMER 2026 ● YEAR ONE OUTPUTS MIXTUREVITAE · HPLT 38 REFERENCE MODELS · OPEN-SCI-REF 0.01 · TRAINING DATA CATALOGUE · MULTISYNT ● vs MINERVA ITALY 128 GPUS LEONARDO · €100M+ PNRR · OPENEUROLLM 4.5M GPU HOURS · €37.4M EU BUDGET · ORDER OF MAGNITUDE LARGER POOLED ● JULY 31 2026 FIRST MODELS · INITIAL DATASET · EVALUATION CODE · STRATEGIC MOMENT FOR EU SOVEREIGN-LLM MOVEMENT

The structural editorial anchor · Hajič’s compute statement

Even at pan-European scale, compute is the bottleneck.

From the OpenEuroLLM first-year progress report, March 6, 2026. The single most important sentence in the public documentation of the project. The pan-European consortium answer — explicitly designed as the response to individual national projects’ resource constraints — is itself constrained by the same resource that limits national projects.

Jan Hajič · OpenEuroLLM coordinator · first-year progress report

Charles University · Institute of Formal and Applied Linguistics (ÚFAL) · OpenEuroLLM coordinator · also coordinator of the HPLT (High Performance Language Technologies) project since 2022. The most quoted public statement about OpenEuroLLM’s structural constraints.

▲ On-record · OpenEuroLLM blog · March 6, 2026

Creating an open source multilingual LLM in the public space and within a large consortium is a challenging task. I am proud that thanks to the expertise, enthusiasm, commitment and hard work of especially the core partners the project has achieved its first-year goals. However, significant challenges, especially in securing more compute for creating the final models, still remain.

— Jan Hajič · Charles University · OpenEuroLLM coordinator
First-year progress and next steps · March 6, 2026

The structural significance: OpenEuroLLM has secured 3M GPU hours on Leonardo BOOSTER, 1.5M GPU hours on LUMI, and strategic compute allocations on four EuroHPC supercomputers through project end. This is real frontier-class scale. Hajič’s statement that it is insufficient for the final models means the pan-European consortium answer, as currently funded, may not produce final models at the parameter scale required to compete with US frontier developers on general capability. Position 1 (frontier-match) may need to be recalibrated to Position 2 + Position 3.

The consortium architecture · what 20 organizations actually looks like

HKUXZR C612 NAS Motherboard LGA2011-3, 10x SATA 6Gbps, 4X 2.5GbE Intel i226-V, 2X M.2 NVMe, 2X PCIe x16, DDR4, Server Workstation ITX Mainboard for Xeon E5 V3/V4 24 * 24cm

【High Performance Processor Support】 Supports Intel Xeon E5-V3/V4 series processors (LGA2011-3 socket), as well as Core i7/i9 series…

As an affiliate, we earn on qualifying purchases.

12 universities. 6 companies. 3 HPC centers. One conspicuous absence.

The OpenEuroLLM consortium combines academic NLP research, commercial AI capability, and EuroHPC supercomputing infrastructure across multiple European nations. The breadth is the strategic bet. The breadth is also the operational complication.

OpenEuroLLM consortium · 20 organizations · three categories

From the official partner list. Project coordinator Jan Hajič at Charles University Prague. Co-lead Peter Sarlin at AMD-owned Silo AI Finland. Started February 1, 2025 with EU Digital Europe Programme funding under grant agreement 101195233.

▲ COORDINATOR

Jan Hajič

Charles University Prague · Institute of Formal and Applied Linguistics (ÚFAL) · Czech computational linguist · HPLT predecessor project coordinator since 2022

▲ CO-LEAD

Peter Sarlin

AMD Silo AI · CEO and co-founder · Finnish AI lab · acquired by AMD for $665M in 2024 · brings hyperscaler-adjacent compute access and commercial discipline

▲ Universities and Research Organizations

Charles University Prague (coordinator) · AI Sweden · ALT-EDIC (France) · University of Tübingen · ELLIS Institute Tübingen · Fraunhofer IAIS (Germany) · Barcelona Supercomputing Center / BSC · Forschungszentrum Jülich · Eindhoven University · University of Helsinki · University of Oslo · University of Turku

▲ Companies

Aleph Alpha (Germany) · AMD Silo AI (Finland · co-lead) · Ellamind (Germany) · LightOn (France) · ELDA (Evaluations and Language resources Distribution Agency, France) · Prompsit Language Engineering (Spain)

▲ HPC Centres

CINECA (Italy) · operating Leonardo, the supercomputer that trained Minerva · CSC (Finland) · operating LUMI, one of Europe’s top supercomputers · SURF (Netherlands)

The conspicuous absence: Mistral, the French AI unicorn, is not in the consortium. From TechCrunch’s launch coverage, Hajič stated: “I tried to approach them, but it hasn’t resulted in a focused discussion about their participation.” Mistral has positioned itself as Europe’s commercial open-source alternative to US frontier developers — and its absence from the official EU sovereign-LLM consortium reflects a strategic-positioning divergence between consortium-led and commercial-led European AI development. The next standalone essay in this track examines that divergence directly.

The deliverables roadmap · 2 of 11 shipped · July 2026 is the strategic moment

The AI-Native Cloud: Architecting and Optimizing Multi-Cloud Systems for Generative AI, Edge Computing, and FinOps

As an affiliate, we earn on qualifying purchases.

Eleven deliverables. Two shipped. Nine pending.

From the official deliverables roadmap. As of mid-May 2026, only two of eleven deliverables have shipped — both from July 2025. The July 31, 2026 cluster — first models, initial dataset, evaluation code — is when OpenEuroLLM becomes empirically comparable to Minerva and AMÁLIA.

Deliverables timeline · 11-item roadmap through January 2028

From openeurollm.eu/deliverables. Status as of mid-May 2026. Each deliverable has a defined due date and a defined scope. The July 31, 2026 cluster is the strategic moment that makes OpenEuroLLM operationally comparable to Minerva (since November 2024) and AMÁLIA (June 2026 final target).

31 Jul 2025

D3.1 · Initial training data catalogue and analytics reports

SHIPPED

31 Jul 2025

D6.1 · Communication, Dissemination and Exploitation Strategy

SHIPPED

31 Jul 2026

Initial dataset release · texts with metadata used to train OpenEuroLLM at mid-project

6 WEEKS

31 Jul 2026

First models · initial release of LLM models · tokenizers + model weights

6 WEEKS

31 Jul 2026

Evaluation Code package · Python package for model evaluation procedures

6 WEEKS

31 Jul 2027

Final dataset release · texts with metadata for final OpenEuroLLM model(s)

PENDING

31 Jan 2028

Stakeholder Report · strategic advice from OSPB and community feedback

FINAL

31 Jan 2028

Final models · final release of LLM models · tokenizers + model weights

FINAL

31 Jan 2028

LLM training report · open publishing and regulatory compliance details

FINAL

31 Jan 2028

Evaluation Report · multilingual and regulatory aspects findings

FINAL

31 Jan 2028

Evaluation Report of Communication, Dissemination and Exploitation Strategy

FINAL

For approximately six weeks between AMÁLIA’s June 2026 final release and OpenEuroLLM’s July 2026 first models, all three answers will have operational artifacts for the first time. This is the moment the structural comparison becomes empirically tractable.

The three-way comparison · the essay track closes

Large-Scale AI Engineering: Design, Train, and Optimize Foundation Models on NVIDIA GPU Clusters

As an affiliate, we earn on qualifying purchases.

Three answers. Three structural findings.

The Minerva from-scratch path. The AMÁLIA continuation path. The OpenEuroLLM consortium path. Each project surfaces an empirical complication the press coverage downplays. Each finding is harder than the framing it’s wrapped in.

Three operational answers · three structural findings

Italy’s national from-scratch investment. Portugal’s national continuation pre-training. The pan-European consortium pooled-resources approach. The strategic discourse benefits from treating all three as complementary experiments rather than competing national-prestige projects.

▲ ITALY · ESSAY 02

Minerva · national from-scratch

FundingPNRR via MUR · large national

ArchitectureFrom scratch · Mistral arch · custom IT tokenizer

Native data1.14T Italian (50%) of 2.5T total

Compute128 GPUs Leonardo · weeks

OpennessTruly-open · day one

FINDINGMinerva-3B: 4.9% on INVALSI Italian school exam · data volume + params crucial above composition alone

▲ PORTUGAL · ESSAY 01

AMÁLIA · national continuation

Funding€5.5M Portuguese gov

ArchitectureContinuation · EuroLLM-derived · inherited tokenizer

Native data5.8B pt-PT (5.5%) of 107B mid-training

ComputeNot publicly detailed

OpennessPartially open · in progress

FINDING“Fully open” claim runs ahead of release · 5.5% pt-PT in model that prioritizes pt-PT

▲ PAN-EU · ESSAY 03

OpenEuroLLM · consortium

Funding€37.4M EU · €20.6M Digital Europe

ArchitectureFrom scratch · methodology developing

Native dataTBD · MultiSynt synthetic primary

Compute4.5M+ GPU hours · 4 EuroHPC

OpennessTruly-open commitment · some EU-copyright caveats

FINDINGHajič: “significant challenges in securing more compute still remain” · pan-EU pooled still constrained

Three projects. Three findings. Each one harder than the framing it’s wrapped in. Each answer is valid for its specific positioning and resource context. None of the three is “the right answer” in the abstract. The strategic discourse benefits from treating all three as data points in the same empirical experiment.

What July 2026 will determine · three scenarios

Corsair AI Workstation 300 Desktop PC – AMD Ryzen AI Max 385 CPU – AMD Radeon 8050S iGPU (Up to 48GBs vRAM) – 64GB LPDDR5X 8000MHz Memory – 1TB M.2 SSD – Black

AI-Optimized Compact Workstation: Experience AI performance out of the box with the compact 4.4L form factor, built for…

As an affiliate, we earn on qualifying purchases.

First models in six weeks. Three scenarios.

The July 31, 2026 first-models deliverable is the strategic moment for OpenEuroLLM specifically and for the European sovereign-LLM movement broadly. Three scenarios are plausible. The structurally honest framing will require acknowledging whatever the empirical results actually show.

Three scenarios for the July 2026 OpenEuroLLM first models

In all three scenarios, the discourse that O.Carmo’s analysis of AMÁLIA modeled and that this essay track has attempted to extend is what the moment requires. Holding competing views simultaneously: the work is real AND the empirical findings are harder than the press coverage suggests. Both can be true at once.

Afrontier-match

First models are capability-competitive at their parameter scale

If OpenEuroLLM’s 8B model demonstrates competitive performance against frontier developers’ similar-scale models on multilingual benchmarks, the pan-European consortium answer is validated. Position 1 + 2 + 3 combination. The strongest outcome for the European sovereign-LLM movement broadly — demonstrates pan-European pooling produces results individual national projects cannot.

Brecalibration

First models are methodologically interesting but capability-limited

If the 8B model demonstrates strong multilingual capability but lags frontier developers on general benchmarks, the project converges toward Position 2 + Position 3 — sovereignty/openness/compliance combined with multilingual specialization. The most likely outcome given Hajič’s compute statement and the structural funding asymmetry. Strategic ambition recalibration becomes explicit.

Ccomplication

First models surface a finding that complicates the simple narrative

Each of the prior two European sovereign-LLM projects surfaced a structural finding the press coverage downplayed (Minerva’s INVALSI 4.9%, AMÁLIA’s 5.5% pt-PT share). OpenEuroLLM’s first models will likely surface their own version. Very uneven performance across the 35-language portfolio is one likely complication. Strong results for high-resource languages, weak for lower-resource. The compute statement is already one such finding.

OpenEuroLLM is one valid answer to the European sovereign-LLM question. AMÁLIA is another. Minerva is a third. Mistral is potentially a fourth — the commercial-frontier answer this essay track examines next. The strategic discourse benefits from treating all of them as complementary experiments in the same empirical question. More analysis like this is needed. Not less.

— Standalone Essay 03 · The OpenEuroLLM case study · May 2026

Impact of Compute Limitations on European Sovereign LLMs

This development underscores a fundamental challenge in Europe’s AI strategy: even large, well-funded collaborations face infrastructural limits that threaten to delay or diminish the quality of sovereign language models. The outcome will influence Europe’s ability to develop independent, multilingual AI systems and reduce reliance on US or Chinese models. It also raises questions about the scalability of pooled resources and the future of pan-European AI initiatives, making the July 2026 model release a key milestone for assessing Europe’s AI sovereignty progress.

European Sovereign-LLM Strategies and Resource Constraints

Europe’s approach to developing sovereign large language models has taken three main paths: Italy’s Minerva, Portugal’s AMÁLIA, and the collective OpenEuroLLM project. Minerva is a from-scratch national effort, while AMÁLIA is a continuation of existing Portuguese models. OpenEuroLLM represents a pooled-resource, pan-European strategy, aiming to leverage collective infrastructure and expertise. However, all three initiatives are limited by the availability of compute resources, which has been a recurring challenge. The first-year progress report for OpenEuroLLM confirms that despite progress, securing sufficient computational capacity remains a bottleneck. The project’s first models are expected in July 2026, but the outcome depends heavily on resolving these resource issues. This situation highlights the broader tension within Europe’s AI development: the need for large-scale infrastructure investment to match ambitious language modeling goals. Learn more about national efforts like Minerva.

“Significant challenges, especially in securing more compute for creating the final models, still remain.”
— Jan Hajič, Charles University

Unresolved Challenges and Future Model Performance

It is not yet clear whether OpenEuroLLM will secure sufficient compute resources by the July 2026 deadline to meet its model release targets. The extent to which resource limitations will impact the quality and scale of the final models remains uncertain. Additionally, the potential involvement of Mistral or other major European AI firms is still unresolved, which could influence the consortium’s capacity and strategic direction.

Upcoming Milestone: First Models and Resource Allocation

The next critical step for OpenEuroLLM is the release of its first models by July 31, 2026. The project’s success hinges on overcoming current compute bottlenecks, which will be assessed through these models’ quality and scale. The upcoming months will also reveal whether additional funding or infrastructure investments are needed to meet project goals. For more context, see Minerva, Italy’s national AI project. Monitoring the consortium’s progress and resource commitments will be essential for understanding Europe’s broader AI sovereignty trajectory.

Key Questions

What is the main goal of OpenEuroLLM?

OpenEuroLLM aims to develop open-source, multilingual large language models for Europe, enhancing AI independence and linguistic diversity across the continent.

What are the key challenges facing the project?

The primary challenge is securing enough computational resources to train and finalize the models, which could delay or limit their scale and quality.

When will the first models be available?

The first models are scheduled for release by July 31, 2026, but this depends on overcoming current resource limitations.

How does this project compare to national efforts like Minerva or AMÁLIA?

Unlike national projects, OpenEuroLLM is a pooled-resource initiative intended to leverage pan-European infrastructure, but it faces similar resource constraints that could impact all three approaches.

What does the absence of Mistral imply for the project?

The lack of participation from Mistral, a leading French AI company, may limit the consortium’s strategic capacity and influence the overall success of the European sovereign-LLM effort.

Source: ThorstenMeyerAI.com

Nothing in this article is financial or investment advice. Cryptocurrency and precious-metal investments carry significant risk — do your own research and consider a licensed advisor.

OpenEuroLLM. The third path.

Up next

Software engineering. The canonical case.

Author

DreamRidiculous Team

Share article

OpenEuroLLM.
The third
path.

Even at pan-European scale, compute is the bottleneck.

HKUXZR C612 NAS Motherboard LGA2011-3, 10x SATA 6Gbps, 4X 2.5GbE Intel i226-V, 2X M.2 NVMe, 2X PCIe x16, DDR4, Server Workstation ITX Mainboard for Xeon E5 V3/V4 24 * 24cm

12 universities. 6 companies. 3 HPC centers. One conspicuous absence.

The AI-Native Cloud: Architecting and Optimizing Multi-Cloud Systems for Generative AI, Edge Computing, and FinOps

Eleven deliverables. Two shipped. Nine pending.

Large-Scale AI Engineering: Design, Train, and Optimize Foundation Models on NVIDIA GPU Clusters

Three answers. Three structural findings.

Corsair AI Workstation 300 Desktop PC – AMD Ryzen AI Max 385 CPU – AMD Radeon 8050S iGPU (Up to 48GBs vRAM) – 64GB LPDDR5X 8000MHz Memory – 1TB M.2 SSD – Black

First models in six weeks. Three scenarios.

Impact of Compute Limitations on European Sovereign LLMs

European Sovereign-LLM Strategies and Resource Constraints

Unresolved Challenges and Future Model Performance

Upcoming Milestone: First Models and Resource Allocation

Key Questions

What is the main goal of OpenEuroLLM?

What are the key challenges facing the project?

When will the first models be available?

How does this project compare to national efforts like Minerva or AMÁLIA?

What does the absence of Mistral imply for the project?

The Roblox Cheat That Broke Vercel.

The Agent Trap: Why 90% of AI “Launches” Are Infrastructure Liars

The Enforcement Countdown: 89 Days Until the EU AI Act’s GPAI Penalty Phase Begins

Software engineering. The canonical case.

Data processing agreement tracker for micro SaaS teams

5 Best PC Tablets Prime Day Deals in 2026

6 Best PC Motherboards Prime Day Deals in 2026

Aleph Alpha. The retrospective case.

OpenEuroLLM. The third path.

Up next

Author

DreamRidiculous Team

Share article

Even at pan-European scale, compute is the bottleneck.

HKUXZR C612 NAS Motherboard LGA2011-3, 10x SATA 6Gbps, 4X 2.5GbE Intel i226-V, 2X M.2 NVMe, 2X PCIe x16, DDR4, Server Workstation ITX Mainboard for Xeon E5 V3/V4 24 * 24cm

12 universities. 6 companies. 3 HPC centers. One conspicuous absence.

The AI-Native Cloud: Architecting and Optimizing Multi-Cloud Systems for Generative AI, Edge Computing, and FinOps

Eleven deliverables. Two shipped. Nine pending.

Large-Scale AI Engineering: Design, Train, and Optimize Foundation Models on NVIDIA GPU Clusters

Three answers. Three structural findings.

Corsair AI Workstation 300 Desktop PC – AMD Ryzen AI Max 385 CPU – AMD Radeon 8050S iGPU (Up to 48GBs vRAM) – 64GB LPDDR5X 8000MHz Memory – 1TB M.2 SSD – Black

First models in six weeks. Three scenarios.

Impact of Compute Limitations on European Sovereign LLMs

European Sovereign-LLM Strategies and Resource Constraints

Unresolved Challenges and Future Model Performance

Upcoming Milestone: First Models and Resource Allocation

Key Questions

What is the main goal of OpenEuroLLM?

What are the key challenges facing the project?

When will the first models be available?

How does this project compare to national efforts like Minerva or AMÁLIA?

What does the absence of Mistral imply for the project?

You May Also Like