Handbook of European HPC projects

Open Euro LLM

Open European Family of Large Language Models

Europe’s leading AI companies and research institutions combine their forces and expertise to develop next-generation open-source language models in an unprecedented collaboration to advance European AI capabilities, the OpenEuroLLM project.

A consortium of 20 leading European research institutions, companies and EuroHPC centres coordinated by Jan Hajič (Charles University, Czechia) and co-led by Peter Sarlin (AMD Silo AI, Finland) will build a family of performant, multilingual, large language foundation models for commercial, industrial and public services.

The transparent and compliant open-source models will democratize access to high-quality AI technologies and strengthen the ability of European companies to compete on a global market and public organizations to produce impactful public services.

The OpenEuroLLM project is aligned with the imperative to improve Europe’s competitiveness and digital sovereignty. The project is a prime example of the type of technology infrastructure needed to lower thresholds for European AI product development and refinement, demonstrating the strength of transparency, openness and community involvement, values largely recognized across the European tech ecosystem.

The models will be developed within Europe’s robust regulatory framework, ensuring alignment with European values while maintaining technological excellence. Cooperating with open-source and open science communities like LAION, open-sci and OpenML, and additional experts in the field assembled in the project’s Open Strategic Partnership Board, OpenEuroLLM will ensure that the models, software, data and evaluation will be fully open and can be fine-tuned and instruction-tuned for specific industry and public sector needs. These performant multilingual models preserve both linguistic and cultural diversity, enabling European companies to develop high-quality products and services in the era of AI.

The project, which has been awarded the STEP (Strategic Technologies for Europe Platform) seal, leverages support from previous European projects and the experience of the partners and their results, including large repositories of high-quality data and pilot LLMs developed previously.

PROJECT’S CONTACT:

Jan Hajič

Support to HPC-powered AI

Call:
DIGITAL-2024-AI-06

Coordinating Organization:
Charles University, Czechia

Project Timespan
2025-02-01 – 2028-01-31

Other Partners:
  • ALT-EDIC – Alliance for Language Technologies EDIC, France
  • Technische Universiteit Eindhoven, Netherlands
  • ELLIS Institute Tübingen, Germany
  • Fraunhofer-Gesellschaft zur Förderung der Angewandten Forschung e.V., Germany
  • FZJ – Forschungszentrum Jülich GmbH, Germany
  • Lindholmen Science Park, Sweden
  • University of Helsinki, Finland
  • University of Oslo, Norway
  • University of Turku, Finland
  • Universität Tübingen, Germany
  • Silo GenAI, Finland
  • Aleph Alpha Research, Germany
  • ellamind, Germany
  • LightOn, France
  • Prompsit Language Engineering, Spain
  • BSC – Barcelona Supercomputing Center, Spain
  • CINECA – Consorzio Interuniversitario, Italy
  • CSC-IT Centre for Science Ltd, Finland
  • SURF, Netherlands