Europe will have its own AI: OpenEuroLLM

raskolnikov@lemm.ee · 9 hours ago

Europe will have its own AI: OpenEuroLLM

bruce965 · edit-2 9 hours ago

I am conflicted about this choice. I am happy that the EU Commission will invest funds into open source technologies, but at the same time the US and China are already investing enough into “free as in free beer” models. Is it really worth it building yet another model?

Why not fund open source software development instead of funding machine learning? €20 million would do miracles divided between a few teams of developers, but they might merely be bread crumbs for machine learning training.

DavidGarcia@feddit.nl · 5 hours ago

2 main issues with the lack of Euro models: 1) Performance of all SOTA models is much better in English. 2) US models have US values. It’s yet another tool to culturally assimilate Europe (and the rest of the world too)

Anyone@slrpnk.net · 8 hours ago

Is it really worth it building yet another model?

Yes, it is, and it has to do with independence and many other reasons. It’ll be multilingual, legally compliant, it comes without Chinese nor other censorship, it is open source unlike Deepseek, ChatGPT, and others.

bruce965 · 8 hours ago

Mmh, okay that makes sense. Especially the multilinguality would be pretty important. As for the legality, we’ll see how it goes. Do we even know if it’s really possible to build a good model with only legally acquired data?

As for the censorship, as far as I know, for DeepSeek’s models it’s injected in the prompt after the training is completed, so it shouldn’t really be censored if you run it locally.

But yeah, you have raised good points. Thanks.

Anyone@slrpnk.net · 8 hours ago

No, DeepSeek isn’t uncensored if you run it locally.

Everything that comes from China is censored, because private companies must apply to the Chinese censorship laws.

RVGamer06@sh.itjust.works · 5 hours ago

The local version could be theoretically be uncensored through abliteration tho

bruce965 · 8 hours ago

Understood, thanks 👍

BestBouclettes@jlai.lu · 8 hours ago

That money should definitely go towards funding sovereign cloud infrastructure and open source software instead of vaporware AI bullshit. Where will you run your LLMs if you have no infra…

albert180@discuss.tchncs.de · 2 hours ago

We already have sovereign Cloud Infrastructure (OVH, Scaleway etc…)

Most people use AWS, Azure and Google Cloud because of Resume Driven Development and nobody got fired for buying AWS, and most of them probably don’t need them

BestBouclettes@jlai.lu · 2 hours ago

Nobody in Europe can realistically compete with AWS, GCP or Azure. Especially not OVH. They mostly focus on small and medium businesses and I wouldn’t trust them for large scale operations like the ones you can do on AWS. They had one too many dumb problems caused by poor design/decisions.
Maybe I should have been more precise: we don’t have sovereign hyperscalers in Europe.

albert180@discuss.tchncs.de · 2 hours ago

You don’t need those companies to run a big LLM in the Cloud.

You can do that on OVH, Scaleway etc…

BestBouclettes@jlai.lu · 1 hour ago

Fair enough !

Europe will have its own AI: OpenEuroLLM

Europe will have its own AI: OpenEuroLLM

Language model: OpenEuroLLM aims to make AI in EU more independent and diverse