Llemma: An Open Language Model For Mathematics

blog.eleuther.ai

cross-posted to:
models@lemmy.intai.tech

Llemma: An Open Language Model For Mathematics

blog.eleuther.ai

ylai to

Free Open-Source Artificial Intelligence@lemmy.worldEnglish · 1 year ago

cross-posted to:
models@lemmy.intai.tech

ArXiv | Models | Data | Code | Blog | Sample Explorer Today we release Llemma: 7 billion and 34 billion parameter language models for mathematics. The Llemma models were initialized with Code Llama weights, then trained on the Proof-Pile II, a 55 billion token dataset of mathematical and scientific documents. The resulting models show improved mathematical capabilities, and can be adapted to various tasks through prompting or additional fine-tuning.

You must log in or # to comment.

Chat

Free Open-Source Artificial Intelligence@lemmy.world

fosai@lemmy.world

Create a post

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !fosai@lemmy.world

Welcome to Free Open-Source Artificial Intelligence!

We are a community dedicated to forwarding the availability and access to:

Free Open Source Artificial Intelligence (F.O.S.A.I.)

More AI Communities

LLM Leaderboards

Developer Resources

GitHub Projects

GitHub Stars

FOSAI Time Capsule

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

3 users / day
22 users / week
109 users / month
426 users / 6 months
131 local subscribers
2.89K subscribers
247 Posts
589 Comments
Modlog