hok@lemmy.dbzer0.com to

LocalLLaMA@sh.itjust.worksEnglish · 5 days ago

Llama 3.3 70b - End of open-weight pretrained models from Meta or just a better Llama 3.1 405b finetune?

6

18

Llama 3.3 70b - End of open-weight pretrained models from Meta or just a better Llama 3.1 405b finetune?

hok@lemmy.dbzer0.com to

LocalLLaMA@sh.itjust.worksEnglish · 5 days ago

6

People are talking about the new Llama 3.3 70b release, which has generally better performance than Llama 3.1 (approaching 3.1’s 405b performance): https://www.llama.com/docs/model-cards-and-prompt-formats/llama3_3

However, something to note:

Llama 3.3 70B is provided only as an instruction-tuned model; a pretrained version is not available.

Is this the end of open-weight pretrained models from Meta, or is Llama 3.3 70b instruct just a better-instruction-tuned version of a 3.1 pretrained model?

Comparing the model cards: 3.1: https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/MODEL_CARD.md 3.3: https://github.com/meta-llama/llama-models/blob/main/models/llama3_3/MODEL_CARD.md

The same knowledge cutoff, same amount of training data, and same training time give me hope that it’s just a better finetune of maybe Llama 3.1 405b.

Chat

Mechanize@feddit.it
link
fedilink
English
arrow-up
1·
5 days ago
deleted by creator

LocalLLaMA@sh.itjust.works

localllama@sh.itjust.works

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !localllama@sh.itjust.works

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
33 users / week
58 users / month
246 users / 6 months
125 local subscribers
2.29K subscribers
224 Posts
869 Comments
Modlog