manitcor@lemmy.intai.tech to

Learn Machine Learning@sh.itjust.worksEnglish · 1 year ago

OpenChat_8192 - The first model to beat 100% of ChatGPT-3.5

lemmy.intai.tech

5

29

OpenChat_8192 - The first model to beat 100% of ChatGPT-3.5

lemmy.intai.tech

manitcor@lemmy.intai.tech to

Learn Machine Learning@sh.itjust.worksEnglish · 1 year ago

5

cross-posted from: https://lemmy.intai.tech/post/40699

Models

opnechat

openchat_8192

opencoderplus

Datasets

openchat_sharegpt4_dataset

Repos

openchat

Related Papers

LIMA Less is More For Alignment

ORCA

Credit:

Tweet

Archive:

@Yampeleg The first model to beat 100% of ChatGPT-3.5 Available on Huggingface

🔥 OpenChat_8192

🔥 105.7% of ChatGPT (Vicuna GPT-4 Benchmark)

Less than a month ago the world witnessed as ORCA [1] became the first model to ever outpace ChatGPT on Vicuna’s benchmark.

Today, the race to replicate these results open-source comes to an end.

Minutes ago OpenChat scored 105.7% of ChatGPT.

But wait! There is more!

Not only OpenChat beated Vicuna’s benchmark, it did so pulling off a LIMA [2] move!

Training was done using 6K GPT-4 conversations out of the ~90K ShareGPT conversations.

The model comes in three versions: the basic OpenChat model, OpenChat-8192 and OpenCoderPlus (Code generation: 102.5% ChatGPT)

This is a significant achievement considering that it’s the first (released) open-source model to surpass the Vicuna benchmark. 🎉🎉

OpenChat: https://huggingface.co/openchat/openchat

OpenChat_8192: https://huggingface.co/openchat/openchat_8192 (best chat)

OpenCoderPlus: https://huggingface.co/openchat/opencoderplus (best coder)

Dataset: https://huggingface.co/datasets/openchat/openchat_sharegpt4_dataset

Code: https://github.com/imoneoi/openchat

Congratulations to the authors!!

[1] - Orca: The first model to cross 100% of ChatGPT: https://arxiv.org/pdf/2306.02707.pdf [2] - LIMA: Less Is More for Alignment - TL;DR: Using small number of VERY high quality samples (1000 in the paper) can be as powerful as much larger datasets: https://arxiv.org/pdf/2305.11206

Chat

simple@lemmy.world
link
fedilink
English
arrow-up
2·
1 year ago
Big if true. Is there a huggingface link to try the model?
- manitcor@lemmy.intai.techOP
  link
  fedilink
  English
  arrow-up
  2·
  1 year ago
  Links in the body
  - simple@lemmy.world
    link
    fedilink
    English
    arrow-up
    2·
    1 year ago
    Seems like the model is too big to try for free on Huggingface. I guess I’ll wait until someone hosts this for others to try.
    - manitcor@lemmy.intai.techOP
      link
      fedilink
      English
      arrow-up
      3·
      edit-2
      1 year ago
      give it 1-2 weeks, someone will post a free one. and ill post it here.

Learn Machine Learning@sh.itjust.works

learnmachinelearning@sh.itjust.works

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !learnmachinelearning@sh.itjust.works

Welcome! This is a place for people to learn more about machine learning techniques, discuss applications and ask questions.

Example questions:

“Should I use a deep neural network for my audio classification task?”
“I’m working with a small dataset, what can I do to make my model generalize well?”
“Is there a library available that implements function X in language Y?”
“I want to learn more about the math behind machine learning technique A, where should I start?”

Please do:

Be kind to new people
Post guides and tutorials that you find helpful
Link to open/free sources instead of paywalled when possible

Please don’t:

Post news articles / memes (there are other machine learning/AI communities for this)

Other communities in this area:

Similar subreddits: r/MLquestions, r/askmachinelearning, r/learnmachinelearning

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
1 user / week
1 user / month
1 user / 6 months
16 local subscribers
485 subscribers
60 Posts
24 Comments
Modlog