Vicuna v1.5 Has Been Released!

@Blaed@lemmy.world · 9 months ago

Vicuna v1.5 Has Been Released!

@Kerfuffle@sh.itjust.works · 9 months ago

Is anyone using these small models for anything? I feel like an LLM snob but I don’t feel motivation to even look at anything less than 70-40B when it’s possible to use those models.

@Blaed@lemmy.world · edit-2 9 months ago

I used to feel the same way until I found some very interesting performance results from 3B and 7B parameter models.

Granted, it wasn’t anything I’d deploy to production - but using the smaller models to prototype quick ideas is great before having to rent a gpu and spend time working with the bigger models.

Give a few models a try! You might be pleasantly surprised. There’s plenty to choose from too. You will get wildly different results depending on your use case and prompting approach.

Let us know if you end up finding one you like! I think it is only a matter of time before we’re running 40B+ parameters at home (casually).

@Kerfuffle@sh.itjust.works · 9 months ago

It is only a matter of time before we’re running 40B+ parameters at home (casually).

I guess that’s kind of my problem. :) With 64GB RAM you can run 40, 65, 70B parameter quantized models pretty casually. It’s not super fast, but I don’t really have a specific “use case” so something like 600ms/token is acceptable. That being the case, how do I get excited about a 7B or 13B? It would have to be doing something really special that even bigger models can’t.

I assume they’ll be working on a Vicuna-70B 1.5 based on LLaMA to so I’ll definitely try that one out when it’s released assuming it performs well.

Vicuna v1.5 Has Been Released!

Vicuna v1.5 Has Been Released!

Vicuna v1.5 Has Been Released!

Starting off with Vicuna v1.5

Vicuna v1.5 GPTQ

7B

13B

Vicuna Model Card

Model Details

Developed by: LMSYS

Model Sources

Uses

How to Get Started with the Model

Training Details

Evaluation Results