My personal collection of interesting models I've quantized from the past week (yes, just week)

noneabove1182@sh.itjust.works · 11 months ago

My personal collection of interesting models I've quantized from the past week (yes, just week)

will_a113 · 11 months ago

Do you do any kind of before/after testing of these to measure performance/accuracy changes? I’ve always wondered if there is some way to generalize the expected performance changes at different quantizations.

noneabove1182@sh.itjust.works · 11 months ago

You can get the resulting PPL but that’s only gonna get you a sanity check at best, an ideal world would have something like lmsys’ chat arena and could compare unquantized vs quantized but that doesn’t yet exist

My personal collection of interesting models I've quantized from the past week (yes, just week)

My personal collection of interesting models I've quantized from the past week (yes, just week)

x.com