• melroy@kbin.melroy.org
    link
    fedilink
    arrow-up
    1
    arrow-down
    1
    ·
    2 days ago

    I see ok. I only want to add that DeepSeek is not the first or the only model that is using mixture-of-experts (MoE).