• QuadratureSurfer@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    6 months ago

    Similar use cases to what I’m doing right now, running LLMs like Mixtral8x7B (or something better by the time we start seeing these), Whisper (STT), or Stable Diffusion.

    I use a fine tuned version of Mixtral (dolphin-Mixtral) for coding purposes.

    Transcribing live audio for notes/search, or translating audio from different languages using Whisper (especially useful for verifying claims of translations for Russian/Ukrainian/Hebrew/Arabic especially with all of the fake information being thrown around).

    Combine the 2 models above with a text to speech system (TTS), a vision model like LLaVA and some animatronics and then I’ll have my own personal GLaDOS: https://github.com/dnhkng/GlaDOS

    And then there’s Stable Diffusion for generating images for DnD recaps, concept art, or even just avatar images.

    • Alphane Moon
      link
      fedilink
      arrow-up
      2
      ·
      6 months ago

      Thank you! I currently use my 3080 dGPU for Stable Diffusion. I wonder to what extent NPUs will be usable with Stable Diffusion XL.