• fidodo@lemmy.world
      link
      fedilink
      English
      arrow-up
      7
      arrow-down
      3
      ·
      9 months ago

      The llm is executing a function on a diffusion image model. The llm does not generate the image itself

      • kelvie@lemmy.ca
        link
        fedilink
        English
        arrow-up
        8
        arrow-down
        2
        ·
        9 months ago

        This doesn’t contradict what the OP said. ChatGPT is now an interface to both an LLM and a diffusion-based image generator.

      • tsonfeir@lemm.ee
        link
        fedilink
        English
        arrow-up
        3
        arrow-down
        2
        ·
        9 months ago

        You’re being pedantic—and confidently ignorant. The product is called “ChatGPT” and through that you can access multiple models. Like ChatGPT 3.5, or DALL•E.

      • CrayonRosary@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        9 months ago

        ChatGPT is just a front-end that maintains a session that gets fed to an LLM each time you add a reply, and now has access to image gen, too, so I was wrong.

    • h3rm17@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      4
      arrow-down
      3
      ·
      9 months ago

      Yeah, but the model that does the images is actually Dall-e, you are just using gpt’s interface to create them

    • Nexz@feddit.nl
      link
      fedilink
      English
      arrow-up
      3
      arrow-down
      2
      ·
      9 months ago

      I mean, the GPT model is a LLM and ChatGPT uses DALL-E in the background to create images. So depending on definition you’re both correct :-)

      • tsonfeir@lemm.ee
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        1
        ·
        9 months ago

        Depending on how I define anything means I’m always correct I guess. 🤷‍♂️