• average650@lemmy.world
    link
    fedilink
    arrow-up
    6
    ·
    2 years ago

    That isn’t actually what’s important. It’s the frequency of the token, which could be as simple as single characters. The frequency of those is certainly not zero.

    LLMs absolutely can make up new words, word combinations, or sentences.

    That’s not to say chatgpt can actually give you good windows keys, but it isn’t a fundamental limitation of LLMs.

    • Demonen
      link
      fedilink
      arrow-up
      2
      ·
      2 years ago

      Okay, I’ll take your word for it.

      I’ve never ever, in many hours of playing with ChatGPT as a toy, had it make up a word. Hallucinate wildly, yes, but not stogulate a word out of nothing.

      I’d love to know more, though. How does it combine new words? Do you have any examples of words ChatGPT has made up? This is fascinating to me, as it means the model is much less chained to the training data than I thought.

      • gun
        link
        fedilink
        arrow-up
        3
        ·
        2 years ago

        It can create new words, I just verified this. First word it gave me: flumjangle. Google gives me 0 results. Maybe Google is missing something and it exists in some data out there, Idk.

        I’m not sure what is so impressive about this though. Language models can string words together in unique ways, why would it be different for characters?

        • Christian
          link
          fedilink
          arrow-up
          4
          ·
          2 years ago

          I’m just surprised to hear that google hasn’t found out about my flumjangles yet.

      • relevants@feddit.de
        link
        fedilink
        arrow-up
        1
        ·
        1 year ago

        A lot of compound words are actually multiple tokens so there’s nothing stopping the LLM from generating the tokens in a new order thereby creating a new word.