• lloram239@feddit.de
    link
    fedilink
    arrow-up
    1
    ·
    1 year ago

    The difference is that repeating a quote does not need new information, it’s all already in the text prompt. The current direction on the other side is not in the text, it has to be derived from the instructions. If you ask GPT to break the problem down into steps, you shrink the size of the problem dramatically. One or two turn it can handle in one step, it’s only when you increase the turn number that it gets it wrong and can’t answer it in one step.

    It’s really not much different from humans here. If I read all those turn instruction, I have no idea where things will end up either. I have to break the problem down and keep track of the direction at each step.

    How, if you’re saying that this is just an inherent limitation of LLMs, can GPT-4 do it?

    GPT-4 is just bigger, meaning it can handle larger problems in one-step. It should still fail when you ask it the same simple problem, but just make it longer.

    • mo_ztt ✅@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 year ago

      Hm… yeah, I see what you’re saying. It’s not capable of maintaining “hidden” state as it goes step by step through the output, but if you have it talk its way through the hidden part of the state, it can do it. I can agree with that.