• Anticorp
    link
    fedilink
    English
    arrow-up
    10
    arrow-down
    1
    ·
    edit-2
    1 年前

    The section about the sky would mention it, so then you go to the index in the R book, find the entry for that phenomenon, and read about Raleigh Scattering. The internet is definitely easier for finding random information though, although it’s harder now than it was like 10 years ago. ChatGPT is amazing for finding random information, but you have to verify what it tells you, since it will just randomly lie for no reason.

    • merc@sh.itjust.works
      link
      fedilink
      arrow-up
      6
      arrow-down
      1
      ·
      1 年前

      It doesn’t “lie” though, it just generates a plausible sequence of words. The sort-of fortunate thing is that facts are often plausible, and it’s going to be trained on a lot of facts. But, facts aren’t the only word-sequences that are plausible, and LLMs are trained to be creative, and that means sometimes choosing a next-word that isn’t the best fit, which might end up meaning the generated sentence isn’t factual.

      Calling it a “lie” suggests that it knows the truth, or that it is being deceptive. But, that’s giving “spicy autocomplete” too much credit. It simply generates word salads that may or may not contain truths.

      • Anticorp
        link
        fedilink
        English
        arrow-up
        3
        ·
        1 年前

        The industry word for it is “hallucination”, but I’m not sure that fits either.

        • merc@sh.itjust.works
          link
          fedilink
          arrow-up
          2
          ·
          1 年前

          It’s better than lying, but it still implies consciousness. It also implies that it’s doing something different than what it normally does.

          In reality, it’s always just generating plausible words.

          • Anticorp
            link
            fedilink
            English
            arrow-up
            1
            arrow-down
            1
            ·
            edit-2
            1 年前

            It is certainly more complex than a predictive text machine. It does seem to understand the concept of objective truth, and facts, vs interpretation and inaccurate information. It never intentionally provides false information, but sometimes it thinks it is giving factual information when really it is using an abundance of inaccurate information that it was trained with. I’m honestly surprised at how accurate it usually is, considering it was trained with public data from places like Reddit, where common inaccuracies have reached the level of folklore.

            • merc@sh.itjust.works
              link
              fedilink
              arrow-up
              2
              ·
              1 年前

              It is certainly more complex than a predictive text machine

              No, it literally isn’t. That’s literally all it is.

              It does seem to understand

              Because people are easily fooled, but what it seems like isn’t what’s actually happening.

              but sometimes it thinks it is giving factual information

              It’s incapable of thinking. All it does is generate a plausible sequence of words.

      • Anticorp
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        1 年前

        The internet wasn’t allowed for school reports until after I was through with college the first time around. The World Wide Web didn’t even exist for the first half of my life.

        Edit: it’s kind of crazy that my career revolves around something that didn’t even exist when people were still asking me what I wanted to be when I grew up. Although, “engineer” was a frequent answer to that question, and that’s certainly in my title now, but it’s an entirely different kind of engineering than I meant back then.