• hottari
    link
    fedilink
    arrow-up
    12
    arrow-down
    2
    ·
    1 year ago

    Isn’t ChatGPT’s launch only less than 6 months old or something…

    • Peanut@sopuli.xyz
      link
      fedilink
      arrow-up
      3
      arrow-down
      1
      ·
      1 year ago

      Reminds me of the article saying open ai is doomed because it can only last about thirty years with its current level of expenditure.

        • diffuselight@lemmy.world
          link
          fedilink
          arrow-up
          2
          ·
          1 year ago

          Cost reduction in the field is orders of magnitude potential. Look at llama running on everything down to a raspy pi after 2 months.

          There are massive gains to be made - once we have dedicated hardware for transformers, that’s orders of magnitude more.

          See your phone being able to playback 24h of video but die after 3h of browsing? Dedicated hardware codec support

          • hottari
            link
            fedilink
            arrow-up
            1
            arrow-down
            4
            ·
            1 year ago

            Yeah but Llama’s quality cannot compete with ChatGPT models (Doesn’t matter what model you use, if you want good and FAST results, you require serious compute). We do have commercial dedicated AI chips from NVDA, last time I checked you had to make an order to even get a price. George Hotz who is also working on something similar, by his account from a Lex Fridman podcast mentioned that a personal AI rig would have to be closer to a mainframe’s size.

            There’s nothing I have seen so far that leads me to believe that generative AI gets more efficient with weaker hardware.

            • diffuselight@lemmy.world
              link
              fedilink
              arrow-up
              3
              ·
              edit-2
              1 year ago

              The trajectory is such that current L2 70B models are easily beating 3.5 and are approaching GPT4 performance - an A6000 can run them comfortably and this is a few months only after release.

              Nah the trajectory is not in favor of proprietary, especially since they will have to dumb down due to alignment more and more

              https://www.anyscale.com/blog/llama-2-is-about-as-factually-accurate-as-gpt-4-for-summaries-and-is-30x-cheaper?trk=feed_main-feed-card_feed-article-content

              • hottari
                link
                fedilink
                arrow-up
                1
                arrow-down
                2
                ·
                edit-2
                1 year ago

                An A6000 ranges between $4500 and $7000 . We are a long long way from reaching efficiency on affordable consumer grade hardware.

                • diffuselight@lemmy.world
                  link
                  fedilink
                  arrow-up
                  1
                  ·
                  1 year ago

                  A 30B model which will be fine for specialized tasks runs on a 3090 or any modern mac today.

                  We are months away from being affordable at current trajectory

                  • hottari
                    link
                    fedilink
                    arrow-up
                    1
                    arrow-down
                    2
                    ·
                    1 year ago

                    Which modern Mac are you talking about and how much does that cost? Again, I doubt any of the opensource 30B models can compete even with ChatGPT 3.5. Which is the point I started with earlier.

                    Seems to me like you are riding this whole efficiency thing on nothing more than hopium.