• bjorney@lemmy.ca
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    2
    ·
    2 hours ago

    I’m sorry but this says nothing about how they lied about the training cost - nor does their citation. Their argument boils down to “that number doesn’t include R&D and capital expenditures” but why would that need to be included - the $6m figure was based on the hourly rental costs of the hardware, not the cost to build a data center from scratch with the intention of burning it to the ground when you were done training.

    It’s like telling someone they didn’t actually make $200 driving Uber on the side on a Friday night because they spent $20,000 on their car, but ignoring the fact that they had to buy the car either way to get to their 6 figure day job

    • ebu@awful.systems
      link
      fedilink
      English
      arrow-up
      3
      ·
      32 minutes ago

      i think you’re missing the point that “Deepseek was made for only $6M” has been the trending headline for the past while, with the specific point of comparison being the massive costs of developing ChatGPT, Copilot, Gemini, et al.

      to stretch your metaphor, it’s like someone rolling up with their car, claiming it only costs $20 (unlike all the other cars that cost $20,000), when come to find out that number is just how much it costs to fill the gas tank up once

  • veroxii@aussie.zone
    link
    fedilink
    English
    arrow-up
    12
    arrow-down
    2
    ·
    5 hours ago

    banned from use by government employees in Australia

    So is every other AI except copilot built into Microsoft products. Government employees can’t use chatgpt directly. So this point is a bit disingenuous.

  • leisesprecher@feddit.org
    link
    fedilink
    English
    arrow-up
    20
    arrow-down
    2
    ·
    7 hours ago

    Even if they greatly underreported costs and their services are banned: the models are out there, open source and way more efficient than anything Meta and OpenAI could produce.

    So it’s pretty obvious that the tech giants are burning money for mediocre output.

    • tyler@programming.dev
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      2
      ·
      2 hours ago

      I’m very confused by this, I had the same discussion with my coworker. I understand what the benchmarks are saying about these models, but have any of y’all actually used deepseek? I’ve been running it since it came out and it hasn’t managed to solve a single problem yet (70b param model, I have downloaded the 600b param model but haven’t tested it yet). It essentially compares to gpt-3 for me, which only cost OpenAI like $4-9 million to train (can’t remember the exact number right now).

      I just do not see the “efficiency” here.

      • self@awful.systems
        link
        fedilink
        English
        arrow-up
        4
        ·
        2 hours ago

        what if none of it’s good, all of it’s fraud (especially the benchmarks), and having a favorite grifter in this fuckhead industry is just too precious

      • Pup Biru@aussie.zone
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        4
        ·
        41 minutes ago

        i haven’t seen another reasoning model that’s open and works as well… it’s LLM base is for sure about GPT-3 levels (maybe a bit better?) but like the “o” in GPT-4o

        the “thinking” part definitely works for me - ask it to do maths for example, and it’s fascinating to see it break down the problem into simple steps and then solve each step

        • blakestacey@awful.systems
          link
          fedilink
          English
          arrow-up
          1
          ·
          14 minutes ago

          the “thinking” part definitely works for me

          [bites tongue, tries really hard to avoid the obvious riposte]

  • Empricorn@feddit.nl
    link
    fedilink
    English
    arrow-up
    8
    ·
    6 hours ago

    I’m sure the next AI will be the ethical, uncensored, environmentally sustainable one…

  • skillissuer@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    12
    ·
    8 hours ago

    wait, 2021 was when crypto was still a thing vcs poured money into, so that might be yet another case of crypto to ai pivot

    • ikt@aussie.zone
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      5
      ·
      edit-2
      1 hour ago

      Jesus you still think AI is comparable to crypto? What year are you in 2022?

        • skillissuer@discuss.tchncs.de
          link
          fedilink
          English
          arrow-up
          4
          ·
          29 minutes ago

          ai is pushed by the same people as crypto, uses the same resources as crypto, captures attention of the same libertarian-brained vcs wanting to build their neofeudal empires, gives result equally as useless, unwanted and aggressively pushed by people that bought into it, not to mention crimes against environment, logic, abuse of workforce or general waste of everyone’s time and attention. but nOo iTs CoMpLeTeLy dIfFeReNt tHiS tImE