• notfromhere
      link
      fedilink
      English
      arrow-up
      4
      ·
      12 hours ago

      Any is very hard to benchmark and is also not how humans are tested.