Response from Martin Woodward, GitHub’s VP of Developer Relations:

Sorry for the inconvenience @koepnick - while searching across all repos has required being logged in for a long time, when we enhanced the search capabilities earlier in the 2023 we had to extend this to repos as well (see https://github.blog/changelog/2023-06-07-code-search-now-requires-login/).

This is primarily to ensure we can support the load for developers on GitHub and help protect the servers from being overwhelmed by anonymous requests from bots etc.

  • OsrsNeedsF2P
    link
    fedilink
    arrow-up
    4
    ·
    7 months ago

    They are probably doing it to limit AI LLM bots from hoovering up the code they’ve already hoovered up.

    Why is this a bad thing, when M$ is already training on it themselves? If your code is permissively licensed, it seems logical or even desired to be scraped for LLMs

    • cybersandwich@lemmy.world
      link
      fedilink
      arrow-up
      2
      ·
      7 months ago

      It’s not a bad thing. I was just saying that’s probably why they are doing it.

      Everyone is getting super protective about “their” data now.

      Oh yea, GitHub copilot is pretty nice too. (trained on all those repos!)

      I realize this is a “hate on GitHub” thread so I’m gonna get downvoted for this post too but it does everything I need it to do, the documentation is fantastic, and it’s the “defacto” repo for a lot of stuff.