For those unaware, YaCy is an open source indexing software with some p2p search capabilities. However, if you’ve run it yourself, you’d know that the search results are very bad. Therefore I am wondering if this is a resources issue or a YaCy software issue, or both.

Had YaCy had really huge resources for the crawling and indexing, would it have been a good enough search engine to replace google/ddg?

  • onlinepersona@programming.dev
    link
    fedilink
    English
    arrow-up
    10
    ·
    8 months ago

    I think the YaCy dev hopes so 🤔 But I’m not sure it’s written to be “hyperscale” or scalable at all. It seems to be a monolithic thing with the indexer, search enginer, crawler, etc. all in one. My guess is: no

    After some searching, there seems to have been an effort to rewrite YaCy in to microservers called YaCy Grid. Soooo maybe?

    It’s unfortunate that all the new “privacy searchengines” like duckduckgo, startpage, ecosia, qwant, and others, all decided to write their own new solution on top of Bing instead of participating in YaCy or some other opensource solution. Doesn’t make sense to me and makes me trust them much less (but still more than google and bing individually).

    Anti Commercial AI thingy

    CC BY-NC-SA 4.0

    • Faresh
      link
      fedilink
      English
      arrow-up
      9
      ·
      8 months ago

      Anti Commercial AI thingy

      I don’t think a license will prevent language models from using your post. If anything, you are allowing people to use your post for more stuff it couldn’t otherwise be used, since a license is you giving someone permission to use your work in a certain way, but if you don’t give a license, copyright law assumes that you haven’t given permission.

    • UraniumBlazer@lemm.eeOP
      link
      fedilink
      English
      arrow-up
      2
      ·
      8 months ago

      That’s sad… The YaCy grid seems to have no commits for at least 2 years. Yeah, that’s disappointing…