Heyas, wondering if there’s an open sourced piece of software or the like, that could scrape media platforms for a specific topic. Platforms like YT, X, Lemmy, News Media, etc., perhaps using RSS? But, a program I can host on my server, that only I have access too, via webpage, CLI, whatever…
Thanks for any info…
FreshRSS has been working great for me! It even has the ability for web scraping if you need it.
Right when I saw you reply, I saw a post about it. Digging in to it now. Thanks!
Seconding the recommendation for FreshRSS, it’s the one I ended up hosting when I looked into this a while back and it’s been really great. Takes a minute to get everything setup, especially if you want to have different settings for different types of feeds, but once it’s all set it’s perfect (for my needs at least).
I’ve also got it setup with my domain so I can access the feed from anywhere and that’s been one of my favourite features.
Everyone is suggesting readers. I think you are looking for something like https://docs.rsshub.app it’s capable of generating RSS feeds from pretty much everything.
This is it. Exactly what I was looking for. Thanks much!
You will also be interested into https://wiki.archlinux.org/title/Web_feed#Obtaining_web_feeds
I use Miniflux, it’s a lightweight RSS reader
That looks great as well. I like that I can integrate with my own domain.
And they just added Omnivore integration, which I’m so excited for.
Oh, this looks nice! I need to try this!
Freshrss is really great!
I use rss-bridge for scraping sites that don’t offer rss feeds: https://rss-bridge.github.io/rss-bridge/index.html
seconded. the built-in custom css selection is excellent. ive been strongly considering self-hosting an RSS bridge, but i think my server has too much unpredictable downtime for it
New Lemmy Post: Self Hosting an RSS feed for news/media/etc? (https://lemmy.world/post/9805996)
Tagging: #SelfHosted(Replying in the OP of this thread (NOT THIS BOT!) will appear as a comment in the lemmy discussion.)
I am a FOSS bot. Check my README: https://github.com/db0/lemmy-tagginator/blob/main/README.md
I use tt-rss.
Check out https://awesome-selfhosted.net/tags/feed-readers.html
YouTube has RSS feeds you can access without scraping, but it’s per channel so if you follow a lot of channels you’ll be following a lot of RSS feeds.
Lemmy also has RSS feeds for each community.
Are you looking for a reader instead? A reader aggregates the feeds and displays them. Usually it keeps track of which items you’ve already read.
Jumping in to ask if there’d be a good reason to use a stand alone feed reader instead of using the nextcloud “news” app?