No.72551
Make your own scraper. Start easy with a site with shit security then scale.
Have a team and COMMUNICATE, don't solo, it DOES NOT WORK for something of this scale. Either you do teamwork on darkweb or you solo on radicle but everything else is just weekend piracy, your scrapers will be broken way faster than you can fix them.
Don't be an asshole to your fellow devs. Listen to them, share ideas. If you're the sysop, ask the devs to keep you informed at all times and have your users and contributors do pentesting.
It might sound overwhelming but it really isn't. If you want something to last and not look like shit or break all the time you have to put some effort into it. Think of it like AI slop or deviantart/furaffinity crap, does all that mass-produced shit look creative and original to you? Well for a scraping and archival project it's the same.
Hope that helps answer your question anon. Good luck out there, let us know if you want to organize something and for the love of all that is unholy DO NOT USE GITHUB. Use codeberg, onedev, whatever, but not github.