Thankfully (1) there just aren't that many useful websites anymore. Most content lives in walled gardens. Negotiate a few deals to get around CloudFlare for those select few and you're set.
'Many a true word was said in jest' as the aphorism goes. IME useful information has become somewhat concentrated into nodes, instead of trawling blogs for info it's all on stackoverflow/Twitter/whatever (depending on subject).
ReCAPTCHA, hCaptcha, CloudFlare definitely don't make it easier to crawl the web than it was ten years ago