ATMO you shouldn't have to maintain knowledge of what kind of crawler bot exist ...

wraptile · on Aug 17, 2023

You can do the opposite since the inception of robots.txt: User-agent: * Disallow: / and then whitelist google bot and whatnot. Most of the web is already configured this way. Just check robots.txt of any major website, e.g. https://twitter.com/robots.txt

xnx · on Aug 17, 2023

The Allow: directive was an extension to robots.txt added later.