About
ByteDance's aggressive web crawler used to collect training data for their AI models including those behind TikTok and Doubao. Known for high crawl rates and ignoring rate limits.
Purpose
AI model training and content indexing
User Agent String
Mozilla/5.0 (Linux; Android 5.0) AppleWebKit/537.36 (KHTML, like Gecko; compatible; Bytespider; spider-feedback@bytedance.com)
How to Control in robots.txt
🚫 Block Bytespider
User-agent: Bytespider Disallow: /
✅ Allow Bytespider
User-agent: Bytespider Allow: /
⚠️ Bytespider has been observed ignoring robots.txt directives. You may need server-level blocking (e.g., firewall rules or user-agent filtering) to effectively prevent access.
Complete Guide: How to Block Bytespider
Server-level blocking, nginx configs, Cloudflare rules, Next.js middleware, and more →
Is Bytespider crawling your site?
Run a free scan to check if ByteDance's crawler is accessing your website.
Check if Bytespider is crawling YOUR site →