How to Block Google-NotebookLM: Stop Google's AI Research Tool from Crawling Your Site
NotebookLM went viral for its AI-generated podcasts. Behind it sits a crawler that reads your pages when users add your URLs as sources. Here's how to block it without touching your Google Search rankings.
Updated March 2026
What Is NotebookLM?
NotebookLM is Google's AI-powered research assistant. Users create "notebooks" by adding sources — URLs, PDFs, Google Docs, YouTube videos — and then interact with an AI that can answer questions, generate summaries, create study guides, and produce remarkably realistic two-person AI podcast episodes ("Audio Overviews") based on those sources.
It went viral in late 2024 primarily because of Audio Overviews — the feature generates podcast-style conversations that discuss and debate the content of uploaded sources with uncanny realism. Researchers, students, journalists, and knowledge workers adopted it rapidly.
When a user adds a URL as a source, Google-NotebookLM fetches and reads that page. The crawler is triggered by individual user actions — it doesn't bulk-crawl the web. Each fetch is tied to a specific notebook session.
Why Publishers Should Pay Attention
NotebookLM creates derivative works from your content: summaries, study guides, Q&A, and audio podcasts. Users interact with an AI version of your content inside Google's product — not your website. You lose:
How to Block Google-NotebookLM
Add this to your robots.txt:
User-agent: Google-NotebookLM Disallow: /
To block all Google AI crawlers while keeping search rankings:
# Keep search indexing User-agent: Googlebot Allow: / # Block AI training User-agent: Google-Extended Disallow: / # Block Deep Research User-agent: Gemini-Deep-Research Disallow: / # Block NotebookLM User-agent: Google-NotebookLM Disallow: /
Google Search is unaffected
Google-NotebookLM is completely separate from Googlebot. Blocking it has zero effect on your Google Search rankings, indexing, or organic traffic.
The Audio Overviews Concern
NotebookLM's Audio Overviews feature generates AI podcast episodes from your content. Two synthetic voices discuss, debate, and explain your material in a conversational format. These audio files can be shared and distributed.
For publishers, this is a new category of derivative work. Your original article or research becomes an AI-generated podcast that:
The generated podcast doesn't necessarily credit your original work or link to your site.
Users can share the generated audio with anyone — distributing a derivative of your content outside your site.
If you produce podcasts, research summaries, or educational content, AI-generated versions of your own content compete directly with you.
What Blocking Does (and Doesn't) Do
- • NotebookLM from reading your URLs when users add them
- • AI-generated summaries and podcasts from your content
- • Your content being used as a notebook source
- • Users uploading PDFs or copy-pasting your content manually
- • Google-Extended training crawls (block separately)
- • Deep Research crawls (block separately)
- • Google Search rankings (unaffected)
Frequently Asked Questions
Can users still add my content to NotebookLM if I block the crawler?
They can still upload PDFs, paste text, or add Google Docs containing your content. The robots.txt block only prevents the Google-NotebookLM crawler from fetching URLs. It doesn't prevent manual uploads — that's a user action, not a crawl.
Is NotebookLM content used for Gemini model training?
Google has stated that NotebookLM content is not used to train Gemini models. The Google-NotebookLM crawler retrieves content for the user's notebook session only. However, for training opt-out, you should also block Google-Extended.
How many people use NotebookLM?
NotebookLM saw rapid adoption after Audio Overviews went viral in late 2024. Google hasn't published exact user numbers, but it's widely used in education, journalism, research, and professional knowledge work. The user base is growing as Google integrates it deeper into its product ecosystem.
What happens when a user tries to add my blocked URL?
If you block Google-NotebookLM via robots.txt, the crawler cannot fetch your page. NotebookLM will likely show an error indicating the source could not be loaded. The user may then resort to manual methods (copy-paste, PDF upload) which the block cannot prevent.
Related Guides
Is your site protected from AI bots?
Run a free scan to check your robots.txt, meta tags, and overall AI readiness score.