A content audit used to mean pulling a spreadsheet of URLs, checking traffic and rankings, then deciding what to update or delete. That workflow still forms the foundation. But in 2026, it misses a layer that increasingly determines whether your content gets discovered at all.
AI search engines like ChatGPT, Perplexity, and Google’s own AI Overviews now pull from your pages to generate answers. Some of your content gets cited. Some gets ignored entirely. And the pages that AI engines favour don’t always match the ones that rank well in traditional search. A modern content audit for SEO needs to account for both surfaces, because optimizing for one while ignoring the other leaves gaps that compound over time.
This guide walks through a practical framework for auditing your content across Google Search, Google Analytics, and AI visibility data, so you can make clear decisions about what to keep, consolidate, update, or remove.
Why content audits need an AI layer now
Google has been explicit that traditional SEO best practices remain relevant for AI search features. According to Google’s AI optimization guide, “the best practices for SEO continue to be relevant because our generative AI features on Google Search are rooted in our core Search ranking and quality systems.” Those systems use retrieval-augmented generation (RAG) to pull content from Google’s search index and generate responses.
That means your existing content is already being evaluated by AI systems, whether you’re optimizing for them or not. Pages that rank well in traditional search have a better starting position for AI citation. But ranking alone isn’t enough. AI engines also evaluate content structure, topical authority, and how clearly your pages answer specific questions.
A content audit that only looks at traditional metrics (traffic, rankings, bounce rate) will tell you which pages perform in Google Search. Adding an AI visibility layer tells you which pages are actually being cited when someone asks ChatGPT or Perplexity a question in your space. The overlap between those two sets is often smaller than you’d expect.
Step 1: Build your page inventory
Before you can evaluate anything, you need a complete list of URLs. For small sites, you can pull this from your sitemap or CMS. For larger sites, a crawler gives you a more accurate picture because it catches pages that might not be in your sitemap.
Screaming Frog’s SEO Spider is a widely used option. It’s a website crawler that audits your site for over 300 SEO issues. You can crawl up to 500 URLs for free, or upgrade for £199 per year to remove the limit and access advanced features. Semrush’s Site Audit can also generate a crawled page list that you export as a CSV.
Ahrefs recommends following the 80/20 rule: audit the top 20% of pages with the most organic traffic or backlinks first. You’ll likely get 80% of the results from focusing there, especially if your site has hundreds or thousands of pages.
Your inventory should capture each URL along with its current status code, word count, last modified date, and any structured data present. This becomes your master sheet for the rest of the audit.
Step 2: Pull Google Search Console data
Google Search Console is where you get your ground-truth search performance data. Its tools and reports help you measure your site’s search traffic and performance, see which queries bring users to your site, and analyze impressions, clicks, and position on Google Search.
Export the Performance report filtered to the last 6 to 12 months. For each URL, you want:
- Clicks and impressions. Pages with high impressions but low clicks often have a title or meta description problem. Pages with declining impressions over time may be losing relevance.
- Average position. Pages ranking between positions 5 and 15 are your best candidates for updates. They’re close enough to page one that targeted improvements can move the needle.
- Query coverage. Which keywords is each page actually ranking for? Pages that rank for queries unrelated to their topic may need to be refocused or split.
Also check the Index Coverage report. Pages that Google has crawled but chosen not to index are signals worth investigating. They might be thin, duplicated, or simply not meeting Google’s quality bar.
Step 3: Layer in Google Analytics engagement data
Search Console tells you how pages perform in search results. Google Analytics tells you what happens after someone clicks through. You need both to make informed decisions.
In GA4, focus on these metrics for each URL in your inventory:
- Engaged sessions and engagement rate. A page with decent traffic but a very low engagement rate signals a content-quality or relevance problem.
- Conversions (key events). Which pages actually drive the outcomes your business cares about? Pages that generate traffic but no conversions may need a CTA overhaul rather than a content rewrite.
- Average engagement time. Long engagement times on informational content usually indicate the page is delivering value. Short engagement times on long-form content suggest readers aren’t finding what they need.
Cross-reference this with your GSC data. A page that ranks well and drives clicks but shows poor engagement metrics is a different problem than a page that doesn’t rank at all. The first needs content improvements. The second needs SEO work, or may be a consolidation candidate.
Step 4: Add AI visibility checks
This is the layer most content audits still skip. You need to know which of your pages are being cited by AI engines, and which topics in your space generate AI answers that don’t mention you at all.
Run your core topics and branded queries through ChatGPT, Perplexity, and Google AI Overviews. For each query, note:
- Whether any AI engine cites your content
- Which specific URLs get cited (they’re often not your homepage)
- Which competitors appear instead
- Whether the AI-generated answer aligns with what your content actually says
This produces three categories of pages:
- AI-visible pages. These are already being cited. Protect them by keeping the content fresh and factually accurate. Don’t make drastic changes to pages that AI engines trust.
- AI-invisible pages that rank well traditionally. These rank in Google but don’t get picked up by AI engines. They’re often missing the clear, structured answers that AI systems prefer. AI content optimization can help close this gap.
- Topic gaps. Queries where AI engines answer confidently but none of your pages appear. These are opportunities for new content, not audit actions, but the audit reveals them.
Step 5: Score and categorize every page
With data from all three sources (GSC, GA4, AI visibility), you can now score each page and assign it to one of four action buckets.
Keep as-is. Pages that perform well across traditional search, engagement, and AI citation. Don’t fix what isn’t broken. Review them quarterly to make sure performance holds.
Update. Pages that show potential (decent rankings, some traffic) but underperform on engagement or AI visibility. Common updates include refreshing outdated statistics, improving content structure with clearer headings, adding direct answers to common questions, and strengthening internal links.
Consolidate. Multiple pages targeting similar keywords that split your authority. Google’s page experience documentation notes that core ranking systems generally evaluate content on a page-specific basis. When you have three thin pages competing for the same query, none of them build enough depth to rank well or earn AI citations. Merge them into one comprehensive page and redirect the old URLs.
Remove. Pages with no traffic, no backlinks, no AI citations, and no strategic value. Outdated product pages, event recaps from two years ago, thin posts that were never updated. Removing these reduces crawl waste and helps search engines focus on your stronger content.
Step 6: Prioritize by impact
Not every action item deserves the same urgency. Prioritize based on a combination of traffic potential, business value, and effort required.
High priority updates are pages that rank on page two for valuable keywords, have existing backlinks, and cover topics where AI engines are actively generating answers. These pages are closest to producing measurable results with the least effort.
Medium priority items include consolidation projects and pages that need structural improvements for AI readability. These take more work but build long-term authority.
Low priority actions are removals and updates to pages in low-value topic areas. They’re worth doing eventually, but they shouldn’t block the work that moves numbers.
Map your priorities into a content calendar so the audit translates into a scheduled execution plan rather than a list that sits in a spreadsheet.
Step 7: Track the results
A content audit isn’t a one-time event. The decisions you make need to be measured, and the audit itself should be repeated on a regular cadence.
After implementing changes, monitor the same metrics you used to score pages in the first place. Track ranking movements in GSC, engagement shifts in GA4, and re-run your AI visibility checks on the topics you optimized for.
Set a cadence. Quarterly audits work for most sites. If you’re publishing frequently or operating in a fast-moving space, monthly spot checks on your highest-value pages keep things from drifting.
The goal isn’t a perfect spreadsheet. It’s a repeatable system that catches declining pages before they become dead weight, surfaces opportunities before competitors claim them, and ensures your content works across both traditional search and the AI engines that increasingly shape how people find answers.