Technical SEO in 2026: The 22-Fix Checklist for Indian E-commerce Sites
Crawl budget, faceted nav, canonical traps, schema, and Core Web Vitals — the exact technical SEO checklist we run on Indian Shopify, WooCommerce, and Next.js stores.

Most technical SEO guides stop at definitions. This one is the checklist we actually run when an Indian e-commerce store comes to us with thousands of indexed URLs, a crawl budget being eaten alive by filter pages, and product schema that Google quietly ignores. Twenty-two fixes, grouped by where they bite hardest on Shopify, WooCommerce, and Next.js.
What is technical SEO for an e-commerce site?
Technical SEO for e-commerce is the work of making sure search engines can crawl, render, index, and cite a store's pages without wasting budget on junk URLs. On a catalogue with thousands of products, it decides whether Google reaches your money pages at all — not whether your copy is clever. Infrastructure first, content second.
The difference between a 50-page brochure site and a 5,000-SKU store is scale. A small site can be sloppy and still get fully indexed. An e-commerce site cannot. Faceted navigation, session parameters, pagination, and JavaScript rendering all multiply your URL count, and Google will not crawl every variation. Technical SEO is how you point its limited attention at the URLs that earn revenue.
Why do Indian e-commerce sites fail technical SEO most often?
Indian stores fail most often on crawl budget and Core Web Vitals — not on keywords. We see catalogues where 80% of crawled URLs are filter combinations and internal search results, while the real product pages go weeks between crawls. Add mobile-first traffic on patchy 4G, and slow, render-blocked pages quietly lose rankings nobody is watching.
Three patterns repeat across nearly every audit we run in India: faceted navigation generating near-infinite URLs, heavy theme apps blocking the main thread on mobile, and canonical tags that point everywhere except the page that should rank. None of these show up in a content audit. All of them cap your ceiling.
How do you fix crawl budget on a large store?
Stop Google from crawling URLs that will never rank, then make the ones that should rank easy to reach. Crawl budget is finite: every filter page, sort parameter, and 404 you let Google chase is a product page it doesn't. The goal is a tight, honest URL set where almost everything crawled is something you'd want indexed.
Google's own guidance on managing crawl budget for large sites is blunt: remove duplicates, return clean status codes, and keep your most important URLs shallow. Here is how we apply it on real catalogues.
- Block parameter and internal-search URLs (?sort=, ?q=, ?ref=) in robots.txt so crawlers never request them.
- Fix 404 and 5xx chains — a store leaking thousands of soft 404s on out-of-stock SKUs bleeds budget every crawl.
- Flatten click depth: every product should sit within three clicks of the homepage through category links, not buried under endless pagination.
- Keep XML sitemaps to canonical, indexable, 200-status URLs only — no redirects, no noindex pages, split by type.
How do you stop faceted navigation from wrecking crawl budget?
Faceted navigation — colour, size, price, brand filters — is the single biggest crawl-budget sink in e-commerce. Left unmanaged, five filters across a category create thousands of crawlable, near-duplicate URLs. The fix is deciding which filter combinations deserve to be indexable pages and blocking the rest before Google ever sees them.
- Pick a small set of high-intent facets (e.g. 'running shoes for men') to expose as clean, indexable, canonical URLs targeting real search demand.
- Render every other filter combination behind parameters that are blocked in robots.txt or rendered client-side without a crawlable link.
- Never let two filters in different orders produce two URLs — normalise parameter order or canonicalise to one.
- On Shopify, audit the tag and collection URLs your theme auto-generates; on WooCommerce, the default layered-nav attribute URLs are the usual culprit.
How do you handle JavaScript rendering on Shopify, WooCommerce, and Next.js?
Make sure the content that matters is in the HTML before JavaScript runs. Google can render JS, but rendering is a second, delayed pass — and on large catalogues it is rationed. If your product title, price, and key copy only appear after client-side hydration, you are betting your rankings on a queue you don't control.
The fix depends on the stack. Shopify and WooCommerce are server-rendered by default — the risk there is app and theme scripts injecting content or lazy-loading it out of the initial HTML. Next.js gives you the most control and the most ways to get it wrong:
- Next.js: render product and category pages with Server Components or getStaticProps/ISR — never ship a client-only product page that fetches data in a useEffect.
- Shopify: keep critical content in the Liquid template, not in a third-party app widget that loads after paint. We cover store-specific tactics in our Shopify SEO guide.
- WooCommerce: disable plugins that defer or AJAX-load product descriptions, and confirm the rendered HTML matches what users see.
- Validate with the URL Inspection tool's rendered HTML — if the price or title is missing from the rendered DOM, fix it before anything else.
Need this done, not just documented?
This is the exact playbook we run for clients. If you'd rather have it implemented and verified against Search Console data than DIY it, that's our technical SEO service. We start with the audit below and ship fixes in priority order.
Which canonical traps cost Indian stores their rankings?
The canonical traps that cost the most are self-referencing tags pointing to the wrong URL, canonicals pointing to noindex or 404 pages, and product variants all canonicalising to a single URL that buries long-tail demand. Google treats canonical as a hint, not a command — so when your signals conflict, it picks for you, often wrong.
- Every indexable page should have a self-referencing canonical to its own clean, parameter-free URL.
- Don't canonicalise filtered or paginated pages to page one if those pages target distinct demand — canonicalise only true duplicates.
- Make sure canonical, hreflang, sitemap, and internal links all point to the same URL version (https, trailing slash, case) — mixed signals waste crawl budget.
- Never canonicalise to a URL that is itself noindexed, redirected, or returning a 404 — a trap we find on most Shopify stores running duplicate collection paths.
How do you validate product schema so it actually earns rich results?
Add valid Product schema with price, availability, and review data, then test it against Google's Rich Results Test before trusting it. Schema that throws errors or omits required fields simply doesn't render as a rich result — and most store themes ship structured data that is incomplete or duplicated across plugins.
- Include Product, Offer (price + priceCurrency in INR + availability), and AggregateRating where you have genuine reviews.
- Run every template through the Rich Results Test and Schema validator — fix every error, not just warnings.
- Kill duplicate schema: a theme and an SEO plugin both injecting Product markup creates conflicting data Google may ignore entirely.
- Add BreadcrumbList schema so category paths show in results and reinforce your site structure.
What Core Web Vitals targets matter for Indian e-commerce in 2026?
Hit LCP under 2.5 seconds, INP under 200 milliseconds, and CLS under 0.1 — measured on mid-range Android over mobile data, not on your office wifi. India's traffic is overwhelmingly mobile, so a store that feels fast on a MacBook can still fail real-user metrics that decide rankings and conversions.
Google's Core Web Vitals thresholds are the public scoreboard, but field data from real Indian devices is what you optimise against. We break the India-specific tuning down in our Core Web Vitals guide for Indian sites. The high-leverage fixes:
- Compress and serve hero and product images as WebP/AVIF with explicit width and height to kill layout shift.
- Defer non-critical third-party scripts — chat widgets, analytics stacks, and review apps are the usual INP killers.
- Preload the LCP image and the primary font; lazy-load everything below the fold.
- Audit Shopify apps and WooCommerce plugins for render-blocking JS — uninstall what you don't actively use.
What does a real technical SEO audit recover?
More indexed money pages and faster crawling — usually within weeks, not months. On a recent WooCommerce fashion catalogue, faceted navigation had Google crawling roughly 14,000 URLs while only about 1,900 were real products. The rest was filter noise. The store wondered why new arrivals took weeks to appear in search.
We blocked the parameter URLs, rebuilt the sitemaps to canonical products only, fixed a canonical loop on collection pages, and trimmed three render-blocking apps. Within four weeks, crawl requests shifted heavily toward product URLs, valid indexed pages climbed, and mobile LCP dropped from roughly 4.1s to 2.3s. Same content, same backlinks — the wins came purely from clearing the technical debris.
The 22-fix technical SEO checklist
Run these in order. The first group protects crawl budget, the middle group fixes indexing signals, and the last group covers speed and structured data.
- 1. Block parameter and internal-search URLs in robots.txt.
- 2. Eliminate soft 404s on out-of-stock and deleted products.
- 3. Fix redirect chains and loops to single 301 hops.
- 4. Keep every product within three clicks of the homepage.
- 5. Split XML sitemaps by type; include only canonical 200 URLs.
- 6. Expose only high-intent facet combinations as indexable pages.
- 7. Block or client-render all other filter URLs.
- 8. Normalise parameter order to prevent duplicate filter URLs.
- 9. Audit auto-generated tag and attribute URLs (Shopify/WooCommerce).
- 10. Server-render product and category content (SSR/ISR on Next.js).
- 11. Keep critical content out of post-paint app widgets.
- 12. Verify rendered HTML contains title, price, and copy.
- 13. Add self-referencing canonicals to clean URLs.
- 14. Stop canonicalising distinct-demand pages to page one.
- 15. Align canonical, hreflang, sitemap, and internal links to one URL version.
- 16. Never canonicalise to noindex, redirected, or 404 URLs.
- 17. Add valid Product + Offer schema with INR pricing.
- 18. Validate every template in the Rich Results Test.
- 19. Remove duplicate or conflicting schema from theme + plugins.
- 20. Add BreadcrumbList schema across category paths.
- 21. Hit LCP < 2.5s, INP < 200ms, CLS < 0.1 on mobile field data.
- 22. Strip render-blocking apps, scripts, and unused plugins.
Frequently Asked Questions
What is technical SEO?
Technical SEO is the practice of optimising a website's infrastructure — crawlability, indexing, rendering, site speed, and structured data — so search engines and AI systems can reach, understand, and rank your pages. It sits beneath content and links: without it, even great content struggles to get crawled and indexed reliably.
What are some examples of technical SEO?
Common examples include managing crawl budget with robots.txt, fixing redirect chains and 404s, controlling faceted navigation, setting correct canonical tags, adding Product and BreadcrumbList schema, building clean XML sitemaps, ensuring server-side rendering, and improving Core Web Vitals like LCP and INP. On e-commerce sites, crawl control and schema usually deliver the fastest gains.
What are the three types of technical SEO?
Broadly: crawlability and indexing (robots.txt, sitemaps, canonicals, faceted nav), rendering and performance (JavaScript rendering, Core Web Vitals, mobile usability), and structured data (Product, Offer, Breadcrumb schema). Most e-commerce technical problems fall into the first two — getting the right pages crawled, indexed, and loading fast on mobile.
How long does technical SEO take to show results?
Crawl and indexing fixes often show within two to six weeks as Google re-crawls and re-evaluates your URLs. Core Web Vitals improvements update on a rolling 28-day field-data window. Ranking gains follow once the right pages are indexed and fast — usually one to three months, faster than most content-led SEO work.
Is technical SEO different for Shopify versus WooCommerce?
The principles are identical, but the failure points differ. Shopify limits robots.txt control and auto-generates collection and tag URLs you must manage with canonicals and theme edits. WooCommerce gives full server control but ships layered-nav URLs and plugin bloat that hurt crawl budget and speed. Next.js stores need correct SSR/ISR setup to avoid rendering traps.
Fix the technical debt before chasing rankings
You can publish brilliant product copy and earn strong backlinks and still lose to a slower, thinner competitor whose site Google can actually crawl. On e-commerce, technical SEO is the multiplier on everything else you do. Work this checklist top to bottom, verify each fix against Search Console, and re-crawl.
If you want us to audit your store and ship the fixes in priority order, talk to our team. We'll show you exactly where your crawl budget is leaking and what it's costing you in indexed pages.

Founder and Director at Neogen Media. Writing field notes on AI automation, growth systems, and the integrated playbook we ship for Indian SMBs. Based in Kochi.
Follow on LinkedIn