How to fix duplicate content in Joomla?




What is duplicate content? Duplicate content means displaying the same page content across multiple URLs on your Joomla site. To explain it more clearly — when a page is accessible from multiple URLs.


Why is it important to fix duplicate content on your Joomla site? Since a page is visible under multiple URLs, search engine crawlers do not know which URL to display in search results so the incorrect URL may be displayed. As a result, this negatively impacts your Joomla SEO.


This guide covers the most common reasons for duplicate content in Joomla and how to fix them.




Content duplication due www and non-www


The most common problem among new web developers is displaying www and non-www pages of your Joomla site in the links.

https://domain.com/pagehttps://www.domain.com/page


If your site's pages are accessible with www and non-www, this means that you are guaranteed to have at least 2 copies of absolutely every page on the site.


To solve this problem, you need to add a redirect to your site in the .htaccess file from non-www to www.

RewriteEngine OnRewriteCond %{HTTP_HOST} ^yourdomain.com [NC]RewriteRule ^(.*)$ http://www.yourdomain.com/$1 [L,R=301]


See more details about redirect in the tutorial: 




Content duplication due https


After connecting an SSL certificate to a domain, be sure that your Joomla website is accessible only with https.

http://www.domain.com/pagehttps://www.domain.com/page


Just like with content duplication due to www and non-www, if your site is accessible from both http and https, you are guaranteed to have duplicates of every page on your Joomla site. The site must be accessible only with https. To solve the duplicate content issue due to https, you need to add this code to your .htaccess file to automatically redirect from http to https.

RewriteEngine On​RewriteCond %{SERVER_PORT} 80RewriteRule ^(.*)$ https://www.yourdomain.com/$1 [R,L]


For more details about https redirect, see the documentation:




Content duplication due link variations


Additional parameters in the URL generate multiple URLs of the page. UTM tags, Google Click ID parameters, paginations, product options, etc. All these things generate link variations.

https://www.domain.com/page?gclid=1https://www.domain.com/page?utm_source=bloghttps://www.domain.com/page?fbclid=1https://www.domain.com/page?size=m&color=black


Content duplication due to link variations can be solved using canonical URLs. What is a canonical URL? Canonical URL is an HTML element that helps search engine crawlers understand which version of the webpage is primary, should be indexed and should be displayed in the search results.

<link href="https://www.domain.com/page" rel="canonical">

 

How to add canonical URLs to pages of your Joomla site? If you are using the Joomla website builder Gridbox, you can enable the Canonical URLs App. You will find how to do this in the documentation:

 

For non-Gridbox Joomla sites, you can visit JED to find and download an extension for generating Canonical URLs in Joomla.

Related articles