
Image via CrunchBase
This is a guest post by Craig Smith of Trinity Insight.
In early February of this year, search engine representatives within Google, Yahoo, and MSN (before it was Bing) made an announcement into a uniform method of embracing a new html tag to reduce duplicate content for a webmaster. This “canonical” tag, which would be inserted within the HEAD portion of any HTML document, is a great way to reduce potential negative affects that can happen when you have the same page indexed multiple times under a variety of URL’s
The tag is written as this: <link rel=”canonical” href=”http://mysite.com/page1.html“/>
This is essentially saying to a search crawler, “Hey Googlebot, this isn’t the preferred page for this content, href=http://mysite.com/page1.html is.”
In working with a variety of ecommerce platforms and content management systems, this is a pretty widespread issue and this tag will go along way to helping webmasters properly structure a site for optimal SEO.
Think about this for a minute. You can have the following example variations for a fictional homepage:
You get the picture. Which is the primary page? All of these pages can be indexed by search engines, but which version should an engine render when users are searching?
Sure you can use 301 redirects to fix this issue, but sometimes these are tough to generate within varied system and server environments. You could try to eliminate varied parameters such as session ID’s and tracking codes, but then you are losing valuable data to help you understand the dynamics of your visitors and marketing campaigns.
In working with a new client in the eCommerce sector, we saw this issue first hand. A single product page had 27 different versions of the page (different url’s for each) indexed within Google. Each page had the same title and each page had exactly the same content. Because of the different paths that users could take to find the product, due to the parametric filtering capabilities on the site, these urls existed in the index for years.
How does this impact your SEO efforts when you have these duplicate url’s in search indexes? For starters, unlike a 301 which redirects all web traffic, the canonical tag is an indicator for only engines which allows you to keep your existing url parameters. It will help engines in concentrating link equity into one primary URL, for a specific piece of content, as well as essentially tell them which page you want to have as the “authority” page.
Looking for answers direct from Google relating to the canonical tag? Here are some Q&A answers that they provided in their webmaster central blog
Is rel=”canonical” a hint or a directive?
It’s a hint that we honor strongly. We’ll take your preference into account, in conjunction with other signals, when calculating the most relevant page to display in search results.
Can I use a relative path to specify the canonical, such as <link rel=”canonical” href=”product.php?item=swedish-fish” />?
Yes, relative paths are recognized as expected with the <link> tag. Also, if you include a <base> link in your document, relative paths will resolve according to the base URL.
Is it okay if the canonical is not an exact duplicate of the content?
We allow slight differences, e.g., in the sort order of a table of products. We also recognize that we may crawl the canonical and the duplicate pages at different points in time, so we may occasionally see different versions of your content. All of that is okay with us.
What if the rel=”canonical” returns a 404?
We’ll continue to index your content and use a heuristic to find a canonical, but we recommend that you specify existent URLs as canonicals.
What if the rel=”canonical” hasn’t yet been indexed?
Like all public content on the web, we strive to discover and crawl a designated canonical URL quickly. As soon as we index it, we’ll immediately reconsider the rel=”canonical” hint.
Can rel=”canonical” be a redirect?
Yes, you can specify a URL that redirects as a canonical URL. Google will then process the redirect as usual and try to index it.
What if I have contradictory rel=”canonical” designations?
Our algorithm is lenient: We can follow canonical chains, but we strongly recommend that you update links to point to a single canonical page to ensure optimal canonicalization results.
Can this link tag be used to suggest a canonical URL on a completely different domain?
No. To migrate to a completely different domain, permanent (301) redirects are more appropriate. Google currently will take canonicalization suggestions into account across subdomains (or within a domain), but not across domains. So site owners can suggest www.example.com vs. example.com vs. help.example.com, but not example.com vs. example-widgets.com.
So what’s your action plan? First thing is to evaluate your site paths and look for instances in which you have multiple url’s with the same content. Look for the duplicates and decide which version that you desire to have as your primary version.
Embed the tag on the duplicate pages as indicated above, potentially in an automated basis within an eCommerce platform, and help the engines more effectively index your site. The canonical tag is a major development within the SEO market that has flown a bit under the radar, but can really make a difference in your rankings. Best of luck in reducing your duplicate content and making your website more efficient for search engines to crawl and index!
————————
About Author:
Craig Smith is the founder of Trinity Insight, an eCommerce optimization firm that specializes in web analytics consulting and multivariate testing
Things like this scare me. I wonder how much of it is going to turn into the same type of nofollow debacle that erupted in late spring. What happens if the search engines decide that canonical isn’t the best way to go anymore, either? Where do you hit that sweet spot of diminishing marginal returns with SEO? When does an extra hour of effort NOT lead to more traffic?
.-= jlbraaten´s last blog ..The Top 8 Best Free Audio Podcast Downloads on the Web =-.
Twitter Comment
Testing the Stumbleupon su.pr WordPress plugin on my latest guest post: [link to post]
– Posted using Chat Catcher
wow… that’s a good news, but it scare me, why this update is failed? that many website will lost the traffic.
I hope does not happen to my website
.-= Reza´s last blog ..[$$] This Tech Move Is Real (at RealMoney by TheStreet.com) =-.
Many thanks for this post.
Again, this is symptomatic of the wider issues with Google; a complete lack of consistentcy in communication.
Thanks
Hi,
its nice to read a useful article for beginner like me.
Some of points from this article are very helpful for me as I haven’t
considered them yet.
I would like to say thank you for sharing this cool article.
Bookmarked and sharing for friends.
- Suresh
I don’t know the importance of canonical tag before, Thanks for sharing
.-= Lonting´s last blog ..Instant Term Life Insurance For Family =-.
i was really not aware of the canonical tags…however i agree with you completely that.. by removing the IDs there will be a huge loss of the data which is surely no one’s gonna do… but the step which we can take is to look ahead and to keep the futuristic project as per the guidelines
Hey thanks for the post, its really very informative. If such tag implemented on every page of each & every site than the issue of duplicate content would be solve soon.
Thanks for explaining about canonical, i understand about this issue right now.
Thank
David
I have been seeing canonical meta tag these days, what I have know that this tag will differentiate a “www” or a non www domain. Now I think I am in the know. Thanks for the article.
.-= Longest Drive´s last blog ..Three Keys to Long Drives and Distance =-.
Thanks for the heads up mate, will definitely give it a try. At least, it pays to keep abreast.
Still trying to understand it…:(
.-= make money online for dummies´s last blog ..A new thing about SEO you might want to know =-.
Hey, thanks for sharing these tags and information, am quite an SEO newbie myself.
Some great points – I guess now we just have to wait until the All-In-One-SEO author will come out with a fix that will allow us to specify it within blogs
.-= Alex Sysoef´s last blog ..The Keyword Minute Report by Brad Callen =-.
Gosh, Craig. You’ve explained the canonical tag principle in a very simple ways! I’ve read several other related articles before, and I thought yours the best yet simple explanation so far.
Thanks for sharing it with us! By the way, you should be glad to have a nice guest writer here, Stephan.
.-= william´s last blog ..Join the Affiliate Program and Turn Your Traffic into the Money Generator! =-.
wow.it is a good news.i think i want to try it!
thanks for the post.it help me a lot to optimize my website!
thanks for article. I m doing SEO job too.:D
.-= Biber Hap?´s last blog ..Bölgesel zay?flama yöntemleri =-.
Wow! Wow! I search about that! Thanks!
I am not use this, is this give big impact to seo?
Thanks
I think this is a great idea but I hope it will not mess up my rankings for the old sites. And yeah not only WP all in one SEO but also WP it self should have inbuilt function to set canonical tag.
thanks for sharing this, now i understand.
.-= itot54joni´s last blog ..Miss Universe 2009 Live Stream Online Webcast, Winners And Results =-.
I wonder how this would work as far as article marketing. Some people are saying that article marketing is not as effective, especially with hte whole bum marketing thing. I fly under the radar by using different article titles and and redirect links. its best just to use an article spinner to avoid the duplicate content issue.
Sara
http://www.gizellefashion.com
wow, keep sharing guys..
I am aware of the canonical tags…however i agree with you completely that.. by removing the IDs there will be a huge loss of the data which is surely no one’s gonna do. Thanks for sharing.
Removing the IDs there will be a huge loss of the data which is surely no one is going to do it.
You’ve explained the canonical tag principle in a very simple ways! I’ve read several other related articles before, but this very impressive.
Google is the boss of search engine.Most of the people uses google search.
Great Article, I learn something from this. Thanks a lot for sharing!
I’ve seen some websites using this tag but for the most part I think it has been ignored. I’ve also noticed that the new version of FireFox displays a little C icon in the URL bar when there is a canonical tag pointing to the page (such as this page).
thanks for sharing this, now i understand. Keep Sharing Guys! thank you
web 2.0 images|web 2.0 graphics
good explanation, and those examples make it even clearer. I’ve heard about this canonical thing from a few friends but no one can explain it clearly to me. Some issue indeed, especially for those whose working on their emarketing. Thanks for the post
.-= muxin´s last blog ..Wi-Fi® Internet Radio =-.
This is a nice tip that should help a lot of webmasters who are tired of the multiple indexing issue. The way you have explained it in detail has helped understand the entire concept well.
.-= R Kumar´s last blog ..Getting rid of Procrastination…My Guest blog =-.
Great post! this will be very useful!
Stephan,
I am sure that this tag would go a long way in avoiding the copying.
What would happen to existing content?
What about copy-paste people?
.-= Chris Peterson´s last blog ..3 Kontera Check worth $8621 – Arrived after 3 Months =-.
This all sounds really good. But any new feature that gets the ID removed tends to make me a little nervous. But anxious to test on some of my smaller sites to compare results
Thanks again for sharing
.-= Megan @ YourNetBiz´s last blog ..YourNetBiz Top Online Business Mentors One on One Training =-.
This Information very much helpful for the SEO persons. This will definitely help to reduce duplicate content of websites
The introduction of this Tag is a good news as it will help webmasters reduce the negative effects of duplicate content.
Things like this scare me. I wonder how much of it is going to turn into the same type of nofollow debacle that erupted in late spring. What happens if the search engines decide that canonical isn’t the best way to go anymore, either? Where do you hit that sweet spot of diminishing marginal returns with SEO?
Hey thanks for the post, its really very informative. If such tag implemented on every page of each & every site than the issue of duplicate content would be solve soon.
Yeah this new HTML tag is a real help, I hope that it helps SEO too.
Hmmm, i have just been trying out the new tag, the search engines do like it!!
I really wouldn’t bother with rel=”canonical” in most cases. If you built your site well in the beginning all alternative URLs would be 301d in your .htaccess or apache config so that all your link juice goes to one page (and people only ever link to that page!). If you haven’t done this, you should probably go back and do it, using canonical as a stopgap measure while you rework code.
I think it has really an impact on the traffic you get. Try checking your website value here > websitevaluecalculator.info, you will notice that it will give different result for e.g. http://www.mysite.com, and mysite.com.
I am using canonical URL in my eCommerce site. Looking on the ‘Internal links’ statistics in Google webmaster, I am not sure Google really takes canonical URL into account.
Cheers,
Sty
.-= Stylight´s last blog ..????? ???? ?????? =-.
Craig, are you sure about this?
… “Hey Googlebot, this isn’t the preferred page for this content, href=http://mysite.com/page1.html is.”
I thought it meant:
… “Hey Googlebot, this is the preferred page, however href=http://mysite.com/page1.html is a duplicate content of this page.”
Nice information.Thanks for sharing this information with us. It is going to be really helpful for me. Bundles of thanks.
Do you think about
.-= moreclick´s last blog ..MoreClick, Google Above & Beyond 2009 Konferans?nda Türkiye’yi Temsil Etti. =-.
Do you think about
.-= moreclick´s last blog ..MoreClick, Google Above & Beyond 2009 Konferans?nda Türkiye’yi Temsil Etti. =-.
Thanks for sharing, I love reading you should post more often…