Tealmermaid Designs

web design, coding, and SEO tips and tricks

How to remove duplicate content

Share:

How to remove duplicate content

What is duplicate content?

"Duplicate content" is any content within a particular article that is substantially similar to the content contained within another article. This can include sentence structure, words, and phrasing. It can also extend to duplicated images. While there are cases in which duplicate content is acceptable (printer-only pages, for example), what concerns most content writers is the deliberate duplication of articles on several sites. This can be done either by allowed syndication via feeds or by simply having one's article lifted in its entirety by a content thief.

Why is duplicate content bad?

There is no real value in having the same article posted in multiple places across the internet. If there are multiple copies of any article available, how can search engines such as Google decide which is the "best" copy? While most of us aren't privy to the subtle nuances of Google's algorithms, we know that SEO (Search-Engine Optimization) matters such as a site's Page Rank may come into play. This means that duplicated content may result in search traffic going to a copy of the article which is not of your choosing.

It is better to have one copy online with many valid backlinks to that one copy. In this way you can chose which copy of the article to show Bing, Yahoo, or Google, and you can decide which site will get the traffic to that particular article.

How to check for duplicate content

It is easy to locate duplicate content if it exists for your article. Copy a snippet of your article, in quotes, to check if matching phrases can be found in Google search, on Bing, or via your favourite search engine.

Feed aggregator sites will typically copy the first few sentences only, which means this is less of an issue than having another site steal your entire article. Be sure to do a line-by-line check the entire article in the plagiarism checker of your choice to be verify that you are in the clear before you move your content. If there are no matching search results, you are probably safe.

How to remove duplicate content

Once you have located the URL of the duplicate content, file a DMCA take-down notice against that URL to claim the article as being under your copyright. This will require:

  1. Locating the host of URL in question via a WHOIS search.
  2. Sending your take-down notice, which can be done by email if it includes a digital signature. Also include any appropriate screencaps needed to prove that the article is both yours and that it existed prior to the creation of the URL in question.

Find a plagiarist?

Sample letter for the plagiarist

Subject: DMCA Copyright Infringement Notification

I am the owner of [domain], and I am the copyright holder of the information which is currently duplicated at your website. Please find attached a copy of the official DMCA Takedown Request.

This letter is an official DMCA takedown notification under the provisions of the Digital Millennium Copyright Act (DMCA) to have the infringing content removed immediately from your website.

My content is located at:

[URL]

The infringing content is located at:

[URL]

Permission was not granted to reproduce this content.

I hereby confirm that the information in this DMCA notification is accurate.

Please reply promptly indicating the actions you have taken to resolve this matter.

Regards,
[Your name]

Sample letter for the plagiarist's web host

Once that is done, send a copy to host or ISP with the following changes:

Subject: DMCA Copyright Infringement Notification

I am the owner of [domain], and I am the copyright holder of the information which is currently duplicated at one of the websites hosted by your service. Please find attached a copy of the official DMCA Takedown Request.

This letter is an official DMCA takedown notification under the provisions of the Digital Millennium Copyright Act (DMCA) to have the infringing content removed immediately from the website hosted by your service.

My content is located at:

[URL]

The infringing content is located at:

[URL]

Permission was not granted to reproduce this content.

Please be advised that the law requires you, as a service provider, to “expeditiously remove or disable access to” the infringing content upon receipt of this notice. Non-compliance may result in a loss of immunity for liability under the DMCA.

I hereby confirm that the information in this DMCA notification is accurate.

Please reply promptly indicating the actions you have taken to resolve this matter.

Regards,
[Your name]

Before moving your content

If you are moving your content from one site to another, the most important thing to do is to remove it from Google's cache. So long as it is cached, there is the potential for it to be flagged as duplicate content. Take a few minutes to remove it before re-posting the content elsewhere. Google has a special form which will allow you to submit your URL for removal.

Once submitted, it will show as "pending" in the Removal requests list until it is removed. This generally takes about 24 hours, which is much faster than waiting for Google to determine on its own that your content is moved. Once the status is updated to "removed", you are free to post your content elsewhere -- another content farm or your own website.

It is however advisable to double-check that your content has in fact been removed from all caches before moving it. This can be done by re-visiting your favourite duplicate content checker to search for your phrases. If it comes up clear, you should be safe.

Google search de-cache form



Advertisement