NameSilo

Know when your site content is copied

Spaceship Spaceship
Watch
Hi,

I stumbled upon COPYSCAPE earlier today and found its service VERY impressive.

I also was interested enough to find some copied content. I've contacted the owner and he said he would remove it :)

They also have a thing where you can sign up and get plagiarism alerts when somebody copies your site. But that's $10 a month.

You can search whole site or just a single page. It's very cool. They even highlight the copied text when you click on the links.

Enjoy.
-Matt
(Not my site.)
 
Last edited:
9
•••
The views expressed on this page by users and staff are their own, not those of NamePros.
0
•••
Useful site thanx for posting it. *rep added*
 
0
•••
No problem... once again enjoy! ^_^
 
0
•••
nice link buddy ... will use it for sure from now on ...
 
0
•••
What if they copy your very unique javascript or css code only?

Will it detect that?
 
0
•••
0
•••
that is too cool thank you so much for sharing.
 
0
•••
tsj5j said:
What if they copy your very unique javascript or css code only?

Will it detect that?
It will only find visible content, not source code.
 
0
•••
thanks for sharing this link, is a great site very usefull.
i wanna to give u a reputation point, but i dont know how?
 
0
•••
Click the thing that looks like a snowflake beneath the green bars...
 
1
•••
0
•••
very nice link!! Rep added.
BTW, these are the same folks who had started the famous google alerts service (which later on became so popular that google started one of it's own).

If I'm not mistaken, this service uses the Google API. Since it is a very useful service, they must have negotiated with Google to raise the 1000 search queries per day limit that Google imposes for it's API service.
Very interesting!! I was thinking of something similar, but passed it .....

It's not a very difficult thing to implement. If anyone is interested in knowing how (via the Google API or the Yahoo API), I can tell. :)
 
0
•••
Thanks for the link! Rep added! Impressive site and should be very useful for web publishers.
 
0
•••
Good find! I have found my blog being copied and used for spam blogging. A bit back I found some jerk was using it as it is medical/health related. They were copying verbatim and just changing my links, originally to medical related sites, to porn. I contacted Google and they shut down every one that I sent them links to.
 
0
•••
0
•••
nice! But I will test it .
 
0
•••
I found only one person copying my content whom I gave permission, Good to see nobody else is stealing it :)
 
0
•••
Very interesting find. Thank you so much for sharing with us. :)
 
0
•••
0
•••
0
•••
Been using this tool for a while :) its pretty good.
 
0
•••
cache: If you really want you could write your OWN tool to do that or suggest it to copyscape themself. Don't ask me... lol I dunno...

cache: If you really want you could write your OWN tool to do that or suggest it to copyscape themself. Don't ask me... lol I dunno...
 
0
•••
there's also the system of % match.... something like what % of content on page A matches content on page B... because copywriters have become pretty smart nowadays... they may not copy entire pages of content.... just a para here and a sentence there ;)
for % matching, one would need to use several Google API queries... each query would randomly select a phrase or sentence, place it in a quoted string and then query google.
since te google api has a limit of 1000 automated queries / 24 hours, the google api is not the right tool for the job (unless you strike a special deal with google).
The yahoo api is better for this purpose because it has a much higher limit of 5000 queries per day per IP address... on a server with lets say 10 IPs that would mean around 50K queries per day... cool.
for google, even if you use diff IPs ypou are still restricted to 1K queries per day per developer key (and you cannot have more than one developer key per person, and you also cannot aggregate keys of diffferent folks for one app ! smart eh?).

I guess now that Alexa has opened up it's huge index to developers (at a nominal fee), it makes more sense to devise one's own algorithms for using the alexa index for the purpose of identifying plagiarized content (instead of using google/yahoo search results). The main bottleneck of building a huge index has ben removed, so now anyone with clever algorithms can leverage the alexa platform for building innovative search apps (even if on a small scale oe for solving a simple problem)
 
Last edited:
0
•••
  • The sidebar remains visible by scrolling at a speed relative to the page’s height.
Back