Splogs and scraper sites are places that I have too often found my articles and posts with intrusive links and ads. Today I found out what they were and what can be done about them. So below are several definitions of splogs, spam, scaper sites and some posts from bloggers that may help you keep your sanity.
Spam in blogs (also called simply blog spam or comment spam) is a form of spamdexing. It is done by automatically posting random comments or promoting commercial services to blogs, wikis, guestbooks, or other publicly accessible online discussion boards. Any web application that accepts and displays hyperlinks submitted by visitors may be a target.
Adding links that point to the spammer’s web site artificially increases the site’s search engine ranking. An increased ranking often results in the spammer’s commercial site being listed ahead of other sites for certain searches, increasing the number of potential visitors and paying customers
Splogs are blogs where the articles are fake, and are only created for search engine spamming. To spam in blogs, conversely, is to include random comments on the blogs of innocent bystanders, in which spammers take advantage of a site’s ability to allow visitors to post comments that may include links.
Sping is short for “spam ping”, and is related to fraudulent pings from blogs using trackbacks, called trackback spam. Pings are messages sent from blog and publishing tools to a centralized network service (a ping server) providing notification of newly published posts or content. Spings, or ping spam, are pings that are sent from spam blogs, or are sometimes multiple pings in a short interval from a legitimate source, often tens or hundreds per minute, due to misconfigured software, or a wish to make the content coming from the source appear fresh.
A scraper site is a website that copies all of its content from other websites using web scraping. No part of a scraper site is original. A search engine is not a scraper site: sites such as Yahoo and Google gather content from other websites and index it so that the index can be searched with keywords. Search engines then display snippets of the original site content in response to a user’s search.
In the last few years, and due to the advent of the Google Adsense web advertising program, scraper sites have proliferated at an amazing rate for spamming search engines. Open content sites such as Wikipedia are a common source of material for scraper sites.
Here is a great definition and article from Techtarget
Now you know when you are bothered by keying in those funny looking words when you go on some sites? Especially if you go to the RSS via Feedburner? Well, I finally appreciate it. It is called CAPTCHA It is one of the good guys because it attempts to catch those nasty splogs.
IPA: /?kæpt??/) is a type of challenge-response test used in computing to ensure that the response is not generated by a computer. The process usually involves one computer (a server) asking a user to complete a simple test which the computer is able to generate and grade. Because other computers are unable to solve the CAPTCHA, any user entering a correct solution is presumed to be human. (Wikipedia)
Some suggest that Google may benefit from sploggers….
A recent blog posted about the possibility of some political blogs that were splogs.
Finally, my sanity solution is to know that splogs can be reported. You know clicking on that flag in blogger or reporting to google. Perhaps more but something can be done. I did not know when I first started blogging and became upset when I saw it from a google alert on my articles but now I do. Read this post from lorelle.wordpress on helping to clean up splogs.
Can anyone suggests plug-ins that can help protect our blogs?
Oops, almost neglected to put a category on this post, when you see posts without categories they can smell like a splog.
Thanks for this. Unfortunately many of those sploggers with Blogger turn off the navbar so you can’t flag them and you have to know what address to go to, to report them. Few people will bother searching for that. But it is good to know more about this topic and thanks for the link to Lorrelle’s post as well.
Another timely and perfectly appropriate post! Everyone is so oversensitive about spam and spammers, but they may not know what they really are.
Thank you for researching and helping others to understand the big issues.
Thanks Sally, our objective for this site is to learn, and share with others in a non-technical and overwhelming way.
In addition it keeps the old brain cells young.
Kathie, any suggestions for how to search when that navigation bar is turned off?
Wow! It’s amazing to learn this stuff. I had never heard of a splog until it showed up in one of my google alerts. Thanks for this very valuable information, Rosie.
And you mean to tell me there are people who create whole websites from content on other websites. What kind of monster is the world-wide web really creating?
Bev, it appears that in the zeal to make money by any means this stuff happens
If only all this spammer ingenuity was put to better use! Those CAPTCHA blocks do help reduce spam entries. If you have an existing HTML form on a web site, there are scripts that can add CAPTCHA to it without too much effort. I teach stuff like this in my web site class (www.pcpowertips.com/webclass) because I’ve had to fight spammers myself.
Pingback: Are Your Permalinks Naked?
Pingback: Menu Bar-Blog Info Tab « Blogging Lessons