annotation

working definition: what're splogs?

motive: why do spammer create spams?

People could earn revenue by generating traffic for commercial websites and in return receiving a per-click commission. When a website owner join an affiliate program, he places a range of banners or textual links that links to affiliated websites (In the case of Google AdSense, the links are dynamically delivered from Google search results.). When a user clicks on one of his links to the affiliated websites, their activity will be tracked by the affiliate software. He gets paid when people click on these affiliated links (pay-per-click, or PPC), or until people make a purchase on the affiliated websites (commission), depending on the rule of the affiliate program. It’s very easy and cheap to create a blog and join an affiliate program to generate revenue. Blogs created by spammers usually do not provide unique or useful content/service.

guide: how to recognize a splog?

In a typical splog, content is generated by machine in order to attract visitors through either search engine or individual blogs. We determine if a blog is a splog based on how the blog author or authors use this blog. Therefore, a blog that contains spams in the form of comment spams or trackback spams will not be considered splogs.

There are observable typical characteristics of splogs:

annotation tool

The tool has three panels:

splog examples

how to label?

For each sampled blog, we assign one the following labels:

Ask the following questions: