What is scraping? What can you do?
A common practice in the blogging world is for sites to “scrape” your content via your RSS feed, automate all of your words to their site and compete with you in Google and get paid via Google Ads or affiliate ads on their sidebars and in each article. The options for recourse are few and it is time consuming to chase down scrapers by first notifying them directly (if they have a contact page or domain registration that isn’t anonymous), then their web host and so on and so forth. The end outcome is rarely a win as scrapers are anonymous and know what they’re doing.
How the world just changed
Last week, the world changed a bit with Roger Cleveland Golf Company, Inc. v. Prince wherein a Golf club counterfeiter and their webhost were sued not only for their counterfeiting of clubs but (more importantly) for infringement. The South Carolina jury and judge agreed with the charges and awarded $770,750 to the Golf Company.
How are golf clubs related to real estate?
Now what does a Golf Company have to do with a real estate blog? Well, for the first time in American history, a web host company has been found liable for contributory infringement without actual notice that a customer’s site lists fake products for sale.
Lead counsel for Cleveland Golf, Christopher Finnerty said that a web hosting company’s obligation is comparable to a landlord’s. “A landlord doesn’t have the obligation to act as an investigator against his tenants to find out they are doing anything illegal, but once they knew or should have known, they have to act. How is that any different online?”
So let’s say you have a real estate blog about Dallas real estate and let’s say you’re so savvy that you only offer partial RSS feeds (so they can’t scrape)- there are now ways around that with tools that can scrape sites directly. Let’s say you’ve emailed the website owner and even the webhost but have not seen any response or action taken to protect your intellectual property. Although Google says it has changed its algorithms to punish junk websites like scrapers, it’s not an elimination of scrapers, just a lower ranking.
Your real estate blog is leaking and you don’t feel like you have any recourse, but what if you are blogging about your listings? And what if your client calls you angry that photos of their home with your words are featured on a website full of porn ads? And can that site advertise your listing without your permission? Are the proper disclaimers and brokerage information included? Is your client protected in this scenario? Not likely.
Also in question is a method widely accepted called “reblogging” where content is used from other sites and proper attribution may not be given. This is not scraping, rather a softer method of copy and paste of another blogger’s content for use on another website not owned by said blogger.
So how is this different than counterfeit golf clubs? Other sites are advertising your content as their own, counterfeiting and infringing. Maybe now web hosts will pay attention to alerts against scrapers (why not email a copy of this column with your next scraper complaint?).
