|
|
While making the trains run on time is a good thing, making them run early is not.
If you define success as getting closer and closer to a mythical perfection, an agreed upon standard, it's extremely difficult to become remarkable, particularly if the field is competitive. Can't get rounder than round.
In general, purple cows live in fields where it's possible to reinvent what people expect.
[You're getting this note because you subscribed to Seth Godin's blog.]
Don't want to get this email anymore? Click the link below to unsubscribe.
Your requested content delivery powered by FeedBlitz, LLC, 9 Thoreau Way, Sudbury, MA 01776, USA. +1.978.776.9498 |
SEOmoz Daily SEO Blog |
Post-Panda, Your Original Content is Being Outranked by Scrapers & Partners Posted: 20 Apr 2011 02:39 PM PDT Posted by BryanCrow This post was originally in YOUmoz, and was promoted to the main blog because it provides great value and interest to our community. The author's views are entirely his or her own and may not reflect the views of SEOmoz, Inc. A weird thing has happened as a result of panda. Something you might have expected Google's Search Quality testers to catch before rolling the update out. Due to the domain-wide nature of the signal, high-quality, original content produced by the websites who were negatively impacted are now being ranked below the exact same content, republished by partners to whom they syndicate. Even more egregious, they are also being outranked by scrapers who effectively steal and republish the same content without permission or credit. I have seen this briefly mentioned by observers, but I haven't seen this phenomenon transparently documented either in SEO press or in the Panda Google forum. The purpose of this post is to transparently share data from the site WonderHowTo.com (of which I am the CTO) and locate others experiencing a similar phenomenon. Pre Panda For three years, we at WonderHowTo organized the sprawling world of HowTo with taxonomical zeal and very human curation. By January, we had grown to more than 10mm monthly uniques. As our community formed, we began to shift our efforts towards the concept of covering timely news in the HowTo space (there is astounding innovation each day among the 427 subcategories we follow). Our journalistic cred grew, and at the beginning of the year, two fantastic syndication partners Business Insider, and Huffington Post recognized our quality and eagerly published our articles in their sections (primarily Technology). On occasion, we noticed that our articles were outranked by our partners, but over the course of a few days, Google always got it right, recognizing the source as WonderHowTo. For the record, pre-Panda, we cannot recall one instance when a scraper outranked us with our own content in Google. Never. There seemed to be order in the universe. Post Panda Our Google traffic fell by 40%. Among our 1 million indexed pages, we experienced plenty of displaced rankings. Before getting into the what, how, & why, one thing has stood out as alarmingly egregious: Original content created by us is no longer able to rise to the top above our partners or even scrapers who republish our content. Ever. Panda branded us the Rosa Parks of content, forcing us to the back of Google's ranking bus, along with all the other sites which fit its profiling. Crediting the Original Source - Google vs Bing I took a look at the articles we're promoting on our home page and syndicating to Business Insider and Huffington Post. As I mentioned earlier, our articles also tend to get scraped and republished on dozens of sites within minutes of them being published. Post panda, it turns out Bing is doing a better (though still imperfect) job of ranking the original source (WonderHowTo) above the scrapers & syndication partners. Here are examples from a few recent posts (For simplicity, I searched for each article's exact title): "How To Remove Your Name and Profile Picture from Facebook's Social Ads" Original Source is #9 on Google "Transform Your Android Home Screen into a 3D Environment with the SPB Shell 3D Launcher App" Original Source is #7 on Google "How to Add a Dislike Button to Your Facebook Page" Original Source is #14 on Google The larger implication is that if Google cannot rank the source first when searching for the exact title, then the source will also lose out on traffic from any additional keyword variations that the very same content ends up receiving on scraper and partner sites. Deconstructing The Panda Damage Our process has always revolved around human curation with the goal of weeding out anything low quality, it seemed odd that the hit would be so large. We did a deep analysis on a variety of signals (article word count, title word count, how many links, embedded media, how many comments, how many favorites, bounce rate, etc) to try to determine which individual pieces of content were getting hit the most. We separated the content that gained the most traffic to compare against the content that had lost the most traffic, comparing signals & looking for trends. The results seemed random. Very short video descriptions would rank quite well, while long, detailed original transcriptions and guides were suffering. Every time we thought we'd found an influencing signal, we'd go on to find enough exceptions to negate it. It became abundantly clear that Panda does not work by filtering out individual low quality content as was originally implied. It works by punishing entire domain names if an undetermined percentage of the content on that site meets the undefined "low-quality" criteria. Soon after we came to this realization, Google confirmed it in a statement to Search Engine Land, and an interview with WIRED. This Site-Wide Approach Punishes High Quality Results With this signal hitting an entire site instead of just its individual low quality content, the results fundamentally oppose the stated goal of search quality and fairness in attribution. The collateral damage results in Google burying the original source of high quality content, promoting those who steal, scrape, and republish above them. Furthermore, it ends up demoting other top quality results simply because of the domain on which the content resides. It's counter-intuitive to think that prejudicially branding every piece of a particular site's content, past, present and future is an effective way to promote top quality results. Trying To Resolve Your Site-Wide Demotion Within a week, several search analysis reports started popping up with post-mortem break-downs. Most were fundamentally flawed in that they only looked at the number of ranking places each site would loose without taking search quantity and click through rate into account. The bottom line is that the difference between ranking 1st and ranking 2nd is mammoth. As such if your site ranked #1 for a couple hundred popular queries and you got flagged by panda, the bulk of your traffic loss would be from those #1 positions changing to #2 to #10 positions. Shifts between #4-#8 don't make nearly as much of a difference. But I digress. A consensus has been forming across the web stating that if you remove duplicate and otherwise low-quality content from your site, or do the work of telling Google not to index it, your classification as low-quality under panda would be lifted. The idea that you can get out from under this cloud started to gain traction as a couple of stand out examples started showing up. Find Your "Problem Content" The vast majority of content on WonderHowTo was written by our team of editors, researchers, and curators. It has always been our policy to write original descriptions for the videos our curators approve for our library so as to ensure authenticity, accuracy, and relevance. It is part of the added value we bring to the table when embedding how-to videos from youtube, vimeo, or any of the other 17,000 creators we've curated in our hunt for useful and excellent HowTos (Talented video creators often produce an excellent tutorial with zero regard to title or description, rendering them invisible to search. To these compelling voices, we have sent a steady stream of deserved traffic). Over the years we have also consummated one-off agreements with a handful of partners who requested that we use their own specific descriptions, word-for-word, when including their content on our site. As was the Pre-Panda norm, Google would always rank the original source 1st, so there was no need for any one-off no-index tags to keep rankings in their correct place. With the growing consensus that such republishing could be a major signal in getting a domain flagged, it seemed apparent that our biggest problem might be this content from our partners. After auditing our library, we found that about 16% of our content had been republished word for word from one of these partners. We would have to noindex these to take them out of search visibility. Enact Your Sweeping Changes to Remove Your Problem Content Once you've identified all your problematic content, it's time to noindex it. Digital Inspiration made a number of similar changes and saw his rankings restored within two weeks. Here are the changes we made to WonderHowTo as of March 25, 2011: 1. Duplicate Content from Syndication Partnerships 2. Related Video Pages 3. Un-embeded Video Pages 4. Tag Pages 5. Page Link Count Wait for your Changes to Take Effect Within a week, Google had re-crawled enough of our content to start removing the no-indexed pages from the index. We knew this would result in an additional drop in search traffic, but the hope was to rectify the side effect of Google ranking our high-quality content lower than the scrapers who republish it. We are hopeful that the changes we've made will remove this site-wide flag, or that Google will tweak the algorithm to only target low quality content as opposed to an entire site. But as of today, (4/19/2011), the problem still exists. Google continues to drive people who search for our content to the republished versions on our partners sites and the sites who scrape us without permission or attribution. Our search traffic has declined (now partially because of our noindexing changes), and our high quality content continues to be outranked by less helpful results. If you have a site that is experiencing a similar phenomenon, let us know in the comments. This behavior seems contrary to the fundamentals of search quality, and Panda specifically. Without making some noise about it, it may never be corrected. |
You are subscribed to email updates from SEOmoz Daily SEO Blog To stop receiving these emails, you may unsubscribe now. | Email delivery powered by Google |
Google Inc., 20 West Kinzie, Chicago IL USA 60610 |
| ||||||||||||
The hungry person at the all you can eat buffet is happy to take one more item. She doesn't spend a lot of time comparing this to that, or saying 'no thank you' or avoiding certain items. If it's interesting, "sure I'll try a little bit. I can always come back."
The guarded person walking down the street avoids eye contact with the homeless person, doesn't answer a request from the petition-signer and certainly doesn't help a Boy Scout with that old lady.
And this is precisely the dichotomy every cause, every candidate and every marketer faces.
Either you're selling to people who are hungry for what you offer, who are open to hearing what you have to say, who are fans...
Or you're selling to people who are actively protecting themselves, guarding against interruption or a mistake or worse.
How can you possibly have a strategy about what you're going to do next until you determine which mindset you're marketing to?
Here's the key truth: in any given moment, in any given situation, a person is either hungry or guarded. You need to decide which sort of person you'll be telling your story to, because one approach won't work on the other type of person.
[PS the mindset can (and does) change as people go through their day. At the bookstore she might be hungry for a new idea, and just a few minutes later, at the bus stop, she wants to be alone...]
[You're getting this note because you subscribed to Seth Godin's blog.]
Don't want to get this email anymore? Click the link below to unsubscribe.
Your requested content delivery powered by FeedBlitz, LLC, 9 Thoreau Way, Sudbury, MA 01776, USA. +1.978.776.9498 |
Damn Cool Pics |
Posted: 20 Apr 2011 02:53 PM PDT The Coachella Valley Music and Arts Festival kicks off this weekend in the hot desert of California, with an expected temperature of around 90 degrees. There are a bajillion people in the middle of the desert watching six stages with a thousand bands and all sorts of other distractions too. There are a lot of scantily clad white girls, including Paris Hilton, shaking their money makers for all the world to see at this outside concert. |
Lady Gaga is a Real Copy Paste Posted: 20 Apr 2011 01:36 PM PDT |
Beach Season Opened with Skiing Posted: 20 Apr 2011 01:30 PM PDT |
Posted: 20 Apr 2011 01:18 PM PDT |
You are subscribed to email updates from Damn Cool Pics To stop receiving these emails, you may unsubscribe now. | Email delivery powered by Google |
Google Inc., 20 West Kinzie, Chicago IL USA 60610 |