|
|
New polling out this week shows that Americans are frustrated with the world and pessimistic about the future. They're losing patience with the economy, with their prospects, with their leaders (of both parties).
What's actually happening is this: we're realizing that the industrial revolution is fading. The 80 year long run that brought ever-increasing productivity (and along with it, well-paying jobs for an ever-expanding middle class) is ending.
It's one thing to read about the changes the internet brought, it's another to experience them. People who thought they had a valuable skill or degree have discovered that being an anonymous middleman doesn't guarantee job security. Individuals who were trained to comply and follow instructions have discovered that the deal is over... and it isn't their fault, because they've always done what they were told.
This isn't fair of course. It's not fair to train for years, to pay your dues, to invest in a house or a career and then suddenly see it fade.
For a while, politicians and organizations promised that things would get back to normal. Those promises aren't enough, though, and it's clear to many that this might be the new normal. In fact, it is the new normal.
I regularly hear from people who say, "enough with this conceptual stuff, tell me how to get my factory moving, my day job replaced, my consistent paycheck restored..." There's an idea that somehow, if we just do things with more effort or skill, we can go back to the Brady Bunch and mass markets and mediocre products that pay off for years. It's not an idea, though, it's a myth.
Some people insist that if we focus on "business fundamentals" and get "back to basics," all will return. Not so. The promise that you can get paid really well to do precisely what your boss instructs you to do is now a dream, no longer a reality.
It takes a long time for a generation to come around to significant revolutionary change. The newspaper business, the steel business, law firms, the car business, the record business, even computers... one by one, our industries are being turned upside down, and so quickly that it requires us to change faster than we'd like.
It's unpleasant, it's not fair, but it's all we've got. The sooner we realize that the world has changed, the sooner we can accept it and make something of what we've got. Whining isn't a scalable solution.
Tomorrow: part II—the opportunity
[You're getting this note because you subscribed to Seth Godin's blog.]
Don't want to get this email anymore? Click the link below to unsubscribe.
Your requested content delivery powered by FeedBlitz, LLC, 9 Thoreau Way, Sudbury, MA 01776, USA. +1.978.776.9498 |
|
While making the trains run on time is a good thing, making them run early is not.
If you define success as getting closer and closer to a mythical perfection, an agreed upon standard, it's extremely difficult to become remarkable, particularly if the field is competitive. Can't get rounder than round.
In general, purple cows live in fields where it's possible to reinvent what people expect.
[You're getting this note because you subscribed to Seth Godin's blog.]
Don't want to get this email anymore? Click the link below to unsubscribe.
Your requested content delivery powered by FeedBlitz, LLC, 9 Thoreau Way, Sudbury, MA 01776, USA. +1.978.776.9498 |
SEOmoz Daily SEO Blog |
Post-Panda, Your Original Content is Being Outranked by Scrapers & Partners Posted: 20 Apr 2011 02:39 PM PDT Posted by BryanCrow This post was originally in YOUmoz, and was promoted to the main blog because it provides great value and interest to our community. The author's views are entirely his or her own and may not reflect the views of SEOmoz, Inc. A weird thing has happened as a result of panda. Something you might have expected Google's Search Quality testers to catch before rolling the update out. Due to the domain-wide nature of the signal, high-quality, original content produced by the websites who were negatively impacted are now being ranked below the exact same content, republished by partners to whom they syndicate. Even more egregious, they are also being outranked by scrapers who effectively steal and republish the same content without permission or credit. I have seen this briefly mentioned by observers, but I haven't seen this phenomenon transparently documented either in SEO press or in the Panda Google forum. The purpose of this post is to transparently share data from the site WonderHowTo.com (of which I am the CTO) and locate others experiencing a similar phenomenon. Pre Panda For three years, we at WonderHowTo organized the sprawling world of HowTo with taxonomical zeal and very human curation. By January, we had grown to more than 10mm monthly uniques. As our community formed, we began to shift our efforts towards the concept of covering timely news in the HowTo space (there is astounding innovation each day among the 427 subcategories we follow). Our journalistic cred grew, and at the beginning of the year, two fantastic syndication partners Business Insider, and Huffington Post recognized our quality and eagerly published our articles in their sections (primarily Technology). On occasion, we noticed that our articles were outranked by our partners, but over the course of a few days, Google always got it right, recognizing the source as WonderHowTo. For the record, pre-Panda, we cannot recall one instance when a scraper outranked us with our own content in Google. Never. There seemed to be order in the universe. Post Panda Our Google traffic fell by 40%. Among our 1 million indexed pages, we experienced plenty of displaced rankings. Before getting into the what, how, & why, one thing has stood out as alarmingly egregious: Original content created by us is no longer able to rise to the top above our partners or even scrapers who republish our content. Ever. Panda branded us the Rosa Parks of content, forcing us to the back of Google's ranking bus, along with all the other sites which fit its profiling. Crediting the Original Source - Google vs Bing I took a look at the articles we're promoting on our home page and syndicating to Business Insider and Huffington Post. As I mentioned earlier, our articles also tend to get scraped and republished on dozens of sites within minutes of them being published. Post panda, it turns out Bing is doing a better (though still imperfect) job of ranking the original source (WonderHowTo) above the scrapers & syndication partners. Here are examples from a few recent posts (For simplicity, I searched for each article's exact title): "How To Remove Your Name and Profile Picture from Facebook's Social Ads" Original Source is #9 on Google "Transform Your Android Home Screen into a 3D Environment with the SPB Shell 3D Launcher App" Original Source is #7 on Google "How to Add a Dislike Button to Your Facebook Page" Original Source is #14 on Google The larger implication is that if Google cannot rank the source first when searching for the exact title, then the source will also lose out on traffic from any additional keyword variations that the very same content ends up receiving on scraper and partner sites. Deconstructing The Panda Damage Our process has always revolved around human curation with the goal of weeding out anything low quality, it seemed odd that the hit would be so large. We did a deep analysis on a variety of signals (article word count, title word count, how many links, embedded media, how many comments, how many favorites, bounce rate, etc) to try to determine which individual pieces of content were getting hit the most. We separated the content that gained the most traffic to compare against the content that had lost the most traffic, comparing signals & looking for trends. The results seemed random. Very short video descriptions would rank quite well, while long, detailed original transcriptions and guides were suffering. Every time we thought we'd found an influencing signal, we'd go on to find enough exceptions to negate it. It became abundantly clear that Panda does not work by filtering out individual low quality content as was originally implied. It works by punishing entire domain names if an undetermined percentage of the content on that site meets the undefined "low-quality" criteria. Soon after we came to this realization, Google confirmed it in a statement to Search Engine Land, and an interview with WIRED. This Site-Wide Approach Punishes High Quality Results With this signal hitting an entire site instead of just its individual low quality content, the results fundamentally oppose the stated goal of search quality and fairness in attribution. The collateral damage results in Google burying the original source of high quality content, promoting those who steal, scrape, and republish above them. Furthermore, it ends up demoting other top quality results simply because of the domain on which the content resides. It's counter-intuitive to think that prejudicially branding every piece of a particular site's content, past, present and future is an effective way to promote top quality results. Trying To Resolve Your Site-Wide Demotion Within a week, several search analysis reports started popping up with post-mortem break-downs. Most were fundamentally flawed in that they only looked at the number of ranking places each site would loose without taking search quantity and click through rate into account. The bottom line is that the difference between ranking 1st and ranking 2nd is mammoth. As such if your site ranked #1 for a couple hundred popular queries and you got flagged by panda, the bulk of your traffic loss would be from those #1 positions changing to #2 to #10 positions. Shifts between #4-#8 don't make nearly as much of a difference. But I digress. A consensus has been forming across the web stating that if you remove duplicate and otherwise low-quality content from your site, or do the work of telling Google not to index it, your classification as low-quality under panda would be lifted. The idea that you can get out from under this cloud started to gain traction as a couple of stand out examples started showing up. Find Your "Problem Content" The vast majority of content on WonderHowTo was written by our team of editors, researchers, and curators. It has always been our policy to write original descriptions for the videos our curators approve for our library so as to ensure authenticity, accuracy, and relevance. It is part of the added value we bring to the table when embedding how-to videos from youtube, vimeo, or any of the other 17,000 creators we've curated in our hunt for useful and excellent HowTos (Talented video creators often produce an excellent tutorial with zero regard to title or description, rendering them invisible to search. To these compelling voices, we have sent a steady stream of deserved traffic). Over the years we have also consummated one-off agreements with a handful of partners who requested that we use their own specific descriptions, word-for-word, when including their content on our site. As was the Pre-Panda norm, Google would always rank the original source 1st, so there was no need for any one-off no-index tags to keep rankings in their correct place. With the growing consensus that such republishing could be a major signal in getting a domain flagged, it seemed apparent that our biggest problem might be this content from our partners. After auditing our library, we found that about 16% of our content had been republished word for word from one of these partners. We would have to noindex these to take them out of search visibility. Enact Your Sweeping Changes to Remove Your Problem Content Once you've identified all your problematic content, it's time to noindex it. Digital Inspiration made a number of similar changes and saw his rankings restored within two weeks. Here are the changes we made to WonderHowTo as of March 25, 2011: 1. Duplicate Content from Syndication Partnerships 2. Related Video Pages 3. Un-embeded Video Pages 4. Tag Pages 5. Page Link Count Wait for your Changes to Take Effect Within a week, Google had re-crawled enough of our content to start removing the no-indexed pages from the index. We knew this would result in an additional drop in search traffic, but the hope was to rectify the side effect of Google ranking our high-quality content lower than the scrapers who republish it. We are hopeful that the changes we've made will remove this site-wide flag, or that Google will tweak the algorithm to only target low quality content as opposed to an entire site. But as of today, (4/19/2011), the problem still exists. Google continues to drive people who search for our content to the republished versions on our partners sites and the sites who scrape us without permission or attribution. Our search traffic has declined (now partially because of our noindexing changes), and our high quality content continues to be outranked by less helpful results. If you have a site that is experiencing a similar phenomenon, let us know in the comments. This behavior seems contrary to the fundamentals of search quality, and Panda specifically. Without making some noise about it, it may never be corrected. |
You are subscribed to email updates from SEOmoz Daily SEO Blog To stop receiving these emails, you may unsubscribe now. | Email delivery powered by Google |
Google Inc., 20 West Kinzie, Chicago IL USA 60610 |
| ||||||||||||