Central Perk

marți, 14 iunie 2011

SEOmoz Daily SEO Blog

Does Google Use Facebook Shares to Influence Search Rankings?

Posted: 13 Jun 2011 02:37 PM PDT

Intro from Rand - this post comes from Dr. Matt Peters, SEOmoz's data scientist. He came on board in February of this year after stints at Harvard (working on climate science models), Washington Mutual, JP Morgan and Fannie Mae (analyzing mortgage securities) and more (including some research into Google Places rankings last November). Matt's particularly passionate about bringing the best practices of scientific and quantitative analysis to the world of inbound marketing, and I'm very excited to welcome him to the Moz community.

One of the most interesting findings from our 2011 Ranking Factors analysis was the high correlation between Facebook shares and Google US search position. In fact, Facebook shares was the highest correlated single factor among the 100+ factors we examined:

Facebook and other social media correlations with Google search position

This blog post presents some modeling results that look at whether Google may be using Facebook shares directly in it's relevance calculation, or whether the correlation between Facebook shares and search position is coincidental, aka the byproduct of other causal factors.

Correlation and Causation

As we have said time and time again on this blog, in our presentations and when speaking, correlation is not causation. However, this post will discuss issues of both correlation and causation, so for the purposes of this discussion it's important to understand the relationship between them on a deeper level. Correlation does not, in general, imply causation. However, two things that are causally related will often be well correlated. Correlation can therefore only be used to support or deny causation, but not to show it directly. Put another way, if we have some prior, a priori, reason to believe that two things are related, correlation can be used as a tool (with rigorous statistical underpinnings) to support the relationship. In turn, weak correlation can be used to weaken the argument that two things are related.

Before we started our work on the 2011 Ranking Factors, we had some reasons to believe that Facebook data may be used by Google. There was an interview with Google/Bing in December 2010 where they disclosed that they were using social media signals in to rank search results. We also began seeing Facebook share information in our search results, so we knew that Google had access to at least some Facebook data.

Facebook share data appears in Google web search results

Even having this public comment from Google and seeing the Facebook data in search results (you can also observe them in Google realtime, e.g. here), we were still surprised at the size of the correlation in our ranking factor result and we wondered whether it was causal or the result of other factors like links. As a simple check, we ran some hasty partial correlations to control for links and concluded that they accounted for some of the correlation, but not all. This appeared to be another data point to support causation, or so we thought...

SMX Advanced

At SMX Advanced in Seattle last week, Rand Fishkin presented the main results from the ranking factors work, including the partial correlations controlling for links. We were still not confident that Facebook shares were being used directly, so Rand was very careful to add several caveats saying that these might not be causally related. You can see his presentation embedded below:

Interesting Data from the 2011 Ranking Factors

View more presentations from Rand Fishkin

That evening, Matt Cutts, the head of Google's web spam team said that they do not crawl Facebook "wall" pages and implied that they don't use Facebook shares for ranking. His language was somewhat vague (leaving room open as to whether some forms of Facebook data are used or come via a special feed), however we and many others felt that Matt had implied that Facebook shares, specifically, are not part of their web ranking algorithm. Rand pointed out that Google does have some access to Facebook data overall and set up a small-scale test to determine if Google would index content that was solely shared on Facebook. To date, that page has not been indexed, despite having quite a few shares (64 according to the OpenGraph).

Both Rand and Matt's talks and the subsequent discussion with Danny Sullivan on Twitter was well covered over at Search Engine Land.

Sitting in the audience, I began to think about the implications of this new information. If Google wasn't using Facebook shares, then the high correlation must be explainable with things that they are using. I made a short list of different factors that Google might be using to determine relevance that would also be correlated with FB shares:

Links. Pages that are heavily shared on Facebook tend to also be heavily linked to.
Other social media signals. Pages that are shared on Facebook also tend to be tweeted about and shared through Google Buzz.
Quality content. People share pages that they find interesting and have high quality content. This drives positive usage signals (time spent on page, bounce rate, etc) that might be used.
Association with well known, quality brands. There is likely more interaction with well known brands then lesser known brands, and this might drive deeper engagement with them on Facebook.

Building a better model for Facebook shares

I thought back to the partial correlations I had run a few months prior. At the time, I was mainly interested in a first look that could be done in a few hours, so I chose partial correlations using a limited set of four metrics from Linkscape as the control variables. Partial correlations use a linear regression model to predict the correlation variables (in this case Facebook shares and search position), the simplest type of regression model. It has the benefit be being well established and easy to use, but falls short when the underlying relationship is more complicated or non-linear. In addition, I didn't try to control for other social media signals since at the time we were interested in links.

I began to wonder if the results would change if I tried a more complicated model using Twitter/Google Buzz and all the available link metrics from Linkscape so I set out to build the model. Before describing the model, it's important to write down our modeling assumptions. They are:

Google uses link metrics for ranking, similar to those available in our Linkscape API.
Google uses other social media data, in particular Tweets and Google Buzz shares to rank.

We are testing whether Facebook shares provide any increase in predictive power beyond these factors.

To build the model, I took a subset of the full dataset used in the ranking factors report (scrubbed for data quality, but otherwise unchanged). The baseline mean Spearman correlation between search position and Facebook shares in this data is 0.30. Then I took all 61 keyword agnostic link metrics used in the ranking factors and (1) ran them through a generic filter to transform them to something close to Gaussian and (2) did a principal component analysis and kept the first 19 principal components that explain 99% of the variability in the original data. This allows me to use a complicated non-linear model without worrying about collinearity issues. I augmented the 19 principal components with three social media metrics, namely the number of Tweets and Topsy Influential metrics from Topsy and the number of Google Buzz shares.

I used a two step process to fit the actual model. First, about 33% of the URLs don't have any Facebook shares and the rest have at least one share. Fitting a regression model to a distribution with a big spike at 0 is generally not a good idea, so I first fit a binary classifier to the cases for 0 / 1 or more shares. Then, I fit a non-linear regression model to the remaining data with at least one share. Over fitting was controlled through cross-validation. The total predicted number of shares can be computed from the output of these two pieces.

The final model performs moderately well, with the correlation coefficient between actual and predicted shares at 0.73. However, the mean Spearman correlation between the predicted number of shares and search position is 0.27, nearly as large as the baseline 0.30. This strongly suggests that our interpretation of Matt Cutts' statement is correct and Google is not using Facebook share data directly to rank.

Takeaways

Facebook shares, at least as related to Google searches, act as a sort of "super-metric", encompassing a variety of different factors (similar to SEOmoz's Page Authority and Domain Authority).
Don't stop sharing and generating brand engagement through Facebook! Driving deeper engagement through social media can only help your brand and likely has other positive benefits (by generating tweets or links, for example).
Earning Facebook shares (probably) will not directly boost your Google rankings (though it may have positive effects that indirectly promote links, tweets and other signals Google may use).
The process of doing this type of correlation work and sharing the results openly started a great dialogue in the search community, and through that we all learned more about how search works. We plan to do more work like this, and are planning a project for the Fall to compare Google and Bing results.

Finally, I'd like to thank Danny Sullivan and Matt Cutts for sparking this work through their discussion at SMX Advanced.

Do you like this post? Yes No

Seth's Blog : Summer reading, 2011

Summer reading, 2011

By request, here's a grab bag of books you might not have read yet.

PS congratulations to my friend Steve on the publication of The Profession.

• Email to a friend •

Anafore Customer Referral Program Review Graywolf's SEO Blog

Anafore Customer Referral Program Review

Posted: 13 Jun 2011 10:34 AM PDT

The following review is a sponsored post for Anafore Customer Referral Program.

Anafore customer referral program is an email add on that merchants or online stores can use to incentivize customers to share promotional offers with their friends in exchange for their own promotional offers. Basically, when a customer completes an order, it sends them an email asking them to forward this offer (like a 10% discount). For every person who converts, they get a a promotion (like 15% off their next order).

The system is really easy to set up. Basically you set up a custom subdomain to use for tracking, upload your logo, setup the discount message, and install a tracking script on your system. Screen shots of the setup below (click to enlarge)

When a customer completes an order, they give you a secret email to forward the customer info to them, and then either right away or on a delay you send them the email containing the referral offer. You have to program your shopping cart to take care of the discount; Anafore only handles the email and tracking. If you are using Amazon Webstore, Big Cartel, Etsy, Majento, Shopify or Volusion, they provide you with links during setup to get the offer set up properly.

Anafore Cart Options

Anafore tracks how many offers you sent, when you sent them, how many were opened and when, who are the biggest referrers, who are the biggest sharers and lots of other stats. I was running an imaginary campaign, so the numbers are low.

Anafore Tracking Dashboard

The system is really easy to set up and get information out of. If you are using one of the above mentioned platforms, integration is really easy. You can use your own ecommerce system. If you do, you will just need to set up the offers and take care of the data exchange, but it’s not too complicated. They do offer one month risk free; after that, it’s 65 cents per click. My suggestion would be not to trigger the campaign until the customer’s order is approved or ships–this way, you don’t end up paying for leads from deadbeats.

Creating a referral loyalty program is a really good idea. Your current customers are always your best leads, and giving them an incentive to share with their friends is a very good idea. The system is extremely easy to set up and, to be honest, I can’t see it taking more than an afternoon to implement. At 65 cents per click, it’s almost a no brainer unless you are working with extremely low-priced merchandise. You can customize your offers and presentation however you like, although my suggestion is to keep it as simple as possible. You can learn more about customization on the FAQ. With a one month risk free offer, it’s hard to go wrong, so give Anafore a try.

The preceding has been a sponsored post. Find out more information about sponsored posts.

photo credit: Shutterstock

5 Ways to Get More Customer Referrals Every business will probably agree that customer referrals are awesome! ...
Constant Contact Review – Email Program In an earlier post (see Bridging the Gap Between Email...
How To Get Customer Reviews on Your Website While UGC and Reviews can often be a mixed blessing,...
Google Adsense for Video Beta Program Perhaps you’ve heard of the Google Adsense for video ads....
New Link Advertising Program InLinks.com from Text Link Ads Yesterday, Text-Link-Ads launched a new service InLinks, that places ads...

Advertisers:

Text Link Ads - New customers can get $100 in free text links.
BOTW.org - Get a premier listing in the internet's oldest directory.
Ezilon.com Regional Directory - Check to see if your website is listed!
Need an SEO Audit for your website, look at my SEO Consulting Services
Link Building- Backlink Build offers 45 PR5+ Backlinks for $295
Directory Journal - Get permanent deep links in a search engine friendly directory
LinkWheel SEO - Get Web 2.0 Backlinks
Links From PR9 Sites - - Get In Top 3 Google ASAP
TigerTech - Great Web Hosting service at a great price.

This post originally came from Michael Gray who is an SEO Consultant. Be sure not to miss the Thesis Wordpress Theme review.

Anafore Customer Referral Program Review

Why is there a rock wall on the South Lawn?

Your Daily Snapshot for
Tuesday, June 14, 2011

Photo of the Day

Kids climb a rock wall during a Let's Move! South Lawn Series event on the South Lawn of the White House, June 13, 2011. The series, which invites students from Washington, D.C., area sports leagues and community programs to participate in a variety of physical activities, will last through the summer. (Official White House Photo by David Lienemann)

Seth's Blog : In praise of programming

In praise of programming

Not computer programming, which is important, but content programming.

Someone decides what to put on the radio after that song you just heard. Someone realizes that Conan needs to do more than just tell standup. Someone decides that if every tweet is just like the tweet you just sent, it's boring.

We're all programmers now. We all have to decide what to post next, what to point to next, what to launch next. Is there a skill in dreaming up Must-See Thursday nights, or in establishing a season of Shakespeare or even in deciding what's on the special list at the restaurant? I think there is.

Yes, you must do great work. You also need to figure out how to program for your audience, even if the audience is only one person.

• Email to a friend •

marți, 14 iunie 2011

SEOmoz Daily SEO Blog

SEOmoz Daily SEO Blog

Correlation and Causation

SMX Advanced

Building a better model for Facebook shares

Takeaways

Seth's Blog : Summer reading, 2011

Summer reading, 2011

More Recent Articles

Anafore Customer Referral Program Review Graywolf's SEO Blog

Anafore Customer Referral Program Review Graywolf's SEO Blog

Related posts:

Advertisers:

Why is there a rock wall on the South Lawn?

Seth's Blog : In praise of programming

In praise of programming

More Recent Articles

Pagini

Persoane interesate

Arhivă blog

marți, 14 iunie 2011

Correlation and Causation

SMX Advanced

Building a better model for Facebook shares

Takeaways

More Recent Articles

Related posts:

Advertisers:

More Recent Articles

Pagini

Persoane interesate

Arhivă blog

Subscribe to