marți, 25 martie 2014

Knowledge Graph 2.0: Now Featuring Your Knowledge

Knowledge Graph 2.0: Now Featuring Your Knowledge


Knowledge Graph 2.0: Now Featuring Your Knowledge

Posted: 24 Mar 2014 04:10 PM PDT

Posted by Dr-Pete

Sometime in January, Google quietly rolled out a change that I believe could revolutionize organic search. Currently, the impact is limited, and it may take months or years for the full effect to be felt, but the underlying shift is fundamental to the future of the Knowledge Graph and the delicate symbiosis between Google and webmasters.

Answer box 1.0

Let's start at the beginning. I've written a lot about the current generation of answer boxes (sometimes called "direct answers" or "one-box answers"). These display quick answers to what are usually concrete questions. For example, if I want to know when the Willis Tower here in Chicago is open, I can search for [Willis tower hours] and get:

Google's ability to understand questions has expanded significantly in the past couple of years, probably pushed forward even more by the Hummingbird update. For example, I can get the same answer box by querying [when is the Sears Tower open].

So, where is this data coming from? Typically, it's coming directly from the Knowledge Graph, and you can spot it pretty easily. Here's the Knowledge Panel for [Willis tower]:

I've added the red arrow â€" as you can see, the information in the answer box is taken directly from a property in the Knowledge Graph. You can easily reverse it, too, to create endless examples. Let's take the property "Construction started: 1970" and turn it into a query, like [when was the sears tower built]. You'll get another answer box:

Most of this information comes from a very limited number of sources, including Freebase, Wikipedia, and Google+. Freebase is structured in terms of entities and properties (think object-based, as opposed to article-based), which makes it a perfect fit for Knowledge Graph.

Google's dilemma

There's a problem, though. The main sources of data for the Knowledge Graph are curated by people. Ironically, Google is facing the same dilemma with Knowledge Graph in 2014 that led to the creation of internet search engines in the first place. Put simply, the scope of information is much too large, and growing too quickly, for any human-edited approach to scale. Google can't just hire Wikipedia editors â€" they need a new data source.

Google is hardly blind to this problem. In a research paper published just this year, Google outlines the basic issue (hat-tip to Andrew Isidoro):

The paper goes on to explain a method of extracting missing knowledge graph data on demand, using Google's existing search technology. Welcome to...

Answer box 2.0

Luckily (for them), Google already has one of the largest data sources on the planet â€" their index of the worldwide web. What if, instead of looking for answers in a limited set of encyclopedic sources, Google could generate answers directly from our websites?

That's exactly what they've done. For example, here's what you'll see at the top of a recent search for [social security tax rate]:

Unlike answer boxes based on the Knowledge Graph, this new format pulls its answer directly from third party websites, giving them attribution via the page title and link. In many ways, this is an additional organic result, and like all answer boxes in the left-hand column, it appears above "#1".

These longer answers look more like search snippets, but there's also a second version, triggered when Google can find a definitive answer on a third-party site. Here's the new answer box for the query [September birthstone]:

This example includes a longer snippet, but the direct answer â€" "Sapphire" â€" is highlighted, more in the style of a traditional answer box. Again, the source page's title and URL is shown below the snippet.

How do we know, beyond the third-party attribution, that this isn't coming from the traditional Knowledge Graph? Try a variation on the query, like [september's birthstone]. I get this result:

Here's the answer box for a longer query [what is september's birthstone]:

Interestingly, the short answer ("sapphire") is no longer capitalized, because that's how Google found it on the source page. In my personal testing, these variations weren't consistent, so Google may be using some kind of query refinement. Regardless of that, it's pretty clear that these answers are being generated on the fly.

The new number one

These answer boxes are essentially a new organic result, and clearly disrupt the traditional top results. So, where are these answers coming from, and how do you get one? We don't have a lot of data yet, but in every case I've seen, the URL used to create the answer box also appears on page one of Google results. So, you have to already be ranking well on the term.

In most of the cases that I've seen so far (again, the data set is small), the answer is coming from the #1 organic position. For example, here's the answer box and #1 result I get for [marine corps' birthday]:

So, military.com is essentially getting two listings on this SERP. In some cases, though, the answer is coming from a result lower on page 1. Here's the answer box and part of page 1 for [richest man in the world]:

In this case, Time Magazine gets credit for the answer box, even though it's all the way down in #8, and Forbes has all three of the top organic spots. What's even worse is that Time article directly cites Forbes as the source, even in the search snippet. So, what's going on here?

I suspect this comes down to fairly basic on-page factors. The main Forbes article is a bit design-heavy (it has limited crawlable text) and uses an "infinite" scroll approach. None of the Forbes pages directly mention the phrase "richest man in the world", especially in proximity to Bill Gates' name.

What if I change my query to something that Forbes targets better, like [world's richest people]? Here's the result I get (all of these searches are incognito, but I can't rule out some sort of query history effect):

It's interesting that Google seems to be inferring that I want to know the world's richest person (and is bolding "Bill Gates"), but doesn't feel that the answer is definitive enough to break it out as a short answer. Even since starting this post, Google has made refinements to the matching system, but currently it seems like on-page keyword targeting is fairly critical.

It's just the beginning

Google clearly has a long way to go. Some of the answer boxes are pretty ridiculous. Take, for example, a search for [hair color]:

This is a pretty ambiguous query, and it doesn't seem well suited for any kind of answer box (let alone one that's one step away from a salon advertisement). Expect Google to put a lot of time and money into improving this system over the next year.

While this post is focused on answer boxes, Google is using a similar approach to expand knowledge panels. For example, here's a search for [biology]:

Notice the "Related topics" section â€" only one of those results is coming from Wikipedia. Google is building a decent chunk of this knowledge panel on sites in their index. The attribution on these is much more subtle â€" only the small, gray text goes to the source site. The blue links (except for "Wikipedia" at the top) go directly to more Google searches.

Is the balance shifting?

It's easy to see how this progression is inevitable â€" Google has to expand the Knowledge Graph, and they can't rely on human editors and static data sources. While this data may be good for users, it represents a shift in the balance between Google and webmasters. There's always been an implied symbiosis â€" Google crawls our sites and extracts information, but they send us traffic in return. We may not always like how they do things, but the end result has benefitted millions of site owners.

What happens when a user can get a simple answer quickly, and that answer is extracted from a third party page and cannibalizes the organic clicks? What happens when third-party data is being used not to drive traffic to the source, but to more Google searches? It seems to me that the symbiosis is threatened.

For now, there's not much you can do. You can work to retune your on-page content to appear in these new entities, but you do so at the risk of harming your own organic traffic. It's probably better to be in the answer box than let your competitor be there, but it's hardly an ideal choice. The best I can say is to be aware of your money terms â€" not just how you're ranking, but how those SERPs actually look in context. At some point, we may all have to decide if giving away our data is worth what we get in return.


Sign up for The Moz Top 10, a semimonthly mailer updating you on the top ten hottest pieces of SEO news, tips, and rad links uncovered by the Moz team. Think of it as your exclusive digest of stuff you don't have time to hunt down but want to read!

AdSense Insider March 2014

Explore the latest updates and tips every month.
Your Publisher ID: ca-pub-1492172262972996
March Edition
In this issue of AdSense Insider, you’ll discover how to track your site’s mobile performance, how a local site grew into an online guide for tourists and how to adjust reporting to your local time zone.
Product News
Scoring your mobile site
If you’re interested in how mobile users navigate your site, check out your publisher scorecard. The PageSpeed Insights mobile analysis provides data on your site's performance on mobile devices.
Back by popular demand
We’ve brought back some popular features to the new AdSense homepage. You can now directly access your Google Analytics account and keep track of important information, including finalized earnings, all in one place.
Latest buzz
Online tourist guide stays free of charge
In the last ten years, SanFranciscoChinatown.com has become a go-to guide for tourists and locals visiting San Francisco’s Chinatown. See how AdSense and DFP helped finance founder Kevin Hsieh’s online guide to events, culture and history in the city he loves — and then share your AdSense story.
Help us help you with our semi-annual survey
We’ll be launching our semi-annual survey on April 23rd and would love to hear from you. If you’d like to participate, review your contact information and opt-in to receive 'occasional survey' messages.
We’ll keep you posted with more news in April.
Until then, see you online.
Did you know:
90%
The Google Display Network reaches 90% of global internet users.*
*source: comScore.
Tip:
Try time zone reporting to get the most relevant reporting based on your location.
Was this message useful? Share your feedback with us.

Seth's Blog : The debilitating myth of musical chairs

 

The debilitating myth of musical chairs

I was invited to a fancy gathering the other day. Thirty of us, chatting amiably over drinks, then invited to sit down to eat.

A little slow on the trigger, I was the last one over to lunch. To my horror, there were only 29 seats at the long table. All of my Jungian anxieties triggered in one moment. No room for you, you don't belong here, you probably shouldn't have come in the first place.

After a deep breath, I walked over, got a chair from along the wall and scooted myself in.

Epic disaster, averted.

It turns out that in the connection economy, where the network effect creates value and abundance in those connections, it's pretty unlikely that there are precisely one-too-few chairs at the table you hope to sit at. And if there are, it turns out that it's easier than ever to bring your own chair.

Even better, start your own table.

In school, we teach kids to try out, to work to make the cut, to suck it up and give up when they don't. We forget to teach them that the better approach (the adult, real world approach) is to just start your own team. One hyper-ironic example: A friend didn't make it past the final try-outs for the improv club at school. Bummed out, he moved on, never realizing that he could start his own improv club...

If you're spending a lot of time worrying about musical chairs, it's almost impossible to be generous and connected. If you've got one eye on the lookout for when the music will stop and which chair you're going to grab, it's inevitable that you're not really focusing on the amazing people you're with. On the other hand, once you stop playing that game, it seems as though new chairs just keep materializing.

       

 

More Recent Articles

[You're getting this note because you subscribed to Seth Godin's blog.]

Don't want to get this email anymore? Click the link below to unsubscribe.




Email subscriptions powered by FeedBlitz, LLC, 9 Thoreau Way, Sudbury, MA 01776, USA.