Google

Insights from Googlers into our products, technology, and the Google culture.

URL

XML feed
http://googleblog.blogspot.com/

Last update

18 hours 52 min ago

August 18, 2008

22:45
The global nature of our mission is reflected in the phrases the "world's information" and "universally accessible." To this end, you may have recently read about our 40-language initiative and the story of a community coming together to develop Google search in the Maori language.

Following on this theme, we'd like to highlight a few new products that enable a better online experience for Tamil speakers around the world.

First, we just released Google News in Tamil. Like other Google News editions, we gather stories from the various Tamil news sources on the web and present an automatically- generated summary with links to the most important stories in each section.

We recognize that it can sometimes be hard to enter Tamil text with existing keyboards. Our transliteration technology enables the conversion from English text to phonetically equivalent text in Indian languages. For example, using transliteration, you could type "vanakkam" and we would convert it to Tamil script as வணக்கம். We have embedded this technology in several Google products to make it easier to enter text in Tamil.

Google search in Tamil enables users to start typing in English and automatically get query suggestions in Tamil. If you wanted to enter the query "ponniyin selvan" in Tamil, just start typing it in English - e.g. "ponni" and we will show the Tamil suggestions:


Tamil transliteration in Blogger is designed for bloggers publishing content in Tamil when using the English keyboard for text entry. It's our hope that this will make Tamil content more popular and more easily available online.

Tamil transliteration in orkut makes it easier to communicate with friends and family by exchanging scraps in Tamil.

We hope that each of these products will help to bring the benefits of the Internet to the millions of Tamil speakers in India and elsewhere.

Posted by Vinodh Kumar R and Naren Manappa, Software Engineers
Categories: Tech Stuff
13:02
We're reading a lot about the candidates and the media this election season. But what are they reading? At google.com/powerreaders now you can track the news sites and blogs Barack Obama and John McCain read (from Drudge to The Daily Show) and follow articles catching the eyes of leading political journalists. Both the McCain and Obama presidential campaigns and leading political journalists are using Google Reader to keep up with their favorite new sites and blogs as well as share articles that interest them. You can follow shared articles and blog posts, or you can add participants' reading lists or shared news feeds to your own Reader account.

We're pleased to include the following contributors in our launch:
  • Obama and McCain campaigns
  • Mike Allen, POLITICO
  • Chuck DeFeo, Townhall
  • John Dickerson, Slate
  • Mark Halperin, TIME
  • Arianna Huffington, Huffington Post
  • Ruth Marcus, Washington Post
  • Jon Meacham, Newsweek
  • Patrick Ruffini, The Next Right
Visit google.com/powerreaders to stay up to date on what the political gurus are reading -- so you too can become one by November.

Posted by Robby Stein, Associate Product Marketing Manager
Categories: Tech Stuff
08:59
For quite some time we've been talking about the potential of the unused airwaves between broadcast TV channels ("white spaces") to provide affordable, high-speed wireless Internet connectivity nationwide. For this to happen, the Federal Communications Commission (FCC) must allow unlicensed use of this spectrum.

If you care about the future of the Internet, now is the time to take action. The FCC has completed its field testing and is expected to make a ruling in the coming months. With this in mind, today we're launching Free The Airwaves, a new effort to bring users together around this important issue.

To help you to learn more about the tremendous promise of these airwaves, people from around the country have filmed video testimonials. Matthew Rantanen of Tribal Digital Village explained how freeing the airwaves would bring new opportunities to the Southern California Native American community, currently underserved by today's broadband providers. Wally Bowen of the Mountain Area Information Network discussed the potential of these airwaves to bring broadband access to rural communities. Many others have also weighed in, and we hope you will too.

At its core, Free The Airwaves is a call to action for everyday users. You don't need to be a telecommunications expert to understand that freeing the "white spaces" has the potential to transform wireless Internet as we know it. When you visit the site, you'll be invited to film a video response explaining what increased Internet access could mean for you, to sign a petition to the FCC, to contact your elected officials, to spread the word, and more.

When it comes to opening these airwaves, we believe the public interest is clear. But we also want to be transparent about our involvement: Google has a clear business interest in expanding access to the web. There's no doubt that if these airwaves are opened up to unlicensed use, more people will be using the Internet. That's certainly good for Google (not to mention many of our industry peers) but we also think that it's good for consumers.

That said, we can't pretend to speak for you. To learn more about what's at stake and to get involved, check out FreeTheAirwaves.com. We hope that once you've explored the facts for yourself, you'll want to make your voice heard.

Posted by Minnie Ingersoll, Product Manager, Alternative Access Team
Categories: Tech Stuff

August 13, 2008

15:55
Back in February we told you about the 2008 Model Your Campus Competition, a call for students to submit 3D models of their college campuses created with Google SketchUp. We got submissions from campuses around the world, and Mexico stood out with submissions from 13 different campuses. At that time we also ran a parallel contest with a top Mexican school, ITESM (The Technology Institute of Monterrey), and offered a separate prize for the best models submitted by ITESM students. The students come from all over Mexico, so there is a truly national mix of competitors. In total, ITESM participants designed 111 buildings, representing 22 ITESM campuses. All of the submissions will live in a collection within the Google 3D Warehouse, an online storage space for all your 3D needs. From intergalactic space vehicles to cucumbers, the 3D Warehouse is flush with downloadable models made by the SketchUp community.

Last week we announced the winners of the contest: David Gómez-Urquiza Madero y Ricardo Pfeiffer Hurtado, both students of Mechatronics at ITESM's Santa Fe Campus. Since a digital Earth needs some digital buildings, we're thrilled that ITESM students have submitted their designs to create a more livable Google Earth-the winning models will be included in the 3D Buildings Layers of Google Earth. The school leadership plan on encouraging students to construct detailed 3D models of all 33 ITESM campuses, and the contest will return for another run next year. Here's to the winners!

one of the models

the winning team

Posted by Ana Paula Blanco, Head of Mexico Communications and Public Affairs
Categories: Tech Stuff

August 12, 2008

20:50
Cross-posted from the Google LatLong Blog.

The recent conflict in Georgia has raised some questions about how Google Maps has handled mapping in that part of the world. The most obvious question is, why doesn't Google Maps show any cities or roads for Georgia, or its neighbors Armenia and Azerbaijan? The answer is we never launched coverage in those countries because we simply weren't satisfied with the map data we had available. We're constantly searching for the best map data we can find, and sometimes will delay launching coverage in a country if we think we can get more comprehensive data. Some of our customers have asked if we removed map data from any of these countries in response to the recent hostilities in that region and I can assure you that is not the case. Data for these countries were never on Google Maps in the first place.

But this has generated a lot of feedback that we are listening to and learning from. We're hearing from our users that they would rather see even very basic coverage of a country than see nothing at all. That certainly makes sense, and so we have started preparing data for the handful of countries that are still blank on Google Maps. Georgia, Armenia, Azerbaijan, as well as other significant regions of the world will benefit from this effort.

In the meantime, much of this data, including cities in Georgia and other surrounding countries, can be found in Google Earth.

Posted by Dave Barth, Product Manager
Categories: Tech Stuff
16:15
The Google Apps Security & Compliance team, which provides email and web security for more than 40,000 companies, regularly tracks trends in spam, viruses, and other threats, and we almost always find something interesting. Check out some of our latest findings -- including details on some specific attacks that you should keep an eye out for -- on the Enterprise blog.

And if you're interested in learning more about what you can do to keep your business safe from web and email threats, be sure to tune into our webinar on Friday, August 15, at 10:00 am PT.

Posted by Amanda Kleha, Google Apps Security & Compliance team
Categories: Tech Stuff

August 11, 2008

12:52
Have you ever been traveling and suddenly realized that you didn't know how to ask the taxi driver to take you to your hotel? It's happened to us too, so the mobile team has put together an iPhone interface for Google Translate, our machine translation project. Read more about it on the Google Mobile blog.

Posted by Allen Hutchison and David Singleton, Software Engineers
Categories: Tech Stuff

August 8, 2008

01:14
As we mark the opening ceremonies of the 2008 Summer Games, I can’t help but remember eight years ago, when I competed on the U.S. cycling team. Even though I didn’t walk away with any medals then, training and competing involved a herculean effort - but that pales in comparison to what we’re unveiling today.

I’m happy to present the 2008 Summer Games on Google, a site that features a number of our products to help you stay updated on Summer Games happenings. And it's available in 66 countries and 31 languages, from Australia to Uruguay, and from Arabic to Vietnamese.

We collaborated with a data provider to make it easy for you to keep current on event schedules and get updates on results, as well as track medal counts with an iGoogle gadget. You can also get schedules and results on Google search results. (Check out the results for water polo.) We're also including the newest Summer Games highlights through Google News. The Summer Games Google Maps lets you view medal and event information based on your favorite regions and sports, and there's a 3D video of the various Games venues you can tour:



Also, be sure to check out this cool collection of 3D stadiums and venues in Beijing created with Google SketchUp. Read more about these efforts on the Lat Long Blog. Since we know many of you are on the go this summer, all this information is available for mobile devices, where Google Mobile is available.

We hope these tools make it easy and enjoyable for you to follow all the action at the Summer Games.

Posted by Dylan Casey, Product Manager
Categories: Tech Stuff

August 7, 2008

08:01
Today we're announcing some key enhancements on the Google content network (partner sites for which we provide advertising) that will offer a better experience for users and better value for advertisers and publishers. These enhancements are the latest result of our integration with DoubleClick and our commitment to making advertising on the Google content network more efficient and accountable. When we purchased DoubleClick, we talked about how we would empower agencies, advertisers and publishers to collaborate more efficiently and effectively, and provide a better experience for our users. We are happy that we have been able to deliver on this promise already, like support for third party vendors on the Google content network.

The new enhancements that we are announcing today and that will be available in the coming months are the next step in our integration and in enabling standard industry functionality on the Google content network:
  • Frequency Capping: Enables advertisers to control the number of times a user sees an ad. Users will have a better experience on Google content network sites because they will no longer see the same ad over and over again.
  • Frequency Reporting: Provides insight into the number of people who have seen an ad campaign, and how many times, on average, people are seeing these ads.
  • Improved Ads Quality: Brings performance improvements within the Google content network.
  • View-Through Conversions: Enables advertisers to gain insights on how many users visited their sites after seeing an ad. This helps advertisers determine the best places to advertise so users will see more relevant ads.
We are enabling this functionality by implementing a DoubleClick ad-serving cookie across the Google content network. Using the DoubleClick cookie means that DoubleClick advertisers and publishers don't have to make any changes on their websites as we continue our integration efforts and offer additional enhancements. This also means that with one click, users can opt out of a single cookie for both DoubleClick ad serving and the Google content network. (If a user has already opted out of the DoubleClick cookie, that opt-out will also automatically apply to the Google content network.)

To learn more, you can check out our updated main privacy policy and a new advertising-specific privacy policy that reflects our integration with the DoubleClick ad serving cookie, and you can visit a section in our Privacy Center devoted to advertising and privacy.

We're excited about our integration of DoubleClick and the improvements we're making to the Google content network. And I am personally excited about seeing more relevant ads, especially if I don't have to see the same ads over and over!

Posted by Rajas Moonka, Senior Business Product Manager
Categories: Tech Stuff

August 6, 2008

16:39
Imagine the last time you misplaced an important document right when you needed it most: your plane ticket the day of a flight; your driver's license before the morning commute; the warranty on your radio just as the speakers begin to crackle. A frustrating circumstance, perhaps, but a manageable one given the relatively limited landscape of one's personal items.

Now imagine you can't find a key statistic before a work presentation, a customer detail before a sales pitch, or product spec before a critical design meeting. To find these, you're navigating the opaque universe of your organization's file shares, content systems and databases.

Because we appreciate the universal quality of the first scenario and the high stakes of the second, we've released the latest Google Search Appliance, an enterprise search solution that can index all of an organization's content (up to 10 million documents) in a single box.

We think that searching for the myriad of business information that helps you do your job should be as easy as searching for information on Google.com -- regardless of how much content your organization has, or where it resides. And since the volume of documents, customer contacts, presentations and other data flowing into your office is probably not going to shrink any time soon, giving your IT organization access to a high-capacity single appliance (instead of the dozens that come with typical enterprise search implementations) might save your company expense and administrative hours while making it that much easier for you to find the exact piece of information you need to close that sales deal -- 10 million documents at a time.

As for finding your driver's license, you're on your own. Learn more about our search solutions for businesses.

Posted by Nitin Mangtani, Lead Product Manager, Enterprise Search
Categories: Tech Stuff

August 5, 2008

20:08
Louisiana is a place with heart and soul, a place where culture lives in the streets, in the rhythm of our music and in the flavors of our unique cuisine. I recently had the opportunity to visit the Googleplex and I expressed my interest in seeing Street View come to Louisiana, so I'm excited to see the launch of Street View imagery for Greater New Orleans, Baton Rouge and Shreveport. This remarkable tool allows us to share with the world life as we see it, here on the ground in my home state.

In this time of recovery and rebuilding, it is important that we share real images of life in Louisiana and on the Gulf Coast. As you explore the streets of New Orleans, you will discover a city marked by extremes. You will see some areas spared the worst of Katrina’s fury which have quickly recovered, and you will find other neighborhoods that remain flattened by the floodwaters that broke the levees. You will see that our residents call both FEMA trailers and antebellum mansions home.

What you might not see is the incredible spirit of those who have given themselves to this city. Those who were lost in the storm, and those who survived and have returned. The thousands who are still searching for a place to call home. The more than one million volunteers who have come from across the nation and the world to give their time, their sweat, and their hearts to rebuilding a great American city.

But rebuilding gives us another opportunity – one unprecedented in our lifetimes. Because we are starting from scratch in many cases, we can build back better than before. We can create new solutions to persistent social problems – solutions that can be put to the test in New Orleans. Whether we’re talking about designing new levees to hold back flood waters, schools to prepare our kids for a 21st century economy, or a justice system to keep our citizens safe, Louisiana is addressing all of these issues and more. We can find the answers for our nation’s ills on the streets of this city.

Street View for New Orleans will help you get to know the city, its streets and its neighborhoods in a way never before possible. And when you are ready to discover more, I invite you to come see and experience the streets, the soul and the spirit of New Orleans for yourself.

Posted by Lt. Governor Mitch Landrieu, State of Louisiana
Categories: Tech Stuff

August 1, 2008

18:11
A few weeks back Udi Manber introduced the search quality group, and the previous posts in this series talked about the ranking of documents. While the ranking of web documents forms the core of what makes search at Google work so well, your search experience consists of much more than that. In this post, I'll describe the principles that guide our development of the overall search experience and how they are applied to the key aspects of search. I will also describe how we make sure we are on the right track through rigorous experimentation. And the next post in this series will describe some of the experiments currently underway.

Let me introduce myself. I'm Ben Gomes, and I've been working on search at Google since 1999, mostly on search quality. I've had the good fortune to contribute to most aspects of the search engine, from crawling the web to ranking. More recently, I've been responsible for the engineering of the interface for search and search features.

A common reaction from friends when I say that I now work on Google's search user interface is "What do you do? It never changes." Then they look at me suspiciously and tell me not to mess with a good thing. Google is fine just the way it is -- a plain, fast, simple web page. That's great, but how hard can that be?"

To help answer that question, let me start with our main goal in web search: to get you to the web pages you want as quickly as possible. Search is not an end in itself; it is merely a conduit. This goal may seem obvious, but it makes a search engine radically different from most other sites on the web, which measure their success by how long their users stay. We measure our web search success partly by how quickly you leave (happily, we hope!). There are several principles we use in getting you to the information you need as quickly as possible:
  • A small page. A small page is quick to download and generally faster for your browser to display. This results in a minimalist design aesthetic; extra fanciness in the interface slows down the page without giving you much benefit.
  • Complex algorithms with a simple presentation. Many search features require a great deal of algorithmic complexity and a vast amount of data analysis to make them work well. The trick is to hide all that complexity behind a clean, intuitive user interface. Spelling correction, snippets, sitelinks and query refinements are examples of features that require sophisticated algorithms and are constantly improving. From the user's point of view search, almost invisibly, just works better.
  • Features that work everywhere. Features must be designed such that the algorithms and presentation can be adapted to work in all languages and countries. Consider the problem of spell correction in Chinese, where user queries are often not broken up into words or Hebrew/Arabic, where text is written right to left (interestingly, this is believed to be an example of first-mover disadvantage -- when chiseling on stone, it is easier to hold the hammer in your right hand!).
  • Data driven decisions - experiment, experiment, experiment. We try to verify that we've done the right thing by running experiments. Designs that may seem promising may end up testing poorly.
There are inherent tensions here. For instance, showing you more text (or images) for every result may enable you to better pick out the best result. But a result page that has too much information takes longer to download and longer to visually process. So every piece of information that we add to the result page has to be carefully considered to ensure that the benefit to the user outweighs the cost of dealing with that additional information. This is true of every part of the search experience, from typing in a query, to scanning results, to further exploration.

The start of your search is typing in a query. A common cause of frustration is if you don't know the correct spelling of a word! Spell correction -- which seems like a simple and obvious feature -- hides many technical challenges. No common English dictionaries would ever include the correct spelling of Britney Spears, for instance (who, probably completely unbeknownst to her, has become the poster child example for this feature). We do a huge amount of analysis of the billions of pages on the web and our query logs to determine what are "real words" on the web, and what are likely to be misspellings. The system that gives you the spell correction has to, in a fraction of a second, consider a huge number of possible words you might have meant (vastly greater than any dictionary ever manually constructed) and determine if there is a more likely query you meant to type. When we are confident that you actually meant to type something else, we take a rare liberty with our search results: we try to distract you from looking at the top result on the page. The spelling correction is in your line of sight and colored a bright must-see red. Furthermore, we now make sure that nothing else on the page is red, unless it is as important to you as spelling! (so far, nothing is). The algorithms involved in spell correction are constantly getting better. They now work in a large number of languages and are even better at detecting when you have made a spelling mistake. Getting the spelling of your query right is so important that we are considering showing you the results of the spell-corrected query in the middle of the page (just in case you missed our bright red text at the top and bottom!).

Having formulated your query correctly, the next task is to pick a page from the result list. For each result, we present the title and url, and a brief two line snippet. Pages that don't have a proper title are often ignored by users. One of the bigger recent changes has been to extract titles for pages that don't specify an HTML title -- yet a title on the page is clearly right there, staring at you. To "see" that title that the author of the page intended, we analyze the HTML of the page to determine the title that the author probably meant. This makes it far more likely that you will not ignore a page for want of a good title. Below the title comes the snippet, and a key early innovation was in what Google showed for the snippet. At the time, search engines showed you the first two lines of the web page; Google, instead, showed you parts of the page where your actual search keywords showed up (information retrieval experts call this "keywords-in-context"). Showing keywords-in-context is visually simple and virtually indistinguishable from the simpler style of snippets, but vastly more useful in helping you decide which page to visit. This simplicity belies underlying complexity: when we create a snippet we have to go through the actual text from each result to find the most relevant part (which contain your keywords) rather than just giving you the first few lines.

We have been making improvements to our snippets over time with algorithms for determining the relevance of portions of the page. The changes range from the subtle -- we highlight synonyms of your query terms in the results -- to more obvious. Here's an example screenshot where the user searched for "arod" and you can see that Alex and Rodriguez are bolded in the search result snippet, based on our analysis that you might plausibly be referring to him:

As a more obvious example, we now extract and show you the byline date from pages that have one. These byline dates are expressed in a myriad formats which we extract and present uniformly, so that you can scan them easily:

For one of the most common types of user needs, navigational queries -- where you type in the name of a web site you know -- we have introduced shortcuts (we refer to them as sitelinks). These sitelinks allow you to get to the key parts of the site and illustrate many of the same principles alluded to above; they are a simple addition to the top search result that adds a small amount of extra text to the page.

For instance, the home page of Hewlett-Packard has almost 60 links, in a two-level menu system. Our algorithms, using a combination of different signals, pick the top ones among these that we think you are most likely to want to visit.

What if you did not find what you were looking for among the top results? In that case, you probably need to try another query. We help you in this process by providing a set of query refinements at the bottom of the results page -- even if they don't give you the query that you need, they provide hints for different (likely more successful) directions in which you could refine your query. By placing the query refinements at the bottom of the page, the refinements don't distract users, but are there to help if the rest of the search results didn't serve a user's information need.

I've described several key aspects of the search experience, including where we have made many changes over time -- some subtle, some more obvious. In making these changes to the search experience, how do we know we've succeeded, that we've not messed it up? We constantly evaluate our changes by sharing them with you! We launch proposed changes to a tiny fraction of our users and evaluate whether it seems to be helping or hurting their search experience. There are many metrics we use to determine if we've succeeded or failed. The process of measuring these improvements is a science in itself, with many potential pitfalls. Our experimental methodology allows us to explore a range of possibilities and launch the ones that work the best. For every feature that we launch, we have frequently run a large number of experiments that did not see the light of day.

So let me answer the question I started with: We're actually constantly changing Google's result page and have been doing so for a long time. And no, we won't mess with a good thing. You won't let us.

In the next post in this series, I'll talk about some of the experiments we are running, and what we hope to learn from them.

Posted by Ben Gomes, Distinguished Engineer
Categories: Tech Stuff
07:59
You may have read a couple of weeks back about our 40-language initiative and our broader goal of making the world’s information accessible in as many languages as possible. For this reason we were extremely pleased last week to take part in an event in Rotorua, New Zealand for the launch of the Google homepage and search interface in the Maori language. I want to emphasize “take part in”, because much of the hard work that made this announcement possible came from a dedicated team of volunteer translators across New Zealand.

In conjunction with our active effort to make all of our products and services available in 40 languages, beginning in 2001 we began a program known as Google in Your Language, which is designed to give anyone the tools to translate Google services into languages in which they are fluent. Thanks to this program, as well as our other efforts to localize our products, the Google homepage itself now appears in more than 100 languages.

Around the time the Google in Your Language program began, I reached out to a former colleague at Waikato University, Dr. Te Taka Keegan, with the idea of translating Google into Maori. While working on his doctorate, Te Taka began the translation effort in his spare time. Over the course of the next six years, with the help of several other volunteers, he had covered 68% of the messages. It was at this point in 2007 that the husband-and-wife team of Potaua and Nikolasa Biasiny-Tule caught wind of the effort, and took it upon themselves to complete the project. Thanks to their passion for the Maori language and technical savvy, they were able to recruit the help of the Maori Language Commission and dozens of volunteers, leading ultimately to all translations being completed within a year—just in time for Maori Language Week 2008. By the end of it all, more than 1,600 phrases, totaling more that 8,500 words, had been translated.

Besides being a fantastic volunteer effort, the Google Maori project is a great example of how the Internet encourages user participation, especially in particular cultural and linguistic communities. I'd like to offer a tremendous thank you and congratulations to the Maori translation team in New Zealand, and to all those who helped make this possible.

Posted by Craig Nevill-Manning, Engineering Director
Categories: Tech Stuff

July 30, 2008

20:55
We set up shop in Ann Arbor, Mich. nearly two years ago. And we’ve been so busy, we’ve barely had time to say hi. But before we tell you about the interesting things we're doing in our new location, we figure you might want to know a little bit more about our state and our town.

Sandwiched between two Great Lakes, peppered with forestry, and teeming with kindhearted Midwesterners, Michigan is the kind of place you'd be lucky to visit and we get to live here. Not only that, but we’re located in Ann Arbor, a town with a great progressive story:
  • Popular Science magazine ranked Ann Arbor in the top 25 greenest cities in America.Some 50,000 trees grow along Ann Arbor streets, and city parks boast another 50,000. And while no trees actually grow in the Google office, our cheeks do seem to be turning a nice leafy shade of green — probably from walking and biking to work as part of Ann Arbor’s Commuter Challenge, swapping paper for reusable dishes in our cafeteria, and educating ourselves on composting and recycling.
  • On Oct. 14, 1960, President John F. Kennedy announced his proposal for the Peace Corps on the front steps of the Michigan Union, in downtown Ann Arbor. Nearly 50 years later, we "A2ooglers" feel a similar sense of urgency — but this time, it’s a desire to work with our very own state, from soup kitchens to river cleanups. We’re also connecting local schools and businesses with Google products.
  • In the first Rose Bowl Game in 1902, University of Michigan (located in Ann Arbor) defeated Stanford 49 - 0. Like our Wolverine neighbors, we're burning with competitive spirit — one that’s given birth to office teams for kickball, soccer, volleyball, tennis, basketball, skiing, ultimate Frisbee and trivia.
Forgive us our moment of boosterism, but there's more:
Inside our walls, you’ll find a team that's committed to our AdWords advertisers — from identifying potential advertisers, to assisting current ones with day-to-day challenges, to strategizing with others for the future. That’s who we are. We’d love to have you join us.

Posted by Eileen Duffy, AdWords Associate
Categories: Tech Stuff
19:35
Google Apps is rapidly gaining momentum in education. We now have more than a million people on campuses worldwide actively using Google's suite of email, calendar and docs to share information and study. This makes perfect sense. Schools have always been a proving ground for innovative ideas. And as we prepare for the new school year, we are happy to welcome more than a dozen universities across the U.S., joining the thousands of other schools that have already embraced cloud computing in education. Here are the new additions:
  • Collin County Community College District
  • Francis Marion University
  • George Washington University
  • Indiana University
  • Kean University
  • Kent State University
  • Kishwaukee College
  • Loyola Marymount University
  • Montgomery County Community College
  • New Jersey Institute of Technology
  • University of Florida
  • University of San Diego
  • University of Virginia
This is really just the beginning. As we continue working to make it easier to communicate and collaborate online, we are going to meet with some of the top technology experts -- the students themselves. For the entire month of September, we are heading "App to School" by embarking on a cross-country bus tour to visit campuses, listen to students and learn more about how cloud computing is helping education. Please check out our Enterprise blog for more info.

Posted by Jeff Keltner, Business Development Manager, Google Apps
Categories: Tech Stuff
16:32
As we continue to refine our search algorithms to deliver more relevant results, we strive to be as open as possible about how we use data to improve your search experience. Today, we're rolling out a new feature in Google Web Search that will help you better understand how your search results are already customized. Over the next few days, you may start to see messages like this in the upper right corner of your search results page (click on the image to view larger):

You can click the "More details" link to get to a page like this:

You'll see these new messages whenever your search results have been customized based on one or more of the following types of information:

  • Location. By default, we identify your approximate city location based on your computer's IP address and use it to customize your search results. If you'd like Google to use a different location, you can sign into or create a Google Account and provide a city or street address. Your specific location will be used not only for customizing search results, but also to improve your experience in Google Maps and other Google products.
  • Recent searches. We take into account whether a particular query followed on the heels of another query. Because recent search activity provides such valuable context for understanding the meaning behind your searches, we use it to customize your results whenever possible, regardless of whether you're signed in or signed out. In order to customize your results and show you the customization details, we keep the most recent query on your browser for a limited time. After that, the information is removed from your browser and disappears immediately if you close your browser.
  • Web History. If you're signed in and have Web History enabled, we customize your search results based on what you've searched for in the past on Google, and what web sites you've visited. One important note about Web History: it belongs to you and you have complete control over it. You can remove specific items or pause the service at any time. And if there's a particular search that you'd rather not have personalized based on your Web History, you can also just temporarily sign out of your Google Account.
This new feature doesn't change anything at all about how you search on Google and the results you get; it just gives you more of a behind-the-scenes look at how we customize your search experience. We consider this to be an important step in our commitment to transparency, and we hope you find it informative and useful.

Posted by Rachel Garb, Product Manager
Categories: Tech Stuff

July 26, 2008

13:31
Randy Pausch, a professor of computer science at Carnegie Mellon University and a good friend of Google, passed away last night. In addition to being recognized as a pioneer in virtual reality research, he became widely known as a gifted teacher and a mentor to many. Millions of people saw his inspiring "Last Lecture" on YouTube. Read more about Randy and his contributions on our Research Blog.

Posted by Kevin McCurley, Research Scientist
Categories: Tech Stuff

July 25, 2008

20:22
Last month, a group of Googlers traveled to Brazil, to conduct our first-ever project in the Amazon. Organized by our Google Earth Outreach team, we went at the special invitation of Amazon Chief Almir Naramayoga Surui, who'd invited us down to train his people on using Google Earth, YouTube, blogs and other Internet tools in order to preserve their history and culture, protect their rainforest, and create a sustainable future for their tribe.

This was an unusual request, especially because until recently, the Surui Indians used stone tools and hunted and fished with bows and arrows. But as we considered this request, we realized that it was very much within the mission of Google Earth Outreach, which helps people around the world learn how to use Google Earth and Maps for public benefit. We had previously collaborated with the U.S. Holocaust Memorial Museum to map destroyed villages in Darfur, with UNHCR to show
Categories: Tech Stuff
13:13
We've known it for a long time: the web is big. The first Google index in 1998 already had 26 million pages, and by 2000 the Google index reached the one billion mark. Over the last eight years, we've seen a lot of big numbers about how much content is really out there. Recently, even our search engineers stopped in awe about just how big the web is these days -- when our systems that process links on the web to find new content hit a milestone: 1 trillion (as in 1,000,000,000,000) unique URLs on the web at once!

How do we find all those pages? We start at a set of well-connected initial pages and follow each of their links to new pages. Then we follow the links on those new pages to even more pages and so on, until we have a huge list of links. In fact, we found even more than 1 trillion individual links, but not all of them lead to unique web pages. Many pages have multiple URLs with exactly the same content or URLs that are auto-generated copies of each other. Even after removing those exact duplicates, we saw a trillion unique URLs, and the number of individual web pages out there is growing by several billion pages per day.

So how many unique pages does the web really contain? We don't know; we don't have time to look at them all! :-) Strictly speaking, the number of pages out there is infinite -- for example, web calendars may have a "next day" link, and we could follow that link forever, each time finding a "new" page. We're not doing that, obviously, since there would be little benefit to you. But this example shows that the size of the web really depends on your definition of what's a useful page, and there is no exact answer.

We don't index every one of those trillion pages -- many of them are similar to each other, or represent auto-generated content similar to the calendar example that isn't very useful to searchers. But we're proud to have the most comprehensive index of any search engine, and our goal always has been to index all the world's data.

To keep up with this volume of information, our systems have come a long way since the first set of web data Google processed to answer queries. Back then, we did everything in batches: one workstation could compute the PageRank graph on 26 million pages in a couple of hours, and that set of pages would be used as Google's index for a fixed period of time. Today, Google downloads the web continuously, collecting updated page information and re-processing the entire web-link graph several times per day. This graph of one trillion URLs is similar to a map made up of one trillion intersections. So multiple times every day, we do the computational equivalent of fully exploring every intersection of every road in the United States. Except it'd be a map about 50,000 times as big as the U.S., with 50,000 times as many roads and intersections.

As you can see, our distributed infrastructure allows applications to efficiently traverse a link graph with many trillions of connections, or quickly sort petabytes of data, just to prepare to answer the most important question: your next Google search.

Posted by Jesse Alpert & Nissan Hajaj, Software Engineers, Web Search Infrastructure Team
Categories: Tech Stuff

July 23, 2008

13:32
A few months ago we announced that we were testing a new product called Knol. Knols are authoritative articles about specific topics, written by people who know about those subjects. Today, we're making Knol available to everyone.

The web contains vast amounts of information, but not everything worth knowing is on the web. An enormous amount of information resides in people's heads: millions of people know useful things and billions more could benefit from that knowledge. Knol will encourage these people to contribute their knowledge online and make it accessible to everyone.
The key principle behind Knol is authorship. Every knol will have an author (or group of authors) who put their name behind their content. It's their knol, their voice, their opinion. We expect that there will be multiple knols on the same subject, and we think that is good.

With Knol, we are introducing a new method for authors to work together that we call "moderated collaboration." With this feature, any reader can make suggested edits to a knol which the author may then choose to accept, reject, or modify before these contributions become visible to the public. This allows authors to accept suggestions from everyone in the world while remaining in control of their content. After all, their name is associated with it!

Knols include strong community tools which allow for many modes of interaction between readers and authors. People can submit comments, rate, or write a review of a knol. At the discretion of the author, a knol may include ads from our AdSense program. If an author chooses to include ads, Google will provide the author with a revenue share from the proceeds of those ad placements.

We are happy to announce an agreement with the New Yorker magazine which allows any author to add one cartoon per knol from the New Yorker's extensive cartoon repository. Cartoons are an effective (and fun) way to make your point, even on the most serious topics.

Everyone knows something. See what people are writing about, then tell the world what you know: knol.google.com

Posted by Cedric Dupont, Product Manager and Michael McNally, Software Engineer
Categories: Tech Stuff