sexta-feira, 30 de março de 2012

Help Desk Hangouts: Reaching the right customers with Google AdWords

Editor’s note: Each week on the Google+ Your Business page, we’re putting you in touch with Googlers and users who can help you as a business owner get the most out of our products and features.

In our latest Help Desk Hangout On Air, we discussed Google AdWords — how to get started, picking keywords, tracking clickthrough rates, and more! Dori Storbeck, Courtney Pannell, Chad Baranik and Gina Bucciere shared some of their tips and tricks for managing a successful AdWords campaign. If you missed it — don’t worry! — you can watch the full hour-long Hangout on the Google Business YouTube channel:


Earlier in the week and during the Hangout, we collected your questions. Here are a few of the things we addressed:

Is there a limit to the amount of AdGroups in one Campaign?

Each campaign can have up to 20,000 Ad Groups.

What is a good CTR? Does my spend affect my CTR?

In general, we consider a good CTR (click-through-rate) to be 1 percent or above on the Search Network. On the Display Network, users are generally at a different point in the buying cycle and, therefore, we expect a much lower CTR. To evaluate the performance of your Display ads, you may want to look at the Relative CTR metric.

What are the top 10 things one can do to improve their Quality Score?

Really, the thing to focus on here is ensuring that you have good account structure and that your keyword lists are tightly themed and highly relevant to your ad text and your landing page. Focusing on CTR, which is a large part of Quality Score, can help too.

I noticed my quality score changes from day to day. Should I only be reviewing my score weekly or monthly before making changes?

Quality Score is a dynamic metric that is actually calculated each time your ad is eligible to enter the ad auction. The score that you can view next to your individual keywords is basically a snapshot estimate of how relevant your ads, keywords, and landing page are to a person seeing your ad. While Quality Score is an important metric, we’d suggest focusing on some of the other metrics we also covered in the Hangout, like CTR, average position, and conversion tracking.

Do you suggest eliminating all keywords/ads that don't produce conversions over a six month period?

It totally depends on your goals. If, for example, your main advertising goal is to drive conversions and you notice some keywords have been in your account for a while but aren’t helping you achieve your goal, you might consider pausing or deleting those keywords - particularly if you’re paying a lot for those clicks.

Will your keywords with different match types "fight" against each other if they both qualify for the impression?

Essentially, yes. In determining which keyword enters the auction, the AdWords system is going to try to match the keyword that most closely matches the user’s query, but it will also factor in which keyword will be cheapest and get the highest ad position.

It’s not necessary to have all the match types for every keyword in your account. When choosing a match type, think about how users might conduct a search for your business or services, then choose which match type (broad, broad match modifier, phrase, exact) would allow for the most number of users to be able to find you for relevant searches. You can read more about the main keyword matching options here, and about the broad match modifier option here.

If I had drawn a custom shape before that feature was removed is it possible that I am still targeting that area (the account still shows I am)?

Location targeting by custom shape is no longer supported. If you didn’t select specific targeting areas other than the custom shape before we sunset the feature, the AdWords system would have used your custom shape to match your campaign to the targeting areas (cities, metro areas, states, countries) that best match the area within your previously selected shape.

How do I target just five states? I see most of our sales coming from just these five. I would like to see if that increases my sales.

Within your AdWords account, you’ll want to navigate to the Campaigns tab at the top and then select the specific campaign. Then, on the gray Settings tab for that campaign, under the “Locations and Languages” subtitle, there is a “Locations” section.  By clicking the blue Edit link next to “Locations” you can then select specific states (or cities) to target.

To learn more about how to get started with AdWords, visit our Help Center or check out the AdWords Community forum. And remember to tune in to the live stream of our next Hangout at 11 a.m. PDT Wednesday April 4, when we discuss how to get started on Google Places. We’ll be collecting your Places questions early next week on the Google+ Your Business page.

Posted by Dori Storbeck and Courtney Pannell, Global Online Advertising Associates

quinta-feira, 29 de março de 2012

Google App Engine Research Awards for scientific discovery



Since its launch four years ago, Google App Engine has been the platform for innovative and diverse applications. Today, Google’s University Relations team is inviting academic researchers to explore App Engine as a platform for their research activities through a new program: the Google App Engine Research Awards.

These research awards provide an opportunity for university faculty to experiment with App Engine, which provides services for building and hosting web applications on the same systems that power Google’s products and services. App Engine offers fast development and deployment, simple administration and built-in scalability -- it’s designed to adapt to large-scale data storage needs and sudden traffic spikes.

As part of Google’s ongoing commitment to support cutting-edge scientific research across the board, this call for applications welcomes university faculty’s proposals in all fields. Projects may focus on activities such as social or economic experiments, development of academic aids, analysis of gene sequence data, or using App Engine MapReduce to crunch large datasets, just to name a few.

This new award program will support up to 15 projects by providing App Engine credits in the amount of $60,000 to each project for one year. In its first year, the program is launched in a limited number of countries. Please see the RFP for details.

If your research has the potential to advance scientific discovery, generates heavy data loads, or needs a reliable platform for running large-scale apps, we encourage you to submit your proposal. Information on how to apply is available on the Google Research website. Applications will be accepted until 11:59 p.m. PST, May 11, 2012.

terça-feira, 27 de março de 2012

Impact of Organic Ranking on Ad Click Incrementality



 In 2011, Google released a Search Ads Pause research study which showed that 89% of the clicks from search ads are incremental, i.e., 89% of the visits to the advertiser’s site from ad-clicks are not replaced by organic clicks when the search ads are paused. In a follow up to the original study, we address two main questions: (1) how often is an ad impression accompanied by an associated organic result (i.e., organic result for the same advertiser)? and (2) how does the incrementality of the ad clicks vary with the rank of advertiser’s organic results?

 A meta-analysis of 390 Search Ads Pause studies highlighted the limited opportunity for clicks from organic search results to substitute for ad clicks when search ads are turned off. We found that on average, 81% of ad impressions and 66% of ad clicks occur in the absence of an associated organic result on the first page of search results. In addition, we found that on average, 50% of the ad clicks that occur with a top rank organic result are incremental. The estimate for average incrementality of the ad clicks increases when the rank is lower; 82% of the ad clicks are incremental when the associated organic search result is between ranks 2 and 4, and 96% of the ad clicks are incremental when the advertiser’s organic result ranked lower than 4 (i.e., 5 and below).

 While these findings provide guidance on overall trends, results for individual advertisers may vary. It’s also important to note that the study focuses on clicks rather than conversions. We recommend that advertisers employ randomized experiments (e.g., geo-based experiments) to better quantify the incremental traffic and lift in conversions from the search ad campaigns and that they use the value-per-click calculations in the original search ads pause study to determine the level of investment on their search ads.

 For more information, find the full study here.


Learn with Google: Get advice and information to help grow your business

Business owners want to understand how online search and marketing works, and make sure they’re getting the most out of the efforts they put in. With that in mind, we’re excited to introduce you to Learn with Google. Whether you’re just getting online, interested in marketing, or itching to do more, Learn with Google can help. On the site you’ll find videos, how-to guides, and worksheets, all of which will help you effectively promote your business online.

Here’s an example of the types of resources you can find at Learn with Google. This video shows you how to connect with local customers online.

Did you know that one out of five Google searches is related to location? This video gives you tips to:
  • Make your website more local-friendly
  • Connect with customers on Google+
  • Bring in customers with reviews


When you’re done with the video you can download the worksheet, which will walk you through the process to start reaching more customers in your area.

Posted by Sadie Stoumen, Product Marketing Manager

sexta-feira, 23 de março de 2012

Help Desk Hangouts: Learn how your business can use Hangouts to engage and collaborate

Editor’s note: Each week on the Google+ Your Business page, we’re putting you in touch with Googlers and users who can help you as a business owner get the most out of our products and features.

In our latest Help Desk Hangout On Air, we discussed Google+ Hangouts — a topic many of you wanted to know more about (how to get started, ideas for what to cover in your Hangouts, and so on). With special guests Teresa Wu of the Google Docs team, and Hangouts power users Seth David and Tom Rolfson, we talked about how you can use Hangouts to engage customers and collaborate with your teammates. If you missed it, you can watch the full hour-long Hangout on the Google Business YouTube channel (really great tips!):


We collected questions via the Google+ Your Business page and tried to answer as many as we could in the Hangout. Here are a few of the questions we addressed:

Yifat: If a business page doesn't have many followers and they open a public Hangout, where does it appear and can anyone join (even people who haven't circled them yet)?

A notification to join the Hangout appears only to those who have added the page to their circles (they’ll see it in the Stream), but anyone with the link can join the Hangout.

Thorben: Are you still planning on making Hangouts On Air available for everybody?

We’re definitely still hoping to make this more widely accessible. Thanks for being patient!

Nadra: I'm looking forward to using Hangouts for event promotion. I'm curious about the different nuances of launching Hangouts before, during and after an event.

Hangouts are a great way to give others insight into an event, especially during the live show (be sure test the sound quality ahead of time to make sure everything’s a-OK). Before an event, you could use the Hangout to start building interest by giving sneak previews to guests, and after the event, use a Hangout to recap the highlights and showcase follow-up interviews.

Eric: How can we embed Hangout info on our website? And help people to pre-register? Can we stream the live Hangout to another web property like our web site?

Embedding a Hangout and streaming on your own web property aren’t possible at the moment, but it’s a common feature request that we get from users and one the team is aware of. As for pre-registering, you can ask the followers of your page to leave a comment if they’d like to be invited to attend, or create a Google form to collect the names of participants.

Barbara: I've been using Hangouts quite a bit but even in smaller groups we've struggled with disturbing noise interferences we couldn't really explain. How can we overcome such seemingly mundane but important difficulties?

Make sure you have a dedicated quiet room for participants in the Hangout. Use microphones and headphones to improve sound and audio quality, and ask participants that when not speaking, hit the “Mute” button at the top right of the Hangout screen.

To learn more about how to get started with Google+ Hangouts, visit our Help Center. And remember to tune in to the live stream of our next Hangout at 11 a.m. PDT Wednesday March 28, as we discuss how you can help the right customers find your business with AdWords. We’ll be collecting your AdWords questions early next week on the Google+ Your Business page.

Posted by Vanessa Schneider, Google Places community manager

quinta-feira, 22 de março de 2012

Google+ page stories: The Pablove Foundation

Editor’s Note: This is the sixth in a series of posts about small businesses on Google+ and their tips and tricks for managing a great page. Visit our YouTube channel to see all the videos in this series and join the discussion on the Google+ Your Business page

Meet The Pablove Foundation, whose mission is to fund pediatric cancer research, educate and empower cancer families, and improve the quality of life for children living with cancer. Watch as Jo Ann Thrailkill, Pablove’s Executive Director, uses Google+ to contribute to the global Pablove movement to fight childhood cancer with love.


Google+ Ripples creates an interactive graphic of the public shares of any public post or URL on Google+ to show you how it has moved through the network. Watch as Jo Ann uses Ripples to discover new supporters of The Pablove Foundation:


Ripples shows you:
  • Who has publicly shared a post or URL and the comments they’ve made
  • How a post or URL was shared over time
  • Statistics on how a post or URL was shared
Want to learn more? Check out our Help Center for specific steps on how to use Ripples, and visit the Google+ Your Business site for more Google+ tips and tricks. You can watch all our small business stories on YouTube.

What interesting people and connections have you discovered through Ripples? Join the discussion on the Google+ Your Business page and tag your posts #mybusinessstory.

Posted by Evelyn Lee, Google+ Pages Associate Product Marketing Manager

Excellent Papers for 2011



UPDATE: Added Theo Vassilakis as an author for "Dremel: Interactive Analysis of Web-Scale Datasets"

Googlers across the company actively engage with the scientific community by publishing technical papers, contributing open-source packages, working on standards, introducing new APIs and tools, giving talks and presentations, participating in ongoing technical debates, and much more. Our publications offer technical and algorithmic advances, feature aspects we learn as we develop novel products and services, and shed light on some of the technical challenges we face at Google.

In an effort to highlight some of our work, we periodically select a number of publications to be featured on this blog. We first posted a set of papers on this blog in mid-2010 and subsequently discussed them in more detail in the following blog postings. In a second round, we highlighted new noteworthy papers from the later half of 2010. This time we honor the influential papers authored or co-authored by Googlers covering all of 2011 -- covering roughly 10% of our total publications.  It’s tough choosing, so we may have left out some important papers.  So, do see the publications list to review the complete group.

In the coming weeks we will be offering a more in-depth look at these publications, but here are some summaries:

Audio processing

Cascades of two-pole–two-zero asymmetric resonators are good models of peripheral auditory function”, Richard F. Lyon, Journal of the Acoustical Society of America, vol. 130 (2011), pp. 3893-3904.
Lyon's long title summarizes a result that he has been working toward over many years of modeling sound processing in the inner ear.  This nonlinear cochlear model is shown to be "good" with respect to psychophysical data on masking, physiological data on mechanical and neural response, and computational efficiency. These properties derive from the close connection between wave propagation and filter cascades. This filter-cascade model of the ear is used as an efficient sound processor for several machine hearing projects at Google.

Electronic Commerce and Algorithms

Online Vertex-Weighted Bipartite Matching and Single-bid Budgeted Allocations”, Gagan AggarwalGagan Goel, Chinmay Karande, Aranyak Mehta, SODA 2011.
The authors introduce an elegant and powerful algorithmic technique to the area of online ad allocation and matching: a hybrid of random perturbations and greedy choice to make decisions on the fly. Their technique sheds new light on classic matching algorithms, and can be used, for example, to pick one among a set of relevant ads, without knowing in advance the demand for ad slots on future web page views.

Milgram-routing in social networks”, Silvio Lattanzi, Alessandro Panconesi, D. Sivakumar, Proceedings of the 20th International Conference on World Wide Web, WWW 2011, pp. 725-734.
Milgram’s "six-degrees-of-separation experiment" and the fascinating small world hypothesis that follows from it, have generated a lot of interesting research in recent years. In this landmark experiment, Milgram showed that people unknown to each other are often connected by surprisingly short chains of acquaintances. In the paper we prove theoretically and experimentally how a recent model of social networks, "Affiliation Networks", offers an explanation to this phenomena and inspires interesting technique for local routing within social networks.

Non-Price Equilibria in Markets of Discrete Goods”, Avinatan Hassidim, Haim Kaplan, Yishay Mansour, Noam Nisan, EC, 2011.
We present a correspondence between markets of indivisible items, and a family of auction based n player games. We show that a market has a price based (Walrasian) equilibrium if and only if the corresponding game has a pure Nash equilibrium. We then turn to markets which do not have a Walrasian equilibrium (which is the interesting case), and study properties of the mixed Nash equilibria of the corresponding games.

HCI

From Basecamp to Summit: Scaling Field Research Across 9 Locations”, Jens Riegelsberger, Audrey Yang, Konstantin Samoylov, Elizabeth Nunge, Molly Stevens, Patrick Larvie, CHI 2011 Extended Abstracts.
The paper reports on our experience with a basecamp research hub to coordinate logistics and ongoing real-time analysis with research teams in the field. We also reflect on the implications for the meaning of research in a corporate context, where much of the value may be less in a final report, but more in the curated impressions and memories our colleagues take away from the the research trip.

User-Defined Motion Gestures for Mobile Interaction”, Jaime Ruiz, Yang Li, Edward Lank, CHI 2011: ACM Conference on Human Factors in Computing Systems, pp. 197-206.
Modern smartphones contain sophisticated sensors that can detect rich motion gestures — deliberate movements of the device by end-users to invoke commands. However, little is known about best-practices in motion gesture design for the mobile computing paradigm. We systematically studied the design space of motion gestures via a guessability study that elicits end-user motion gestures to invoke commands on a smartphone device. The study revealed consensus among our participants on parameters of movement and on mappings of motion gestures onto commands, by which we developed a taxonomy for motion gestures and compiled an end-user inspired motion gesture set. The work lays the foundation of motion gesture design—a new dimension for mobile interaction.

Information Retrieval

Reputation Systems for Open Collaboration”, B.T. Adler, L. de Alfaro, A. Kulshreshtha , I. Pye, Communications of the ACM, vol. 54 No. 8 (2011), pp. 81-87.
This paper describes content based reputation algorithms, that rely on automated content analysis to derive user and content reputation, and their applications for Wikipedia and google Maps. The Wikipedia reputation system WikiTrust relies on a chronological analysis of user contributions to articles, metering positive or negative increments of reputation whenever new contributions are made. The Google Maps system Crowdsensus compares the information provided by users on map business listings and computes both a likely reconstruction of the correct listing and a reputation value for each user. Algorithmic-based user incentives ensure the trustworthiness of evaluations of Wikipedia entries and Google Maps business information.

Machine Learning and Data Mining

Domain adaptation in regression”, Corinna Cortes, Mehryar Mohri, Proceedings of The 22nd International Conference on Algorithmic Learning Theory, ALT 2011.
Domain adaptation is one of the most important and challenging problems in machine learning.  This paper presents a series of theoretical guarantees for domain adaptation in regression, gives an adaptation algorithm based on that theory that can be cast as a semi-definite programming problem, derives an efficient solution for that problem by using results from smooth optimization, shows that the solution can scale to relatively large data sets, and reports extensive empirical results demonstrating the benefits of this new adaptation algorithm.

On the necessity of irrelevant variables”, David P. Helmbold, Philip M. Long, ICML, 2011
Relevant variables sometimes do much more good than irrelevant variables do harm, so that it is possible to learn a very accurate classifier using predominantly irrelevant variables.  We show that this holds given an assumption that formalizes the intuitive idea that the variables are non-redundant.  For problems like this it can be advantageous to add many additional variables, even if only a small fraction of them are relevant.

Online Learning in the Manifold of Low-Rank Matrices”, Gal Chechik, Daphna Weinshall, Uri Shalit, Neural Information Processing Systems (NIPS 23), 2011, pp. 2128-2136.
Learning measures of similarity from examples of similar and dissimilar pairs is a problem that is hard to scale. LORETA uses retractions, an operator from matrix optimization, to learn low-rank similarity matrices efficiently. This allows to learn similarities between objects like images or texts when represented using many more features than possible before.

Machine Translation

Training a Parser for Machine Translation Reordering”, Jason Katz-Brown, Slav Petrov, Ryan McDonald, Franz Och, David Talbot, Hiroshi Ichikawa, Masakazu Seno, Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP '11).
Machine translation systems often need to understand the syntactic structure of a sentence to translate it correctly. Traditionally, syntactic parsers are evaluated as standalone systems against reference data created by linguists. Instead, we show how to train a parser to optimize reordering accuracy in a machine translation system, resulting in measurable improvements in translation quality over a more traditionally trained parser.

Watermarking the Outputs of Structured Prediction with an application in Statistical Machine Translation”, Ashish Venugopal, Jakob Uszkoreit, David Talbot, Franz Och, Juri Ganitkevitch, Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP).
We propose a general method to watermark and probabilistically identify the structured results of machine learning algorithms with an application in statistical machine translation. Our approach does not rely on controlling or even knowing the inputs to the algorithm and provides probabilistic guarantees on the ability to identify collections of results from one’s own algorithm, while being robust to limited editing operations.

Inducing Sentence Structure from Parallel Corpora for Reordering”, John DeNero, Jakob UszkoreitProceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP).
Automatically discovering the full range of linguistic rules that govern the correct use of language is an appealing goal, but extremely challenging.  Our paper describes a targeted method for discovering only those aspects of linguistic syntax necessary to explain how two different languages differ in their word ordering.  By focusing on word order, we demonstrate an effective and practical application of unsupervised grammar induction that improves a Japanese to English machine translation system.

Multimedia and Computer Vision

Kernelized Structural SVM Learning for Supervised Object Segmentation”, Luca Bertelli, Tianli Yu, Diem Vu, Burak Gokturk,Proceedings of IEEE Conference on Computer Vision and Pattern Recognition 2011.
The paper proposes a principled way for computers to learn how to segment the foreground from the background of an image given a set of training examples. The technology is build upon a specially designed nonlinear segmentation kernel under the recently proposed structured SVM learning framework.

Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths”, Matthias Grundmann, Vivek Kwatra, Irfan Essa, IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2011).
Casually shot videos captured by handheld or mobile cameras suffer from significant amount of shake. Existing in-camera stabilization methods dampen high-frequency jitter but do not suppress low-frequency movements and bounces, such as those observed in videos captured by a walking person. On the other hand, most professionally shot videos usually consist of carefully designed camera configurations, using specialized equipment such as tripods or camera dollies, and employ ease-in and ease-out for transitions. Our stabilization technique automatically converts casual shaky footage into more pleasant and professional looking videos by mimicking these cinematographic principles. The original, shaky camera path is divided into a set of segments, each approximated by either constant, linear or parabolic motion, using an algorithm based on robust L1 optimization. The stabilizer has been part of the YouTube Editor (youtube.com/editor) since March 2011.

The Power of Comparative Reasoning”, Jay Yagnik, Dennis Strelow, David Ross, Ruei-Sung Lin, International Conference on Computer Vision (2011).
The paper describes a theory derived vector space transform that converts vectors into sparse binary vectors such that Euclidean space operations on the sparse binary vectors imply rank space operations in the original vector space. The transform a) does not need any data-driven supervised/unsupervised learning b) can be computed from polynomial expansions of the input space in linear time (in the degree of the polynomial) and c) can be implemented in 10-lines of code. We show competitive results on similarity search and sparse coding (for classification) tasks.

NLP

Unsupervised Part-of-Speech Tagging with Bilingual Graph-Based Projections”, Dipanjan Das, Slav Petrov, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL '11), 2011, Best Paper Award.
We would like to have natural language processing systems for all languages, but obtaining labeled data for all languages and tasks is unrealistic and expensive. We present an approach which leverages existing resources in one language (for example English) to induce part-of-speech taggers for languages without any labeled training data. We use graph-based label propagation for cross-lingual knowledge transfer and use the projected labels as features in a hidden Markov model trained with the Expectation Maximization algorithm.

Networks

TCP Fast Open”, Sivasankar Radhakrishnan, Yuchung Cheng, Jerry Chu, Arvind Jain, Barath Raghavan, Proceedings of the 7th International Conference on emerging Networking EXperiments and Technologies (CoNEXT), 2011.
TCP Fast Open enables data exchange during TCP’s initial handshake. It decreases application network latency by one full round-trip time, a significant speedup for today's short Web transfers. Our experiments on popular websites show that Fast Open reduces the whole-page load time over 10% on average, and in some cases up to 40%.

Proportional Rate Reduction for TCP”, Nandita Dukkipati, Matt Mathis, Yuchung Cheng, Monia Ghobadi, Proceedings of the 11th ACM SIGCOMM Conference on Internet Measurement 2011, Berlin, Germany - November 2-4, 2011.
Packet losses increase latency of Web transfers and negatively impact user experience. Proportional rate reduction (PRR) is designed to recover from losses quickly, smoothly and accurately by pacing out retransmissions across received ACKs during TCP’s fast recovery. Experiments on Google Web and YouTube servers in U.S. and India demonstrate that PRR reduces the TCP latency of connections experiencing losses by 3-10% depending on response size.

Security and Privacy

Automated Analysis of Security-Critical JavaScript APIs”, Ankur Taly, Úlfar Erlingsson, John C. Mitchell, Mark S. Miller, Jasvir Nagra, IEEE Symposium on Security & Privacy (SP), 2011.
As software is increasingly written in high-level, type-safe languages, attackers have fewer means to subvert system fundamentals, and attacks are more likely to exploit errors and vulnerabilities in application-level logic.  This paper describes a generic, practical defense against such attacks, which can protect critical application resources even when those resources are partially exposed to attackers via software interfaces.  In the context of carefully-crafted fragments of JavaScript, the paper applies formal methods and semantics to prove that these defenses can provide complete, non-circumventable mediation of resource access; the paper also shows how an implementation of the techniques can establish the properties of widely-used software, and find previously-unknown bugs.

App Isolation: Get the Security of Multiple Browsers with Just One”, Eric Y. Chen, Jason Bau, Charles Reis, Adam Barth, Collin Jackson, 18th ACM Conference on Computer and Communications Security, 2011.
We find that anecdotal advice to use a separate web browser for sites like your bank is indeed effective at defeating most cross-origin web attacks.  We also prove that a single web browser can provide the same key properties, for sites that fit within the compatibility constraints.

Speech

Improving the speed of neural networks on CPUs”, Vincent Vanhoucke, Andrew Senior, Mark Z. Mao, Deep Learning and Unsupervised Feature Learning Workshop, NIPS 2011.
As deep neural networks become state-of-the-art in real-time machine learning applications such as speech recognition, computational complexity is fast becoming a limiting factor in their adoption. We show how to best leverage modern CPU architectures to significantly speed-up their inference.

Bayesian Language Model Interpolation for Mobile Speech Input”, Cyril Allauzen, Michael Riley, Interspeech 2011.
Voice recognition on the Android platform must contend with many possible target domains - e.g. search, maps, SMS. For each of these, a domain-specific language model was built by linearly interpolating several n-gram LMs from a common set of Google corpora. The current work has found a way to efficiently compute a single n-gram language model with accuracy very close to the domain-specific LMs but with considerably less complexity at recognition time.

Statistics

Large-Scale Parallel Statistical Forecasting Computations in R”, Murray Stokely, Farzan Rohani, Eric Tassone, JSM Proceedings, Section on Physical and Engineering Sciences, 2011.
This paper describes the implementation of a framework for utilizing distributed computational infrastructure from within the R interactive statistical computing environment, with applications to timeseries forecasting. This system is widely used by the statistical analyst community at Google for data analysis on very large data sets.

Structured Data

Dremel: Interactive Analysis of Web-Scale Datasets”, Sergey Melnik, Andrey Gubarev, Jing Jing Long, Geoffrey Romer, Shiva Shivakumar, Matt Tolton, Theo Vassilakis, Communications of the ACM, vol. 54 (2011), pp. 114-123.
Dremel is a scalable, interactive ad-hoc query system. By combining multi-level execution trees and columnar data layout, it is capable of running aggregation queries over trillion-row tables in seconds. Besides continued growth internally to Google, Dremel now also backs an increasing number of external customers including BigQuery and UIs such as AdExchange front-end.

Representative Skylines using Threshold-based Preference Distributions”, Atish Das Sarma, Ashwin Lall, Danupon Nanongkai, Richard J. Lipton, Jim Xu, International Conference on Data Engineering (ICDE), 2011.
The paper adopts principled approach towards representative skylines and formalizes the problem of displaying k tuples such that the probability that a random user clicks on one of them is maximized. This requires mathematically modeling (a) the likelihood with which a user is interested in a tuple, as well as (b) how one negotiates the lack of knowledge of an explicit set of users. This work presents theoretical and experimental results showing that the suggested algorithm significantly outperforms previously suggested approaches.

Hyper-local, directions-based ranking of places”, Petros Venetis, Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, PVLDB, vol. 4(5) (2011), pp. 290-30.
Click through information is one of the strongest signals we have for ranking web pages. We propose an equivalent signal for raking real world places: The number of times that people ask for precise directions to the address of the place. We show that this signal is competitive in quality with human reviews while being much cheaper to collect, we also show that the signal can be incorporated efficiently into a location search system.

Systems

Power Management of Online Data-Intensive Services”, David Meisner, Christopher M. Sadler, Luiz André Barroso, Wolf-Dietrich Weber, Thomas F. Wenisch, Proceedings of the 38th ACM International Symposium on Computer Architecture, 2011.
Compute and data intensive Web services (such as Search) are a notoriously hard target for energy savings techniques. This article characterizes the statistical hardware activity behavior of servers running Web search and discusses the potential opportunities of existing and proposed energy savings techniques.

The Impact of Memory Subsystem Resource Sharing on Datacenter Applications”, Lingjia Tang, Jason Mars, Neil Vachharajani, Robert Hundt, Mary-Lou Soffa, ISCA, 2011.
In this work, the authors expose key characteristics of an emerging class of Google-style workloads and show how to enhance system software to take advantage of these characteristics to improve efficiency in data centers. The authors find that across datacenter applications, there is both a sizable benefit and a potential degradation from improperly sharing micro-architectural resources on a single machine (such as on-chip caches and bandwidth to memory). The impact of co-locating threads from multiple applications with diverse memory behavior changes the optimal mapping of thread to cores for each application. By employing an adaptive thread-to-core mapper, the authors improved the performance of the datacenter applications by up to 22% over status quo thread-to-core mapping, achieving performance within 3% of optimal.

Language-Independent Sandboxing of Just-In-Time Compilation and Self-Modifying Code”, Jason Ansel, Petr Marchenko, Úlfar Erlingsson, Elijah Taylor, Brad Chen, Derek Schuff, David Sehr, Cliff L. Biffle, Bennet S. Yee, ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI), 2011.
Since its introduction in the early 90's, Software Fault Isolation, or SFI, has been a static code technique, commonly perceived as incompatible with dynamic libraries, runtime code generation, and other dynamic code.  This paper describes how to address this limitation and explains how the SFI techniques in Google Native Client were extended to support modern language implementations based on just-in-time code generation and runtime instrumentation. This work is already deployed in Google Chrome, benefitting millions of users, and was developed over a summer collaboration with three Ph.D. interns; it exemplifies how Research at Google is focused on rapidly bringing significant benefits to our users through groundbreaking technology and real-world products.

Thialfi: A Client Notification Service for Internet-Scale Applications”, Atul Adya, Gregory Cooper, Daniel Myers, Michael Piatek,Proc. 23rd ACM Symposium on Operating Systems Principles (SOSP), 2011, pp. 129-142.
This paper describes a notification service that scales to hundreds of millions of users, provides sub-second latency in the common case, and guarantees delivery even in the presence of a wide variety of failures.  The service has been deployed in several popular Google applications including Chrome, Google Plus, and Contacts.












quarta-feira, 21 de março de 2012

Google at INFOCOM 2012



The computer networking community will get together in Orlando, Florida the week of March 25th for INFOCOM 2012, the Annual IEEE International Conference on Computer Communications.

At the conference, we will discuss topics such as traffic engineering, traffic anomaly detection, and random walk algorithms for topology-aware networks. We serve so much internet traffic to Google users and exchange so much data between our data centers that computer networking is naturally something we care about. As traffic grows with richer content (photos, video, ...), new modes of engagement (cloud computing, social networking, ...) and an increasing number of users, engineering and research efforts are necessary to help networks scale.

The following papers were co-authored by Googlers from offices around the world:

  • Near-optimal random walk sampling in distributed networks by Atish Das Sarma, Anisur Molla, and Gopal Pandurangan
  • How to split a flow by Tzvika Hartman, Avinatan Hassidim, Haim Kaplan, Danny Raz, and Michal Segalov
  • Upward max-min fairness by Emilie Danna, Avinatan Hassidim, Haim Kaplan, Alok Kumar, Yishay Mansour, Danny Raz, and Michal Segalov (runner up for best paper)
  • A practical algorithm for balancing the max-min fairness and throughput objectives in traffic engineering by Emilie Danna, Subhasree Mandal, and Arjun Singh
  • Traffic anomaly detection based on the IP size distribution by Fabio Soldo and Ahmed Metwally

If you are attending, stop by and say hi!

segunda-feira, 19 de março de 2012

Gamification for Improved Search Ranking for YouTube Topics



In earlier posts we discussed automatic ways to find the most talented emerging singers and the funniest videos using the YouTube Slam experiment. We created five “house” slams -- music, dance, comedy, bizarre, and cute -- which produce a weekly leaderboard not just of videos but also of YouTubers who are great at predicting what the masses will like. For example, last week’s cute slam winning video claims to be the cutest kitten in the world, beating out four other kittens, two puppies, three toddlers and an amazing duck who feeds the fish. With a whopping 620 slam points, YouTube user emoatali99 was our best connoisseur of cute this week. On the music side, it is no surprise that many of music slam’s top 10 videos were Adele covers. A Whitney Houston cover came out at the top this week, and music slam’s resident expert on talent had more than a thousand slam points. Well done! Check out the rest of the leaderboards for cute slam and music slam.

Can slam-style game mechanics incentivize our users to help improve the ranking of videos -- not just for these five house slams -- but for millions of other search queries and topics on YouTube? Gamification has previously been used to incentivize users to participate in non-game tasks such as image labeling and music tagging. How many votes and voters would we need for slam to do better than the existing ranking algorithm for topic search on YouTube?

As an experiment, we created new slams for a small number of YouTube topics (such as Latte Art Slam and Speed Painting Slam) using existing top 20 videos for these topics as the candidate pool. As we accumulated user votes, we evaluated the resulting YouTube Slam leaderboard for that topic vs the existing ranking on youtube.com/topics (baseline). Note that both the slam leaderboard and the baseline had the same set of videos, just in a different order.

What did we discover? It was no surprise that slam ranking performance had a high variance in the beginning and gradually improved as votes accumulated. We are happy to report that four of five topic slams converged within 1000 votes with a better leaderboard ranking than the existing YouTube topic search. In spite of small number of voters, Slam achieves better ranking partly because of gamification incentives and partly because it is based on machine learning, using:

  1. Preference judgement over a pair, not absolute judgement on a single video, and,

  2. Active solicitation of user opinion as opposed to passive observation. Due to what is called a “cold start” problem in data modeling, conventional (passive observation) techniques don’t work well on new items with little prior information. For any given topic, Slam’s improvement over the baseline in ranking of the “recent 20” set of videos was in fact better than the improvement in ranking of the “top 20” set.

Demographics and interests of the voters do affect slam leaderboard ranking, especially when the voter pool is small. An example is a Romantic Proposals Slam we featured on Valentine’s day last month. Men thought this proposal during a Kansas City Royals game was the most romantic, although this one where the man pretends to fall off a building came close. On the other hand, women rated this meme proposal in a restaurant as the best, followed by this movie theater proposal.

Encouraged by these results, we will soon be exploring slams for a few thousand topics to evaluate the utility of gamification techniques to YouTube topic search. Here are some of them: Chocolate BrowniePaper PlaneBush FlyingStealth TechnologyStencil GraffitiYosemite National Park, and Stealth Technology.

Have fun slamming!

Learn how Google can help your business with our new Help Desk Hangouts on Air series

You’re looking to grow your business, and we offer a ton of tools to help you do just that. But sometimes, you need a little help learning all the options and getting started. That’s why this week on the Google+ Your Business page, we’ve launched a new series of Help Desk Hangouts On Air to put you in touch with teams who can help you get the most out of our products and features.

What’s a Hangout On Air? Well, Hangouts are a video group chat with a limit of 10 participants. Hangouts On Air are special Hangouts that allow you to broadcast that 10-person Hangout to many people and record it for future viewing.

To kick things off, we asked Justin Cutroni of the Google Analytics team (and author of the blog Analytics Talk) to show us how business owners can use Analytics to track their advertising campaigns, website performance and see how users are getting to their site (e-mail, social media, referrals). If you missed it, you can watch the full hour-long Hangout on the Google Business YouTube channel:


Here’s how the video breaks down:
  • Intros - Hi, Mom! (2 mins)
  • The basics (25 mins)
    • What is Analytics?
    • Why, as a business owner, should I use Analytics?
    • How do I get started?
    • How do I read the reporting information? (Here Justin walks us through the features of an active account.)
  • Q&A (25 mins)
  • Wrapup (2 mins)
Earlier in the week, we asked you to share with us your Analytics questions — and you had plenty! Here are just some of the questions Justin addressed in the Hangout:

Kenneth: Is there a threat in respect to data privacy?
We take privacy very seriously at Google. The only person that has access to your Google Analytics data is you. You can also grant other people access to your Analytics data, but that’s up to you.

Martynas: Is there a plan to update the administration part of GA? We need more levels: creator, administrator, manager, reader.
Excellent feature request, and it relates to the question above. We get this question often and know that the current model is limiting. We are working hard to figure out the best user model for Analytics.

Lea: When oh when will export to PDF be available in the new version of Analytics?
We hear you! It’s coming back very soon. We’re sorry it’s taken so long to add this feature to the new version of Google Analytics.

Jeremy: What are the plans for integrating Google Analytics with Site Optimizer?
Another great question. For those of you that don’t know, Website Optimizer is a website testing tool. You can use it to test different variations of your website, like landing pages or the checkout process. We’ve heard our users loud and clear that Website Optimizer would be a lot more useful as part of Google Analytics. Stay tuned ...

Connie: Is there a good WordPress plug-in for adding GA code to a blog?
Plug-ins! Justin’s favorite, we learned yesterday in the Hangout. There are some great ones out there, especially for WordPress. Check out Google Analytics for WordPress.

Raphael: Can you tell us more about the benefits of using Analytics for tracking mobile apps?
You can absolutely track apps with Google Analytics. We have two SDKs, one for Android and one for iOS, that make it easy to  track how people use an app. If you’re going to use GA to track apps you should also understand Event Tracking and Custom Variables. These two features are very useful when tracking apps.


Justin shows us a feature that tells you how often you show up in Google’s organic search results and the number of click-throughs that you get.

To learn more about how to get started with Google Analytics, visit our Help Center. And remember to tune in to the live stream of our next Hangout at 11 a.m. PDT Wednesday, as we discuss how to use Hangouts (something a bunch of you guys have asked us to talk about!). We’ll be collecting your Hangout questions today on the Google+ Your Business page.

Posted by Vanessa Schneider, Google Places community manager

terça-feira, 13 de março de 2012

Google+ page stories: Birds Barbershop

Editor’s Note: This is the fifth in a series of posts about small businesses on Google+ and their tips and tricks for managing a great page. Visit our YouTube channel to see all the videos in this series and join the discussion on the Google+ Your Business page

Meet Birds Barbershop, a collection of five hair salons in Austin, Texas. Jayson Rapaport and Michael Portman dreamed of reinterpreting the old school barbershop for a new generation through affordable, quality cuts, free beer — and now Google+. Watch this pair of big thinking entrepreneurs as they use Google+ to speak directly with their regulars and create a virtual lookbook using Google+ photos.


Google+ Pages is a great tool for coordinating internal communications. In the video below, see how Michael and Jayson use their page to expand to a new location.


On Google+, users get unlimited photo and video uploads — a great opportunity to show off your business’s space, new merchandise, or a recent event you hosted. Here are some quick tips on how to create albums of beautiful photos to show off your business:
  • Edit your photos with basic features like rotating images or using auto-fix. You can use the Creative Kit for more advanced edits like adding effects and text to your pictures.
  • Manage your images quickly by using the photos icon at the top of your Google+ dashboard to change the share settings for all of your albums in one place.
  • Change the visibility of your albums so you can pick who sees what. For example, you might want to share photo of your products with your customers, and pictures from company events with just your employees.
Want to learn more? Visit the Google+ Your Business site, and stay tuned for more Google+ stories and tips from small businesses. Also check out our Help Center content for specific steps on how to use photos in your posts. You can also watch all our Google+ page stories on YouTube.

How do you use photos and videos on Google+ to better connect with your customers? Join the discussion on the Google+ Your Business page and tag your posts #mybusinessstory.

Posted by Evelyn Lee, Google+ Pages Associate Product Marketing Manager

segunda-feira, 12 de março de 2012

Search Ads Pause Studies Update



In July 2011, Google released a study called "Incremental Clicks Impact of Search Advertising" that showed the amount of search ad traffic that is incremental to traffic from an advertiser’s organic search results. In that study, we asked these questions: What happens when search ads are paused? How much does organic traffic make up for the loss in traffic from search ads?

We found that an average 89% of paid clicks are essentially lost and not recovered by an increase in organic clicks when a search campaign is paused. This number - what we call the Incremental Ad Clicks (IAC) - was consistent across all verticals.

In that initial study, we only examined cases where ads were completely paused. In this update, we looked at three additional change scenarios and included new cases up to August 2011, giving a total of more than 5,300 cases.


For the paused cases, the average IAC of 85% was a little lower than the previous value of 89%. We see there was some volatility in this estimate, month-to-month, driven purely by the mix of advertisers who choose to pause their ads in that month.

In the cases where spend was decreased (as opposed to paused), we found that the ads associated with the spend decrease drive on average 80% incremental traffic. This means that 80% of the traffic from those ads would not be made up for by organic traffic. This value is lower than the 85% value in the paused cases, possibly due to advertisers selectively turning down parts of their search advertising which they find less effective.

In cases where an advertiser was already spending on search ads and subsequently increased their ad spend, we also found that the associated ads drive, on average, 78% incremental traffic. In the last scenario, where advertisers were previously not advertising with search ads, and then turned on search ads, the incremental traffic was 79%.

Across the board, our findings are consistent: ads drive a very high proportion of incremental traffic - traffic that is not replaced by navigation from organic listings when the ads are turned off or turned down.

Click here for an infographic.

quinta-feira, 8 de março de 2012

Go inside with indoor maps on Google Maps for Android

Google Maps helps you orient yourself in the world around you, and as of a few months ago, began to help you do this indoors as well. Indoor maps, a Google Maps feature now available for Android mobile users, shows detailed building floor plans where available. This helps your customers using Android phones figure out where they are and what’s around them in your shop, and enables new customers to check out the layout of your location before they even visit.

Indoor maps were initially released with a limited set of partners (mainly large retailers, airports, and transit stations), and now we’re looking to bring these maps to more places where users might benefit from being able to quickly see floor plans labeled with ATMs, restrooms, departments, and more. You can upload your venue’s floor plan to the Google Maps floor plans tool (make sure you have the necessary permissions and follow our content guidelines). If accepted, we’ll format it to appear on Google Maps for Android. Your floor plan can be a blueprint, a digital image from your website or a brochure. If you only have a physical copy of the floor plan, you can scan or take a picture of it and use that image instead. Easy!




Sofia Italian Steakhouse, West Roxbury, MA

If your store is located within a larger indoor space, you’re still able to participate. Talk to your property manager or building owner about uploading a floor plan, since improved and more detailed information can help all the businesses in your establishment.

This feature is currently available in the U.S. and Japan — we’ll keep you posted as we expand. For additional questions or information, please email floorplans@google.com

Posted by Mac Smith, Senior User Researcher

quarta-feira, 7 de março de 2012

Google+ page stories: Loluma

Editor’s Note: This is the fourth in a series of posts about small businesses on Google+ and their tips and tricks for managing a great page. Visit our YouTube channel to see all the videos in this series and join the discussion on the Google+ Your Business page

Meet Loluma, a team of artists that designs one-of-a-kind events. Sha Sha Harnik leads the team, as they provide design direction and coordination for wedding and event clients. In the video below, see how she uses Google+ not only to create moments, but also to capture and share them with her wedding clients.


Google+ Hangouts allow you to collaborate through group video chat. Watch Sha Sha hangout with a family of stealthy birthday planners to help them create the ultimate surprise party for their grandma’s 80th birthday.


Try Hangouts with Extras for even more collaborative features. With Extras, you can:

  • Share your screen. With screen sharing, you can let other Hangout attendees see what’s on your screen. Want to share a website with your colleagues, or a design you’re working on with clients? Just select the “Share screen” button at the top.
  • Collaborate on documents in real time. You can share notes and even work on documents at the same time. Perfect for brainstorming sessions!

To get started, select the “Start a hangout” button on the navigation bar along the right. In the yellow bar at the bottom of the pop-up window, select “Hangouts with extras,” and get started!

Want to learn more? Visit the Google+ Your Business site, and stay tuned for more Google+ stories and tips from small businesses.

How do you use Hangouts to collaborate with your co-workers and customers? Join the discussion on the Google+ Your Business page and tag your posts #mybusinessstory.

Posted by Evelyn Lee, Google+ Pages Associate Product Marketing Manager

segunda-feira, 5 de março de 2012

Introducing the Learn with Google Webinar Program

(Cross-posted on the Inside AdWords blog.) 

At Google, one of our goals is to help make the web work for your business. Today weíre introducing the Learn with Google webinar program that does just that, by sharing best practices and tips across a variety of products, including search ads, mobile ads, display ads, YouTube and Google Analytics.

Weíre kicking off the program with eight live webinars in March:

  • March 13 at 10am PDT: 5 Tips to Start Marketing your Business with Video
  • March 14 at 10am PDT: Introduction to the Google Display Network
  • March 15 at 10am PDT: GoMo: Mobilize your Site and Maximize your Advertising
  • March 20 at 10am PDT: Understanding Mobile Ads Across Marketing Objectives
  • March 21 at 10am PDT: Reaching Your Goals with Google Analytics
  • March 22 at 10am PDT: GoMo for Publishers
  • March 27 at 10am PDT: Manage Large AdWords Campaigns with Less Effort
  • March 28 at 10am PDT: 3 Tips to Get More out of your Video Advertising Campaigns
Check out our new webinar page to register for any of the sessions or to access on-demand webinars. Weíll be adding new webinars as theyíre scheduled, so check back regularly for updates. You can also stay up-to-date on the schedule by downloading our Learn with Google Webinar calendar to automatically see upcoming webinars in your Google Calendar.

Whether your goal is to engage the right customers at the right time, make better decisions faster, or get the most from your marketing dollars, we hope that youíll use these tips and how-toís to maximize the impact of digital and grow your business. Weíre looking forward to having you join us!


Posted by Erica Tsai, Product Marketing Manager

Keeping an “OER mind” about shared resources for education



With ever-increasing demands being placed on our education system, including new skill sets that need to be taught to create a pipeline that can fill 21st century jobs, we must figure out how to make high-quality education more accessible to more people without overburdening our existing educational institutions. The Internet, and the platforms, tools and programs it enables, will surely be a part of the answer to this challenge.

Open Educational Resources (OER) are one piece of the solution. OER are teaching and learning resources that anyone can share, reuse and remix. As part of our ongoing commitment to increasing access to a cost-effective, high-quality education, we’re supporting the OpenCourseWare Consortium — a collaboration of higher education institutions and associated organizations from around the world creating OER — in organizing Open Education Week 2012, which begins today.

An example of OER in action is OpenStax, a recent non-profit initiative of Rice University and Connexions to offer students free, professional quality textbooks that meet scope and sequence requirements for several courses. They believe that these books could save students over $90 million in the next five years. Non-profit isn’t the only model for open education. Flat World Knowledge has built a business around OER by providing free online access to open textbooks, then selling print-on-demand copies and supplemental materials.

We’ll be acknowledging OER week through a panel event in Washington, DC, and over on our +Google in Education page, where we’ll be posting articles, sharing stories and interviews about the benefits of open education resources. Opening these resources to everyone can improve the quality of education while getting more out of our investments in educational resources. We hope you’ll join us in celebrating Open Education Week. Go to openeducationweek.org to learn more and get involved.