<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>Gnip Blog</title>
	
	<link>http://blog.gnip.com</link>
	<description>Social media data tracking, updates from Twitter, Facebook, and other publishers, Gnip product updates, and more.</description>
	<lastBuildDate>Wed, 23 May 2012 17:48:13 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.2</generator>
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/gnip/JGtz" /><feedburner:info uri="gnip/jgtz" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><item>
		<title>Filtering for Tweets by User Bio</title>
		<link>http://feedproxy.google.com/~r/gnip/JGtz/~3/ipj_KrvWoC0/</link>
		<comments>http://blog.gnip.com/filter-twitter-bios/#comments</comments>
		<pubDate>Wed, 23 May 2012 16:11:56 +0000</pubDate>
		<dc:creator>Adam Tornes, Product</dc:creator>
				<category><![CDATA[Product]]></category>
		<category><![CDATA[Filter Twitter]]></category>
		<category><![CDATA[Filter Twitter Bios]]></category>
		<category><![CDATA[Filtering]]></category>

		<guid isPermaLink="false">http://blog.gnip.com/?p=2770</guid>
		<description><![CDATA[One of the requests we often hear from customers is that they’d like to be able to filter for Tweets from users who match a specific demographic.  I’m excited to announce the addition of a new operator to our PowerTrack suite that enables you to do exactly that. The bio_contains operator enables you to filter [...]]]></description>
			<content:encoded><![CDATA[<p>One of the requests we often hear from customers is that they’d like to be able to filter for Tweets from users who match a specific demographic.  I’m excited to announce the addition of a new operator to our PowerTrack suite that enables you to do exactly that.</p>
<p>The bio_contains operator enables you to filter for Tweets from users whose freeform Twitter bio contains a specific keyword, phrase or string.  The operator does a substring match against the user bio, much like our url_contains operator matches against the contents of the URL string.  To use the bio_contains operator, simply add a bio_contains:keyword clause to any rule.</p>
<p><strong>Use Cases</strong><br />
One great use for this operator is to filter for Tweets based on target demographic.  For example, say you’re analyzing social media for Tide laundry detergent and want to see what moms are saying about the brand following a major marketing campaign.  Using the bio_contains operator, you could create a rule to receive Tweets from Twitter users who explicitly state in their bio that they are a mom and mentioned Tide in their Tweet.</p>
<p>Example:<br />
<strong>User’s Bio:</strong> “Loving Mom, Wife and Daughter”<br />
<strong>Tweet:</strong> “I love the new Tide!”<br />
<strong>Rule:</strong> Tide bio_contains:mom</p>
<p>Another use would be to see all Tweets from a competitor’s employees in hopes of gaining some competitive intelligence.  In this use case, I might want to receive ALL tweets from users whose bio mentions ABC Corp.</p>
<p>Example:<br />
<strong>User’s Bio:</strong> “Product Manager at ABC Corp”<br />
<strong>Rule:</strong> bio_contains:”ABC Corp”</p>
<p>These are only a few of the possible use cases and we’re sure our customers have many others that would put these to shame.  We’d love to hear about them!</p>
<p><strong>Important Details</strong><br />
The operator does have some intricacies that it is important to be aware of.</p>
<ul>
<li>Unless the bio_contains operator is combined with additional clauses and operators in a rule, the bio_contains operator will match EVERY tweet from a user whose bio contains the keyword or phrase.  Depending on the keyword or phrase, this could result in receiving A LOT of Tweets.</li>
<li>All keywords or phrases containing spaces or punctuation should be surrounded by quotes.</li>
<li>The operator performs a substring match against a user’s bio and ignores word boundaries.  As a result, if your keyword or phrase is part of another word or phrase, it will be considered a match.  For example, a keyword of “pants” would match a bio containing a term like “#TeamSpongeBobSquarePants”.  Should this be an issue, we would recommend one of two solutions:</li>
</ul>
<ol>
<li>Add a negation to exclude the matches you don’t want<br />
i.e. bio_contains:pants -bio_contains:”#TeamSpongeBobSquarePants”</li>
<li>Quote common word boundaries in conjunction with the OR operator<br />
i.e. bio_contains:” pants ”  OR bio_contains:”pants/” OR bio_contains:” pants.”</li>
</ol>
<p>As with most of our work, this new operator started with customer requests.  Thanks for the product feedback and keep it coming.  Additional documentation of this new operator and others can be found in our <a href="http://docs.gnip.com/">online documentation</a>.</p>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/gnip/JGtz?a=ipj_KrvWoC0:m_NzEtAJSOg:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/gnip/JGtz?d=yIl2AUoC8zA" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/gnip/JGtz/~4/ipj_KrvWoC0" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.gnip.com/filter-twitter-bios/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.gnip.com/filter-twitter-bios/</feedburner:origLink></item>
		<item>
		<title>Syncing Up: Product &amp; Engineering</title>
		<link>http://feedproxy.google.com/~r/gnip/JGtz/~3/R_t5Mrm2yy0/</link>
		<comments>http://blog.gnip.com/syncing-product-engineering/#comments</comments>
		<pubDate>Tue, 22 May 2012 21:55:05 +0000</pubDate>
		<dc:creator>Jud Valeski, Co-Founder and CEO</dc:creator>
				<category><![CDATA[Corporate]]></category>
		<category><![CDATA[Gnip Jobs]]></category>
		<category><![CDATA[Jobs]]></category>
		<category><![CDATA[Product Development]]></category>

		<guid isPermaLink="false">http://blog.gnip.com/?p=2767</guid>
		<description><![CDATA[Gnip has a very strong foundation under our belts now and we’re pushing our product roadmap harder than we ever have before. I’m always in awe when I consider the business we’ve built to date with the resources we have. That said, we’re growing at a rate that demands more from the company; much more. [...]]]></description>
			<content:encoded><![CDATA[<p>Gnip has a very strong foundation under our belts now and we’re pushing our product roadmap harder than we ever have before. I’m always in awe when I consider the business we’ve built to date with the resources we have. That said, we’re growing at a rate that demands more from the company; much more. As I talked about previously in this “<a href="http://blog.gnip.com/big-software-big-jobs-big-impact/">Big Software, Big Jobs, Big Impact</a>” post, we have Big needs for great people to join our team.</p>
<p>We’ve been hiring in many areas for a while now, and we now need more <a href="http://gnip.com/careers">Product Management capacity</a>.</p>
<p>We challenge ourselves every day in order to strike the right balance between Engineering and Product. Too much on the Product requirements front and the Product team gets frustrated because what they design and build requirements for isn’t getting built. Too much on the Engineering side and you’re over paying for construction resources because a coherent Product story isn’t being defined and prioritized.</p>
<p>The Engineering team is steadily growing at Gnip, and we need more leadership on the Product Manager front as a result. We need to maintain that balance as we scale and feed the beast.</p>
<p>The tricky thing with scaling Product teams is getting the leadership balance right. Too many cooks in the kitchen isn’t healthy, but as the business and product scales, functional areas indeed require strong minds, strong direction, and a lot of prioritization skill.</p>
<p>In order to grow this thing further, we need even more strength in the Product arena. We need people who can help us crystallize the vision we have for this commercial data ecosystem. We need people who can help us build teams. We need people that can help us execute on the products we want our customers to have.</p>
<p>If you believe you have the skills to help us here, please check out our <a href="http://gnip.com/careers">job openings</a> and get in touch. We’re at a fascinating and unique time in our company as we build products that enable the next generation of what is possible with social data. If your imagination sparks when you think about what’s the things that could be built with access to the world’s public consciousness as represented by social conversation data, there’s no time like right now to get involved with what we’re doing at Gnip.</p>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/gnip/JGtz?a=R_t5Mrm2yy0:6CJrA5-pA4U:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/gnip/JGtz?d=yIl2AUoC8zA" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/gnip/JGtz/~4/R_t5Mrm2yy0" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.gnip.com/syncing-product-engineering/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.gnip.com/syncing-product-engineering/</feedburner:origLink></item>
		<item>
		<title>Taming The Social Media Firehose, Part I</title>
		<link>http://feedproxy.google.com/~r/gnip/JGtz/~3/mlzOjILe0js/</link>
		<comments>http://blog.gnip.com/streaming-api/#comments</comments>
		<pubDate>Thu, 17 May 2012 17:08:13 +0000</pubDate>
		<dc:creator>Scott Hendrickson, Data Science</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Social Media Firehose]]></category>
		<category><![CDATA[Streaming API]]></category>
		<category><![CDATA[twitter firehose]]></category>

		<guid isPermaLink="false">http://blog.gnip.com/?p=2756</guid>
		<description><![CDATA[This is the first post in our series on what a social media “firehose” (e.g. streaming api) is and what it takes to turn it into useful information for your organization.  Here I outline some of the high-level challenges and considerations when consuming the social media firehose; in Parts II and III, I will give [...]]]></description>
			<content:encoded><![CDATA[<p><em>This is the first post in our series on what a social media “firehose” (e.g. streaming api) is and what it takes to turn it into useful information for your organization.  Here I outline some of the high-level challenges and considerations when consuming the social media firehose; in Parts II and III, I will give more practical examples.</em></p>
<p style="text-align: center;"><strong><img class="size-medium wp-image-2759 aligncenter" title="Taming the Firehose" src="http://blog.gnip.com/wp-content/uploads/2012/05/Screen-Shot-2012-05-17-at-11.16.25-AM-300x254.png" alt="Social Media Firehose" width="300" height="254" /></strong></p>
<p style="text-align: left;"><strong>Why consume the social media firehose?</strong></p>
<p>The idea of consuming large amounts of social data is to get small data&#8211;to gain insights and answer questions, to guide strategy and help with decision making. To accomplish these objectives, you are not only going to collect data from the firehose, but you are going to have to parse it, scrub and structure it based on the analysis you will pursue. (If you’re not familiar with the term “parse,” it means machines are working to understand the structure and contents of the social media activity data.) This might mean analyzing text for sentiment, looking at the time-series of the volume of mentions of your brand on Tumblr, following the trail of political reactions on the social network of commenters or any of thousands of other possibilities.</p>
<p><strong>What do we mean by a social media firehose?</strong></p>
<p>Gnip offers social media data from Twitter, Tumblr, Disqus and Automattic (WordPress blogs) in the form of “firehoses.”  In each case, the <em>firehose is a continuous stream of flexibly structured social media activities arriving in near-real time</em>. Consuming that sounds like it might be a little tricky. While the technology required to consume and analyze social media firehoses is not new, the synthesis of tools and ideas needed to successfully consume the firehose deserves some consideration.</p>
<p>It may help to start by contrasting firehoses with a more common way of looking at the API world&#8211;the plain vanilla HTTP request and response. The explosion of <a href="http://en.wikipedia.org/wiki/SOAP">SOAPy</a> (Simple Object Access Protocol) and <a href="http://en.wikipedia.org/wiki/Representational_state_transfer">RESTful APIs</a> has enabled the integration and functional ecosystem of nearly every application on the Web. At the core of web services is a pair of simple ideas: that we can leverage the simple infrastructure of HTTP requests (the biggest advantage may be that we can build on existing web server, load balancers, etc.), and that scaleable applications can be build on simple stateless request/response pairs exchanging bite-sized chunks of data in standard formats.</p>
<p>Firehoses are a little different in that, while we may choose to use HTTP for many of the reasons REST and SOAP did, we don’t plan to get responses in mere bite-sized chunks.  With a firehose, we intend to open a connect to the server once and stream data indefinitely.</p>
<p>Once you are consuming the firehose, and&#8211;even more importantly&#8211;with some analysis in mind, you will choose a structure that adequately supports approach. With any luck (more likely smart people and hard work), you will end up not with Big Data, but rather with simple insights&#8211;simple to understand and clearly prescriptive for improving products, building stronger customer relationships, preventing the spread of disease, or any other outcome you can imagine.</p>
<p><strong>The Elements Of a Firehose</strong></p>
<p>Now that we have a why, let’s zero in on consuming the firehose. Returning to the definition above, here is what we need to address:</p>
<p><strong>Continuous.</strong> For example, the Twitter full firehose delivers over 300M activities per day. That is an average of 3,500 activities/second or 1 activity every 290 microseconds. The WordPress firehose delivers nearly 400K activities day. While this is a much more leisurely 4.6 activities/second there still isn’t much time to sleep between the 1 activity every 0.22 s.  And if your system isn’t continuously pulling data out of the firehose, much can be lost in a short time.</p>
<p><strong>Streams.</strong> As mentioned above, the intention is to make a firehose connection and consume the stream of social media activities indefinitely. Gnip delivers the social media stream over HTTP. The consumer of data needs to build their HTTP client so that it can decompress and process the buffer without waiting for the end of the response. This isn’t your traditional request-response paradigm (that’s <a href="http://blog.gnip.com/gnip-the-story-behind-the-name/">why we’re not called Ping</a>&#8211;and also, that name was taken).</p>
<p><strong>Unstructured data</strong>. I prefer “flexibly structured” because there is plenty of structure in the <a href="http://en.wikipedia.org/wiki/JSON">JSON</a> or <a href="http://en.wikipedia.org/wiki/XML">XML</a> formatted activities contained in the firehose. While you can simply and quickly get to the data and metadata for the activity, you will need to parse and filter the activity. You will need to make choices about how to store activity data in the structure that best supports your modeling and analysis. It is not so much what tool is good or popular, but rather what question you want to answer with the data.</p>
<p>Time-ordered activities done by people. The primary structure of the firehose data is that it represents the individual activities of people rather than summaries or aggregations. The stream of data in the firehose describes activities such as:</p>
<ul>
<li>Tweets, micro-blogs</li>
<li>Blog/rich-media posts</li>
<li>Comments/threaded discussions</li>
<li>Rich media-sharing (urls, reposts)</li>
<li>Location data (place, long/lat)</li>
<li>Friend/follower relationships</li>
<li>Engagement (e.g. Likes, up- and down-votes, reputation)</li>
<li>Tagging</li>
</ul>
<p>Real-time. Activities can be delivered soon after they are created by the user (this is referred to as low latency). (<a href="http://www.siliconprairienews.com/2012/04/paul-kedrosky-touts-merits-of-waste-in-latest-google-fiber-talk-video">Paul Kedrosky points out</a> that a 70s station wagon full of DVDs has about the same bandwidth as the internet, but an inconvenient coast-to-coast latency of about 4 days.) Both bandwidth and latency are measures of speed. Many people know how to worry about bandwidth but latency issues can really mess up real-time communications even if you have plenty of bandwidth. When consuming the Twitter firehose, it is common to realize latency (measured as the time from Tweet creation to the parsing the tweet coming from the firehose) of ~1.6 s  and as low as 300 milliseconds. WordPress posts and comments arrive 2.5 seconds after they are created on average.</p>
<p>So there are a lot of activities and they are coming fast. And they never stop, so you never want to close your connection or stop processing activities.</p>
<p>However, in real life “indefinitely” is more of an ideal than a regular achievement. The stream of data may be interrupted by any number of variations in the network and server capabilities along the line between Justin Bieber tweeting and my analyzing what brand of hair gel teenaged girls are going to be talking their boyfriends into using next week.<br />
We need to work around practicalities such as high network latency, limited bandwidth, running out of disk space, service provider outages, etc. In the real world, we need connection monitoring, dynamic shaping of the firehose, redundant connections and historical replay to get at missed data.</p>
<p><em>In Part II we make this all more concrete. We will collect data from the firehose and analyze it. Along the way, we will address particular challenges of consuming the firehose and discuss some strategies for dealing with them.</em></p>
<p>&nbsp;</p>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/gnip/JGtz?a=mlzOjILe0js:v0FOPw8bU4s:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/gnip/JGtz?d=yIl2AUoC8zA" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/gnip/JGtz/~4/mlzOjILe0js" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.gnip.com/streaming-api/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.gnip.com/streaming-api/</feedburner:origLink></item>
		<item>
		<title>New Big Boulder Speakers Announced</title>
		<link>http://feedproxy.google.com/~r/gnip/JGtz/~3/X4Wk_J36oqE/</link>
		<comments>http://blog.gnip.com/big-boulder-speakers/#comments</comments>
		<pubDate>Wed, 16 May 2012 16:25:17 +0000</pubDate>
		<dc:creator>Elaine Ellis, Marketing</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Big Boulder]]></category>
		<category><![CDATA[get satisfaction]]></category>
		<category><![CDATA[Social Data Conference]]></category>
		<category><![CDATA[tumblr]]></category>

		<guid isPermaLink="false">http://blog.gnip.com/?p=2751</guid>
		<description><![CDATA[Big Boulder is just over a month away, and we’re excited to announce seven incredible new speakers to the Big Boulder agenda. When we started planning the first social data conference, we wanted to put together a world class speaker list. We’ve been thrilled by the response and are excited to add speakers from companies [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://bigboulderconf.com/">Big Boulder</a> is just over a month away, and we’re excited to announce seven incredible new speakers to the <a href="http://bigboulderconf.com/agenda/">Big Boulder agenda</a>. When we started planning the first <a href="http://bigboulderconf.com/">social data conference</a>, we wanted to put together a world class speaker list. We’ve been thrilled by the response and are excited to add speakers from companies such as Tumblr and Get Satisfaction. We’re also working on some really interesting panels so keep your eye out for more to come!</p>
<p>Below is a list of our latest additions, and you can also see the <a href="http://bigboulderconf.com/speakers/">complete list of speakers</a>.</p>
<ul>
<li>Tumblr: <a href="http://derekg.org/">Derek Gottfrid</a>, VP of Product, and <a href="http://www.daniellestrle.com/#fe2/tumblr">Danielle Strle</a>, Community Team</li>
<li>Get Satisfaction: <a href="https://twitter.com/#!/wendyslea">Wendy Lea</a>, CEO</li>
<li>Altimeter: <a href="https://twitter.com/#!/setlinger">Susan Etlinger</a>, Industry Analyst</li>
<li>GetGlue: <a href="http://getglue.com/jmburros">Jesse Burros</a>, Business Development</li>
<li>Vanilla: <a href="https://twitter.com/#!/navvywavvy">Mark Sullivan</a>, CEO</li>
<li>The Net Savvy Executive: <a href="http://net-savvy.com/executive/">Nathan Gilliat,</a> Analyst</li>
</ul>
<p>If you’d like to attend, but aren’t a Gnip customer, we’re <a href="http://bigboulderconf.com/volunteer_application/">looking for volunteers</a> to help with photography and live blogging.</p>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/gnip/JGtz?a=X4Wk_J36oqE:uk_wbtZHrpg:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/gnip/JGtz?d=yIl2AUoC8zA" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/gnip/JGtz/~4/X4Wk_J36oqE" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.gnip.com/big-boulder-speakers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.gnip.com/big-boulder-speakers/</feedburner:origLink></item>
		<item>
		<title>Social Data: A Beacon in Natural Disasters</title>
		<link>http://feedproxy.google.com/~r/gnip/JGtz/~3/6OGWeRoadQ0/</link>
		<comments>http://blog.gnip.com/social-data-natural-disasters/#comments</comments>
		<pubDate>Thu, 03 May 2012 16:55:20 +0000</pubDate>
		<dc:creator>Elaine Ellis, Marketing</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Natural Disasters]]></category>
		<category><![CDATA[social data]]></category>
		<category><![CDATA[visionlink]]></category>

		<guid isPermaLink="false">http://blog.gnip.com/?p=2741</guid>
		<description><![CDATA[During natural disasters and health epidemics, social data provides a valuable window into what is happening on the ground. Social data can help track diseases and serve as a powerful method of communication letting people know what is happening. Healthcare is just beginning to understand the value of big data. Google first gained attention for [...]]]></description>
			<content:encoded><![CDATA[<p>During natural disasters and health epidemics, social data provides a valuable window into what is happening on the ground. Social data can help track diseases and serve as a powerful method of communication letting people know what is happening. Healthcare is just beginning to understand the <a href="http://www.theatlantic.com/health/archive/2012/05/using-data-mining-to-predict-epidemics-before-they-spread/256605/">value of big data</a>.</p>
<p style="text-align: left;">Google first gained attention for helping to predict influenza outbreaks based on search volume but scientists from <a href="http://sociable.co/science/can-twitter-and-facebook-be-used-to-predict-flu-and-other-outbreaks/">Bristol University took it a step further with Twitter</a> &#8212; they analyzed 50 million geo-tagged tweets related to flu and their results nearly perfectly correlated with national health statistics from the CDC.</p>
<p>Social data went a step further when scientists <a href="http://chronicle.com/blogs/percolator/twitter-tracks-cholera-outbreaks-faster-than-health-authorities/28205">used Twitter to predict cholera outbreaks in Haiti</a>. In 2010, a 7.0 earthquake ravaged the small island of Haiti, a third-world country with meager infrastructure. The island began preparing for disease outbreaks, specifically cholera. During the height of the cholera outbreak, the island was losing people at the rate of 50 people a day and a total of more than 3,000 people were killed that year. Scientists began tracking cholera outbreaks via Twitter and found that social data could beat the official reports by up to two weeks. As one of the scientists noted, “Not all cholera patients go to hospitals to be counted officially.”</p>
<p>This isn’t the first time that social data has played an important role after an earthquake.  During the aftermath of Japan’s earthquake in 2011, social media played an important role in helping to<a href="http://blogs.scientificamerican.com/observations/2011/05/12/twitter-helped-doctors-tell-patients-where-to-get-meds-after-japan-earthquake/"> identify how people could obtain medicine for their chronic illnesses</a> that still needed treatment but so many of the traditional pharmaceutical supply chains were cut off.</p>
<p>Like in the Japanese earthquake, there is immense potential for social data to help emergency responders during a disaster. During the Fourmile Canyon Fire in Boulder in 2010, <a href="http://boulderfire.communityos.org/zf/googlemaps/index/disaster_id/107">VisionLink</a> provided fire crews and managers <a href="http://blog.gnip.com/social-media-in-natural-disasters/">a real-time view</a> into what was happening on the ground by layering geo-tagged Tweets and Flickr images they got from Gnip onto a Google map of the area. With this information, emergency workers were better able to see what was happening on the ground.</p>
<p>Studies have proven that people rely on information communicated via social media in emergencies. <a href="http://www.theaustralian.com.au/media/using-social-media-during-natural-disasters-is-comforting-empowering-study-finds/story-e6frg996-1226311497275">A study from the University of Western Sydney</a> found that people relied on official news channels and social media and shared and re-tweeted the most useful information. The authors of the study believe that social media helps people to not feel alone as well as to help getting the message out. Which also echoes<a href="http://www.theaustralian.com.au/media/using-social-media-during-natural-disasters-is-comforting-empowering-study-finds/story-e6frg996-1226311497275"> a study put together by the Red Cross </a>that shows one-third of the U.S. population said they would use social media to let their loved ones know they were safe. Which is exactly why Red Cross has created their own social media control room powered by Radian6.</p>
<p>If you’d like to learn more about social data in natural disasters, contact <a href="mailto:info@gnip.com">info@gnip.com.</a></p>
<p style="text-align: center;"><img class="size-large wp-image-2743 aligncenter" title="Social Data in Natural Disasters" src="http://blog.gnip.com/wp-content/uploads/2012/05/VisionLink-Fire-Example-898x1024.png" alt="VisionLink Map of Boulder Fires" width="360" height="410" /></p>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/gnip/JGtz?a=6OGWeRoadQ0:eJ8piPoLTh0:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/gnip/JGtz?d=yIl2AUoC8zA" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/gnip/JGtz/~4/6OGWeRoadQ0" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.gnip.com/social-data-natural-disasters/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.gnip.com/social-data-natural-disasters/</feedburner:origLink></item>
		<item>
		<title>Tumblr Firehose Now Available Exclusively from Gnip</title>
		<link>http://feedproxy.google.com/~r/gnip/JGtz/~3/_imlcqKUP4s/</link>
		<comments>http://blog.gnip.com/tumblr-api/#comments</comments>
		<pubDate>Tue, 17 Apr 2012 17:00:07 +0000</pubDate>
		<dc:creator>Chris Moody, President and COO</dc:creator>
				<category><![CDATA[Marketing]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Product]]></category>
		<category><![CDATA[social data]]></category>
		<category><![CDATA[Tumblr API]]></category>
		<category><![CDATA[Tumblr Data]]></category>

		<guid isPermaLink="false">http://blog.gnip.com/?p=2731</guid>
		<description><![CDATA[I’m thrilled to announce that the full firehose of public Tumblr posts is now available exclusively from Gnip. Tumblr is one of the fastest growing social networks in the world. Much of this growth is fueled by the enormous number of conversations that are unique to the Tumblr community. These conversations cover a huge range [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><a href="http://blog.gnip.com/wp-content/uploads/2012/04/Screen-Shot-2012-04-16-at-5.50.27-PM.png"><img class="aligncenter size-full wp-image-2732" title="Tumblr Data " src="http://blog.gnip.com/wp-content/uploads/2012/04/Screen-Shot-2012-04-16-at-5.50.27-PM.png" alt="" width="562" height="90" /></a></p>
<p>I’m thrilled to announce that the full firehose of public Tumblr posts is now available exclusively from Gnip. Tumblr is one of the fastest growing social networks in the world. Much of this growth is fueled by the enormous number of conversations that are unique to the Tumblr community. These conversations cover a huge range of subjects, from <a href="http://capitolcouture.pn/">movies</a>, <a href="http://thedailyshow.tumblr.com/">TV shows</a> and <a href="http://targetstyle.tumblr.com/">fashion</a> to <a href="http://smartercities.tumblr.com/">business</a>, <a href="http://urbanoutfitters.tumblr.com/">apparel</a> and <a href="http://madebynike.tumblr.com/">consumer products</a>. Check out these stats to get a feel for the volume of discussion on Tumblr:</p>
<ul>
<li>50 million new posts every day</li>
<li>15 billion page views every month</li>
<li>20 billion total posts</li>
<li>300% traffic growth last year</li>
</ul>
<p>While some social platforms react quickly to news and other events, Tumblr conversations often spread around concepts and trends. Take the example of Urban Outfitters where a photographer <a href="http://felishatolentino.tumblr.com/post/19717855249/citronella-dreams-for-urban-outfitters-get-the">posted a picture</a> to her personal Tumblr of a piece from one of their new collections. That post received over 1,000 notes and almost no mention elsewhere. In the case of Land Rover, the company <a href="http://tumblr.landroverusa.com/post/20979015051/grillz-john-the-dog-shows-off-his-grill">posted a picture</a> of a dog riding in a Land Rover to their Tumblr that received more than 5,000 notes and very little mention on other networks.</p>
<p>It doesn’t take a large leap to see the impact this type of information can have on brand management and product development. The conversations on Tumblr are rich in images and discussion about brands and products, from simply sharing a picture about a favorite pair of shoes to reblogging news about favorite brand. And given the highly social nature of the Tumblr community, these discussions move quickly and broadly through the community. You often see posts that are shared tens of thousands of times. For brands, every conversation matters and access to the full firehose ensures they won’t miss a thing.</p>
<p>We’re excited to be able to offer Tumblr to our customers and can’t wait to see what other intriguing use cases they find for this data.</p>
<p>Drop us a line at <a href="mailto:sales@gnip.com">sales@gnip.com</a> to learn more.</p>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/gnip/JGtz?a=_imlcqKUP4s:2mD_xR7I2w4:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/gnip/JGtz?d=yIl2AUoC8zA" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/gnip/JGtz/~4/_imlcqKUP4s" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.gnip.com/tumblr-api/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		<feedburner:origLink>http://blog.gnip.com/tumblr-api/</feedburner:origLink></item>
		<item>
		<title>Big Software, Big Jobs, Big Impact</title>
		<link>http://feedproxy.google.com/~r/gnip/JGtz/~3/Nd7Mnogfvyc/</link>
		<comments>http://blog.gnip.com/big-software-big-jobs-big-impact/#comments</comments>
		<pubDate>Thu, 12 Apr 2012 22:26:02 +0000</pubDate>
		<dc:creator>Jud Valeski, Co-Founder and CEO</dc:creator>
				<category><![CDATA[Corporate]]></category>
		<category><![CDATA[Data]]></category>
		<category><![CDATA[Development]]></category>
		<category><![CDATA[Industry]]></category>
		<category><![CDATA[Product]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://blog.gnip.com/?p=2715</guid>
		<description><![CDATA[Gnip moves billions of real-time public social activities to its customers everyday. Doing that efficiently, accurately, and reliably is an awesome challenge. Gnip also has a lot of product roadmap to build-out. We’re hiring across the board, but I wanted to provide some insight into the software side of things to give any software construction [...]]]></description>
			<content:encoded><![CDATA[<p>Gnip moves billions of real-time public social activities to its customers everyday. Doing that efficiently, accurately, and reliably is an awesome challenge. Gnip also has a lot of product roadmap to build-out. <a href="http://gnip.com/careers">We’re hiring across the board</a>, but I wanted to provide some insight into the software side of things to give any software construction minded readers of this post a sense of the interesting technical challenges you’d get to work on if you were on our development team.</p>
<p>
<strong>Big Streams: Long-lived, Stateful, Variable Throughput TCP Connections</strong><br />
While our web-app (Rails) looks/feels much like a typical web-app, looks can be deceiving. Our core system (Java) has uncommon TCP (HTTP for the most part) connection challenges that we’re constantly applying operational and business logic creativity to. We often describe the connectivity scenario as akin to a video streaming system (Netflix for example). The load balancing, tuning, connection handling logic (often in code (Java and/or C)), restart, buffering, flushing games we get to play keep one’s brain thoroughly engaged. Imagine getting to write code that runs in a pipeline moving at several hundred Mbps (sustained).</p>
<p>
<strong>Big CPU: Filtering</strong><br />
We’ve built a <a href="http://docs.gnip.com/w/page/35663947/Power%20Track#Operators">real-time equivalent</a> parallel to age-old SQL. Something powerful and efficient, yet simple and intuitive to use. The world has grown up with SQL on their minds, yet it generally doesn’t apply to the world of high-volume real-time nature of public social data streams. We spend a lot of time and energy crafting the language, as well as building the infrastructure underneath to ensure it can operate on a message (from small Tweets, to large blog posts) in ones of milliseconds. Very powerful. Very fun. We’d love your help in evolving this part of the system with us.</p>
<p>
<strong>Big Blending: Enrichments</strong><br />
As data moves through Gnip’s infrastructure we do a variety of things to it. As a couple examples, we will enrich it with Klout scores and typed language classification scores. We’ll also unwind all those opaque URLs (and allow you to filter against the result) for our customers, so they don’t have to stand up horizontally scaled/parallelized infrastructure to do so on their end. We have a long list of enrichments we’re in the process of adding, and we’d like to do so with as minimal latency impact as technically possible. Speed matters. There’s amazing opportunity in the industry to blend other datasets into “the stream.” Help us do this.</p>
<p>
<strong>Big Data: Historical</strong><br />
There are three organizations on Earth that have the complete public Twitter corpus on their servers. Gnip, Twitter &amp; the <a href="http://blog.gnip.com/gnip-partners-libraryofcongress/">Library of Congress</a>. Gnip has dipped our toes into the “historical” offering with our 30-day reply product. Doing so has allowed us to meet many of the business critical backfill requirements of our customers. That said, there is still a huge opportunity in the historical space. We leverage a variety of parallelizable data access, query, and filter technologies at Gnip, but want more horsepower here. We’d love to have more people on the team with practical experience around map-reduce based data access models. Gnip personifies “Big Data” challenges and solutions. Show us what you’ve got (beyond textbook and academic understanding of the latest trends in this space)!</p>
<p>
<strong>Big Customers: Impact</strong><br />
We’ve been delivering public social data for four years and serve the biggest, most demanding customers in the space. From 8 of the 9 largest social media monitoring companies to some of the largest hedge funds in the world, Gnip is expected to provide a bulletproof solution. Through our customers, Gnip serves more than 90% of the Fortune 500. We’re not just about “Big Ideas,” we’re about “Big Impact.” Want to work in a place where what you do matters? This is it.</p>
<p>
<strong>Our Approach</strong><br />
We take an “every message matters” approach to our products. As a result, our development team is tightly coupled (one-and-the-same for the most part) with the production operation of the system. We don’t have a wall between the software being built and that software being run. Gnippers who write code also operate that code throughout the local, review, staging, and production environments. We believe this approach yields a better service for our customers; higher quality, better reliability, better consciousness. There is a strong sense of collective code ownership, as opposed to folks siloing into “their area.” This results in everyone’s creative and talented mind having an opportunity to impact the entire system, and subsequently yields better, more consistent, product.</p>
<p><a href="mailto:jobs@gnip.com">jobs@gnip.com</a></p>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/gnip/JGtz?a=Nd7Mnogfvyc:wasRUlrPC0c:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/gnip/JGtz?d=yIl2AUoC8zA" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/gnip/JGtz/~4/Nd7Mnogfvyc" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.gnip.com/big-software-big-jobs-big-impact/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://blog.gnip.com/big-software-big-jobs-big-impact/</feedburner:origLink></item>
		<item>
		<title>Gnip Adds Sina Weibo, 300-million member Chinese Microblogging Service</title>
		<link>http://feedproxy.google.com/~r/gnip/JGtz/~3/Hv9UkJ0FftQ/</link>
		<comments>http://blog.gnip.com/gnip-adds-sina-weibo-300-million-member-chinese-microblogging-service/#comments</comments>
		<pubDate>Tue, 10 Apr 2012 16:06:24 +0000</pubDate>
		<dc:creator>Chris Moody, President and COO</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Partners]]></category>
		<category><![CDATA[Product]]></category>
		<category><![CDATA[Microblogging]]></category>
		<category><![CDATA[microblogs]]></category>
		<category><![CDATA[Sina Weibo]]></category>
		<category><![CDATA[social data]]></category>

		<guid isPermaLink="false">http://blog.gnip.com/?p=2692</guid>
		<description><![CDATA[If you’re a large retail chain and people are talking about your brand, you want to know that immediately. But what if your brand is being talked about in Chinese on the largest microblogging service in China with 300 million members? For large Western brands with a presence in China, understanding what is happening on [...]]]></description>
			<content:encoded><![CDATA[<p>If you’re a large retail chain and people are talking about your brand, you want to know that immediately. But what if your brand is being talked about in Chinese on the largest microblogging service in China with 300 million members?</p>
<p>For large Western brands with a presence in China, understanding what is happening on Sina Weibo is just as important as understanding any social media channel in English speaking locales. It’s a legitimate problem as evidenced by <a href="http://www.china.org.cn/business/2012-02/07/content_24569503.htm">KFC recently squelching rumors about its delivery service</a> or the <a href="http://articles.cnn.com/2011-07-06/world/china.redcross_1_netizens-china-red-cross-society-posts?_s=PM:WORLD">Red Cross having to address misuse of funds</a> by a supposed employee on Sina Weibo. Ogilvy even did a <a href="http://www.china.org.cn/business/2012-02/10/content_24601460.htm">study on the importance of responding to crises</a> due to how quickly conversations spread on Weibos, as it’s extremely viral. For KFC, one Weibo post was forwarded 41,000 times and picked up 5,000 comments.</p>
<p>Monitoring plays an important role in crisis communications and brand awareness, and Gnip is proud to announce that we are now offering Sina Weibo as part of our Enterprise Data Collector offering. This makes it seamless for Gnip’s social media clients to begin offering public Sina Weibo activities to their customers.</p>
<p>With 500 million Internet users, China has the largest population of Internet users on the planet and there are 300 million people using Sina Weibo that are potential customers for Western brands having conversations that shouldn’t be missed. The user base of Sina Weibo is extremely diverse and includes celebrities, executives, politicians and advertisers. According to a joint study with Sina Weibo and social media intelligence firm CIC estimate that some <a href="http://www.ciccorporate.com/index.php?option=com_content&amp;view=article&amp;id=748%3Acic-and-sina-release-joint-white-paper-qmicroblog-revolutionizing-chinas-social-business-developmentq&amp;catid=52%3Aarchives-2011&amp;Itemid=158&amp;lang=en">50,000 companies have signed up on Sina Weibo </a>to date. Why? Because 95% of surveyed Chinese believe that brands with a presence on Sina Weibo are more trustworthy.</p>
<p>More Facts About Sina Weibo:</p>
<ul>
<li>Brands on Sina Weibo include Ikea, L’Oreal, Louis Vuitton, Dell, Coca Cola, Nike, Adidas, Burberry, Starbucks, Remy Martin and ZZ.</li>
<li>The most Weibos ever sent was an astounding 32,312 messages per second were sent during the Chinese New Year period.</li>
<li>Sixty-one percent of social networking users in China have made a purchase because of a digital campaign.</li>
<li>Sina Weibo has 57% of the Weibo market based on active users and 87% based on browsing time.</li>
<li>The most popular Weibo personality, Chinese actress Chen Yao has more than 17 million followers.</li>
<li>Notable personalities using Sina Weibo include Emma Watson, Tom Cruise, Bill Gates, Paris Hilton, San Francisco Mayor Ed Lee and Radiohead.</li>
<li>Nearly half a million of Sina Weibo users are from the United States.</li>
</ul>
<p><strong id="internal-source-marker_0.9369974923320115">If you’re interested in looking at understanding and analyzing the public Chinese social media market, please email <a href="mailto:sales@gnip.com">sales@gnip.com</a> for more information.</strong></p>
<p><strong id="internal-source-marker_0.9369974923320115"> </strong></p>
<p><strong id="internal-source-marker_0.9369974923320115"> </strong></p>
<p><strong id="internal-source-marker_0.9369974923320115"></p>
<p style="text-align: center;"><a href="http://blog.gnip.com/wp-content/uploads/2012/04/Screen-Shot-2012-04-10-at-9.35.41-AM.png"></a><a href="http://blog.gnip.com/wp-content/uploads/2012/04/Screen-Shot-2012-04-10-at-9.43.11-AM.png"></a></p>
<p style="text-align: center;"><a href="http://blog.gnip.com/wp-content/uploads/2012/04/Screen-Shot-2012-04-10-at-10.04.56-AM.png"><img class="aligncenter size-full wp-image-2708" title="Sina Weibo Ikea" src="http://blog.gnip.com/wp-content/uploads/2012/04/Screen-Shot-2012-04-10-at-10.04.56-AM.png" alt="Ikea on Sina Weibo" width="653" height="365" /></a></p>
<p></strong></p>
<p><strong id="internal-source-marker_0.9369974923320115"> </strong></p>
<p><strong id="internal-source-marker_0.9369974923320115"> </strong></p>
<p><strong id="internal-source-marker_0.9369974923320115"> </strong></p>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/gnip/JGtz?a=Hv9UkJ0FftQ:5EMyowrHvOw:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/gnip/JGtz?d=yIl2AUoC8zA" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/gnip/JGtz/~4/Hv9UkJ0FftQ" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.gnip.com/gnip-adds-sina-weibo-300-million-member-chinese-microblogging-service/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.gnip.com/gnip-adds-sina-weibo-300-million-member-chinese-microblogging-service/</feedburner:origLink></item>
		<item>
		<title>Big Boulder: The World’s First Social Data Conference</title>
		<link>http://feedproxy.google.com/~r/gnip/JGtz/~3/qe-a8yFpwm4/</link>
		<comments>http://blog.gnip.com/big-boulder-the-world%e2%80%99s-first-social-data-conference/#comments</comments>
		<pubDate>Thu, 05 Apr 2012 19:10:08 +0000</pubDate>
		<dc:creator>Chris Moody, President and COO</dc:creator>
				<category><![CDATA[Events]]></category>
		<category><![CDATA[Marketing]]></category>
		<category><![CDATA[News]]></category>

		<guid isPermaLink="false">http://blog.gnip.com/?p=2685</guid>
		<description><![CDATA[Social data is an emerging new industry and enterprises are applying social data to solve challenges in both the business world and the public sector. And while there are a wide range of conferences to discuss big data or social media marketing, there hasn’t been a place for the people and organizations working with social [...]]]></description>
			<content:encoded><![CDATA[<p>Social data is an emerging new industry and enterprises are applying social data to solve challenges in both the business world and the public sector. And while there are a wide range of conferences to discuss big data or social media marketing, there hasn’t been a place for the people and organizations working with social data to discuss best practices, challenges, trends and developments.</p>
<p>We decided to change that. Gnip is excited to launch <a href="http://bigboulderconf.com/">Big Boulder</a>, the first conference dedicated to social data and the myriad use cases surrounding it.  Big Boulder will take place on June 21-22 in beautiful Boulder, Colorado.</p>
<p>Our speaker lineup is incredible and we’re proud of the caliber of talent we’re attracting to our inaugural conference. Confirmed speakers include <a href="https://twitter.com/#!/davemorin">Dave Morin</a> of Path, <a href="https://twitter.com/#!/rsarver">Ryan Sarver</a> and <a href="https://twitter.com/#!/dougw">Doug Williams</a> of Twitter, <a href="http://www.linkedin.com/in/seanbruich">Sean Bruich</a> of Facebook, <a href="https://twitter.com/#!/danielha">Daniel Ha</a> and <a href="https://twitter.com/#!/ro_gupta">Ro Gupta</a> of Disqus, <a href="https://twitter.com/#!/howardlindzon">Howard Lindzon</a> of StockTwits, <a href="https://twitter.com/#!/joefernandez">Joe Fernandez</a> and <a href="https://twitter.com/#!/daddymention">Matt Thomson</a> of Klout, <a href="https://twitter.com/#!/znh">Zach Hofer-Shall</a> of Forrester and <a href="http://www.linkedin.com/pub/brian-hyndman/4/6a3/5">Bryan Hyndman</a> of NASDAQ. And this is just the start.</p>
<p>In fact, I’m excited to announce two new additions to our speaking roster, <a href="http://www.linkedin.com/in/scottnicholsonphd">Scott Nicholson</a>, data scientist at LinkedIn, and <a href="http://www.linkedin.com/in/wendylea">Wendy Lea</a>, CEO of Get Satisfaction. We’re thrilled to add them to the agenda and look forward to announcing more new speakers in the coming weeks.</p>
<p>Social data is in early stages but it’s already being used to power a range of <a href="http://www.businessinsider.com/9-ways-social-media-data-is-being-used-2012-3?mkt_tok=3RkMMJWWfF9wsRonuKTNZKXonjHpfsX56%2BQvUKC%2BlMI%2F0ER3fOvrPUfGjI4ASMNmI%2FqLAzICFpZo2FFfFOmF">incredible use cases</a> including <a href="http://online.wsj.com/article/SB10001424052970204778604577239574141848142.html">predicting stock markets</a>, <a href="http://online.wsj.com/article/SB10001424052970204138204576598942105167646.html">breaking the news of earthquakes</a>, and helping <a href="http://blogs.scientificamerican.com/observations/2011/05/12/twitter-helped-doctors-tell-patients-where-to-get-meds-after-japan-earthquake/">emergency responders in Japan find medicine after last year’s earthquake</a>. Big Boulder will have both fireside chats with the leading social publishers and panels discussing the use of social data in the public sector and financial markets, geo-spatial applications of social media data and social data analytics.</p>
<p>Being located in Boulder has been amazing for Gnip, and we’re thrilled to show off our hometown. Boulder has been <a href="http://www.nytimes.com/2010/05/14/business/14boulder.html?scp=1&amp;sq=boulder%20brad%20feld&amp;st=cse">frequently recognized for our thriving high tech community</a>. Not to mention, Boulder has been named the foodiest, healthiest, happiest and the best town for startups.</p>
<p>Big Boulder will be a deliberately intimate conversation, so attendance will be by invitation only for Gnip’s partners and customers. Check out <a href="http://www.bigboulder.com/">www.bigboulderconf.com</a> to learn more.</p>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/gnip/JGtz?a=qe-a8yFpwm4:JnuWs_SXyP0:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/gnip/JGtz?d=yIl2AUoC8zA" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/gnip/JGtz/~4/qe-a8yFpwm4" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.gnip.com/big-boulder-the-world%e2%80%99s-first-social-data-conference/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.gnip.com/big-boulder-the-world%e2%80%99s-first-social-data-conference/</feedburner:origLink></item>
		<item>
		<title>For the Times When Every Tweet is Too Many</title>
		<link>http://feedproxy.google.com/~r/gnip/JGtz/~3/orME9whdrBQ/</link>
		<comments>http://blog.gnip.com/for-the-times-when-every-tweet-is-too-many/#comments</comments>
		<pubDate>Tue, 27 Mar 2012 15:38:26 +0000</pubDate>
		<dc:creator>Adam Tornes, Product</dc:creator>
				<category><![CDATA[News]]></category>

		<guid isPermaLink="false">http://blog.gnip.com/?p=2679</guid>
		<description><![CDATA[Our customers tell us that getting every single Tweet that matters is one of the key reasons they work with Gnip. And sometimes getting every Tweet that matters means filtering out the Tweets you don’t want. With this in mind, I’m happy to announce the introduction of two new operators to our Power Track filtering [...]]]></description>
			<content:encoded><![CDATA[<p>Our customers tell us that getting every single Tweet that matters is one of the key reasons they work with Gnip. And sometimes getting every Tweet that matters means filtering out the Tweets you don’t want. With this in mind, I’m happy to announce the introduction of two new operators to our Power Track filtering suite.</p>
<p><strong>Retweet Operator</strong></p>
<p>The Retweet operator allows a customer to ensure <em>only</em> Retweets that match a rule are delivered or excluded.</p>
<p>To use the Retweet operator, simply add <em>is:retweet</em> or <em>–is:retweet</em> to any rule.</p>
<p>Examples Include:</p>
<ul>
<li>Receive only Retweets mentioning Apple using a rule like: <em>apple is:retweet</em> as a way to measure engagement of the brand’s fan base</li>
</ul>
<p>or<strong> </strong></p>
<ul>
<li>Get only Tweets with unique content about Apple using a rule like: <em>apple -is:retweet </em> to monitor conversation about the brand and ignore the tremendous volume of retweets generated by the brand</li>
</ul>
<p><strong>Sampling Operator</strong></p>
<p>The Sampling operator allows a customer to receive a random sample of Tweets that match a rule rather than the entire set of Tweets.</p>
<p>There are several use cases where the Sampling operator is useful.  Say you want to stay within a budgeted number of Tweets each month, but you’re trending higher than that budget halfway through the month.  With the Sampling operator, you can scale back your consumption without fully eliminating rules.  In another use case you might want to monitor a very high-volume rule or user, but your internal systems can’t handle this volume.  Sampling makes this more manageable.  Finally, there are times when you simply need to know the directional volumes for things, and don&#8217;t need every tweet.</p>
<p>To use the Sampling operator, add <em>sample:##</em> to any rule with an integer value between 1 to 100. The Sampling operator applies to the entire rule and requires any “OR’d” terms be grouped.</p>
<p>Examples Include:</p>
<ul>
<li>Receive a sampling of 10% of all Tweets that contain “apple” using a rule like:</li>
</ul>
<p><em> apple sample:10</em></p>
<p><em>or</em></p>
<ul>
<li>Receive a sampling of 50% of all Tweets that contain “iPad” or “iPhone” using a rule like:</li>
</ul>
<p><em> (ipad OR iphone) sample:50</em></p>
<p><em> </em></p>
<p>As always, thank you for the product feedback and keep it coming.  Additional documentation of these new operators and others can be found in our <a href="http://docs.gnip.com">online documentation</a>.</p>
<p>&nbsp;</p>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/gnip/JGtz?a=orME9whdrBQ:7iLLGlIzQZY:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/gnip/JGtz?d=yIl2AUoC8zA" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/gnip/JGtz/~4/orME9whdrBQ" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.gnip.com/for-the-times-when-every-tweet-is-too-many/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.gnip.com/for-the-times-when-every-tweet-is-too-many/</feedburner:origLink></item>
	</channel>
</rss>

