<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>iRevolution</title>
	<atom:link href="http://irevolution.net/feed/" rel="self" type="application/rss+xml" />
	<link>http://irevolution.net</link>
	<description>From innovation to Revolution</description>
	<lastBuildDate>Tue, 18 Jun 2013 17:36:54 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='irevolution.net' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/a91b9f81df7e7c4bf1f96afc0312c864?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>iRevolution</title>
		<link>http://irevolution.net</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://irevolution.net/osd.xml" title="iRevolution" />
	<atom:link rel='hub' href='http://irevolution.net/?pushpress=hub'/>
		<item>
		<title>Using Crowdring for Disaster Response?</title>
		<link>http://irevolution.net/2013/06/18/crowdring-disaster-response/</link>
		<comments>http://irevolution.net/2013/06/18/crowdring-disaster-response/#comments</comments>
		<pubDate>Tue, 18 Jun 2013 13:24:41 +0000</pubDate>
		<dc:creator>Patrick Meier</dc:creator>
				<category><![CDATA[Digital Activism]]></category>
		<category><![CDATA[Humanitarian Tech]]></category>
		<category><![CDATA[Beeping]]></category>
		<category><![CDATA[Crowdring]]></category>
		<category><![CDATA[Disaster]]></category>
		<category><![CDATA[Flashing]]></category>
		<category><![CDATA[Mobile]]></category>
		<category><![CDATA[Response]]></category>

		<guid isPermaLink="false">http://irevolution.net/?p=12004</guid>
		<description><![CDATA[35 million missed calls. That’s the number of calls that 75-year old social justice leader Anna Hazare received from people across India who supported his efforts to fight corruption. Two weeks earlier, he had invited India to join his movement by &#8230; <a href="http://irevolution.net/2013/06/18/crowdring-disaster-response/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=irevolution.net&#038;blog=3385318&#038;post=12004&#038;subd=irevolution&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p style="text-align:justify;"><strong>35 million missed calls.</strong></p>
<p style="text-align:justify;">That’s the number of calls that 75-year old social justice leader <span style="color:#444444;line-height:1.7;">Anna Hazare received from people across India who supported his efforts to fight corruption. Two weeks earlier, he had invited India to join his movement by making &#8220;missed calls&#8221; to a local number. Missed calls, known as beeping or flashing, are calls that are intentionally dropped after ringing. The advantage of making missed call is that neither the caller or recipient is charged. This tactic is particularly common in emerging economies to avoid paying for air time or SMS. </span><span style="color:#444444;line-height:1.7;">To build on this pioneering work, Anna and his team are developing a mobile petition tool called <a href="http://personaldemocracy.com/media/organizing-majority-world">Crowdring</a>, which turns a free &#8220;missed call&#8221; into a signature on a petition.</span></p>
<p style="text-align:justify;"><a href="https://s3.amazonaws.com/ksr/assets/000/363/468/27e38d5a470fec89b90aa1acfc1a6627_large.jpg?1359589096"><img class="aligncenter size-large wp-image-12005" alt="crowdring_pic" src="http://irevolution.files.wordpress.com/2013/06/crowdring_pic.jpg?w=500&#038;h=267" width="500" height="267" /></a></p>
<p style="text-align:justify;"><span style="color:#444444;line-height:1.7;">Communicating with disaster-affected communities is key for effective disaster response. Crowdring could be used to poll disaster affected communities. The service could also be used in combination with local community radio stations. The latter would broadcast a series of yes or no questions; ringing once would signify yes, twice would mean no. Some questions that come to mind:</span></p>
<ol>
<li><span style="line-height:20px;">Do you have enough drinking water? </span></li>
<li>Are humanitarian organizations doing a good job?</li>
<li>Is someone in your household displaying symptoms of cholera?</li>
</ol>
<p style="text-align:justify;">By receiving these calls, humanitarians would automatically be able to create a database of phone numbers with associated poll results. This means they could text them right back for more information or to arrange an in person meeting. You can learn more about Crowdring in this short video below.</p>
<p><span class='embed-youtube' style='text-align:center; display: block;'><iframe class='youtube-player' type='text/html' width='500' height='312' src='http://www.youtube.com/embed/Uzj-hvc9k5g?version=3&#038;rel=1&#038;fs=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;wmode=transparent' frameborder='0'></iframe></span></p>
<p><b><b><a href="http://www.irevolution.net/bio"><img class="aligncenter" alt="bio" src="http://irevolution.files.wordpress.com/2013/02/hat3.png?w=108&#038;h=59&#038;h=59" width="108" height="59" /></a></b></b></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/irevolution.wordpress.com/12004/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/irevolution.wordpress.com/12004/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=irevolution.net&#038;blog=3385318&#038;post=12004&#038;subd=irevolution&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://irevolution.net/2013/06/18/crowdring-disaster-response/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/dbb4cc8a291903fda7c9fc254172645d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Patrick Philippe Meier</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/06/crowdring_pic.jpg?w=500" medium="image">
			<media:title type="html">crowdring_pic</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/02/hat3.png?w=108&#38;h=59&#38;h=59" medium="image">
			<media:title type="html">bio</media:title>
		</media:content>
	</item>
		<item>
		<title>How ReCAPTCHA Can Be Used for Disaster Response</title>
		<link>http://irevolution.net/2013/06/17/recaptcha-for-disaster-response/</link>
		<comments>http://irevolution.net/2013/06/17/recaptcha-for-disaster-response/#comments</comments>
		<pubDate>Mon, 17 Jun 2013 16:19:16 +0000</pubDate>
		<dc:creator>Patrick Meier</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Crowdsourcing]]></category>
		<category><![CDATA[Social Computing]]></category>
		<category><![CDATA[Social Media]]></category>
		<category><![CDATA[Disaster]]></category>
		<category><![CDATA[ReCAPTCHA]]></category>
		<category><![CDATA[Response]]></category>

		<guid isPermaLink="false">http://irevolution.net/?p=12027</guid>
		<description><![CDATA[We&#8217;ve all seen prompts like this: More than 100 million of these ReCAPTCHAs get filled out every day on sites like Facebook, Twitter and CNN. Google uses them to simultaneously filter out spam and digitize Google Books and archives of &#8230; <a href="http://irevolution.net/2013/06/17/recaptcha-for-disaster-response/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=irevolution.net&#038;blog=3385318&#038;post=12027&#038;subd=irevolution&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p style="text-align:justify;"><strong>We&#8217;ve all seen prompts like this:</strong></p>
<p style="text-align:justify;"><a href="http://irevolution.files.wordpress.com/2013/06/recaptcha_pic.png"><img class="aligncenter size-large wp-image-12029" alt="recaptcha_pic" src="http://irevolution.files.wordpress.com/2013/06/recaptcha_pic.png?w=500&#038;h=208" width="500" height="208" /></a></p>
<p style="text-align:justify;">More than 100 million of these <a href="https://en.wikipedia.org/wiki/ReCAPTCHA">ReCAPTCHAs</a> get filled out every day on sites like Facebook, Twitter and CNN. Google uses them to simultaneously filter out spam and digitize Google Books and archives of the New York Times. For example:</p>
<p style="text-align:justify;"><a href="http://irevolution.files.wordpress.com/2013/06/recaptcha_pic2.jpg"><img class="aligncenter size-large wp-image-12030" alt="recaptcha_pic2" src="http://irevolution.files.wordpress.com/2013/06/recaptcha_pic2.jpg?w=500&#038;h=406" width="500" height="406" /></a></p>
<p style="text-align:justify;">So what&#8217;s the connection to disaster response? In early 2010, I blogged about using <a href="http://irevolution.net/2010/03/24/games-to-turksource/">massive multiplayer games to tag crisis information</a> and asked: What is the game equivalent of reCAPTCHA for tagging crisis information? (Big thanks to friend and colleague <a href="http://www.nationalgeographic.com/explorers/bios/albert-lin/">Albert Lin</a> for reminding me of this recently). Well, the game equivalent is perhaps the Internet Response League (<a href="http://irevolution.net/2013/05/29/gamers-to-the-rescue/">IRL</a>). But what if we simply used ReCPATCHA itself for disaster response?</p>
<p style="text-align:justify;">Humanitarian organizations like the American Red Cross regularly monitor Twitter for disaster-related information. But they are often overwhelmed with millions of tweets during major events. While my team and I at <a href="http://qcri.com/our-research/social-innovation/social-innovation-projects">QCRI</a> are developing <a href="http://irevolution.net/2013/04/01/auto-extracting-disaster-info/">automated solutions</a> to manage this Big (Crisis) Data, we could also  use the ReCAPTCHA methodology. For example, our <a href="http://irevolution.net/2013/04/01/auto-extracting-disaster-info/">automated classifiers</a> can tell us with a certain level of accuracy whether a tweet is disaster-related, whether it refers to infrastructure damage, urgent needs, etc. If the classifier is not sure—say the tweet is scored as having a 50% chance of being related to infrastructure damage—then we could automatically post it to our version of ReCAPCHA (see below). Perhaps a list of 3 tweets could be posted with the user prompted to tag which one of the 3 is damage-related. (The other two tweets could come from a separate database of random tweets).</p>
<p style="text-align:justify;"><a href="http://irevolution.files.wordpress.com/2013/06/recaptcha_pic3.png"><img class="aligncenter size-large wp-image-12031" alt="ReCaptcha_pic3" src="http://irevolution.files.wordpress.com/2013/06/recaptcha_pic3.png?w=500&#038;h=208" width="500" height="208" /></a></p>
<p style="text-align:justify;">There are reportedly <a href="https://careers.un.org/lbw/home.aspx?viewtype=VD">44,000 United Nations employees</a> around the globe. World Vision also employs over 40,000, the International Committee of the Red Cross (ICRC) has more than 12,000 employees while Oxfam has about 7,000. That&#8217;s 100,000 people right there who probably log onto their work emails at least once a day. Why not insert a ReCaptcha when they log in? We could also add  ReCAPTCHAs to these organizations&#8217; Intranets &amp; portals like <a href="http://irevolution.net/2008/04/09/from-intellipedia-to-virtual-osocc-to-wikiwarning/">Virtual OSOCC</a>. On a related note, <span style="color:#444444;line-height:1.7;">Google recently </span><a href="http://techcrunch.com/2012/03/29/google-now-using-recaptcha-to-decode-street-view-addresses/"><span style="line-height:1.7;">added images</span></a><span style="color:#444444;line-height:1.7;"> from Google Street View to ReCAPTCHAS. So we could automatically collect images shared on social media during disasters and post them to our own disaster response ReCAPTCHAs:</span></p>
<p style="text-align:justify;"><a href="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-17-at-12-13-27-pm.png"><img class="aligncenter size-large wp-image-12050" alt="Image ReCAPTCHA" src="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-17-at-12-13-27-pm.png?w=500&#038;h=217" width="500" height="217" /></a></p>
<p style="text-align:justify;"><span style="color:#444444;line-height:1.7;">In sum, as humanitarians log into their emails multiple times a day, they&#8217;d be asked to tag which tweets and/or pictures relate to on ongoing disaster. Last year, w</span><span style="line-height:1.7;">e </span><a style="color:#444444;line-height:1.7;" href="http://irevolution.net/2012/12/08/digital-response-typhoon-pablo/">tagged tweets and images</a><span style="line-height:1.7;"> in support of the UN&#8217;s disaster response efforts in the Philippines following Typhoon Pablo. Adding a customized ReCAPTCHA for disaster response would help us tap a much wider audience of &#8220;volunteers&#8221;, which would mean an even more rapid turn around time for damage assessments following major disasters.</span><span style="line-height:1.7;"><br />
</span></p>
<p><b id="docs-internal-guid-1949a2bc-cc43-e561-e691-1fd26d813043"><a href="http://irevolution.net/bio/"><img class="aligncenter" alt="Bio" src="http://irevolution.files.wordpress.com/2013/02/screen-shot-2013-02-16-at-7-55-33-am.png?w=104&#038;h=71&#038;h=71" width="104" height="71" /></a></b></p>
<p style="text-align:justify;">
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/irevolution.wordpress.com/12027/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/irevolution.wordpress.com/12027/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=irevolution.net&#038;blog=3385318&#038;post=12027&#038;subd=irevolution&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://irevolution.net/2013/06/17/recaptcha-for-disaster-response/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/dbb4cc8a291903fda7c9fc254172645d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Patrick Philippe Meier</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/06/recaptcha_pic.png?w=500" medium="image">
			<media:title type="html">recaptcha_pic</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/06/recaptcha_pic2.jpg?w=500" medium="image">
			<media:title type="html">recaptcha_pic2</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/06/recaptcha_pic3.png?w=500" medium="image">
			<media:title type="html">ReCaptcha_pic3</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-17-at-12-13-27-pm.png?w=500" medium="image">
			<media:title type="html">Image ReCAPTCHA</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/02/screen-shot-2013-02-16-at-7-55-33-am.png?w=104&#38;h=71&#38;h=71" medium="image">
			<media:title type="html">Bio</media:title>
		</media:content>
	</item>
		<item>
		<title>Using Waze, Uber, AirBnB and SeeClickFix for Disaster Response</title>
		<link>http://irevolution.net/2013/06/11/uber-waze-airbnb-seeclickfix-for-disaster-response/</link>
		<comments>http://irevolution.net/2013/06/11/uber-waze-airbnb-seeclickfix-for-disaster-response/#comments</comments>
		<pubDate>Tue, 11 Jun 2013 04:03:45 +0000</pubDate>
		<dc:creator>Patrick Meier</dc:creator>
				<category><![CDATA[Crowdsourcing]]></category>
		<category><![CDATA[Humanitarian Tech]]></category>
		<category><![CDATA[Social Media]]></category>
		<category><![CDATA[AirBnB]]></category>
		<category><![CDATA[Disaster]]></category>
		<category><![CDATA[Response]]></category>
		<category><![CDATA[SeeClickFix]]></category>
		<category><![CDATA[Uber]]></category>
		<category><![CDATA[Waze]]></category>

		<guid isPermaLink="false">http://irevolution.net/?p=12008</guid>
		<description><![CDATA[After the Category 5 Tornado in Oklahoma, map editors at Waze used the service to route drivers around the damage. While Uber increased their car service fares during Hurricane Sandy, they could have modified their App to encourage the shared &#8230; <a href="http://irevolution.net/2013/06/11/uber-waze-airbnb-seeclickfix-for-disaster-response/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=irevolution.net&#038;blog=3385318&#038;post=12008&#038;subd=irevolution&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p style="text-align:justify;">After the Category 5 Tornado in Oklahoma, map editors at Waze used the service to <a href="http://www.waze.com/blog/waze-map-editors-help-drivers-get-around-collapsed-bridge-and-damage-from-oklahoma-tornadoes/">route drivers around the damage</a>. While Uber <a href="http://news.cnet.com/8301-1035_3-57543776-94/storm-surge-uber-just-doubled-car-service-pricing-in-nyc/">increased their car service fares</a> during Hurricane Sandy, they could have modified their App to encourage the shared use of Uber cars to <a href="https://twitter.com/cbracy/status/343142783105777664">fill unused seats</a>. This would have taken some work,  but AirBnB did modify their platform overnight to let over 1,400 kindhearted New Yorkers offer <a href="http://www.crowdsourcing.org/editorial/airbnb-unveils-disaster-response-service/26574">free housing</a> to victims of the hurricane. SeeClick fix was used also to report over <a href="http://opensource.com/government/12/10/crowdsourced-reports-hurricane-sandy">800 issues in just 24 hours</a> after Sandy made landfall. These included reports on the precise location of power outages, flooding, downed trees, downed electric lines, and other storm damage. Following the Boston Marathon Bombing, SeeClick fix was used to quickly find <a href="http://seeclickfix.blogspot.com/2013/04/emergency-lodging-for-those-in-boston.html">emergency housing</a> for those affected by the tragedy.</p>
<p style="text-align:justify;"><a href="http://www.brainpickings.org/wp-content/uploads/2010/05/pie2.png"><img class="aligncenter size-large wp-image-12010" alt="" src="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-09-at-11-30-54-pm.png?w=500&#038;h=250" width="500" height="250" /></a></p>
<p style="text-align:justify;">Disaster-affected populations have always been the real first responders. Paid emergency response professionals cannot be everywhere at the same time, but <a href="http://irevolution.net/2010/08/14/crowd-is-always-there/">the crowd is always there</a>. Disasters are collective experiences; and today, disaster-affected crowds are increasingly &#8220;digital crowds&#8221; as well—that is, both a source and consumer of that digital information. <span style="color:#444444;line-height:1.7;">In other words, they are also the first <em>digital</em> <em>responders</em>. Thanks to <a href="http://irevolution.net/2010/11/01/digital-disruption/">connection technologies</a> like Waze, Uber, AirBnB and SeeClickFix, disaster affected communities can self-organize more quickly than ever before since these new technologies drastically reduce the cost and time necessary to self-organize. And because resilience is a function of a <a href="http://irevolution.net/2013/01/11/disaster-resilience-2-0/">community&#8217;s ability to self-organize</a>, these new technologies can also render disaster-prone populations more resilient by <a href="http://irevolution.net/2012/12/18/social-media-social-capital-disaster-resilience/">fostering social capital</a>, thus enabling them to </span><a style="line-height:1.7;" href="http://irevolution.net/2013/01/11/disaster-resilience-2-0/">bounce back more quickly</a><span style="color:#444444;line-height:1.7;"> after a crisis.</span></p>
<p style="text-align:justify;">When we&#8217;re affected by disasters, we tend to use the tools that we are most familiar with, i.e. those we use on a daily basis when there is no disaster. That&#8217;s why we often see so many Facebook updates, Instagram pictures, tweets, YouTube videos, etc., posted during a disaster. The same holds true for services like Waze and AirBnB, for example. So I&#8217;m thrilled to see more examples of these platforms used as humanitarian technologies and equally heartened to know that the companies behind these tools are starting to play a more active role during disasters, thus <a href="http://irevolution.net/2013/04/24/jointly-app/">helping people help themselves</a>. Each of these platforms have the potential to become hyper-local <a href="http://irevolution.net/2013/02/27/matchapp-disaster-response-app/">match.com&#8217;s for disaster response</a>. Facilitating this kind of mutual-aid not only builds social capital, <a href="http://irevolution.net/2012/12/12/social-capital-disaster-resilience/">which is critical to resilience</a>, it also shifts the burden and pressure off the shoulders of paid responders who are often overwhelmed during major disasters.</p>
<p style="text-align:justify;">In sum, these useful everyday technologies also serve to crowdsource and democratize disaster response. Do you know of other examples? Other everyday smartphone apps and web-based apps that get used for disaster response? If so, I&#8217;d love to know. Feel free to post your examples in the comments section below. Thanks!</p>
<p><a href="http://irevolution.net/bio"><img class="aligncenter" alt="bio" src="http://irevolution.files.wordpress.com/2013/02/hat4.png?w=108&#038;h=70&#038;h=70" width="108" height="70" /></a></p>
<p style="text-align:justify;">
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/irevolution.wordpress.com/12008/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/irevolution.wordpress.com/12008/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=irevolution.net&#038;blog=3385318&#038;post=12008&#038;subd=irevolution&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://irevolution.net/2013/06/11/uber-waze-airbnb-seeclickfix-for-disaster-response/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/dbb4cc8a291903fda7c9fc254172645d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Patrick Philippe Meier</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-09-at-11-30-54-pm.png?w=500" medium="image" />

		<media:content url="http://irevolution.files.wordpress.com/2013/02/hat4.png?w=108&#38;h=70&#38;h=70" medium="image">
			<media:title type="html">bio</media:title>
		</media:content>
	</item>
		<item>
		<title>Big Data for Disaster Response: A List of Wrong Assumptions</title>
		<link>http://irevolution.net/2013/06/10/wrong-assumptions-big-data/</link>
		<comments>http://irevolution.net/2013/06/10/wrong-assumptions-big-data/#comments</comments>
		<pubDate>Mon, 10 Jun 2013 04:03:15 +0000</pubDate>
		<dc:creator>Patrick Meier</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Crowdsourcing]]></category>
		<category><![CDATA[Social Computing]]></category>
		<category><![CDATA[Social Media]]></category>
		<category><![CDATA[Assumptions]]></category>
		<category><![CDATA[Bias]]></category>
		<category><![CDATA[False]]></category>
		<category><![CDATA[Misleading]]></category>
		<category><![CDATA[Random]]></category>
		<category><![CDATA[Representative]]></category>
		<category><![CDATA[Sampling]]></category>

		<guid isPermaLink="false">http://irevolution.net/?p=11883</guid>
		<description><![CDATA[Derrick Herris puts it best: &#8220;It might be provocative to call into question one of the hottest tech movements in generations, but it’s not really fair. That’s because how companies and people benefit from Big Data, Data Science or whatever &#8230; <a href="http://irevolution.net/2013/06/10/wrong-assumptions-big-data/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=irevolution.net&#038;blog=3385318&#038;post=11883&#038;subd=irevolution&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-09-at-1-24-56-pm.png"><img class="aligncenter size-large wp-image-12001" alt="Screen Shot 2013-06-09 at 1.24.56 PM" src="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-09-at-1-24-56-pm.png?w=500&#038;h=180" width="500" height="180" /></a></p>
<p><strong>Derrick Herris <a href="http://gigaom.com/2013/05/28/if-youre-disappointed-with-big-data-youre-not-paying-attention/">puts it best</a>:</strong></p>
<p style="text-align:justify;padding-left:30px;">&#8220;It might be provocative to call into question one of the hottest tech movements in generations, but it’s not really fair. That’s because how companies and people benefit from Big Data, Data Science or whatever else they choose to call the movement toward a data-centric world is directly related to what they expect going in. Arguing that big data isn’t all it’s cracked up to be is a strawman, pure and simple—because no one should think it’s magic to begin with.&#8221;</p>
<p style="text-align:justify;">So here is a list of misplaced assumptions about the relevance of Big Data for disaster response and emergency management:</p>
<p>•  <strong>&#8220;Big Data will improve decision-making for disaster response&#8221;</strong></p>
<p style="text-align:justify;">This recent <a href="http://irevolution.net/2013/04/09/humanitarianism-network-age/">groundbreaking study</a> by the UN confirms that many decisions made by humanitarian professionals during disasters are not based on any kind of empirical data—regardless of how large or small a dataset may be and even when the data is fully trustworthy. In fact, humanitarians often use anecdotal information or mainstream news to inform their decision-making. So no, Big Data will not magically fix these decision-making deficiencies in humanitarian organizations, all of which pre-date the era of Big (Crisis) Data.</p>
<p style="text-align:justify;"><strong>•  <strong>&#8220;</strong>Big Data suffers from extreme sample bias.&#8221;</strong></p>
<p style="text-align:justify;"><strong></strong>This is often true of any dataset collected using non-random sampling methods. The statement also seems to suggest that representative sampling methods can actually be carried out just as easily, quickly and cheaply. This is very rarely the case, hence the use of non-random sampling. In other words, sample bias is not some strange disease that only affects Big Data or social media. And even though Big Data is biased and not necessarily objective, Big Data such as social media still represents a &#8220;new, large, and arguably unfiltered insights into attitudes and behaviors that were previously difficult to track in the wild.&#8221;</p>
<p style="text-align:justify;"><a href="http://images.quanjing.com/agent/ibrm014/ibldig00251556.jpg"><img class="aligncenter size-large wp-image-11996" alt="digital prints" src="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-09-at-12-57-27-pm.png?w=500&#038;h=288" width="500" height="288" /></a></p>
<p style="text-align:justify;">Statistical correlations in Big Data do not imply causation; they simply suggest that there may be something worth exploring further. Moreover, data that is collected via non-random, non-representative sampling does not invalidate or devalue the data collected. <span style="color:#444444;line-height:1.7;">Much of the data used for medical research, digital disease detection and police work is the product of convenience sampling. Should they dismiss or ignore the resulting data because it is not representative? Of course not.</span></p>
<p style="text-align:justify;"><span style="color:#444444;line-height:1.7;">While the 911 system was set up in 1968, the service and number were not widely known until the 1970s and some municipalities did not have the crowdsourcing service until the 1980s. So it was hardly a representative way to collect emergency calls. Does this mean that the millions of 911 calls made before the more widespread adoption of the service in the 1990s were all invalid or useless? Of course not, even despite the tens of <a href="http://irevolution.net/2013/01/08/disaster-tweets-versus-911-calls/">millions of false 911 calls</a> and hoaxes that are made ever year.</span><span style="color:#444444;line-height:1.7;"> Point is, there has never been a moment in history in which everyone has had access to the same communication technology at the same time. This is unlikely to change for a while even though mobile phones are by far the most rapidly distributed and widespread communication technology in the history of our species.</span></p>
<p style="text-align:justify;">There were over 20 million tweets posted during Hurricane Sandy last year. <span style="color:#444444;line-height:1.7;">While &#8220;only&#8221; 16% of Americans are on Twitter and while this demographic is younger, more urban and affluent than the norm, as <a href="http://bits.blogs.nytimes.com/2013/06/01/why-big-data-is-not-truth/">Kate Crawford rightly notes</a>, this does not render the informative and actionable tweets shared during the Hurricane useless to emergency managers. After Typhoon Pablo devastated the Philippines last year, the UN used images and videos shared on social media as a preliminary way to assess the disaster damage. According to one Senior UN Official I recently spoke with, their relief efforts would have overlooked certain disaster-affected areas had it not been <a href="http://irevolution.net/2012/12/08/digital-response-typhoon-pablo/">for this map</a>.</span></p>
<p style="text-align:justify;"><a href="http://cdn.theatlantic.com/static/infocus/bopha121012/s_b12_RTR3BAV1.jpg"><img class="aligncenter size-large wp-image-11931" alt="PHILIPPINES-TYPHOON" src="http://irevolution.files.wordpress.com/2013/06/pablo_destroy.jpg?w=500&#038;h=337" width="500" height="337" /></a></p>
<p style="text-align:justify;"><span style="color:#444444;line-height:1.7;">Was the data representative? No. Were the underlying images and videos objective? No, they captured the perspective of those taking the pictures. Note that &#8220;only&#8221; <a href="http://irevolution.net/2013/06/09/mapping-global-twitter-heartbeat/">3% of the world&#8217;s population</a> are active Twitter users and fewer still post images and videos online. But the damage captured by this data was not virtual, it was  real damage. And it only takes one person to take a picture of a washed-out bridge to reveal the infrastructure damage caused by a Typhoon, even if all other onlookers have never heard of social media. Moreover, this recent <a href="http://irevolution.net/2013/06/09/mapping-global-twitter-heartbeat/">statistical study</a> reveals that tweets are <em>evenly</em> geographically distributed according to the availability of electricity. This is striking given that Twitter has only been around for 7 years compared to the light bulb, which was invited 134 years ago.</span></p>
<p style="text-align:justify;"><strong>•  <strong>&#8220;</strong>Big Data enthusiasts suggest doing away with traditional sources of information for disaster response.&#8221;</strong></p>
<p style="text-align:justify;">I have yet to meet anyone who earnestly believes this. As Derrick writes, &#8220;social media shouldn&#8217;t usurp traditional customer service or market research data that&#8217;s still useful, nor should the Centers for Disease Control start relying on Google Flu Trends at the expense of traditional flu-tracking methodologies. Web and social data are just one more source of data to factor into decisions, albeit a potentially voluminous and high-velocity one.&#8221; In other words, the situation is not either/or, but rather a both/and. Big (Crisis) Data from social media can complement rather than replace traditional information sources and methods.</p>
<p style="text-align:justify;"><strong style="line-height:1.7;">•  <strong>&#8220;</strong>Big Data will make us forget the human faces behind the data.&#8221;</strong></p>
<p style="text-align:justify;"><strong></strong>Big (Crisis) Data typically refers to user-generated content shared on social media, such as Twitter, Instagram, Youtube, etc. Anyone who follows social media during a disaster would be hard-pressed to forget where this data is coming from, in my opinion. Social media, after all, is social and increasingly visually social as witnessed by the tremendous popularity of <a href="http://irevolution.net/2013/06/01/multimedia-tornado-analysis/">Instagram and Youtube during disasters</a>. These help us capture, connect and feel real emotions.</p>
<p style="text-align:justify;"><a href="http://media.commercialappeal.com/media/img/photos/2013/05/23/APTOPIX_Oklahoma_Tornado.JPEG-08af6_t607.JPG"><img class="aligncenter size-large wp-image-11933" alt="OkeTorn" src="http://irevolution.files.wordpress.com/2013/06/oketorn.jpg?w=500&#038;h=324" width="500" height="324" /></a></p>
<p><b><b> </b></b></p>
<p><b><b><a href="http://www.irevolution.net/bio"><img class="aligncenter" alt="bio" src="http://irevolution.files.wordpress.com/2013/02/hat3.png?w=108&#038;h=59&#038;h=59" width="108" height="59" /></a></b></b></p>
<p><strong>See also: </strong></p>
<ul>
<li>&#8220;No Data is Better than Bad Data&#8230;&#8221; Really? [<a href="http://irevolution.net/2011/06/22/no-data-bad-data/">Link</a>]</li>
<li>Crowdsourcing and the Veil of Ignorance [<a href="http://irevolution.net/2010/04/25/veil-ignorance/">Link</a>]</li>
</ul>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/irevolution.wordpress.com/11883/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/irevolution.wordpress.com/11883/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=irevolution.net&#038;blog=3385318&#038;post=11883&#038;subd=irevolution&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://irevolution.net/2013/06/10/wrong-assumptions-big-data/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/dbb4cc8a291903fda7c9fc254172645d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Patrick Philippe Meier</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-09-at-1-24-56-pm.png?w=500" medium="image">
			<media:title type="html">Screen Shot 2013-06-09 at 1.24.56 PM</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-09-at-12-57-27-pm.png?w=500" medium="image">
			<media:title type="html">digital prints</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/06/pablo_destroy.jpg?w=500" medium="image">
			<media:title type="html">PHILIPPINES-TYPHOON</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/06/oketorn.jpg?w=500" medium="image">
			<media:title type="html">OkeTorn</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/02/hat3.png?w=108&#38;h=59&#38;h=59" medium="image">
			<media:title type="html">bio</media:title>
		</media:content>
	</item>
		<item>
		<title>The Geography of Twitter: Mapping the Global Heartbeat</title>
		<link>http://irevolution.net/2013/06/09/mapping-global-twitter-heartbeat/</link>
		<comments>http://irevolution.net/2013/06/09/mapping-global-twitter-heartbeat/#comments</comments>
		<pubDate>Sun, 09 Jun 2013 04:36:31 +0000</pubDate>
		<dc:creator>Patrick Meier</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Crisis Mapping]]></category>
		<category><![CDATA[Crowdsourcing]]></category>
		<category><![CDATA[Early Warning]]></category>
		<category><![CDATA[Social Computing]]></category>
		<category><![CDATA[Social Media]]></category>
		<category><![CDATA[Geography]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://irevolution.net/?p=11911</guid>
		<description><![CDATA[My colleague Kalev Leetaru recently co-authored this comprehensive study on the various sources and accuracies of geographic information on Twitter. This is the first detailed study of its kind. The detailed analysis, which runs some 50-pages long, has important implications &#8230; <a href="http://irevolution.net/2013/06/09/mapping-global-twitter-heartbeat/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=irevolution.net&#038;blog=3385318&#038;post=11911&#038;subd=irevolution&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p style="text-align:justify;">My colleague <a href="http://www.kalevleetaru.com/">Kalev Leetaru</a> recently co-authored this <a href="http://firstmonday.org/ojs/index.php/fm/article/view/4366/3654">comprehensive study</a> on the various sources and accuracies of geographic information on Twitter. This is the first detailed study of its kind. The detailed analysis, which runs some 50-pages long, has important implications vis-a-vis the use of social media in emergency management and humanitarian response. Should you not have the time to analyze the comprehensive study, this blog post highlights the most<strong> important and relevant findings</strong>.</p>
<p style="text-align:justify;">Kalev <em>et al. </em>analyzed 1.5 billion tweets (collected from the Twitter Decahose via GNIP) between October 23 and November 30th, 2012. This came to 14.3 billion words posted by 35% of all active users at the time. Note that <span style="line-height:20px;">2.9% of the world&#8217;s population are active Twitter users and that <strong>87% of all tweets ever posted</strong> since the launch of Twitter in 2006 were posted in the </span>past 24 months alone. On average, Kalev and company found that the lowest number of tweets posted per hour is one million; the highest is 2 million. In addition, almost 50% of all tweets are posted by 5% of users. (Click on images to enlarge).</p>
<p style="text-align:justify;"><a href="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-03-at-1-55-22-pm.png"><img class="aligncenter size-large wp-image-11913" alt="Tweets" src="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-03-at-1-55-22-pm.png?w=500&#038;h=227" width="500" height="227" /></a></p>
<p style="text-align:justify;">In terms of geography, there are two ways to easily capture geographic data from Twitter. The first is from the location information specified by a user when registering for a Twitter account (selected from a drop down menu of place names). The second, which is automatically generated, is from the coordinates of the Twitter user&#8217;s location when tweeting, which is typically provided via GPS or cellular triangulation. On a typical day, about <strong>2.7% of Tweets contain GPS</strong> or cellular data while 2.02% of users list a place name when registering (1.4% have both). The figure above displays all GPS/cellular coordinates captured from tweets during the 39 days of study. In contrast, the figure below combines all Twitter locations, adding registered place names and GPS/cellular data (both in red), and overlays this with the <strong>location of electric lights</strong> (blue) based on satellite imagery obtained from NASA.</p>
<p style="text-align:justify;"><a href="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-03-at-1-59-32-pm.png"><img class="aligncenter size-large wp-image-11914" alt="Tweets / Electricity" src="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-03-at-1-59-32-pm.png?w=500&#038;h=261" width="500" height="261" /></a></p>
<p style="text-align:justify;">White areas depict locations with an equal balance of tweets and electricity. Red areas reveal a higher density of tweets than night lights while blue areas have more night lights than tweets.&#8221; <strong>Iran and China</strong> show substantially fewer tweets than their electricity levels would suggest, reflecting their bans on Twitter, while India shows strong clustering of Twitter usage along the coast and its northern border, even as electricity use is far more balanced throughout the country. Russia shows more electricity usage in its eastern half than Twitter usage, while most countries show far more Twitter usage than electricity would suggest.&#8221;</p>
<p style="text-align:justify;">The Pearson <strong>correlation between tweets and lights</strong> is 0.79, indicating very high similarity. That is, wherever in the world electricity exists, the chances of there also being Twitter users is very high indeed. That is, tweets are evenly distributed geographically according to the availability of electricity. And so, event though &#8220;less than<span style="color:#444444;line-height:1.7;"> three percent of all tweets having geolocation information, this suggests they could be used as a dynamic reference baseline to evaluate the accuracy of other methods of geographic recovery.&#8221; Keep in mind that the light bulb was invented 134 years ago in contrast to Twitter&#8217;s short 7-year history. And yet, the correlation is already very strong. This is why they call it an information revolution. </span><span style="color:#444444;line-height:1.7;">Still, <strong>just 1% of all Twitter users</strong> accounted for 66% of all georeferenced tweets during the period of study, which means that relying purely on these tweets may provide</span> a skewed view of the Twitterverse, particularly over short periods of time. But whether this poses a problem ultimately depends on the research question or task at hand.</p>
<p style="text-align:justify;"><a href="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-03-at-2-26-46-pm.png"><img class="aligncenter size-full wp-image-11915" alt="Twitter table" src="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-03-at-2-26-46-pm.png?w=500"   /></a></p>
<p style="text-align:justify;">The <strong>linguistic geography of Twitter</strong> is critical: &#8220;If English is rarely used outside of the United States, or if English tweets have a fundamentally different geographic profile than other languages outside of the United States, this will significantly skew geocoding results.&#8221; As the table below reveals, georeferenced tweets with English content constitute <strong>41.57% of all geo-tagged tweets</strong>.</p>
<p style="text-align:justify;"><a href="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-03-at-2-28-47-pm.png"><img class="aligncenter size-full wp-image-11916" alt="Geo Tweets Language" src="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-03-at-2-28-47-pm.png?w=500"   /></a></p>
<p style="text-align:justify;">The data from the above table is displayed geographically below for the European region. See the <a href="http://www.sgi.com/go/twitter/images/hires/figure7.png">global map here</a>. &#8220;In cases where <strong>multiple languages</strong> are present at the same coordinate, the point is assigned to the most prevalent language at that point and colored accordingly.&#8221; Statistical analyses of geo-tagged English tweets compared to all other languages suggests that &#8220;<span style="color:#444444;line-height:1.7;">English offers a <strong>spatial proxy for all languages</strong> and that a geocoding algorithm which processes only English will still have strong penetration into areas dominated by other languages (though English tweets may discuss different topics or perspectives).&#8221;</span></p>
<p style="text-align:justify;"><a href="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-03-at-2-36-36-pm.png"><img class="aligncenter size-large wp-image-11918" alt="Twitter Languages Europe" src="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-03-at-2-36-36-pm.png?w=500&#038;h=328" width="500" height="328" /></a></p>
<p style="text-align:justify;">Another important source of geographic information is a Twitter user&#8217;s bio. This public location information was available for 71% of all tweets studied by Kalev and company. Interestingly, &#8220;Approximately <strong>78.4 percent of tweets</strong> include the user’s time zone in textual format, which offers an approximation of longitude [...].&#8221; As Kalev <em>et al. </em>note, &#8220;Nearly one third of all locations on earth share their name with another location somewhere else on the planet, meaning that a reference to &#8216;Urbana&#8217; must be <strong>disambiguated by a geocoding system</strong> to determine which of the 12 cities in the world it might refer to, including 11 cities in the United States with that name.&#8221;</p>
<p style="text-align:justify;">There are several ways to get around this challenging, ranging from developing a Full Text Geocoder to using gazetteers such a Wikipedia Gazetteer and MaxFind which machine translation. Applying the latter has revealed that the &#8220;textual geographic density of Twitter changes by more than 53 percent over the course of each day. This has <strong>enormous ramifications</strong> for the use of Twitter as a global monitoring system, as it suggests that the representativeness of geographic tweets changes considerably depending on time of day.&#8221; That said, the success of a monitoring system is solely dependent on spatial data. Temporal factors and deviations from a baseline also enable early detection.  In any event, &#8220;The small volume of georeferenced tweets can be <strong>dramatically enhanced</strong> by applying geocoding algorithms to the textual content and metadata of each tweet.&#8221;</p>
<p style="text-align:justify;">Kalet <em>et al</em>. also carried out a comprehensive analysis of <strong>geo-tagged retweets</strong>. They find that &#8220;geography plays little role in the location of influential users, with the volume of retweets instead simply being a factor of the total population of tweets originating from that city.&#8221; They also calculated that the average geographical distance between two Twitter users &#8220;connected&#8221; by retweets (RTs) and who geotag their tweets is about <strong>750 miles or 1,200 kilometers</strong>. When a Twitter user references another (@), the average geographical distance between the two is 744 miles. This means that RTs and @&#8217;s cannot be used for geo-referencing Twitter data, even when coupling this information with time zone data. The figure below depicts the location of users retweeting other users. The geodata for this comes from the geotagged tweets (rather than account information or profile data).</p>
<p style="text-align:justify;"><a href="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-03-at-3-34-06-pm.png"><img class="aligncenter size-large wp-image-11919" alt="Map of Retweets" src="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-03-at-3-34-06-pm.png?w=500&#038;h=287" width="500" height="287" /></a></p>
<p style="text-align:justify;">On average, about <strong>15.85% of geo-tagged tweets contain links</strong>. The most popular links for these include Foursquare, Instagram, Twitter and Facebook. See my <a href="http://irevolution.net/2013/06/01/multimedia-tornado-analysis/">previous blog post</a> on the analysis &amp; value of such content for disaster response. In terms of Twitter geography versus that of mainstream news, Kalev <em>et al</em>. analyzed all news items available <strong>via Google News</strong> during the same period as the tweets they collected. This came to over 3.3 million articles pointing to just under 165,000 locations. The latter are color-coded red in the data ziv below, while Tweets are blue and white areas denote equal balance of both.</p>
<p style="text-align:justify;"><a href="http://www.sgi.com/go/twitter/images/hires/figure17.png"><img class="aligncenter size-large wp-image-11986" alt="Twitter vs News" src="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-09-at-12-15-17-am.png?w=500&#038;h=275" width="500" height="275" /></a></p>
<p style="text-align:justify;">&#8220;Mainstream media appears to have significantly less coverage of Latin America and vastly better greater of Africa. It also covers <strong>China and Iran</strong> much more strongly, given their bans on Twitter, as well as having enhanced coverage of India and the Western half of the United States. Overall, mainstream media appears to have <strong>more even coverage, with less clustering</strong> around major cities.&#8221; This suggests &#8221;there is a strong difference in the geographic profiles of Twitter and mainstream media and that the intensity of discourse mentioning a country does not necessarily match the intensity of discourse emanating from that country in social media. It also suggests that Twitter is not simply a mirror of mainstream media, but rather has a distinct geographic profile [...].&#8221;</p>
<p style="text-align:justify;">In terms of <strong>future growth</strong>, &#8220;the Middle East and Eastern Europe account for some of Twitter’s largest new growth areas, while Indonesia, Western Europe, Africa, and Central America have high proportions of the world’s most influential Twitter users.&#8221;</p>
<p><b id="docs-internal-guid-1949a2bc-cc43-e561-e691-1fd26d813043"><a href="http://irevolution.net/bio/"><img class="aligncenter" alt="Bio" src="http://irevolution.files.wordpress.com/2013/02/screen-shot-2013-02-16-at-7-55-33-am.png?w=104&#038;h=71&#038;h=71" width="104" height="71" /></a></b></p>
<p style="text-align:justify;"><em>See also:</em></p>
<ul>
<li><strong>Social Media &#8211; Pulse of the Planet?</strong> [<a href="http://irevolution.net/2013/02/02/pulse-of-the-planet/">Link</a>]</li>
<li><strong>Big Data for Disaster Response &#8211; A list of Wrong Assumptions</strong> [<a href="http://irevolution.net/2013/06/10/wrong-assumptions-big-data/">Link</a>]</li>
<li><strong>A Multi-Indicator Approach for Geolocalization of Tweets </strong>[<a href="http://crowdresearch.org/blog/?p=6792&amp;utm_source=feedburner&amp;utm_medium=email&amp;utm_campaign=Feed%3A+FollowTheCrowd+%28Follow+the+Crowd%29">Link</a>]</li>
</ul>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/irevolution.wordpress.com/11911/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/irevolution.wordpress.com/11911/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=irevolution.net&#038;blog=3385318&#038;post=11911&#038;subd=irevolution&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://irevolution.net/2013/06/09/mapping-global-twitter-heartbeat/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/dbb4cc8a291903fda7c9fc254172645d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Patrick Philippe Meier</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-03-at-1-55-22-pm.png?w=500" medium="image">
			<media:title type="html">Tweets</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-03-at-1-59-32-pm.png?w=500" medium="image">
			<media:title type="html">Tweets / Electricity</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-03-at-2-26-46-pm.png" medium="image">
			<media:title type="html">Twitter table</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-03-at-2-28-47-pm.png" medium="image">
			<media:title type="html">Geo Tweets Language</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-03-at-2-36-36-pm.png?w=500" medium="image">
			<media:title type="html">Twitter Languages Europe</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-03-at-3-34-06-pm.png?w=500" medium="image">
			<media:title type="html">Map of Retweets</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-09-at-12-15-17-am.png?w=500" medium="image">
			<media:title type="html">Twitter vs News</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/02/screen-shot-2013-02-16-at-7-55-33-am.png?w=104&#38;h=71&#38;h=71" medium="image">
			<media:title type="html">Bio</media:title>
		</media:content>
	</item>
		<item>
		<title>Could CrowdOptic Be Used For Disaster Response?</title>
		<link>http://irevolution.net/2013/06/05/crowdoptic-for-disaster-response/</link>
		<comments>http://irevolution.net/2013/06/05/crowdoptic-for-disaster-response/#comments</comments>
		<pubDate>Wed, 05 Jun 2013 04:28:49 +0000</pubDate>
		<dc:creator>Patrick Meier</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Crowdsourcing]]></category>
		<category><![CDATA[Humanitarian Tech]]></category>
		<category><![CDATA[Social Media]]></category>
		<category><![CDATA[CrowdOptic]]></category>
		<category><![CDATA[Disaster]]></category>
		<category><![CDATA[Photosynth]]></category>
		<category><![CDATA[Response]]></category>

		<guid isPermaLink="false">http://irevolution.net/?p=11944</guid>
		<description><![CDATA[Crowds—rather than sole individuals—are increasingly bearing witness to disasters large and small. Instagram users, for example, snapped 800,000 #Sandy pictures during the hurricane last year. One way to make sense of this vast volume and velocity of multimedia content—Big Data—during &#8230; <a href="http://irevolution.net/2013/06/05/crowdoptic-for-disaster-response/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=irevolution.net&#038;blog=3385318&#038;post=11944&#038;subd=irevolution&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p style="text-align:justify;">Crowds—rather than sole individuals—are increasingly bearing witness to disasters large and small. Instagram users, for example, snapped 800,000 #Sandy pictures during the hurricane last year. One way to make sense of this vast volume and velocity of multimedia content—Big Data—during disasters is with PhotoSynth, as <a href="http://irevolution.net/2013/06/01/multimedia-tornado-analysis/">blogged here</a>. Another perhaps more sophisticated approach would be to use <a href="http://www.crowdoptic.com/">CrowdOptic</a>, which automatically zeros in on the specific location that eyewitnesses are looking at when using their smartphones to take pictures or recording videos.</p>
<p><a href="http://www.digitaltrends.com/wp-content/uploads/2012/10/Instagram-Hurricane-Sandy-blog.png"><img class="aligncenter size-large wp-image-11945" alt="Instagram-Hurricane-Sandy" src="http://irevolution.files.wordpress.com/2013/06/instagram-hurricane-sandy.png?w=500&#038;h=260" width="500" height="260" /></a></p>
<p style="text-align:justify;">How does it work? CrowdOptic simply triangulates line-of-sight intersections using sensory metadata from pictures and videos taken using a smartphone. The basic approach is depicted in the figure below. The areas of intersection is called a focal cluster. CrowdOptic automatically identifies the location of these clusters.</p>
<p><a href="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-04-at-8-36-48-pm.png"><img class="aligncenter size-large wp-image-11946" alt="Cluster" src="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-04-at-8-36-48-pm.png?w=500&#038;h=218" width="500" height="218" /></a></p>
<p style="text-align:justify;">&#8220;Once a crowd’s point of focus is determined, any content generated by that point of focus is automatically authenticated, and a relative significance is assigned based on CrowdOptic’s focal data attributes [...].&#8221; These include: (1) Number of Viewers; (2) Location of Focus; (3) Distance to Epicenter; (4) Cluster Timestamp, Duration; and (5) Cluster Creation, Dissipation Speed.&#8221; CrowdOptic can also be used on live streams and archival images &amp; videos. Once a cluster is identified, the best images/videos pointing to this cluster are automatically selected.</p>
<p style="text-align:justify;">Clearly, all this could have important applications for disaster response and <a href="http://irevolution.net/category/information-forensics/">information forensics</a>. My colleagues and I <a href="http://irevolution.net/2013/06/01/multimedia-tornado-analysis/">recently collected</a> over 12,000 Instagram pictures and more than 5,000 YouTube videos posted to Twitter during the first 48 hours of the Tornado in Oklahoma. These could be uploaded to CrowdOptic for cluster identification. Any focal cluster with several viewers would almost certainly be authentic, particularly if the time-stamps are similar. These clusters could then be tagged by digital humanitarian volunteers based on whether they depict evidence of disaster damage. Indeed, we could have tested out CrowdOptic during in the disaster response efforts <a href="http://irevolution.net/2012/12/08/digital-response-typhoon-pablo/">we carried out for the United Nations</a> following the devastating Philippines Typhoon. Perhaps CrowdOptic could facilitate rapid damage assessments in the future. Of course, the value of CrowdOptic ultimately depends on the volume of geotagged images and videos shared on social media and the Web.</p>
<p><span class='embed-youtube' style='text-align:center; display: block;'><iframe class='youtube-player' type='text/html' width='500' height='312' src='http://www.youtube.com/embed/2gt4lgq4ZW8?version=3&#038;rel=1&#038;fs=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;wmode=transparent' frameborder='0'></iframe></span></p>
<p style="text-align:justify;">I once <a href="http://irevolution.net/2010/04/08/wag-the-dog/">wrote a blog post</a> entitled, &#8220;Wag the Dog, or How Falsifying Crowdsourced Data Can Be a Pain.&#8221; While an image or video could certainly be falsified, trying to fake several focal clusters of multimedia content with dozens of viewers each would probably require the equivalent organization capacity of a small movie-production or commercial. So I&#8217;m in touch with the CrowdOptic team to explore the possibility of carrying out a proof of concept based on the multimedia data we&#8217;ve collected following the Oklahoma Tornados. Stay tuned!</p>
<p><a href="http://irevolution.net/bio"><img class="aligncenter" alt="bio" src="http://irevolution.files.wordpress.com/2013/02/hat4.png?w=108&#038;h=70&#038;h=70" width="108" height="70" /></a></p>
<p style="text-align:justify;">
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/irevolution.wordpress.com/11944/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/irevolution.wordpress.com/11944/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=irevolution.net&#038;blog=3385318&#038;post=11944&#038;subd=irevolution&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://irevolution.net/2013/06/05/crowdoptic-for-disaster-response/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/dbb4cc8a291903fda7c9fc254172645d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Patrick Philippe Meier</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/06/instagram-hurricane-sandy.png?w=500" medium="image">
			<media:title type="html">Instagram-Hurricane-Sandy</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/06/screen-shot-2013-06-04-at-8-36-48-pm.png?w=500" medium="image">
			<media:title type="html">Cluster</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/02/hat4.png?w=108&#38;h=70&#38;h=70" medium="image">
			<media:title type="html">bio</media:title>
		</media:content>
	</item>
		<item>
		<title>Data Mining Wikipedia in Real Time for Disaster Response</title>
		<link>http://irevolution.net/2013/06/04/wikipedia-for-disaster-response/</link>
		<comments>http://irevolution.net/2013/06/04/wikipedia-for-disaster-response/#comments</comments>
		<pubDate>Tue, 04 Jun 2013 04:50:45 +0000</pubDate>
		<dc:creator>Patrick Meier</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Crowdsourcing]]></category>
		<category><![CDATA[Humanitarian Tech]]></category>
		<category><![CDATA[Disaster]]></category>
		<category><![CDATA[Response]]></category>
		<category><![CDATA[Wikipedia]]></category>

		<guid isPermaLink="false">http://irevolution.net/?p=11828</guid>
		<description><![CDATA[My colleague Fernando Diaz has continued working on an interesting Wikipedia project since he first discussed the idea with me last year. Since Wikipedia is increasingly used to crowdsource live reports on breaking news such as sudden-onset humanitarian crisis and &#8230; <a href="http://irevolution.net/2013/06/04/wikipedia-for-disaster-response/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=irevolution.net&#038;blog=3385318&#038;post=11828&#038;subd=irevolution&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p style="text-align:justify;">My colleague <a href="http://www.linkedin.com/pub/fernando-diaz/3/902/b6b">Fernando Diaz</a> has continued working on an interesting Wikipedia project since he first discussed the idea with me last year. Since Wikipedia is increasingly used to crowdsource live reports on breaking news such as sudden-onset humanitarian crisis and disasters, why not mine these pages for structured information relevant to humanitarian response professionals?</p>
<p style="text-align:justify;"><a href="http://www.funny-potato.com/blog/wp-content/uploads/2008/06/wikipedia-logo.jpg"><img class="aligncenter size-large wp-image-11831" alt="wikipedia-logo" src="http://irevolution.files.wordpress.com/2013/05/wikipedia-logo.jpg?w=500&#038;h=200" width="500" height="200" /></a></p>
<p style="text-align:justify;">In computing-speak, <em>Sequential Update Summarization</em> is a task that generates useful, new and timely sentence-length updates about a developing event such as a disaster. In contrast, <em>Value Tracking</em> tracks the value of important event-related attributes such as fatalities and financial impact. Fernando and his colleagues will be using both approaches to mine and analyze Wikipedia pages in real time. Other attributes worth tracking include injuries, number of displaced individuals, infrastructure damage and perhaps disease outbreaks. Pictures of the disaster uploaded to a given Wikipedia page may also be of interest to humanitarians, along with meta-data such as the number of edits made to a page per minute or hour and the number of unique editors.</p>
<p style="text-align:justify;">Fernando and his colleagues have recently launched <a href="http://www.trec-ts.org/">this tech challenge</a> to apply these two advanced computing techniques to disaster response based on crowdsourced Wikipedia articles. The challenge is part of the <em>Text Retrieval Conference</em> (<a href="http://trec.nist.gov/">TREC</a>), which is being held in Maryland this November. As part of this applied research and prototyping challenge, Fernando <em>et al. </em>plan to use the resulting summarization and value tracking from Wikipedia to verify related  crisis information shared on social media. Needless to say, I&#8217;m really excited about the potential. <span style="color:#444444;line-height:1.7;">So Fernando and I are exploring ways to ensure that the results of this challenge are appropriately transferred to the humanitarian community. Stay tuned for updates. </span></p>
<p><b><b><a href="http://www.irevolution.net/bio"><img class="aligncenter" alt="bio" src="http://irevolution.files.wordpress.com/2013/02/hat3.png?w=108&#038;h=59&#038;h=59" width="108" height="59" /></a></b></b></p>
<p>&nbsp;</p>
<p style="text-align:justify;"><strong>See also: </strong>Web App Tracks Breaking News Using Wikipedia Edits [<a href="http://irevolution.net/2013/04/23/breaking-news-using-wikipedia-edits/">Link</a>]</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/irevolution.wordpress.com/11828/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/irevolution.wordpress.com/11828/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=irevolution.net&#038;blog=3385318&#038;post=11828&#038;subd=irevolution&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://irevolution.net/2013/06/04/wikipedia-for-disaster-response/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/dbb4cc8a291903fda7c9fc254172645d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Patrick Philippe Meier</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/05/wikipedia-logo.jpg?w=500" medium="image">
			<media:title type="html">wikipedia-logo</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/02/hat3.png?w=108&#38;h=59&#38;h=59" medium="image">
			<media:title type="html">bio</media:title>
		</media:content>
	</item>
		<item>
		<title>Could Lonely Planet Render World Bank Projects More Transparent?</title>
		<link>http://irevolution.net/2013/06/03/lonely-planet-world-bank/</link>
		<comments>http://irevolution.net/2013/06/03/lonely-planet-world-bank/#comments</comments>
		<pubDate>Mon, 03 Jun 2013 04:35:03 +0000</pubDate>
		<dc:creator>Patrick Meier</dc:creator>
				<category><![CDATA[Crowdsourcing]]></category>
		<category><![CDATA[Accountability]]></category>
		<category><![CDATA[development]]></category>
		<category><![CDATA[Evaluation]]></category>
		<category><![CDATA[Guides]]></category>
		<category><![CDATA[Impact]]></category>
		<category><![CDATA[Lonely]]></category>
		<category><![CDATA[Planet]]></category>
		<category><![CDATA[Tourism]]></category>
		<category><![CDATA[Transparency]]></category>
		<category><![CDATA[WorldBank]]></category>

		<guid isPermaLink="false">http://irevolution.net/?p=11812</guid>
		<description><![CDATA[That was the unexpected question that my World Bank colleague Johannes Kiess asked me the other day. I was immediately intrigued. So I did some preliminary research and offered to write up a blog post on the idea to solicit &#8230; <a href="http://irevolution.net/2013/06/03/lonely-planet-world-bank/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=irevolution.net&#038;blog=3385318&#038;post=11812&#038;subd=irevolution&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p dir="ltr" style="text-align:justify;">That was the unexpected question that my World Bank colleague <a href="http://www.linkedin.com/pub/johannes-kiess/6/438/940">Johannes Kiess</a> asked me the other day. I was immediately intrigued. So I did some preliminary research and offered to write up a blog post on the idea to solicit some early feedback. According to <a href="https://en.wikipedia.org/wiki/Tourism#World_tourism_statistics_and_rankings">recent statistics</a>, international tourist arrivals numbered over 1 billion in 2012 alone. Of this population, the demographic that Johannes is interested in comprises those intrepid and socially-conscious backpackers who travel beyond the capitals of developing countries. Perhaps the time is ripe for a new form of tourism: <strong>Tourism for Social Good</strong>.</p>
<p dir="ltr" style="text-align:center;"><a href="http://www.webseoanalytics.com/blog/wp-content/uploads/2012/01/social_media_tourism_1.jpg"><img class="aligncenter  wp-image-11904" alt="tourism_socialmedia" src="http://irevolution.files.wordpress.com/2013/06/tourism_socialmedia.jpg?w=400&#038;h=386" width="400" height="386" /></a></p>
<p dir="ltr" style="text-align:justify;">There may be a real opportunity to engage a large crowd because travelers—and in particular the backpacker type—are smartphone savvy, have time on their hands, want to do something meaningful, are eager to get off the beaten track and explore new spaces where others do not typically trek. Johannes believes this approach could be used to <strong>map critical social infrastructure</strong> and/or to monitor development projects. Consider a simple smartphone app, perhaps integrated with existing travel guide apps or Tripadvisor. The app would ask travelers to record the quality of the roads they take (with the GPS of their smartphone) and provide feedback on the condition, e.g.,  bumpy, even, etc., every 50 miles or so.</p>
<p dir="ltr" style="text-align:justify;">They could be asked to find the nearest hospital and take a geotagged picture—a <strong>scavenger hunt for development</strong> (as Johannes calls it); <a href="https://en.wikipedia.org/wiki/Geocaching">Geocaching</a> for Good? Note that governments often do not know exactly where schools, hospitals and roads are located. The app could automatically alert travelers of a nearby development project or road financed by the World Bank or other international donor. Travelers could be prompted to take (automatically geo-tagged) pictures that would then be forwarded to development organizations for subsequent visual analysis (which could easily be carried out using microtasking). Perhaps a very simple, 30-second, multiple-choice survey could even be presented to travelers who pass by certain donor-funded development projects. For quality control purposes, these pictures and surveys could easily be triangulated. Simple gamification features could also be added to the app; travelers could <strong>gain points for social good tourism</strong>—collect 100 points and get your next Lonely Planet guide for free? Perhaps if you&#8217;re the first person to record a road within the app, then it could be named after you (of course with a notation of the official name). Even <a href="https://en.wikipedia.org/wiki/Photosynth">Photosynth</a> could be used to create panoramas of visual evidence.</p>
<p dir="ltr" style="text-align:justify;">The obvious advantage of using travelers against the now <em>en vogue</em> stakeholder monitoring approach is that they said bagpackers are <em>already</em> traveling there anyway and have their phones on them to begin with. Plus, they&#8217;d be independent third parties and would not need to be trained. This obviously doesn&#8217;t mean that the stakeholder approach is not useful. The travelers strategy would simply be complementary. Furthermore, this tourism strategy comes with several key challenges, such as the safety of backpackers who choose to take on this task, for example. But <strong>appropriate legal disclaimers</strong> could be put in place, so this challenge seems surmountable. In any event, Johannes, together with his colleagues at the World Bank (and I), hope to explore this idea of Tourism for Social Good further in the coming months.</p>
<p style="text-align:justify;">In the meantime, we would be very <strong>grateful for feedback</strong>. What might we be overlooking? Would you use such an app if it were available? Where can we find reliable statistics on top backpacker destinations and flows?</p>
<p><b id="docs-internal-guid-1949a2bc-cc43-e561-e691-1fd26d813043"><a href="http://irevolution.net/bio/"><img class="aligncenter" alt="Bio" src="http://irevolution.files.wordpress.com/2013/02/screen-shot-2013-02-16-at-7-55-33-am.png?w=104&#038;h=71&#038;h=71" width="104" height="71" /></a></b></p>
<p style="text-align:justify;"><strong>See also: </strong></p>
<ul>
<li>What United Airlines can Teach the World Bank about Mobile Accountability [<a href="http://irevolution.net/2012/06/10/united-airlines-world-bank/">Link</a>]</li>
</ul>
<h2></h2>
<p style="text-align:justify;"><strong></strong></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/irevolution.wordpress.com/11812/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/irevolution.wordpress.com/11812/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=irevolution.net&#038;blog=3385318&#038;post=11812&#038;subd=irevolution&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://irevolution.net/2013/06/03/lonely-planet-world-bank/feed/</wfw:commentRss>
		<slash:comments>20</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/dbb4cc8a291903fda7c9fc254172645d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Patrick Philippe Meier</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/06/tourism_socialmedia.jpg?w=500" medium="image">
			<media:title type="html">tourism_socialmedia</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/02/screen-shot-2013-02-16-at-7-55-33-am.png?w=104&#38;h=71&#38;h=71" medium="image">
			<media:title type="html">Bio</media:title>
		</media:content>
	</item>
		<item>
		<title>Analysis of Multimedia Shared in Millions of Tweets After Tornado (Updated)</title>
		<link>http://irevolution.net/2013/06/01/multimedia-tornado-analysis/</link>
		<comments>http://irevolution.net/2013/06/01/multimedia-tornado-analysis/#comments</comments>
		<pubDate>Sat, 01 Jun 2013 15:20:29 +0000</pubDate>
		<dc:creator>Patrick Meier</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Crowdsourcing]]></category>
		<category><![CDATA[Social Computing]]></category>
		<category><![CDATA[Social Media]]></category>
		<category><![CDATA[Instagram]]></category>
		<category><![CDATA[Multimedia]]></category>
		<category><![CDATA[Oklahoma]]></category>
		<category><![CDATA[Tornado]]></category>
		<category><![CDATA[Twitter]]></category>
		<category><![CDATA[Youtube]]></category>

		<guid isPermaLink="false">http://irevolution.net/?p=11864</guid>
		<description><![CDATA[Humanitarian organizations and emergency management offices are increasingly interested in capturing multimedia content shared on social media during crises. Last year, the UN Office for the Coordination of Humanitarian Affairs (OCHA) activated the Digital Humanitarian Network (DHN) to identify and geotag &#8230; <a href="http://irevolution.net/2013/06/01/multimedia-tornado-analysis/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=irevolution.net&#038;blog=3385318&#038;post=11864&#038;subd=irevolution&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p style="text-align:justify;">Humanitarian organizations and emergency management offices are increasingly interested in capturing multimedia content shared on social media during crises. Last year, the UN Office for the Coordination of Humanitarian Affairs (OCHA) <a href="http://irevolution.net/2012/12/08/digital-response-typhoon-pablo/">activated the Digital Humanitarian Network</a> (DHN) to identify and geotag pictures and videos shared on Twitter that captured the damage caused by Typhoon Pablo, for example. So I&#8217;m collaborating with my colleague <a href="http://knoesis.org/researchers/hemant">Hemant Purohit</a> to analyze the multimedia content shared in the millions of tweets posted after the <a href="https://en.wikipedia.org/wiki/May_18%E2%80%9321,_2013_tornado_outbreak">Category 5 Tornado</a> devastated the city of Moore, Oklahoma on May 20th. The results are shared below along with details of a project I am spearheading at QCRI to provide disaster responders with relevant multimedia content in real time during future disasters.</p>
<p style="text-align:justify;"><a href="http://irevolution.files.wordpress.com/2013/05/multimedia_tornado.png"><img class="aligncenter size-large wp-image-11868" alt="Multimedia_Tornado" src="http://irevolution.files.wordpress.com/2013/05/multimedia_tornado.png?w=500&#038;h=323" width="500" height="323" /></a></p>
<p style="text-align:justify;">For this preliminary multimedia analysis, we focused on the first 48 hours after the Tornado and specifically on the following multimedia sources/types: Twitpic, Instagram, Flickr, JPGs, YouTube and Vimeo. JPGs refers to URLs shared on Twitter that include &#8220;.jpg&#8221;. Only ~1% of tweets posted during the 2-day period included URLs to multimedia content. We filtered out duplicate URLs to produce the following <em>unique</em> counts depicted above and listed below.</p>
<ul>
<li><strong><span style="line-height:20px;">Twitpic </span></strong><span style="line-height:20px;">=</span><strong><span style="line-height:20px;"> 784</span></strong></li>
<li><strong></strong><strong>Instagram </strong>=<strong> <strong>11,822</strong></strong></li>
<li><strong>Flickr </strong>=<strong> 33</strong></li>
<li><strong>JPGs </strong>=<strong> 347 </strong></li>
<li><strong style="line-height:1.7;">YouTube</strong><span style="color:#444444;line-height:1.7;"> = </span><strong style="line-height:1.7;">5,474</strong></li>
<li><strong style="line-height:1.7;">Vimeo</strong><span style="color:#444444;line-height:1.7;"> = </span><strong style="line-height:1.7;">88</strong></li>
</ul>
<p style="text-align:justify;">Clearly, Instagram and Youtube are important sources of multimedia content during disasters. The graphs below (click to enlarge) depict the frequency of individual multimedia types by hour during the first 48 hours after the Tornado. Note that we were only able to collect about 2 million tweets during this period using the Twitter Streaming API but expect that millions more were posted, which is why access to the <a href="http://irevolution.net/2013/05/30/twitter-api-vs-firehose/">Twitter Firehose is important</a> and why I&#8217;m a strong advocate of <a href="http://irevolution.net/2012/06/04/big-data-philanthropy-for-humanitarian-response/">Big Data Philanthropy</a> for Humanitarian Response.</p>
<p style="text-align:justify;"><a href="http://irevolution.files.wordpress.com/2013/05/twitpic_tornado.png"><img class="aligncenter size-large wp-image-11869" alt="Twitpic_Tornado" src="http://irevolution.files.wordpress.com/2013/05/twitpic_tornado.png?w=500&#038;h=325" width="500" height="325" /></a></p>
<p style="text-align:justify;">A comparison of the above Twitpic graph with the Instagram one below suggests very little to no time lag between the two unique streams.</p>
<p style="text-align:justify;"><a href="http://irevolution.files.wordpress.com/2013/05/instagram_tornado.png"><img class="aligncenter size-large wp-image-11871" alt="Instagram_Tornado" src="http://irevolution.files.wordpress.com/2013/05/instagram_tornado.png?w=500&#038;h=325" width="500" height="325" /></a></p>
<p style="text-align:justify;">Clearly Flickr pictures are not widely shared on Twitter during disasters. Only 53 links to Flickr were tweeted compared to 11,822 <em>unique</em> Instagram pictures.</p>
<p style="text-align:justify;"><a href="http://irevolution.files.wordpress.com/2013/05/flickr_tornado.png"><img alt="Flickr_Tornado" src="http://irevolution.files.wordpress.com/2013/05/flickr_tornado.png?w=500&#038;h=323" width="500" height="323" /></a></p>
<p style="text-align:justify;">The sharing of JPG images is more popular than links to Flickr but the total number of uniques still pales in comparison to the number of Instagram pictures.</p>
<p style="text-align:justify;"><a href="http://irevolution.files.wordpress.com/2013/05/jpgs_tornado.png"><img class="aligncenter size-large wp-image-11872" alt="JPGs_Tornado" src="http://irevolution.files.wordpress.com/2013/05/jpgs_tornado.png?w=500&#038;h=320" width="500" height="320" /></a></p>
<p style="text-align:justify;">The frequency of tweets sharing unique links to Youtube videos does not vary considerably over time.</p>
<p style="text-align:justify;"><a href="http://irevolution.files.wordpress.com/2013/05/youtube_tornado.png"><img class="aligncenter size-large wp-image-11873" alt="Youtube_Tornado" src="http://irevolution.files.wordpress.com/2013/05/youtube_tornado.png?w=500&#038;h=322" width="500" height="322" /></a></p>
<p style="text-align:justify;">In contrast to the large volume of Youtube links shared on twitter, only 88 unique links to Vimeo were shared.</p>
<p style="text-align:justify;"><a href="http://irevolution.files.wordpress.com/2013/05/vimeo_tornado.png"><img class="aligncenter size-large wp-image-11874" alt="Vimeo_Tornado" src="http://irevolution.files.wordpress.com/2013/05/vimeo_tornado.png?w=500&#038;h=325" width="500" height="325" /></a></p>
<p style="text-align:justify;">Geographic information is of course imperative for disaster response. We collected about 2.7 million tweets during the 10-day period after Tornado and found that <strong>51.23% had geographic data</strong>—either the tweet was geo-tagged or the Twitter user&#8217;s bio included a location. During the first 48 hours, about 45% of Tweets with links to Twitpic had geographic data; 40% for Flickr and 38% for Instagram . Most digital pictures include embedded geographic information (i.e., the GPS coordinates of the phone or camera, for example). So we&#8217;re working on automatically  extracting this information as well.</p>
<p style="text-align:justify;">An important question that arises is which Instagram pictures &amp; Youtube videos actually captured <strong>evidence of the damage</strong> caused of the Tornado? Of these, which are already geotagged and which could be quickly geotagged manually? The Digital Humanitarian Network was able to answer these questions within 12 hours following the devastating Typhoon that ravaged the Philippines last year (see map below). The reason it took that long is because we spent most of the time customizing the <a href="http://irevolution.net/2012/12/05/help-microtask-pablo/">microtasking apps</a> to tag the tweets/links. Moreover, we were looking at every single link shared on twitter, i.e., not just those that linked directly to Instagram, Youtube, etc. We need to do better, and we can.</p>
<p style="text-align:justify;"><img class="alignnone" alt="" src="http://irevolution.files.wordpress.com/2012/12/typhon-pablo_social_media_mapping-ocha_a4_portrait_6dec2012.jpg?w=500&#038;h=400" width="500" height="400" /></p>
<p style="text-align:justify;">This is why we&#8217;re launching <a href="http://irevolution.net/2013/04/13/micromappers-for-digital-disaster-response/">MicroMappers</a> in partnership with the United Nations. MicroMappers are very user-friendly microtasking apps that allows anyone to support humanitarian response efforts with a simple click of the mouse. This means <strong>anyone can be a Digital Humanitarian Volunteer</strong>. In the case of the Tornado, volunteers could easily have tagged the Instagram pictures posted on Twitter. During Hurricane Sandy, about half-a-million Instagram pictures were shared. This is certainly a large number but other microtasking communities like my friends at Zooniverse <a href="http://irevolution.net/2013/03/26/zooniverse-big-crisis-data/">tagged millions of pictures</a> in a matter of days. So it is possible.</p>
<p style="text-align:justify;">Incidentally, hundreds of the geo-tagged Instagram pictures posted during the Hurricane captured the <em>same</em> damaged infrastructure across New York, like the same fallen crane, blocked road or a flooded neighborhood. These pictures, taken by <strong>multiple eyewitnesses from different angles</strong> can easily be &#8220;stitched&#8221; together to create a 2D or even 3D tableau of the damage. <a href="https://photosynth.net/">Photosynth</a> (below) already does this stitching automatically for free. Think of Photosynth as Google Street View but using crowdsourced pictures instead. One simply needs to a collection of related pictures, which is what MicroMappers will provide.</p>
<p style="text-align:justify;"><a href="http://irevolution.files.wordpress.com/2013/05/screen-shot-2013-05-31-at-3-22-46-pm.png"><img class="aligncenter size-large wp-image-11879" alt="Photosynth" src="http://irevolution.files.wordpress.com/2013/05/screen-shot-2013-05-31-at-3-22-46-pm.png?w=500&#038;h=373" width="500" height="373" /></a></p>
<p style="text-align:justify;">Disasters don&#8217;t wait. Another <a href="http://www.cnn.com/2013/06/01/us/midwest-weather/index.html?hpt=hp_t1">major Tornado</a> caused havoc in Oklahoma just yesterday. So we are developing MicroMappers as we speak and plan to test the apps soon. Stay tuned for future blog post updates!</p>
<p><a href="http://irevolution.net/bio"><img class="aligncenter" alt="bio" src="http://irevolution.files.wordpress.com/2013/02/hat4.png?w=108&#038;h=70&#038;h=70" width="108" height="70" /></a></p>
<p><strong>See also:</strong> Analyzing 2 Million Disaster Tweets from Oklahoma Tornado [<a href="http://irevolution.net/2013/05/29/analyzing-tweets-tornado/">Link</a>]</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/irevolution.wordpress.com/11864/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/irevolution.wordpress.com/11864/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=irevolution.net&#038;blog=3385318&#038;post=11864&#038;subd=irevolution&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://irevolution.net/2013/06/01/multimedia-tornado-analysis/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/dbb4cc8a291903fda7c9fc254172645d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Patrick Philippe Meier</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/05/multimedia_tornado.png?w=500" medium="image">
			<media:title type="html">Multimedia_Tornado</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/05/twitpic_tornado.png?w=500" medium="image">
			<media:title type="html">Twitpic_Tornado</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/05/instagram_tornado.png?w=500" medium="image">
			<media:title type="html">Instagram_Tornado</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/05/flickr_tornado.png?w=500" medium="image">
			<media:title type="html">Flickr_Tornado</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/05/jpgs_tornado.png?w=500" medium="image">
			<media:title type="html">JPGs_Tornado</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/05/youtube_tornado.png?w=500" medium="image">
			<media:title type="html">Youtube_Tornado</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/05/vimeo_tornado.png?w=500" medium="image">
			<media:title type="html">Vimeo_Tornado</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2012/12/typhon-pablo_social_media_mapping-ocha_a4_portrait_6dec2012.jpg" medium="image" />

		<media:content url="http://irevolution.files.wordpress.com/2013/05/screen-shot-2013-05-31-at-3-22-46-pm.png?w=500" medium="image">
			<media:title type="html">Photosynth</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/02/hat4.png?w=108&#38;h=70&#38;h=70" medium="image">
			<media:title type="html">bio</media:title>
		</media:content>
	</item>
		<item>
		<title>Crowdsourcing Crisis Information from Syria: Twitter Firehose vs API</title>
		<link>http://irevolution.net/2013/05/30/twitter-api-vs-firehose/</link>
		<comments>http://irevolution.net/2013/05/30/twitter-api-vs-firehose/#comments</comments>
		<pubDate>Thu, 30 May 2013 14:27:36 +0000</pubDate>
		<dc:creator>Patrick Meier</dc:creator>
				<category><![CDATA[Crowdsourcing]]></category>
		<category><![CDATA[Social Media]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Social Computing]]></category>
		<category><![CDATA[Twitter]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[Syria]]></category>
		<category><![CDATA[API]]></category>
		<category><![CDATA[Firehose]]></category>
		<category><![CDATA[Geo]]></category>

		<guid isPermaLink="false">http://irevolution.net/?p=11847</guid>
		<description><![CDATA[Over 400 million tweets are posted every day. But accessing 100% of these tweets (say for disaster response purposes) requires access to Twitter&#8217;s &#8220;Firehose&#8221;. The latter, however, can be prohibitively expensive and also requires serious infrastructure to manage. This explains &#8230; <a href="http://irevolution.net/2013/05/30/twitter-api-vs-firehose/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=irevolution.net&#038;blog=3385318&#038;post=11847&#038;subd=irevolution&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p style="text-align:justify;">Over 400 million tweets are posted every day. But accessing 100% of these tweets (say for disaster response purposes) requires access to Twitter&#8217;s &#8220;Firehose&#8221;. The latter, however, can be prohibitively expensive and also requires serious infrastructure to manage. This explains why many (all?) of us in the Crisis Computing &amp; Humanitarian Technology space use Twitter&#8217;s &#8220;Streaming API&#8221; instead. But how representative are tweets sampled through the API vis-a-vis overall activity on Twitter? This is important question is posed and answered in <a href="http://crowdresearch.org/blog/?p=6596&amp;utm_source=feedburner&amp;utm_medium=email&amp;utm_campaign=Feed%3A+FollowTheCrowd+%28Follow+the+Crowd%29">this new study</a> using Syria as a case study.</p>
<p style="text-align:justify;"><a href="http://crowdresearch.org/blog/?p=6596&amp;utm_source=feedburner&amp;utm_medium=email&amp;utm_campaign=Feed%3A+FollowTheCrowd+%28Follow+the+Crowd%29"><img class="aligncenter size-large wp-image-11848" alt="Tweets Syria" src="http://irevolution.files.wordpress.com/2013/05/screen-shot-2013-05-30-at-9-29-20-am.png?w=500&#038;h=178" width="500" height="178" /></a></p>
<p style="text-align:justify;">The analysis focused on &#8220;Tweets collected in the region around Syria during the period from December 14, 2011 to January 10, 2012.&#8221; The first dataset was collected using Firehose access while the second was sampled from the API. The <strong>tag clouds</strong> <strong>above (click to enlarge)</strong> displays the most frequent top terms found in each dataset. The hashtags and geoboxes used for the data collection are listed in the table below.</p>
<p style="text-align:justify;"><a href="http://irevolution.files.wordpress.com/2013/05/screen-shot-2013-05-30-at-9-46-15-am.png"><img class="aligncenter size-large wp-image-11850" alt="Syria List" src="http://irevolution.files.wordpress.com/2013/05/screen-shot-2013-05-30-at-9-46-15-am.png?w=500&#038;h=184" width="500" height="184" /></a></p>
<p style="text-align:justify;">The graph below shows the number of tweets collected between December 14th, 2011 and January 10th, 2012. This amounted 528,592 tweets from the API and 1,280,344 tweets from the Firehose. On average, the <strong>API captures 43.5%</strong> of tweets available on the Firehose. &#8220;One of the more interesting results in this dataset is that as the data in the Firehose spikes, the Streaming API coverage is reduced. One possible explanation for this phenomenon could be that due to the Western holidays observed at this time, activity on Twitter may have reduced causing the 1% threshold to go down.&#8221;</p>
<p style="text-align:justify;"><a href="http://irevolution.files.wordpress.com/2013/05/screen-shot-2013-05-30-at-9-42-12-am.png"><img class="aligncenter size-large wp-image-11851" alt="Syria Graph" src="http://irevolution.files.wordpress.com/2013/05/screen-shot-2013-05-30-at-9-42-12-am.png?w=500&#038;h=407" width="500" height="407" /></a></p>
<p style="text-align:justify;">The authors, Fred Morstatter, Jürgen Pfeffer, Huan Liu and Kathleen Carley, also carry out <strong>hashtag analysis</strong> using each dataset. &#8220;Here we see mixed results at small values of <em>n </em>[top hashtags], indicating that the Streaming data may not be good for ﬁnding the top hashtags. At larger values of n, we see that the Streaming API does a better job of estimating the top hashtags in the Firehose data.&#8221; In addition, the analysis reveals that the &#8220;Streaming API data does not consistently ﬁnd the top hashtags, in some cases revealing reverse correlation with the Firehose data [...]. This could be indicative of a ﬁltering process in Twitter’s Streaming API which causes a misrepresentation of top hashtags in the data.&#8221;</p>
<p style="text-align:justify;">In terms of <strong>social network analysis</strong>, the the authors were able to show that &#8220;50% to 60% of the top 100 key-players [can be identified] when creating the networks based on one day of Streaming API data.&#8221; Aggregating more days&#8217; worth of data &#8220;can increase the accuracy substantially. For network level measures, ﬁrst in-depth analysis revealed interesting correlation between network centralization indexes and the proportion of data covered by the Streaming API.&#8221;</p>
<p style="text-align:justify;"><span style="text-align:justify;color:#444444;line-height:1.7;">Finally, study also compares the <strong>geolocation of tweets</strong>. More specifically, the authors assess how the &#8220;geographic distribution </span><span style="text-align:justify;color:#444444;line-height:1.7;">of the geolocated tweets is </span><span style="text-align:justify;color:#444444;line-height:1.7;">affected by the sampling performed by the Streaming API. </span><span style="text-align:justify;color:#444444;line-height:1.7;">The number of geotagged tweets is low, with only 16,739 </span><span style="text-align:justify;color:#444444;line-height:1.7;">geotagged tweets in the Streaming data (3.17%) and 18,579 </span><span style="text-align:justify;color:#444444;line-height:1.7;">in the Firehose data (1.45%).&#8221; Still, the authors find that &#8220;despite the difference in tweets collected on the whole we get 90.10% </span><span style="text-align:justify;color:#444444;line-height:1.7;">coverage of geotagged tweets.&#8221;</span></p>
<p style="text-align:justify;">In sum, the study finds that &#8220;the results of using the Streaming API depend strongly on the coverage and the type of analysis that the researcher wishes to perform. This leads to the next question concerning the estimation of how much data we actually get in a certain time period.&#8221; This is critical if researchers want to place their results into context and potentially apply statistical methods to <strong>account (and correct) for bias</strong>. The authors suggest that in some cases the Streaming API coverage can be estimated. In future research, they hope to &#8220;ﬁnd methods to compensate for the biases in the Streaming API to provide a more accurate picture of Twitter activity to researchers.&#8221; In particularly they want to &#8220;determine whether the methodology presented here will yield similar results for Twitter data collected from other domains, such as natural, protest &amp; elections.&#8221;</p>
<p style="text-align:justify;"><span style="text-align:justify;color:#444444;line-height:1.7;">The authors will present their paper at this year&#8217;s International Conference on Weblogs and Social Media </span><a style="text-align:justify;line-height:1.7;" href="http://www.icwsm.org/2013/index.php">(ICWSM</a><span style="text-align:justify;color:#444444;line-height:1.7;">). So I look forward to meeting them there to discuss related research we are carrying out at <a href="http://qcri.com/our-research/social-innovation/social-innovation-projects">QCRI</a>.</span></p>
<p><b><b><a href="http://www.irevolution.net/bio"><img class="aligncenter" alt="bio" src="http://irevolution.files.wordpress.com/2013/02/hat3.png?w=108&#038;h=59&#038;h=59" width="108" height="59" /></a></b></b></p>
<p><b style="line-height:23px;color:#333333;font-size:14px;background-color:#ffffff;"> See also:</b></p>
<ul>
<li><strong></strong><a href="http://irevolution.net/2012/03/25/crisis-mapping-syria/" rel="bookmark">Crisis Mapping Syria: Automated Data Mining and Crowdsourcing </a></li>
<li><a href="http://irevolution.net/2013/05/29/analyzing-tweets-tornado/">Using Twitter API to Analyze 2 Million+ Disaster Tweets</a></li>
</ul>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/irevolution.wordpress.com/11847/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/irevolution.wordpress.com/11847/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=irevolution.net&#038;blog=3385318&#038;post=11847&#038;subd=irevolution&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://irevolution.net/2013/05/30/twitter-api-vs-firehose/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/dbb4cc8a291903fda7c9fc254172645d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Patrick Philippe Meier</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/05/screen-shot-2013-05-30-at-9-29-20-am.png?w=500" medium="image">
			<media:title type="html">Tweets Syria</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/05/screen-shot-2013-05-30-at-9-46-15-am.png?w=500" medium="image">
			<media:title type="html">Syria List</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/05/screen-shot-2013-05-30-at-9-42-12-am.png?w=500" medium="image">
			<media:title type="html">Syria Graph</media:title>
		</media:content>

		<media:content url="http://irevolution.files.wordpress.com/2013/02/hat3.png?w=108&#38;h=59&#38;h=59" medium="image">
			<media:title type="html">bio</media:title>
		</media:content>
	</item>
	</channel>
</rss>
