<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
<channel>
<title>Neoformix</title>
<copyright>Copyright (c) 2007 Jeff Clark. All rights reserved.</copyright>
<link>http://www.neoformix.com</link>
<description>Discovering and Illustrating Patterns in Data</description>
<language>en-us</language>
<lastBuildDate>Mon, 25 Aug 2008 16:12:31 GMT</lastBuildDate>

<item>
 <title>Some Recent Attention</title>
 <link>http://www.neoformix.com/2008/SomeRecentAttention.html</link>
 <guid>http://www.neoformix.com/2008/SomeRecentAttention.html</guid>
 <pubDate>Mon, 25 Aug 2008 17:10:00 GMT</pubDate>
 <description><![CDATA[
 		<p>
			<a href="http://www.neoformix.com/Projects/TwitterStreamGraphs/view.php">Twitter StreamGraphs</a> has triggered some attention
			for <a href="http://www.neoformix.com">Neoformix</a> over the last month. Some of the major sources were:
		</p>
		<ol>
		<li>A tweet about <a href="http://twitter.com/timoreilly/statuses/879006202">StreamGraphs</a> from <a href="http://radar.oreilly.com/">Tim O'Reilly</a>
		 on August 5th to his 8,680 followers. Tim is the founder and CEO of <a href="http://oreilly.com/">O'Reilly Media</a> - a publisher of excellent technical books.</li>
		 <li>An article written by Scott Gilbertson entitled <a href="http://www.webmonkey.com/blog/StreamGraph_Maps_Twitter_Word_by_Word">StreamGraph Maps Twitter Word by Word</a> for <a href="http://www.webmonkey.com/">Webmonkey</a>.
		 A link to this showed up on the front page of <a href="http://www.wired.com/">Wired</a> and drove some significant traffic.</li>
		 <li>A tweet about <a href="http://twitter.com/ev/statuses/895254199">using StreamGraphs to follow references to the Olympics</a> from <a href="http://twitter.com/ev">Evan Williams</a>. Ev is the Co-founder and CPO of Twitter and currently has 17,535 followers on the service.</li>
		 <li>An article by <a href="http://www.timshowers.com/web-design-blog/">Tim Showers</a> called <a href="http://www.timshowers.com/2008/08/visualization-strategies-text-documents/">Visualization Strategies: Text &amp; Documents</a>
			that highlighted several of my little tools. His article attracted some attention on <a href="http://reddit.com">Reddit</a>. </li>
		</ol>
		<p>
			Thanks very much to these people and the many others who have let people know about <a href="http://www.neoformix.com">Neoformix</a> !
		</p>

 ]]></description>
</item>
<item>
 <title>Another Box Office Graphic</title>
 <link>http://www.neoformix.com/2008/AnotherBoxOfficeGraphic.html</link>
 <guid>http://www.neoformix.com/2008/AnotherBoxOfficeGraphic.html</guid>
 <pubDate>Fri, 01 Aug 2008 15:20:00 GMT</pubDate>
 <description><![CDATA[
 		<p>
			<a href="http://www.xach.com/">Zach Beane</a> has created another variation on a graphic to illustrate
			movie box office data. See <a href="http://www.xach.com/moviecharts/">Movie box office charts</a> for the
			original but here are a few interesting bits:
		</p>
		<center>
			<a href="http://www.xach.com/moviecharts/"><img  src="http://www.neoformix.com/2008/boxOffice0.png" border="1"></a><br>&nbsp;<br>
			<a href="http://www.xach.com/moviecharts/"><img  src="http://www.neoformix.com/2008/boxOffice1.png" border="1"></a><br>
		</center>
		<p>
			In Zach's words:
			<blockquote>
				Each page displays trends in the top 25 movies at the box office
				for each weekend in a year. The color is based on the movie's debut
				week. Because of that, long-running movies will gradually start to
				stand out from newer movies with different colors.
			</blockquote>
			There is an interactive version as well.
		</p>
		<p>
			Related posts: <a href="http://www.neoformix.com/2008/MoviesEbbAndFlow.html">Movies Ebb and Flow</a>
		</p>

 ]]></description>
</item>
<item>
 <title>Twitter StreamGraphs</title>
 <link>http://www.neoformix.com/2008/TwitterStreamGraphs.html</link>
 <guid>http://www.neoformix.com/2008/TwitterStreamGraphs.html</guid>
 <pubDate>Tue, 15 Jul 2008 12:15:00 GMT</pubDate>
 <description><![CDATA[
 		<p>
			I just posted a new application in my projects section called <a href="http://www.neoformix.com/Projects/TwitterStreamGraphs/view.php">Twitter StreamGraphs</a>.
			It is an interactive tool to let you create StreamGraphs from the latest tweets containing a given word	or from a particular user. A few
			examples are shown below.
		</p>
		<center>
			<img  src="http://www.neoformix.com/2008/tsg_coffee.jpg" border="1" alt="Twitter StreamGraph for coffee"><br>&nbsp;<br>
			<img  src="http://www.neoformix.com/2008/tsg_obama.jpg" border="1" alt="Twitter StreamGraph for Obama"><br>&nbsp;<br>
			<img  src="http://www.neoformix.com/2008/tsg_scoble.jpg" border="1" alt="Twitter StreamGraph for @scobleizer"><br>
		</center>
		<p>
			The application shows a StreamGraph for the latest 200 tweets which contain the search word. The default search word is 'interesting'
			but a new one can be typed into the text box at the top of the application. You can also enter a <a href="http://twitter.com">Twitter</a> ID preceded
			by the '@' symbol to see the latest tweets from that user. A parameter to the URL can be used to specify the initial search word.
			For example, use <a href="http://www.neoformix.com/Projects/TwitterStreamGraphs/view.php?q=coffee">http://www.neoformix.com/Projects/TwitterStreamGraphs/view.php?q=coffee</a>
			to see the latest tweets about coffee. This makes it possible to link to a StreamGraph for your own tweets from your blog or within a twitter
			update. 
		</p>
		<p>
			The StreamGraph shows the usage over time for the words most highly associated with the search word. One of these series
			together with a time period are in a selected state and coloured red. The tweets that contain this word in the given
			time period are shown below the graph. You can click on another word series or time period to see different matches.
			In the match list you click on any word to create a different graph with tweets containing that word. You can also
			click on the user or comment icons and any URL to see the appropriate content in another window. If you see a large
			spike in one time period that hides the detail in all the other periods it will be useful to click in the area
			to the left of the y-axis in order to change the vertical scale.
		</p>
		<p>
			Credits go to <a href="http://www.leebyron.com/what/lastfm/">Lee Byron</a>
			for the visual ideas behind the StreamGraph (although I'm using a simpler symmetrical form), to <a href="http://processing.org">Processing</a> for the development tools,
			to Martin Porter for the <a href="http://tartarus.org/~martin/PorterStemmer/">Porter Stemming Algorithm</a>, 
			to <a href="http://www.tenbytwenty.com/products/icon-sets/vaga">Vaga</a> for the two small icons, and
			to <a href="http://summize.com/">Summize</a> for building a great API into the <a href="http://twitter.com">Twitter</a> data.
		</p>
		<p>
			Related posts are <a href="http://www.neoformix.com/Projects/TwitArcs/TwitArcs.html">TwitArcs</a> and
			<a href="http://www.neoformix.com/Projects/TwitterSpectrum/TwitterSpectrum.html">Twitter Spectrum</a>.
		</p>

 ]]></description>
</item>
<item>
 <title>Canada Day</title>
 <link>http://www.neoformix.com/2008/CanadaDay.html</link>
 <guid>http://www.neoformix.com/2008/CanadaDay.html</guid>
 <pubDate>Tue, 01 Jul 2008 09:45:00 GMT</pubDate>
 <description><![CDATA[
 		<center>
		<a href="http://www.neoformix.com/2008/Flag_Canada2.png"><img  src="http://www.neoformix.com/2008/Flag_Canada2_s.png" border="1"></a>
		</center>
		<p>
			Happy Canada Day ! I've created a simple flag graphic using a few words that come to my mind when I think of Canada.
		</p>

 ]]></description>
</item>
<item>
 <title>TechCrunch Analysis Part II</title>
 <link>http://www.neoformix.com/2008/TechCrunchAnalysis2.html</link>
 <guid>http://www.neoformix.com/2008/TechCrunchAnalysis2.html</guid>
 <pubDate>Fri, 27 Jun 2008 13:35:00 GMT</pubDate>
 <description><![CDATA[
 		<p>
			My last <a href="http://www.neoformix.com/2008/TechCrunchAnalysis.html">post</a> explored the company and product names discussed on <a href="http://www.techcrunch.com">TechCrunch</a>
			and how they varied over time. The number of posts written by the various authors and how it varied over time was also illustrated. An obvious follow-up
			analysis is to look at the interaction between author and company/product names. Do certain TechCrunch authors specialize in writing about particular companies or products ?
			Or do some authors avoid specific domains ?
		</p>
		<p>
			I've done this analysis and presented the results below. For each of the top 6 authors and top 60 names the number of times each author used each name was determined.
			The first graph shows the breakdown for the top 10 names. The second has the same form and shows numbers 11-60 but I've broken it into a separate graph because
			it uses a different scale. This lets us see more details for these names. I have also colored the bars to show proportional use of the names.
			A deep blue color means that the name was used proportionally much more often for that author and a deep red shows that it was used proportionally
			much less often. Paler colors indicate a lesser degree of high(blue) or low(red) usage.
		</p>
		<center>
			<i>Click on the images to see larger views</i>
			<a href="http://www.neoformix.com/2008/ag_techcrunch1_10.png"><img  src="http://www.neoformix.com/2008/ag_techcrunch1_10_s.png" border="1" alt="Top 10 TechCrunch Names Author Breakdown"></a><br>&nbsp;<br>
			<a href="http://www.neoformix.com/2008/ag_techcrunch11_60.png"><img  src="http://www.neoformix.com/2008/ag_techcrunch11_60_s.png" border="1" alt="11-60 TechCrunch Names Author Breakdown"></a>
		</center>
		<p>
			Some things that I spotted quickly from the larger version of the top 10 graphic include:
			<ol>
			<li>There aren't too many names that were discussed a lot more by a particular author - no deep blues</li>
			<li>Perhaps the deepest blue in the graphic shows that Hendrickson discussed FaceBook proportionally more than the others</li>
			<li>Unusually low (deeper red) combinations are Schonfeld-Digg, Kirkpatrick-FaceBook, and to a lesser degree Gonzalez-Microsoft and Hendrickson-Microsoft</li>
			</ol>
			Some of the notable features in the 11-60 graphic are:
			<ol>
			<li>Arrington discusses Life (as in Second Life, Yahoo Life, Online Life, various others), VOIP, Adsense, Silverlight, and P2P proportionally less than average</li>
			<li>High proportional names for Riley include Twitter, Life, Windows, and Silverlight</li>
			<li>Low proportional names for Riley include RSS, Flash, Zoho, AJAX, Salesforce, NetVibes, and Wikia</li>
			<li>High proportional names for Schonfeld include Comscore, Bebo, Salesforce, and especially OpenSocial</li>
			<li>Low proportional names for Schonfeld include RSS, Life, API, URL, Ning, photobucket, and a few more</li>
			<li>Other high proportional pairings are Kirkpatrick-RSS and IM, Gonzalez-VOIP and Zoho, Hendrickson-Bebo,Ning,Hulu and OpenSocial</li>
			</ol>
			Some of these differences in proportional frequency of references are likely due to the fact that certain companies and products
			were discussed a lot in particular periods of time and the number of articles posted by the various authors varied a lot over the
			time period. For example, Twitter wasn't really mentioned on TechCrunch until around Feb 2007 which was around the time Marshall Kirkpatrick
			stopped posting so it isn't suprising that he didn't mention Twitter hardly at all.
		</p>
		<p>
			The data for this analysis was kindly provided by Yuvi from <a href="http://thestatbot.com">The StatBot</a>.
		</p>

 ]]></description>
</item>
<item>
 <title>TechCrunch Analysis</title>
 <link>http://www.neoformix.com/2008/TechCrunchAnalysis.html</link>
 <guid>http://www.neoformix.com/2008/TechCrunchAnalysis.html</guid>
 <pubDate>Wed, 25 Jun 2008 13:45:00 GMT</pubDate>
 <description><![CDATA[
 		<p>
			<a href="http://www.techcrunch.com">TechCrunch</a> is a weblog that reviews products or companies 
			that are having an impact on the internet. Who do they write about and how do references to these companies or products vary over time ?
			I've analyzed the proper names referenced in their posts in the time frame May 1st, 2006 until April 30th, 2008 - 2 years of data. I discarded
			place names and people and plotted the top 8 names with the most references in a <a href="http://www.neoformix.com/2008/TwitterClientUsageStream.html">StreamGraph</a> below.
		</p>
		<center>
			<i>Click on the image for a larger view</i>
			<a href="http://www.neoformix.com/2008/sg_techcrunchNames8.png"><img  src="http://www.neoformix.com/2008/sg_techcrunchNames8_s.png" border="1" alt="Top 8 TechCrunch Names StreamGraph"></a>
		</center>
		<p>
			The graph clearly shows the companies that have dominated TechCrunch coverage over the last 2 years. Google looks biggest with
			FaceBook, Yahoo, and Microsoft being quite large as well. You can spot the increase in coverage for Microsoft and Yahoo in Feb of this year
			due to the merger talks. Notice also that MySpace and FaceBook were fairly even until July 2007 when FaceBook began dominating.
			If you look closely you can also tell that Twitter has become important lately with the number of references in April 2008 very similar to
			both Microsoft and FaceBook.
		</p>
		<center>
			<i>Click on the image for a larger view</i>
			<a href="http://www.neoformix.com/2008/lg_techcrunchNames8.png"><img  src="http://www.neoformix.com/2008/lg_techcrunchNames8_s.png" border="1" alt="Top 8 TechCrunch Names Line Graph"></a>
		</center>
		<p>
			<a href="http://www.neoformix.com/2008/lg_techcrunchNames20.png"><img  src="http://www.neoformix.com/2008/lg_techcrunchNames20_t.png" border="0" align="right"></a>
			The standard line graph for the same data lets you see some details more clearly. Google was indeed the most referenced company in all but 
			a few months where it was barely exceeded by Yahoo (Sep 2006, Feb 2008) and FaceBook (Aug and Oct 2007). And references to Twitter did
			barely exceed Microsoft and Facebook in Apr 2008.
			<br>&nbsp;<br>
			The standard line graph cannot usefully show the top 20 names
			because so many of the series overlap each other and can't be distinguished.
			The StreamGraph version for 20 names is much more useful at full size.
		</p>
		<center>
			<a href="http://www.neoformix.com/2008/sg_techcrunchNames20.png"><img  src="http://www.neoformix.com/2008/sg_techcrunchNames20_s.png" border="1" alt="Top 20 TechCrunch Names StreamGraph"></a>
		</center>
<br><a href="http://www.neoformix.com/2008/TechCrunchAnalysis.html#SummaryEnd">(More...)</a>

 ]]></description>
</item>
<item>
 <title>Obama/McCain Economic Statement StreamGraphs</title>
 <link>http://www.neoformix.com/2008/ObamaMcCainEconomic_sg.html</link>
 <guid>http://www.neoformix.com/2008/ObamaMcCainEconomic_sg.html</guid>
 <pubDate>Thu, 19 Jun 2008 08:25:00 GMT</pubDate>
 <description><![CDATA[
 		<center>
			<a href="http://www.neoformix.com/2008/sg_Obama_RenewingAmericanComp.jpg"><img  src="http://www.neoformix.com/2008/sg_Obama_RenewingAmericanComp_s.png" border="1"></a><br>&nbsp;<br>
			<a href="http://www.neoformix.com/2008/sg_McCain_LeadershipGlobalEconomy.jpg"><img  src="http://www.neoformix.com/2008/sg_McCain_LeadershipGlobalEconomy_s.png" border="1"></a>
		</center>
		<p>
			The above StreamGraphs show the texts delivered by Obama and McCain recently on the American economy. Click on either one to see more detail.
			Obama's remarks, given the title
			<a href="http://www.realclearpolitics.com/articles/2008/06/renewing_american_competitiven.html">Renewing American Competitiveness</a>,
			were delivered at Kettering University in Flint, Michigan on June 16th, 2008. John McCain delivered his remarks concerning
			<a href="http://www.johnmccain.com/Informing/News/Speeches/f083c0bb-3470-437a-88c7-dabf2079f140.htm">America's Leadership in the Global Economy</a>
			to the National Restaurant Association, in Chicago, Illinois, on May 19, 2008. Of course it's more informative to actually <i>read the texts</i>
			but these things do jump out from the graphics:
			<ul>
			<li>McCain mentions 'tax' a lot more than Obama</li>
			<li>McCain mentions 'tax' a lot more towards the beginning of his speech than the end</li>
			<li>McCain mentions the large numbers 'million', 'billion' and 'trillion' a lot, they aren't prominent in Obama's remarks </li>
			<li>Obama mentions 'teachers', 'schools', and 'education' a lot, not McCain</li>
			<li>They both discuss 'jobs' and 'trade' although trade is a bit more prominent towards the end of Obama's speech</li>
			<li>Obama mentions 'oil' and 'energy'</li>
			<li>McCain mentions 'farmers' and 'subsidies'</li>
			<li>Obama mentions 'Bush', McCain doesn't mention him with any prominence</li>
			</ul>
		</p>

 ]]></description>
</item>
<item>
 <title>My Twitter ID</title>
 <link>http://www.neoformix.com/2008/TwitterID.html</link>
 <guid>http://www.neoformix.com/2008/TwitterID.html</guid>
 <pubDate>Wed, 18 Jun 2008 19:35:00 GMT</pubDate>
 <description><![CDATA[
 		<p>
			I've been having fun playing with <a href="http://twitter.com">Twitter</a> data lately. It's a wonderful playground for those interested
			in analyzing text data.	I'm also starting to actually use the service a bit more for early announcements of projects I'm working on.
			Feel free to follow <a href="http://twitter.com/JeffClark">Jeff Clark</a> to see my updates. I try and keep my signal to noise ratio pretty good :-)
		</p>

 ]]></description>
</item>
<item>
 <title>Little Brother StreamGraph</title>
 <link>http://www.neoformix.com/2008/LittleBrotherStreamGraph.html</link>
 <guid>http://www.neoformix.com/2008/LittleBrotherStreamGraph.html</guid>
 <pubDate>Tue, 17 Jun 2008 15:25:00 GMT</pubDate>
 <description><![CDATA[
 		<center>
			<a href="http://www.neoformix.com/2008/sg_LittleBrother.jpg"><img  src="http://www.neoformix.com/2008/sg_LittleBrother_s.png" border="1"></a>
		</center>
		<p>
			I have created another StreamGraph, this one for the book <a href="http://craphound.com/littlebrother/download/">Little Brother</a>, by <a href="http://craphound.com/">Cory Doctorow</a>.
			Click on it to see a larger version. It shows the distribution of proper noun references across the text. Here are a few things you can pick out
			from the graph:
			<ul>
			<li>major people referenced seem to be Darryl, Ange, Van</li>
			<li>lots of secondary characters like Charles, Marcus, Mom, Dad, Jolu</li>
			<li>Jolu appears primarily around the middle of the text</li>
			<li>Ange referenced much more often in the second half of text</li>
			<li>About 1/3 through the text there are lots of references to Booger, Zit, and Pigspleen none of which seem to reappear afterwards to any great degree</li>
			<li>Booger, Zit, and Pigspleen seem associated with Internet, Xnet</li>
			<li>Masha figures prominently in the end of the text but not beforehand</li>
			</ul>
		</p>
		<p>
			See the post <a href="http://www.neoformix.com/2008/TomSawyer.html">Tom Sawyer Character StreamGraph</a> for
			a very brief description of how it was constructed. The design of the graph is based loosely on those created by <a href="http://www.leebyron.com/what/lastfm/">Lee Byron</a>. 
		</p>
		<p>
			Here is another for a different work by Cory, <a href="http://craphound.com/down/download.php">Down and Out in the Magic Kingdom</a>.
		</p>
		<center>
			<a href="http://www.neoformix.com/2008/sg_DownAndOut.jpg"><img  src="http://www.neoformix.com/2008/sg_DownAndOut_s.png" border="1"></a>
		</center>

 ]]></description>
</item>
<item>
 <title>Tom Sawyer Character StreamGraph</title>
 <link>http://www.neoformix.com/2008/TomSawyer.html</link>
 <guid>http://www.neoformix.com/2008/TomSawyer.html</guid>
 <pubDate>Tue, 17 Jun 2008 13:45:00 GMT</pubDate>
 <description><![CDATA[
 		<center>
			<a href="http://www.neoformix.com/2008/sg_TomSawyer.jpg"><img  src="http://www.neoformix.com/2008/sg_TomSawyer_s.png" border="1"></a>
		</center>
		<p>
			The above image is a StreamGraph for the book <a href="http://www.gutenberg.org/etext/74">The Adventures of Tom Sawyer</a>, by Mark Twain.
			Click on it to see a larger version. It seems to do a pretty good job of communicating the ebb and flow of the various characters throughout
			the book. The Mississippi River figures prominently in the book so a stream-like representation of the text seems appropriate.
		</p>
		<p>
			I have adapted the StreamGraph code used to create the various <a href="http://www.neoformix.com/2008/TwitterTopicStream.html">Twitter Topic Streams</a>
			so I can create StreamGraphs from arbitrary text documents. The document is split up into 25 equal sized segments and the word counts
			are done within each segment. These segments are used in place of time along the horizontal axis of the StreamGraph. This document StreamGraph
			again focuses on capitolized words but ignores a few common ones like 'Mr' and 'Mrs'. I'm also using a longer format for the graph
			and showing two labels for each word series - one on the left half of the graph, and one on the right. The difference in label size for the
			same word can show whether it was used more frequently in the first or second half of the document. In the 'Tom Sawyer' graphic above you can
			clearly see that both 'Ben' and 'Mary' are more prominent in the first half of the text but that 'Huck' is more common in the second half.
		</p>

 ]]></description>
</item>
<item>
 <title>Twitter Topic Streams for some Top Users</title>
 <link>http://www.neoformix.com/2008/TwitterTopicStreamsTopUsers.html</link>
 <guid>http://www.neoformix.com/2008/TwitterTopicStreamsTopUsers.html</guid>
 <pubDate>Mon, 16 Jun 2008 23:30:00 GMT</pubDate>
 <description><![CDATA[
 		<p>
			Many people seemed to enjoy the Topic StreamGraph I made a few days back for Robert Scoble so
			I have created Topic StreamGraphs for some of the other top <a href="http://www.twitter.com">Twitter</a> users.
			If you missed my post from last week on <a href="http://www.neoformix.com/2008/TwitterTopicStream.html">Twitter Topic Streams</a> a quick explanation is that
			they illustrate the most interesting capitolized words used in the tweets for these people. I removed many common terms from consideration including most of the
			placenames although a few managed to squeak through.
		</p>
		<center>
			<a href="http://www.neoformix.com/2008/sg_broganTwitterTopics.png"><img  src="http://www.neoformix.com/2008/sg_broganTwitterTopics_s.png" border="1"></a>
			<a href="http://www.neoformix.com/2008/sg_jimLongTwitterTopics.png"><img  src="http://www.neoformix.com/2008/sg_jimLongTwitterTopics_s.png" border="1"></a>
			<a href="http://www.neoformix.com/2008/sg_scobleTwitterTopics.png"><img  src="http://www.neoformix.com/2008/sg_scobleTwitterTopics_s.png" border="1"></a>
			<a href="http://www.neoformix.com/2008/sg_winerTwitterTopics.png"><img  src="http://www.neoformix.com/2008/sg_winerTwitterTopics_s.png" border="1"></a>
			<a href="http://www.neoformix.com/2008/sg_SteveRubelTwitterTopics.png"><img  src="http://www.neoformix.com/2008/sg_SteveRubelTwitterTopics_s.png" border="1"></a>
			<a href="http://www.neoformix.com/2008/sg_evTwitterTopics.png"><img  src="http://www.neoformix.com/2008/sg_evTwitterTopics_s.png" border="1"></a>
		</center>

 ]]></description>
</item>
<item>
 <title>Wordle</title>
 <link>http://www.neoformix.com/2008/Wordle.html</link>
 <guid>http://www.neoformix.com/2008/Wordle.html</guid>
 <pubDate>Wed, 11 Jun 2008 21:30:00 GMT</pubDate>
 <description><![CDATA[
 		<p>
			<a href="http://mrfeinberg.com/">Jonathan Feinberg</a> has created an interesting toy for building excellent looking word clouds from
			submitted text. You can adjust the font, color scheme, and choose from a variety of layouts. It's similar in many ways to what I did with
			<a href="http://www.neoformix.com/2008/WordHearts.html">Word Hearts</a> a couple of months ago. A few samples are shown below.
			Great work Jonathan !
		</p>
		<center>
			<a href="http://wordle.net/gallery/GANGSTER_COMPUTER_GOD"  title="Wordle: GANGSTER COMPUTER GOD"><img  src="http://wordle.net/thumb/GANGSTER_COMPUTER_GOD"  style="padding:4px;border:1px solid #ddd"></a>&nbsp;&nbsp;
			<a href="http://wordle.net/gallery/United_States_Constitution" title="Wordle: United States Constitution"><img  src="http://wordle.net/thumb/United_States_Constitution" style="padding:4px;border:1px solid #ddd"></a>&nbsp;&nbsp;
			<a href="http://wordle.net/gallery/Untitled_5" title="Wordle: Untitled"><img  src="http://wordle.net/thumb/Untitled_5" style="padding:4px;border:1px solid #ddd"></a>
		</center>

 ]]></description>
</item>
<item>
 <title>Twitter Topic Stream</title>
 <link>http://www.neoformix.com/2008/TwitterTopicStream.html</link>
 <guid>http://www.neoformix.com/2008/TwitterTopicStream.html</guid>
 <pubDate>Wed, 11 Jun 2008 14:00:00 GMT</pubDate>
 <description><![CDATA[
 		<center>
			<a href="http://www.neoformix.com/2008/sg_TopUsersTwitterTopics.png"><img  src="http://www.neoformix.com/2008/sg_TopUsersTwitterTopics_s.png" border="1"></a>
		</center>
		<p>
			The above StreamGraph illustrates the distribution of the most interesting capitolized words in the <a href="http://thestatbot.com">StatBot</a>
			dataset of all the updates for the top 100 twitter users. I removed most place names (NY, Paris, Boston etc) and several common words like
			'twitter', 'lol', 'company', 'web', and 'internet'. The interestingness of a word was quantified by a function of the total references as well
			as the burstiness of the word distribution.
		</p>
		<p>
			The most 'interesting' words in this data are primarily product, technology, or technology event names with the exceptions of 'Scoble' and 'Obama'.
			This isn't surprising since the top twitter	users are early-adopters interested in technology.
			I was a bit surprised at the large volume for <a href="http://seesmic.com/">Seesmic</a>
			but discovered that it is a company founded by <a href="http://www.loiclemeur.com/">Loic Le Meur</a>, the 6th top twitter user.
		</p>
		<p>
			I also created the twitter topic stream for <a href="http://twitter.com/scobleizer">Robert Scoble</a> shown below. The graphic
			does a pretty good job of highlighting the primary technologies Scoble focused on over the last year or so.
		</p>
<br><a href="http://www.neoformix.com/2008/TwitterTopicStream.html#SummaryEnd">(More...)</a>

 ]]></description>
</item>
<item>
 <title>Top Twitter Users StreamGraph</title>
 <link>http://www.neoformix.com/2008/TopTwitterUsersStreamGraph.html</link>
 <guid>http://www.neoformix.com/2008/TopTwitterUsersStreamGraph.html</guid>
 <pubDate>Wed, 11 Jun 2008 00:00:00 GMT</pubDate>
 <description><![CDATA[
 		<center>
			<a href="http://www.neoformix.com/2008/sg_TopTwitterUsers.png"><img  src="http://www.neoformix.com/2008/sg_TopTwitterUsers_s.png" border="1"></a>
		</center>
		<p>
			This StreamGraph shows the top twitter users based on the number of tweets sent during the period December 2006 until April 2008.
			Click the image to see a larger version with more of the labels legible.
		</p>

 ]]></description>
</item>
<item>
 <title>Twitter Client Usage StreamGraph</title>
 <link>http://www.neoformix.com/2008/TwitterClientUsageStream.html</link>
 <guid>http://www.neoformix.com/2008/TwitterClientUsageStream.html</guid>
 <pubDate>Tue, 10 Jun 2008 19:21:00 GMT</pubDate>
 <description><![CDATA[
 		<center>
			<a href="http://www.neoformix.com/2008/TwitterClientStream.png"><img  src="http://www.neoformix.com/2008/TwitterClientStream_s.png" border="1"></a>
		</center>
		<p>
			I have mentioned before the wonderful <a href="http://www.neoformix.com/2008/StreamGraph.html">stream-like visualizations</a>
			created by <a href="http://leebyron.com">Lee Byron</a>.	I've written some code so I can create my own using whatever data I
			want. The one above was constructed using the twitter data from <a href="http://thestatbot.com">The StatBot</a>.
			You can click on it to see a larger version of the image. I left out the first few months which had a very low volume of data so
			this one runs from Dec 2006 to Apr 2008.
		</p>
		<p>
			For a small number of series a simple line graph would be superior because you can directly see which values are larger at each point
			in time. These StreamGraphs do a better job of emphasizing the sum at each point and the breakdown into the various series.
			I think StreamGraphs are also better at showing lots of series that dominate for short parts of the timespan of interest. For example see
			the image below that shows movie revenues. There are a great many movies illustrated and each one is only present in a fairly small
			part of the overall range of time.
		</p>
		<center><img  src="http://www.neoformix.com/2008/MoviesEbbTrends.png" border="1"></center>
		<p>
			<a href="http://leebyron.com">Lee Byron</a> and <a href="http://www.bewitched.com/">Martin Wattenberg</a> have written a short paper describing the design decisions and algorithms behind these types of graphics.
			Have a look at <a href="http://leebyron.com/else/streamgraph/download.php?file=stackedgraphs_byron_wattenberg.pdf">Stacked Graphs - Geometry &amp; Aesthetics</a> (pdf) if you are interested in the details.
		</p>

 ]]></description>
</item>
<item>
 <title>Top Twitter Users Over Time</title>
 <link>http://www.neoformix.com/2008/TopTwitterUsersOverTime.html</link>
 <guid>http://www.neoformix.com/2008/TopTwitterUsersOverTime.html</guid>
 <pubDate>Fri, 06 Jun 2008 14:31:00 GMT</pubDate>
 <description><![CDATA[
 		<p>Using data from <a href="http://thestatbot.com">The StatBot</a> again I've built some graphs detailing usage of the top twitter users
		over the May 2006- May 2008 period. A line graph with this data is too crowded to interpret properly unless we restrict it to only a few
		top users so I decided to create a set of bar graphs instead. The pink bars are the highest for that month and the highest month of all
		is the last scobleizer month - 2005 tweets for April 2008. Here are a few observations:
		<ol>
		<li>chrisbrogan has the most tweets overall but his totals were eclipsed in most months by newmediajim</li>
		<li>scobleizer jumped in quickly in Mar 2007 and had the most tweets of anyone in his first month of use</li>
		<li>down at #21 'ev' seems to have the highest use during the first few months. This isn't surprising since 'ev' is Evan Williams, Co-founder of Twitter</li>
		<li>guykawasaki and problogger are high in the overall usage rankings despite starting later than many of the others</li>
		</ol>
		</p>
		<center>
			<img  src="http://www.neoformix.com/2008/TopTwitterUsersOverTime.png" border="1">
		</center>


 ]]></description>
</item>
<item>
 <title>Twitter Client Usage Over Time</title>
 <link>http://www.neoformix.com/2008/TwitterClientUsage.html</link>
 <guid>http://www.neoformix.com/2008/TwitterClientUsage.html</guid>
 <pubDate>Fri, 06 Jun 2008 10:01:00 GMT</pubDate>
 <description><![CDATA[
 		<center>
			<img  src="http://www.neoformix.com/2008/tcu_everyone.png" border="1">
		</center>
		<p>
			I've constructed a graph showing how use of <a href="http://twitter.com">Twitter</a> clients by power users has changed over time. I used a dataset
			containing all the tweets from the power users in the <a href="http://twitterific.com">Twitterific Top 100 list</a> which
			was graciously provided by Yuvi, over at <a href="http://thestatbot.com">The StatBot</a>.
			Two full years of data,	from May 1st, 2006 until April 30th, 2008 were used for the analysis. The main things that jump out for me are:
			<ol>
			<li>Web client use dominates</li>
			<li>txt client use seems to have plateaued between Mar 2006 and Mar 2008</li>
			<li>very rapid recent growth for Twhirl</li>
			<li>decent growth for im</li>
			<li>all the curves are fairly spiky</li>
			</ol>
		</p>
		<p>
			There have been some other recent posts giving statistics on the clients used most often to post updates to <a href="http://twitter.com">Twitter</a>.
			One, from <a href="http://www.readwriteweb.com">ReadWriteWeb</a>, was called <a href="http://www.readwriteweb.com/archives/top_twitter_clients_definitive_list.php">How We Tweet: The Definitive List of the Top Twitter Clients</a>
			and was based on a random sample of over 37,000 tweets from the public timeline. The results were posted April 2, 2008 so I presume the data was
			collected shortly before then. The top 3 clients from their survey were:
		</p>
		<p>
			<ol>
			<li><a href="http://www.twitter.com/">Web</a> 56% (20734)</li>
			<li>IM 8% (2975)</li>
			<li><a href="http://www.twhirl.org/">Twhirl</a> 7% (2754)</li>
			</ol>
			Visit the <a href="http://www.readwriteweb.com/archives/top_twitter_clients_definitive_list.php">original post</a> for full results.
		</p>
		<p>
			Yuvi, more recently, did a similar
			<a href="http://thestatbot.com/2008/05/28/twitter-clients-used-by-twitter-power-users/">analysis</a> based on the
			data he provided to me. He listed a number of findings by constrasting the 
			two datasets including that the power users make use of both SMS txt messages (10% vs 5%) and Mobile Twitter (6% vs negligible)
			much more often than the typical user.
			He also claimed that the power users are using <a href="http://www.twhirl.org/">Twhirl</a> less than the typical user (5% vs 7%).
			I believe this claim is incorrect.
		</p>
		<p>
			The two studies mentioned above
			show an average client usage over <i>very different time periods.</i> The <a href="http://www.readwriteweb.com/archives/top_twitter_clients_definitive_list.php">ReadWriteWeb study</a>
			uses data from a 24 hour period around April 2, 2008 but the <a href="http://thestatbot.com/2008/05/28/twitter-clients-used-by-twitter-power-users/">StatBot analysis</a>
			uses a complete list of tweets that span a timeframe from March 21st, 2006 until May 25th, 2008. Drawing comparisons between two datasets
			based on such vastly different time periods should be done very cautiously. <a href="http://www.twhirl.org/">Twhirl</a> is relatively
			new and Yuvi's analysis used lots of historical data before Twhirl was available.
		</p>
		<p>
			The StatBot analysis showed that on average it was the 6th most popular client. In fact, if you look at the graph above at the point between Mar and Apr 2008, which corresponds to when the 'typical user' study
			was done by <a href="http://www.readwriteweb.com">ReadWriteWeb</a>, you can easily see that Twhirl was actually the second
			most popular client for power users. I've looked at the power user data for all tweets between Mar 31st and Apr 2nd, 2008
			and there were 241 for Twhirl out of 1833 total - 13% , which is much higher than the ReadWriteWeb result of 7% for typical users.
			This makes sense to me - power users have more of an incentive to install a specialized client than an average user who doesn't
			use twitter very often.
		</p>
		<center>
		<img  src="http://www.neoformix.com/2008/tcu_scobleizer.png" border="1">
		</center>
		<p>
			Just for fun, (well, and to try and get them to link to me ! ), I have generated graphs for two of the top power users: Robert Scoble and
			Chris Brogan. Note that the clients are coloured differently in the three graphs. Ideally, for easy comparison, they should be consistent.
			Here are a few observations concerning their patterns of use:
			<ol>
			<li>Scoble has a huge peak for use of the web client around Mar 2007 and a lull in overall usage in June 2007</li>
			<li>Scoble switched to be a heavy user of twitterific for the last half of 2007 and again had a peak in usage</li>
			<li>Starting in Nov 2007 Scoble switched from twitterific to im as his primary client - web usage still common as well</li>
			<li>Scoble dabbles in many other twitter clients as well</li>
			<li>Brogan had a large drop in web client use at the same time as Scoble - Mar 2007, but didn't drop as far</li>
			<li>Brogan has used the mobile client a great deal since Jun 2007</li>
			</ol>
		</p>
		<center>
		<img  src="http://www.neoformix.com/2008/tcu_chrisbrogan.png" border="1">
		</center>
		<p align="center">
				<script type="text/javascript">
				digg_url = 'http://www.neoformix.com/2008/TwitterClientUsage.html';
				digg_bgcolor = '#FFFFFF';
				digg_skin = 'compact';
				</script>
				<script src="http://digg.com/tools/diggthis.js" type="text/javascript"></script>
		</p>


 ]]></description>
</item>
<item>
 <title>TwitArcs</title>
 <link>http://www.neoformix.com/2008/TwitArcs.html</link>
 <guid>http://www.neoformix.com/2008/TwitArcs.html</guid>
 <pubDate>Thu, 29 May 2008 10:40:00 GMT</pubDate>
 <description><![CDATA[
 		<p>
			I've combined some visuals from a side project related to linguistics with twitter data to create <a href="http://www.neoformix.com/Projects/TwitArcs/TwitArcs.html">TwitArcs</a>.
			It takes the latest 100 tweets for a twitter ID or term of interest and creates a list representation
			that has arcs connecting messages sent to the same users or that use the same primary term. You can click on the left side to load the
			tweets for a new user, on the right side to load the tweets for a specific term, and in the middle to visit the actual tweet.
		</p>
		<p>
			Thanks to <a href="http://www.twitter.com">Twitter</a> and <a href="http://summize.com/">Summize</a> for the data and <a href="http://www.processing.org">Processing.org</a> for
			the tools. Give <a href="http://www.neoformix.com/Projects/TwitArcs/TwitArcs.html">TwitArcs</a> a try !
		</p>
		<center>
		<p><i>TwitArcs (static image)</i></p>
		<a href="http://www.neoformix.com/Projects/TwitArcs/TwitArcs.html"><img  src="http://www.neoformix.com/2008/TwitArcs.png"></a>
		</center>

 ]]></description>
</item>
<item>
 <title>Twitter Spectrum Changes</title>
 <link>http://www.neoformix.com/2008/TwitterSpectrum2.html</link>
 <guid>http://www.neoformix.com/2008/TwitterSpectrum2.html</guid>
 <pubDate>Tue, 20 May 2008 15:30:00 GMT</pubDate>
 <description><![CDATA[
 		<p>
			I've slightly improved the <a href="http://www.neoformix.com/Projects/TwitterSpectrum/TwitterSpectrum.html">Twitter Spectrum</a> application
			so that clicking on words used in conjunction with both terms properly use OR in the query. I also changed the default search terms
			to 'from:jasoncalacanis' and 'from:scobleizer' to show how you can compare the tweets from two users rather than related to two terms.
		</p>
		<center>
		<p><i>Twitter Spectrum (static image)</i></p>
		<a href="http://www.neoformix.com/Projects/TwitterSpectrum/TwitterSpectrum.html"><img  src="http://www.neoformix.com/2008/TwitterSpectrum2.png"></a>
		</center>

 ]]></description>
</item>
<item>
 <title>Twitter Spectrum</title>
 <link>http://www.neoformix.com/2008/TwitterSpectrum.html</link>
 <guid>http://www.neoformix.com/2008/TwitterSpectrum.html</guid>
 <pubDate>Thu, 15 May 2008 14:40:00 GMT</pubDate>
 <description><![CDATA[
 		<p>
			Just for fun, I've modified my <a href="http://www.neoformix.com/Projects/NewsSpectrum/NewsSpectrum.html">News Spectrum</a> application
			to take data from <a href="http://www.twitter.com">Twitter</a> instead. Its called <a href="http://www.neoformix.com/Projects/TwitterSpectrum/TwitterSpectrum.html">Twitter Spectrum</a> of course !
			It uses the wonderful <a href="http://summize.com/">Summize</a> which
			provides excellent search capability for <a href="http://www.twitter.com">Twitter</a> data.
		</p>
		<p>
			As before, one topic is coloured blue, the other red, and the associated words are coloured and positioned based on how highly
			they are associated with the two topics. Click on any word to see the related tweets.
			Give <a href="http://www.neoformix.com/Projects/TwitterSpectrum/TwitterSpectrum.html">Twitter Spectrum</a> a try !
			As always, <a href="mailto:web1@neoformix.com">feedback</a> is welcome.
		</p>
		<p>
			Thanks to <a href="http://www.twitter.com">Twitter</a> and <a href="http://summize.com/">Summize</a> for the data, <a href="http://www.processing.org">Processing.org</a> for
			the tools, and <a href="http://www.chrisharrison.net/projects/wordspectrum/index.html">Chris Harrison</a> for the inspiration
			behind the design.
		</p>
		<center>
		<p><i>Twitter Spectrum (static image)</i></p>
		<a href="http://www.neoformix.com/Projects/TwitterSpectrum/TwitterSpectrum.html"><img  src="http://www.neoformix.com/2008/TwitterSpectrum.png"></a>
		</center>

 ]]></description>
</item>

</channel>
</rss>
