Boing Boing Analysis - Part 5

By: Jeff Clark    Date: Sun, 30 Jul 2006

We've looked at a number of aspects of the weblog Boing Boing over the last little while. The topics discussed included things like posts over time by author, day of the week analysis, images/post by author, outbound links and acronym use. This continues our analysis by examining in more detail the contents of the actual posts. What are they writing about ?

The Radial Treemap shown below illustrates which topics from my simple topic hierarchy get more emphasis. This is scaled by the number of words written on the various topics. Posts which didn't match any of the topics very well were grouped under None.

Here are the first 3 high-level topics shown by themselves so more details are clear.

These diagrams do seem to give a reasonable weight to the topics that Boing Boing seems to emphasize although before I did the measurement I expected that Technology would be larger than the Arts and Society topics.

How well is the categorizer working ? Let's look at the posts which most closely match some of the given 3rd level topics.

For Photography:

  1. Top Ten Digital Photography Tips (2003/03/26)
  2. Infrared portrait photography (2005/11/10)
  3. Digital Sensor Is Said to Match Quality of Film (2002/02/11)

For Military:

  1. The US Military is planning (2002/01/03)
  2. WiFi companies and military agree on noise-limits (2003/01/31)
  3. The US military takes out (2001/08/17)

For Aerospace Engineering:

  1. Bezos's private space-program (2003/04/27)
  2. Space is cool to look (2001/12/21)
  3. founder buys ranch for his aerospace company (2005/01/13)

These examples seem to match well but I know this is a pretty simplistic categorizer. I expect the labels for posts farther down the lists to be more questionable. The post labeled as Aerospace Engineering with the lowest score is this:

  1. Got $4.5 million? Buy this used aircraft carrier (2003/05/28)
It seems pretty good too !

