Anyway.. I was exploring the site looking at some of the very cool ways that folks had created to crunch and display data.. Many of the visualizations are dynamic.. that is you can explore them interactively.. many of them are beautiful, too. While most of the analysis and stuff was for processing numerical data, I found a few visualization for text analytics. When I saw that, I did what any red blooded geek would do at 12:30 in the morning, .. I thought of sources of large amounts of text data that I could analyze.. Of course !.. My blog.. I decided to run these analytics on the a years worth of my log entries starting from the first day, 3 days after Sam died. . I found one of my blog archives which had each of the first years 365 entries in web page format (HTML).then I figured out some processing on how to remove the the pictures, the headers and the rest of the HTML formatting.. Then I concatenated it all into a singe dataset and uploaded it to ManyEyes.
I then started looking at the data.. what would it tell me ? I’m not even sure what I was looking for.. I felt like one of those kabalah mystics who looked through old religious texts looking for secrets in the patterns of letters and words. The first thing I learned was that In that first year, I had written exactly 708,576 words (including punctuation).. Now.. that alone was pretty interesting . I then started looking at word frequency… One tool shows you a ’tag cloud’ of word frequency. The larger the word, the more it occurred.. ’Sam’ was the word I typed most frequently last year… I wrote Sam’s name in my blog 3,394 times in 365 days.. and said his name to myself 1000 times that as I wrote . That makes sense… and is somehow .. i don’t know.. comforting ? What else did I say frequently ?
Diane. Max, Gabe,
Love, Great Friends,
House, Cool. Work,
Night, Today…
Sounds like Haiku.
Gotta run.. love to you all (the word ’love’ appears 1396 times !) Gnite Sam
-me ( which occurs 3688 times)
ps.. The word ’I’ occurs 16,710 times !!!!!!! .. whoa.. what an ego !)