Underestimating Digg, Or 100 Most Popular Words on Digg

One day (two days ago, as far as I remember), I asked myself a question: «What are the most popular topics on digg.com? Sex? War? $$$ per oil barrel?».

I downloaded all 571 pages of popular stories for last year with cUrl (there were more than 8000 stories), parsed them to extract titles and diggs, and processed the titles with a simple PHP script.

As you see, it is very simple. It uses Porter stemming algorithm to normalize words (so, eg. apple and apples could be recognized as the same word), but it doesn't remove prepositions, articles and other such words with no “useful meaning”. The score of each word is calculated as story's diggs multiplied by the word's relative frequency in the story title. These scores don't have any useful meaning, they are used for ordering only.

So, the top 100 digg words:

  1. How 
  2. New 
  3. Digg
  4. Picture
  5. Apple
  6. Google
  7. Free
  8. Top 
  9. 10 //(remarkable, as lots of people name their articles as “10 reasons”, “10 steps”, etc...)
  10. Game
  11. Ever
  12. Video
  13. Use/using/user
  14. Windows
  15. Get 
  16. Mac 
  17. Amazing
  18. World
  19. Firefox
  20. Best
  21. Most
  22. Pic 
  23. Year
  24. iPod
  25. Vista
  26. Release
  27. Microsoft
  28. About
  29. Time
  30. Image
  31. RIAA
  32. iPhone
  33. Internet
  34. Man 
  35. Computer
  36. 3
  37. Can 
  38. Hack
  39. Nintendo
  40. 5
  41. Out 
  42. Thing
  43. Design
  44. Now 
  45. Photoshop
  46. Like
  47. TV 
  48. Have
  49. Look
  50. See 
  51. Over
  52. Real
  53. Website
  54. Pirate
  55. CSS 
  56. PS3
  57. Build
  58. Linux
  59. Work
  60. Day/Daily
  61. Bush
  62. Wait
  63. iTunes
  64. Old 
  65. Awesome
  66. Show
  67. First
  68. Movie
  69. More
  70. Want
  71. Download
  72. Ad 
  73. Online
  74. Say 
  75. Site
  76. User
  77. OS 
  78. Drive/Driver
  79. Cool
  80. PC 
  81. Tutorial
  82. Take
  83. Flash
  84. Should
  85. Back
  86. Announce
  87. Final
  88. Live
  89. Gmail
  90. Own 
  91. Bill
  92. 7 (another “magic:" number, like 10, 3 and 5)
  93. vs 
  94. Job 
  95. School
  96. Worst
  97. XP 
  98. Know
  99. People
100. Car

You understood already, what was the previous post, didn't you? I took some words from the top of the list, and pretended to be too lazy to write a good article. I was interested in how may people would click the link despite it's obvious absurdness (very few) and how many people would digg the story without following the link (none).

I thought, the social networks work much worse. Should stop underestimating them and pay more attention to them.

Top Top  AddThis Social Bookmark Button

Filed under: seo Tags: seo, digg.com, popular words

Comments (0)
[Comment deleted]