book mapping
I love maps, and Goodreads has the data to generate some interesting visualizations of reading habits tied to location, so I figured I'd do a little map-making. The simplest form of geographic data we store are US postal codes, so I focused on making regional maps of US metropolitan areas, starting, of course, with the San Francisco Bay Area, where I live. First of all, here's a map of users in mind-numbing pink:
Woah! I think I'm blind. Note that the intensity of the color is on a log scale, so smaller values are more exaggerated than larger ones (please forgive my cartographic irresponsibility in not including legend, just didn't have time to figure one out). Not sure what the deal with the numbers in the South Bay are all about. I actually had to remove Palo Alto all together b/c a lot of users who only use the Facebook app get logged as being from there. Oops.
Now, here's one for reviews of books by Michael Pollan, slightly calmer color scheme:
Here the intensity of the color correlates with the number of reviews by people in the zip code divided by the total number of people in that zip code. This isn't perfect because people can review multiple editions of a book or read multiple books by an author, but it's a decent proxy metric. You can see Pollan's greater popularity in wealthy, urban areas like SF and Berkeley. These values are also log-normalized, and I just didn't show data from zips with less than 50 people, since I figured there was too much skew there (I know, not exactly science). Areas with less than 50 users or no data are shown in grey.
Now check out Stephanie Meyer:
Meyer is so popular that it really isn't necessary to log-normalize the data. Here's the same data on a linear scale and zoomed out a bit:
See, now that's interesting (to me at least). What are we seeing here? Urban / suburban? Liberal / conservative? Adults / kids? Let's repeat the Meyer experiment in some other cities.
Boston
NYC
Houston (what's up with those empty areas?!)
Chicago
Seattle
I'll leave it to you guys to draw your own conclusions.
Sources & Tools

Woah! I think I'm blind. Note that the intensity of the color is on a log scale, so smaller values are more exaggerated than larger ones (please forgive my cartographic irresponsibility in not including legend, just didn't have time to figure one out). Not sure what the deal with the numbers in the South Bay are all about. I actually had to remove Palo Alto all together b/c a lot of users who only use the Facebook app get logged as being from there. Oops.
Now, here's one for reviews of books by Michael Pollan, slightly calmer color scheme:

Here the intensity of the color correlates with the number of reviews by people in the zip code divided by the total number of people in that zip code. This isn't perfect because people can review multiple editions of a book or read multiple books by an author, but it's a decent proxy metric. You can see Pollan's greater popularity in wealthy, urban areas like SF and Berkeley. These values are also log-normalized, and I just didn't show data from zips with less than 50 people, since I figured there was too much skew there (I know, not exactly science). Areas with less than 50 users or no data are shown in grey.
Now check out Stephanie Meyer:

Meyer is so popular that it really isn't necessary to log-normalize the data. Here's the same data on a linear scale and zoomed out a bit:

See, now that's interesting (to me at least). What are we seeing here? Urban / suburban? Liberal / conservative? Adults / kids? Let's repeat the Meyer experiment in some other cities.
Boston

NYC

Houston (what's up with those empty areas?!)

Chicago

Seattle

I'll leave it to you guys to draw your own conclusions.
Sources & Tools
Comments Showing 1-24 of 24 (24 new)
date
newest »

message 1:
by
Athira
(new)
Feb 26, 2011 09:02AM

flag


Do you plan to map user reading habits to all countries? I'll surely be curious about that, though I have once seen a map showing proportion of goodreads' members vs other book cataloging websites.


I'd love to see how or what Austin reads.. :)

That mid-pink area on Galveston Island at the end of the freeway at the south of the Houston map? I'm pretty sure that may be just me. ;) (Hmm... some of the empty spots are "Asia-town," where they probably don't read a lot of English. And I don't think West Galveston Bay got marked as water... there's several miles of water between the island and the mainland, so that explains at least one empty spot.)
This is actually kind of cool. :)

This is a visualization I did once using Dewey classification numbers:
http://sepans.com/vis/goodreadsFocus.htm

And have you ever read How to Lie with Maps? You would love it.




Laughs, I used to live in Houston and they didn't read, that's for sure.



Meredith: totally. These maps are based on the books people choose to review, not necessarily the books they actually read.

I think that Goodreads should remove the above from thier privacy policy and replace it with something that says something like: "We will never give your private book reading info to anyone for any reason including future potential buyers of our company."
The above blog post (and others like it that I've seen on this site) are exactly why I think the only eventual outcome for this site is a sale to a huge company that will use my book reading habits for marketing purposes. It's astonishingly valuable and one of the reasons why I'm so hestitant to put my own book data here.
Having said all that - this site is cool as hell. It's going to come back to bite me if I use it but I'm going to use it anyway.