Goodreads Developers discussion

367 views
questions > Best way to extract data from Good Reads for Analysis

Comments Showing 1-1 of 1 (1 new)    post a comment »
dateUp arrow    newest »

message 1: by San (new)

San To | 2 comments Hello,

I am trying to make a dataset of work_ids, ratings, and user_id , similar to the dataset found here.

https://github.com/zygmuntz/goodbooks...

Except I would like to do it for the top 30k books. I was wondering how to go about this.

I was thinking of looking up the top 30k books based on some goodreads or amazon metric, then extracting the ratings and user_id info for each book from the api, but there doesn't seem to be an specific api for that.

The next option is to use an api call to extract directly from each user https://www.goodreads.com/api/index#r...

But I'm not sure if that's not the most efficient way as there's over 25 million users. I'm not even sur if API is the best way. I'm a bit new to data analysis.

I would love any suggestions on how to go about doing this.


back to top