Goodreads Librarians Group discussion
note: This topic has been closed to new comments.
Book & Author Page Issues
>
Updating some of the Unknown Book/Authors

Is this going to be another weekend job?

I meant after the full run. If Evan is correct in his suggestion that "there are bound to be some mistakes" then I for one would be willing to browse through the resulting books and see if I can spot anything obvious.

It would be good to be able to review the changes without just stumbling along.


I'd be happy to share links to the updated books, and definitely appreciate any feedback you have! I'm not sure it's feasible to post all 25,000 links, but I'll start by posting a random sampling of 100.
I will be starting the update in just a few minutes, and will post the links once the update is done.

Here's a random sampling of the books that were updated, let me know if you have any feedback!
Thanks!
http://www.goodreads.com/book/show/30...
http://www.goodreads.com/book/show/50...
http://www.goodreads.com/book/show/56...
http://www.goodreads.com/book/show/60...
http://www.goodreads.com/book/show/62...
http://www.goodreads.com/book/show/63...
http://www.goodreads.com/book/show/64...
http://www.goodreads.com/book/show/65...
http://www.goodreads.com/book/show/66...
http://www.goodreads.com/book/show/66...
http://www.goodreads.com/book/show/71...
http://www.goodreads.com/book/show/81...
http://www.goodreads.com/book/show/81...
http://www.goodreads.com/book/show/81...
http://www.goodreads.com/book/show/82...
http://www.goodreads.com/book/show/82...
http://www.goodreads.com/book/show/82...
http://www.goodreads.com/book/show/82...
http://www.goodreads.com/book/show/83...
http://www.goodreads.com/book/show/83...
http://www.goodreads.com/book/show/84...
http://www.goodreads.com/book/show/84...
http://www.goodreads.com/book/show/85...
http://www.goodreads.com/book/show/86...
http://www.goodreads.com/book/show/86...
http://www.goodreads.com/book/show/87...
http://www.goodreads.com/book/show/87...
http://www.goodreads.com/book/show/88...
http://www.goodreads.com/book/show/88...
http://www.goodreads.com/book/show/89...
http://www.goodreads.com/book/show/90...
http://www.goodreads.com/book/show/90...
http://www.goodreads.com/book/show/91...
http://www.goodreads.com/book/show/92...
http://www.goodreads.com/book/show/93...
http://www.goodreads.com/book/show/93...
http://www.goodreads.com/book/show/94...
http://www.goodreads.com/book/show/94...
http://www.goodreads.com/book/show/95...
http://www.goodreads.com/book/show/96...
http://www.goodreads.com/book/show/97...
http://www.goodreads.com/book/show/98...
http://www.goodreads.com/book/show/10...
http://www.goodreads.com/book/show/10...
http://www.goodreads.com/book/show/10...
http://www.goodreads.com/book/show/10...
http://www.goodreads.com/book/show/10...
http://www.goodreads.com/book/show/10...

http://www.goodreads.com/book/show/10...
http://www.goodreads.com/book/show/10...
http://www.goodreads.com/book/show/10...
http://www.goodreads.com/book/show/10...
http://www.goodreads.com/book/show/10...
http://www.goodreads.com/book/show/11...
http://www.goodreads.com/book/show/11...
http://www.goodreads.com/book/show/11...
http://www.goodreads.com/book/show/11...
http://www.goodreads.com/book/show/11...
http://www.goodreads.com/book/show/11...
http://www.goodreads.com/book/show/11...
http://www.goodreads.com/book/show/11...
http://www.goodreads.com/book/show/11...
http://www.goodreads.com/book/show/11...
http://www.goodreads.com/book/show/11...
http://www.goodreads.com/book/show/11...
http://www.goodreads.com/book/show/11...
http://www.goodreads.com/book/show/11...
http://www.goodreads.com/book/show/11...
http://www.goodreads.com/book/show/12...
http://www.goodreads.com/book/show/12...
http://www.goodreads.com/book/show/12...
http://www.goodreads.com/book/show/12...
http://www.goodreads.com/book/show/12...
http://www.goodreads.com/book/show/12...
http://www.goodreads.com/book/show/12...
http://www.goodreads.com/book/show/12...
http://www.goodreads.com/book/show/12...
http://www.goodreads.com/book/show/12...
http://www.goodreads.com/book/show/12...
http://www.goodreads.com/book/show/12...
http://www.goodreads.com/book/show/12...
http://www.goodreads.com/book/show/12...
http://www.goodreads.com/book/show/12...
http://www.goodreads.com/book/show/13...
http://www.goodreads.com/book/show/13...
http://www.goodreads.com/book/show/13...
http://www.goodreads.com/book/show/13...
http://www.goodreads.com/book/show/13...
http://www.goodreads.com/book/show/13...
http://www.goodreads.com/book/show/13...
http://www.goodreads.com/book/show/13...
http://www.goodreads.com/book/show/13...
http://www.goodreads.com/book/show/13...
http://www.goodreads.com/book/show/13...
http://www.goodreads.com/book/show/13...
http://www.goodreads.com/book/show/13...
http://www.goodreads.com/book/show/13...
http://www.goodreads.com/book/show/13...

I also noticed that this does not automatically combine works. So, someone will need to go back and see which ones need combining and do that.
Those are just my initial thoughts.

https://www.goodreads.com/book/show/1...
shows on Amazon as having two authors
http://www.amazon.com/Puppy-Tricks-St...
the second author is a dog (must avoid making joke!) but would you expect both authors to be added or do you have a dog detector.
https://www.goodreads.com/book/show/1...
has an illegal author name "Dr. Blaise T. Ryan". If the import/fixer is just going to bring in authors then this is the sort of thing that needs to be fixed manually.
https://www.goodreads.com/book/show/1...
is still listed with an unknown author but the Amazon page
http://www.amazon.co.uk/Walking-God-Q...
does have "Harvest House Publishers" listed (and another bunch of real people inside)
https://www.goodreads.com/book/show/9...
this first author is still unknown but Amazon.co.uk lists Teri Fritz in that place
http://www.amazon.co.uk/Would-Like-Fr...
Back later

still listed as unknown author but Amazon has Steve Russell for that ASIN
http://www.amazon.co.uk/BLOOD-OF-INNO...
or on amazon.com
http://www.amazon.com/BLOOD-OF-INNOCE...
This one shows "Jason Vey" as primary author with "Gary A. Shilling" as secondary
https://www.goodreads.com/book/show/9...
Primary author should be "A. Gary Shilling" (different name) and I cannot see any sign of "Jason Vey" in connection with the book. Jason Vey on Goodreads does not appear to be into financial books
https://www.goodreads.com/author/show...
Looks like an Unknown Author has been combined with Jason Vey. Yuk.
Back later

Another thing of note is that the period appears to have appeared in the wrong place...

https://www.goodreads.com/book/show/1...
If your script could scan for a list of not-approved sequences, Dr, LLD, etc then you could post a list for fixing.
Still listed as unknown author https://www.goodreads.com/book/show/1...
but Amazon has "Colin Haskin"
http://www.amazon.com/Ferret-Girl-ebo...
this one has 2 author listed "Isumi" and "Ted Smith"
https://www.goodreads.com/book/show/1...
neither Amazon, nor anywhere else, has a mention of Isumi and the proper author is "T.A. Smith" as is shown on the cover.
oops, also wrong on Amazon
https://www.goodreads.com/book/show/1...
Back later

https://www.goodreads.com/book/show/1...

https://www.goodreads.com/book/show/1...
http://www.amazon.co.uk/Nils-Holgerse...
logically, any author name which includes a plus sign is due for review.
still shows as unknown
https://www.goodreads.com/book/show/8...
amazon has "Tammy Valentine"
http://www.amazon.com/Fresh-To-Your-D...
this one would need the author combining with the proper version
https://www.goodreads.com/book/show/8...
https://www.goodreads.com/author/show...
this naughty book is still showing unknown
https://www.goodreads.com/book/show/1...
Amazon has it listed as by "Blake Cross" (the 'look inside' is NSFW!!)
http://www.amazon.com/Beauty-Undresse...
Back later

https://www.goodreads.com/book/show/1...
but the logs show "amazon_sable updated the book Dividing Worlds by Jan Ögren " so the logs know who the book is by.
Amazon agree,
http://www.amazon.com/Dividing-Worlds...
We have an author "Jan Ögren MFT"
https://www.goodreads.com/author/show...
with an edition of the same title but when I tried to remove the MFT from his name I got the prompt to merge author profiles with the "Unknown Author 30" that shows on the book above. Something appears to be a bit confused.
EDIT: if I go to the Unknown Author 30 page there is no sign of the book

Thanks for all the feedback! It looks like the biggest problem is with the author names - either not updating it, bad characters sneaking in, or getting combined with a different author incorrectly.
We're going to see if we can address some of these concerns with the script. For example, this run has uncovered that some books are storing multiple authors in a non-standard format (at least when one of the authors is a dog!). We may also be looking into using a different data source.
I don't have an ETA for when the second pass of the script will happen, but hopefully it will be soon. I'll update this thread once I know more.

https://www.goodreads.com/book/show/1...
!ruby/hash:ActionController::Parameters text: Harvest House Publishers language: ''

I've reverted the changes you just made. This thread is for debugging developer problems. Not for editing.

https://www.goodreads.com/book/show/1...
illegal author name
https://www.goodreads.com/book/show/1...


Possibly that should have been made clear to us non-techies in the thread title, or at least near the top. I was tempted to start "helping" until I read all the way down to post 22. :-)

I would have thought post 1 makes it fairly clear what is going on. I expect the problem is that many people seem to read the end of a thread first. We'll blame it on the staff for not making it clearer.


Um, not to clueless me anyway.
Abcdarian <---has some bugs of her own to work out


I have some bio stuff for him as well if needed. Let me know in a reply to this comment. Thanks in advance!

I have some bio stuff for him as well if needed. Let me kn..."
In order to be able to use an author's photo it needs to be one that is free to use. This means that either:
- You own the rights to the image (usually meaning that you created the image yourself).
- You can prove that the copyright holder has licensed the image under a free license.
- You have explicit permission from the author or the copyright holder of the photo to use it here.
Does the photo on your profile meet with one of these conditions?
The bio stuff is always welcome, just post it in this group (preferably in a new thread) and a libriarian will add it for you. :)


We're experimenting right now with what kinds of data we can programmatically fix. The upper-case is something we can partially fix, but the extra words in the title and the language will unfortunately be harder.

BOOK: https://www.goodreads.com/book/show/6...
CORRECT AUTHOR: https://www.goodreads.com/author/show...


I've noticed the same problem. Additionally, if you search for a book's ASIN, it will bring up the Kindle edition, yet when you look at that edition's page, all that is displayed is the ISBN. (If you go through to the edit page, the ASIN is there, but if you click between ASIN and ISBN both wind up disappearing.)
I'm not opposed to an edition having both an ASIN and an ISBN, as the ISBN is often found inside the book but the ASIN is how readers can find the book at Amazon (or in their own collection), but if it is possible to assign both, the edition page should display both. If the edition page can't display both, or editions shouldn't have both, then whatever is causing the Amazon importer to include both should be fixed to include only one or the other.

Unfortunately, I fixed all the ones I've seen so far before I thought to post in here, so I don't have any examples to include.

Here is an example:
https://www.goodreads.com/book/show/1...
Are there any with more recent import dates? (That one's is 10/12.) Because I believe the issue that was causing last, first imports has been fixed, although the effect is not retroactive.

https://www.goodreads.com/book/show/1...
Yeah, any with that date (or very close, maybe up to 2 days later) is from the batch before we fixed that issue. Feel free to fix them.

This one has two authors with commas,
https://www.goodreads.com/book/show/1...
The second one was added a month ago.
amazon_kcw updated the book München Manhattan #1 (German Edition) by Vollmann, Vanessa
additional author added: Reiter, Annette
(flag)
Oct 12, 2013 09:38AM (#57641576)

This was another case (now corrected by me) of a Kindle edition with an ISBN that turns into an ASIN when you go to the edit page. Also the title had the usual "(Italian Edition)", language was wrong and the description had stuff that it's not allowed in Goodreads (reviews, critical praise, text in all caps, etc)

Is 'onix ingram' amazon? Anyways, seems like an easy script to write, just changing all 'audio' to 'audiobook', methinks.

and this https://www.goodreads.com/book/show/1...

https://www.goodreads.com/book/show/1...

https://www.goodreads.com/book/show/1...
I haven't updated it as of yet.
This topic has been frozen by the moderator. No new comments can be posted.
I wanted to let people know that we are going to start updating data for books currently labeled as 'Unknown Book xxxxx'. This will be an automated process - while we've done a lot of testing, there are bound to be some mistakes.
In the first round, we will be focusing primarily on fixing titles and authors. We will unfortunately not have publisher, format or language information. We also know that some books will be under an ASIN when they should have an ISBN instead. Subsequent passes will hopefully update these missing fields.
The first pass will consist of ~25,000 books.
Please let me know of any questions or concerns!
Thanks,
- Evan