Goodreads Librarians Group discussion
note: This topic has been closed to new comments.
[Closed] Added Books/Editions
>
Large Book Data Import

https://www.goodreads.com/book/edits/...
https://www.goodreads.com/book/edits/..."
Oh, thanks.

https://www.goodreads.com/book/show/2...
Most of the time these subject-tag authors are given a role, too, like 'editor' or 'illustrator'. Weird.

I was intrigued that Baby Animals wrote the preface, but not so sure about Bedtime Stories taking the photographs...
This may have been an attempt to gain more prominent search results by including key words in more places, though I'm not sure. Right now we don't have any blacklisted terms for author fields, but it may be worth investigating.
I'm not sure if we'll be able to get someone to work on that right at the moment, though.

That would be very useful. I think when/if you decide to do it you should consider the terms in this comment to be amongst the first banned.

I thought policy was not to identify different editions like this.
Standardization would be to have the publisher field say "Smashwords"
https://www.goodreads.com/book/show/2...

https://www.goodreads.com/book/edits/...
The book was added in August by a user, with the ASIN, and then in December amazon_kcw replaced it with the ISBN for the ebook? (if I'm reading it right)
The edit is a bit old so maybe this behavior has been fixed in the meantime
Moloch wrote: "Is the Amazon import assigning ISBNs to Kindle editions?"
It was assigning both ISBNs and ASINs, but that was fixed a month or two back.
It was assigning both ISBNs and ASINs, but that was fixed a month or two back.

https://www.goodreads.com/author/show...

Here is an example: https://www.goodreads.com/book/edits/...
70021481 shows a change not done by the user. He had changed the language and the ISBN change is added to his log.
Followed by my try to revert. Revert failed, so fixed manually.
Reverted my own change, since I came to the conclusion it might be wrong.
The log for all editions doesn't show a script making changes at that time.
Another failed try to revert to ISBN leads the book to show ISBN and edit page ASIN. Haven't reverted my try to revert yet, so that you can see. I realize my revert my be wrong but there is no way I would know the change hasn't been made by the user
Ellie, please don't revert those. They are from when Kindle editions were accidentally being imported with both ASINs and ISBNs; any edits removed the ISBN and kept the ASIN. Which is what we want for Kindle editions.

Well I couldn't and then I figured I shouldn't! I just left the one edition as an example. Will fix it.

I have been opening the edit page and saving it again with a note saying was saved to show the ASIN instead of imported ISBN.
Bookworm R wrote: "I thought they had run a script to fix those that came in with both ASIN and ISBN to just show the ASIN"
I know we stopped importing ISBNs for Kindle editions. Not sure if a script has been run to fix the ones that had already imported.
I know we stopped importing ISBNs for Kindle editions. Not sure if a script has been run to fix the ones that had already imported.

That is what happened when I tried to revert, is it possible someone else tried to revert those editions?

I don't know. I have checked ones that I did the save with and they still show the ASIN.
Although I know sometimes things will show as my edit that I don't recall making, so it is hard to say.

https://www.goodreads.com/author/show...

This thread is for questions and discussions about the ongoing data imports. To request a book be added to the Goodreads database, please start a new thread.

Because I noticed a book that had just been imported today - without ISBN - where the author's name was all mangled, like Harriet Goldher Ph.D. Lerner or something. I deleted it.

Well and my question is just about that, the data import. So to pose it simpler, are you just importing American books from the American Amazon or also internationally?

rivka wrote: "To request a book be added to the Goodreads database, please start a new thread."
The person she is replying to had deleted their post which was just after yours and was a request to add a book to the database.
I can't really reply to your question, but I don't think they are importing from amazon fr, gr, jp, and etc. There are non-English books added to the database but usually the record fit the data at either com or uk. Of course I don't edit as much foreign editions as English ones.
And IMO I'm not sure the librarians could handle such a big import, considering all the problems that come up with it.

I checked two of the books that appeared and they both came in auto import in April, one from amazon_kcw and the other from ingram-onix. (Not all of these are recent imports, but still needs cleaned up and someway to stop in future).
The titles need help on some as well.
I'm mobile so did not make any changes, just noticed when looking at The Eighth Guardian edition.

Author = "pseud Mary Renault."
I deleted it.
Maybe filter out "pseud" from author field?

Among other issues, certain amazon_kcw imports have -publisher- or -publisher format- in author field. See, e.g.:
28-Mar-2014 import @ www.goodreads.com/book/show/21796828-...
11-Apr-2014 import @ www.goodreads.com/book/show/21897526-...

Have checked them all, one by one.
No matter how weird those titles sounded, 99% of them were correct - apparently they work with those strange abbreviations in these Skyscape editions.
I have filtered out the ones I could find authors of, but what is left, is Skyspace. Whether that's a publisher or not, it seems they got the rights and having checked their website, worldcat, Amazon etc - I couldn't find anything else (when it comes to authors).
While I was at it, I've also added publication dates and covers - if there were any to be found on Amazon, Worldcat or Skyspace.com

My not fixing it was deliberate. Pls don't fix until a GR tech person has a chance to see exactly what the problem is.

Yes, I figured that already. But how does that solve the issue of preventing messed up imports in future?

My not fixing it was deliberate. Pls don't fix until a GR tech person has a chance to see exactly what the problem is."
I'm fairly certain that Rivka said earlier in this thread that things can be fixed. Just mentioning it here is enough for them to track.

Yes, but wasn't there also mention re: evaluating some uniform import errors so to create scripts to auto-clean?

That was weird! Never seen any as bad as the first link. Must have been stuck in a blender ;) All fixed now & combined.
For the second link. Although somebody worked on it, the author's name was still incorrect. Changed & combined with the existing editions

Truthfully, hard to say. It seems like this thread has been left to Rivka and the development team has dropped of reporting back in.
I thought the auto-scripts was to correct past errors, but still find the ISBN/ASIN import issue so those may have been fixed for future, but doesn't seem like it went back and fixed old?

https://www.goodreads.com/author/show...
The George R.R. Martin book was imported as recently as May 13th. As you can see, there are a LOT of books here. I suspect some (most, even) pre-date the latest large data imports, but some of them are recent.

As you can see not much info there . Seems imported about ten days ago.

https://www.goodreads.com/book/show/1...

As you can see not much info there . Seems imported about ten days ago."
Isn't/shouldn't there be a rule about Not importing an edition, from Any Source, if there is no ISBN or ASIN? In other words, no automatic import Unless there is either an ISBN or ASIN.
I have seen imports with no numbers from sources other than Amazon.

I know, especially the ones listing the condition of the used book for sale as the synopsis...

I think there was such rule created for Amazon. That is why I am pointing it out. But I think the staff is no longer looking in this topic

Another 3: https://www.goodreads.com/book/edits/...
Please make amazon stop :(

https://www.goodreads.com/book/edits/...
I like the reset to "Unknown Binding" best.
I undid everything. This is an ideal pastime for masochists...

As you can see not much info there . Seems imported about ten days ago."
After I deleted all BUT ONE of those duplicates I noticed amazon created ANOTHER copy two days ago!
I send an e-mail as no one is paying attention to this topic.
Michael wrote: "I like the reset to "Unknown Binding" best."
Frustrating right?
This topic has been frozen by the moderator. No new comments can be posted.
Books mentioned in this topic
Snobs (other topics)The Twelve Dates of Christmas: Dates 1 and 2 (other topics)
The Twelve Dates of Christmas: Dates 1 and 2 (other topics)
The Twelve Dates of Christmas: Dates 1 and 2 (other topics)
Divisadero (other topics)
More...
Authors mentioned in this topic
Unknown (other topics)Various (other topics)
Unknown (other topics)
Unknown (other topics)
Avery T. Willis Jr. (other topics)
More...
https://www.goodreads.com/book/edits/...
https://www.goodreads.com/book/edits/...