Goodreads Librarians Group discussion
note: This topic has been closed to new comments.
Archived
>
Amazon is going away as a data source

http://www.goodreads.com/rescue_books/at..."
I'll take a look and see what I can do. No guarantees, though -- and if someone in France has any good resources, they'd be unquestionably better positioned than me!

Please differentiate--as I have been doing-- between a notice posted on the librarian group page for active group-checkers, and the notice sent to librarians' personal email accounts, and you will perhaps better understand why I believe twice as many records could potentially have been saved had the email notice been sent in simultaneity with the posting of the group notice. I maintain that given the scale of the problem, all librarians could and should have been rallied by every possible communication channel as soon as was prudent-- and according to when this change was made known on the group page, that seems to have been on the 20th, not the 25th. I do not anywhere say we should have been notified before goodreads knew what we should do to prevent loss of records, as you suggest in your previous response. I hope this clarifies for you what I'm saying and what I haven't said, as I shall not contribute any more to the lengthening of this thread by repeating this point again. My concern has been expressed; I have not asked anyone to agree with it, but understand it correctly at least.
Thank you.
Brixton, between when the notice was posted in this group and the email was sent, over 14 million book records were imported from various sources. The email was sent out shortly after the major imports were completed.
Brixton wrote: "The first rescue I decided to look back at revealed something I do not think should be happening, and I'm guessing that if I've discovered this on the first random selection, it's happening a lot (..."
Ok, the devs figured this one out. It was ONLY affecting the publisher field, FYI -- no other data should have bee affected. And as of the next release, it won't do that anymore either.
Thanks for reporting the bug!
Ok, the devs figured this one out. It was ONLY affecting the publisher field, FYI -- no other data should have bee affected. And as of the next release, it won't do that anymore either.
Thanks for reporting the bug!

Hmm - you might be able to do it yourself, actually, by looking at his 'My Books.'

Thanks Cheryl. I don't really see it as my job to export my friend's books for him: it's been tough enough looking after my own.
I've posted on Facebook so that, hopefully, he'll become aware of this predicament, given that even after the Ingrams import he has hundreds of unmatched ISBNs.
But my main point is that given the scale of what he has been left with, and as I don't know that he's even been informed by Goodreads that he has an issue, he'll need more time to save and match up his books.

Reclam Verlag
http://www.reclam.de/
Deutscher Taschenbuch Verlag
http://www.dtv.de/
Might be worth looking into for future goodreads imports as well where german books are concerned. =)

Thank you!"
Done -- and no problem. Mind that I didn't upload any images of book covers, though, so unless somebody else finds the time and resources to do something about this, you may want to warn her that she'll find about 40+ books without cover images on her shelves come Jan. 31 ... My focus at the moment is on saving books! :(

Awesome! I wrote to let him know to export his books just in case, I'm sure he'll be relieved to hear his books are saved at least, thank you so much!
I'm having the feeling that on Tuesday I'm going to check the site and it will look like it did 4 years ago when I joined. A lot more "No Image Available"s and no descriptions. I think it's appropriate to (maybe this language is too strong, but how else to put it?) grieve the loss of so much of our work. Diligently did I once pull all the books off my shelves and enter or fix page counts, original publication dates, and everything else, and all the library books I fixed which I most certainly will not get again for this purpose; so much of that work has been goofed up by amazon, now simply disappearing before our eyes. Sorry, I'm having a Sisyphus moment. :o\ But I also think of this as an opportunity to start clean(-ish), since I've always felt amazon has been more of a wrecker than a support of good, hand-entered data. I'm hoping this means all the calendars, floor displays, posters and everything else not a book will disappear from the combine pages? :o)))

*cringe*
Tiffany wrote: "i hate to ask this (and i haven't seen it asked anywhere else, but you know... 200 pages of comments...), but is there a way to find out if there are any books in our groups that are at risk for be..."
Not really. You could go to each one's book page. How annoying that would be depends on how many your group(s) have shelved, of course.
Not really. You could go to each one's book page. How annoying that would be depends on how many your group(s) have shelved, of course.

My thoughts exactly -- and wouldn't the loss of all those other-than-book-paper-products be a nice byproduct of the whole process?! ;)
Sorry for the inadvertent gender bender on your friend, btw :~ (seems I've been doing too much of my joint resuing thing with women up to now; besides, in my defense I' just mention that it's already rather late where I am ...)
/s/ Sysypha

I was afraid of that :) Okay, thanks rivka!

I just can't wrap my mind around this. Maybe this is a product of granting corporations personhood, but there's something I'm missing fundamentally in understanding where amazon gets off forcing us to pull data which has *become* ours through use over time.
For instance (and I'm just making this example up for illustrative purposes-- or maybe this actually happened? after 4 years, who can remember such things?): Let's say once upon a time I was looking at bookswap and I saw someone listed a book which said this: "last picture show" by "larry mcmurty" isbn1234, and this data was sourced from some sloppy amazon used-book seller. I say to myself, oh that's not right, and like a good little librarian, I edit the record to "The Last Picture Show" and "Larry McMurtry". Where did I get this information? Technically, I first got it from having seen the film. Sony Pictures Entertainment is not going to come after goodreads and say "That's our data!" (or will they? <-kidding). But amazon can come and say, "Hey, that isbn1234, 'The Last Picture Show' by 'Larry McMurtry'? That's ours-- yeah thanks for improving our data, but suck it."
And I can't rescue that book now if that's not an edition I own, it was probably an old mass market paperback I'll never find a url for, but I can't say "I recognise that copy from when I fixed it from bookswap, and I know 'The Last Picture Show' and 'Larry McMurtry' go together" without a weblink.
This is just bizarro-world to me that what's in our brains has effectively become through legal tricks and turns property of a corporation.

It's not so much that the data belongs to Amazon.
What it comes down to is that Amazon and Goodreads had a contract that permitted Goodreads to automatically import and display data from Amazon. The contract is no longer valid and Goodreads can no longer use that data.
Once upon a time a boyfriend of mine let me have his old video card for my computer, after I dumped him he called and wanted the video card back. Amazon is doing the same thing - the only difference is that I said no and kept the video card, Goodreads can't do that.
Well - I suppose they could, but only if they want to spend the next few months going bankrupt as they attempt to fight off the Amazon Super-Giant in court... which doesn't sound like a great idea to me. ;)

http://www.goodreads.com/book/show/91...

Actually, if it's a non-Kindle ASIN edition, it should get deleted (and only imported due to a bug).

That what I thought, which is why I suggested she switch editions.


The problem I'm having has to do with well-known works. For example, I have a book I'm trying to rescue which is a rather obscure omnibus of a very well-known SFF series. I can only find a handful of sites through a Google search which recognize it, and they are all pay sites. The only piece of information which is needed for rescuing is the author name. If I went up to 100 SFF fans and asked who wrote TITLE (of the omnibus), 99 would probably know the answer without thinking about it. However, we cannot rescue this book unless we find a source for the author. There's just something fundamentally wrong about this.
Every book on my shelves that still needs to be rescued is an obscure omnibus collection of a well known author or series. In no case is the title in question; every single one requires an author, with about half looking for publisher, # pages etc. Since the author and title are the most critical pieces of information, the fact that we can't rescue books whose (not-needed-to-be-rescued) titles are known and which many people could easily name the author from the title without every having seen a physical copy of the book, there just seems to be something fundamentally problematic with the requirements.
It's not as bad as the person who asked why we needed a source for every individual copy of Moby Dick to confirm that it was Herman Melville, but the principle is the same.
Legally this may be the situation we're stuck with, but it really doesn't make much sense.
Sarah wrote: "Or to create a new edition without an ISBN and then combine them?"
This is better. It's cleaner -- there's no question of where the record came from.
This is better. It's cleaner -- there's no question of where the record came from.

Michael, a personal record of book information can be used to rescue books too. I had an old (pre-GR) spreadsheet with some info from library books, for example.

I think the answer to what that something is might be found here: http://en.wikipedia.org/wiki/Feist_v....
Some quick pull-quotes:
"information alone without a minimum of original creativity cannot be protected by copyright"
"Information (that is, facts, discoveries, etc.), from any source, is fair game"
"a copyright holder in a compilation of public domain data cannot use that copyright to prevent others from using the underlying public domain data"
Btw, I'm the one who said that about Melville ;o)


I understand, Brooke, but that service being pulled should not require us to have a physical copy at hand or otherly-sourced url to prove we know ISBN 9780141439723 = Bleak House = Charles Dickens. Those three pieces of information are public domain; we can't use amazon to automatically plug in the numbers/titles/authors, but in my understanding of the case cited above we can use our own hands to plug those numbers/titles/authors in from anywhere, including our own eyeballs, memories, and even cutting and pasting from the existing goodreads page or amazon's (as probably a super lot of us have always done during regular every-day editing).
Please note I am not advocating doing anything other than what goodreads has asked us to do, I have merely provided the information above in an effort to help myself and others get at something many of us are having trouble putting our fingers on, specifically: how is it that amazon "owns" the ISBN/title/author association, and that continuing to use this information would be a legal infringement of some kind. Seems to me come Jan 30, goodreads would have to stop importing from amazon, and only creative content (descriptions, reviews, covers, etc) would have to be deleted. Right now, it's the needing to have a physical copy/record or source url that is perplexing, and preventing folks from rescuing certain books.
Since I apparently suck at finding good source urls, I personally am going to wait til Tuesday to see what the fallout looks like-- and return to normal librarian editing in the normal way, which is what I'm good at, even if that means we're having to re-build from the ground upwards again. We've done it once, it can be done again.


But if that's the case we should be allowed to google the book we're looking for, click on the Amazon result (because let's face it that's often the first one) and then manually input that data into goodreads, but we can't. For a description of the book created by Amazon that would be fair enough, but we're not supposed to get any of our facts from there.
Like Brixton I'm not suggesting anyone ignore goodreads' rule to not use Amazon as I'm sure they put it there for a reason. Just saying that this change hasn't just stopped the importing of data from Amazon but also librarians using it simply to check *facts* that Amazon has no copyright over and enter them manually.

This is better. It's cleaner -- there's no question of where the record came from."
Okay, that's what I'm doing. It ends up creating essentially a duplicate book. I'm combining editions so that reviews show up in the right place. My hope then is that when January 30 comes around, the editions should be merged so there won't be any more duplicates.


I would understand that database to be a "compilation of public domain data", as in: "a copyright holder in a compilation of public domain data cannot use that copyright to prevent others from using the underlying public domain data" (emphasis mine).


I have a copy (physical, ebook, or audio) of this book or a personal record of the book data.
Do you think the Amazon police are coming to your house to inspect your personal records of book data?

Is this really a serious question? Well if you insist on asking, I shall provide an answer: Certainly I do not, but I also am not capable of being purposely dishonest (if this is what your question is rhetorically designed to suggest). You know they added "personal record of" only after I asked about friends providing me with needed information to save their books, so I am not sure why you are pointing this out. Doing so does not answer the lingering question of why we must provide otherly-sourced urls for books we do not own or have record of, or why we are strongly advised to not get even a page count from a bookseller.
I for one will not ever defend amazon's supposed right to require goodreads to self-destroy records when there seems to be legal precedent stating goodreads should not have to do so.


At this point, my understanding is that we're just trying to disassociate the data from Amazon and make sure we're replacing (or verifying) the data via independent sources that won't cause us to have future issue of this same nature.


- weird diacritical marks that don't match the ones in the GR database, which will not match up with GR author entries.
- they don't have format data (hardcover, paperback, etc.)

* I open the existing GR book page in one window
* I open the rescue page in another window (side-by-side)
* I check WorldCat to make sure the author/title is what I'm expecting
* I copy/paste the URL
* I type in correct data according our GR way of doing things (if there are differences)
Of course, there are going to be issues with author ambiguity again... At least, I haven't been paying attention to spaces! (I just thought of it, actually!) But at least the spellings will be correct...
Patrick wrote: "We are importing directly from WorldCat, so you can feel fine about using data from there (though it should get imported automatically, eventually)."
My wrist is tired -- so I hope when I wake up tomorrow the imports will be completed! :)
My wrist is tired -- so I hope when I wake up tomorrow the imports will be completed! :)

We have just opened up the rescue books page to all users, and later tonight (Friday) or early tomorrow (Saturday), we will be sending an email to all authors with a book at risk, as well as to all users with books at risk. We are continuing to import books from WorldCat and several other sources, which will help a bit.
Thank you for all your efforts in helping us with this transition. We’ll continue to keep you updated and please don’t hesitate to let us know if you have additional questions.

I ask because I sorted out all my other books so my list is currently saying: "Great news, all of your books have been rescued!" But I went and found some kindle books I have shelved and they're still in need of rescuing according to their book pages. So it would be great to see those on the rescue list if that's possible.
This topic has been frozen by the moderator. No new comments can be posted.
Books mentioned in this topic
The Tough Guide to Fantasyland (other topics)The Tough Guide to Fantasyland (other topics)
The Tough Guide to Fantasyland (other topics)
The Newbery Companion: Booktalk and Related Materials for Newbery Medal and Honor Books (other topics)
Der Weiße Wolf (other topics)
More...
Authors mentioned in this topic
Norbert Elias (other topics)Catherine Gaskin (other topics)
An excellent point. There are millions of books out there. Even 10,000 librarians can't make that much difference in a week. I hope someone is making an assessment of how many books in the database remain unrescued after the big import.