Goodreads Librarians Group discussion
note: This topic has been closed to new comments.
Archived
>
Amazon is going away as a data source

Dely, it is only 7:33 a.m. on Monday in California, where GoodReads is based. Give the staff a chance to get to their desks--I'm sure we'll get some definitive answers later today.

When you are on the "rescue the book" page, simply click the check box next to...
I have a physical or ebook copy of this book present.
No URL is required when you have the book in hand. Enter the data and hit the submit button. That is all that is need to save a book.


Elizabeth (Alaska) wrote: "They have answered many questions, and they may not (and I don't know one way or the other) know the answers to many questions. I think they're probably up to their a$$es in alligators. They don't ..."
Yes, of course. I will wait for the answers and I know they don't have a good time with all these problems.

My understanding (and I could be totally wrong, so don't quote me on this) is that no they can't.
The particular title and author being associated with that ISBN was the service that Amazon was providing. Now the relationship is dissolving and so Goodreads can't use that information to make that association anymore in any way.

James wrote: "It appears we're to..."
Insane? I don't think so. Stressed? Probably!
Amazon is pulling there data out of Goodreads. It is my understanding that if the import doesn't cover a particular book and no one is around to "rescue" it, then when Amazon pulls their info it will leave Goodreads with blank data fields for that book. I don't believe this is an end result that Goodreads really wants to happen. From the previous comments from staff, it seems like the upcoming import of data from other sources and the librarian rescue tool is an attempt to keep this from happening to as few books as possible.

Only 10 days' notice to fix something that required many hours of data entry and will take even more time to correct and seems likely to disproportionately impact non-Americans, many of whom seem to have been already feeling less than appreciated is understandable as a source of frustration.


Then why not just cut the references to Amazon? Anyone can go to Amazon and look up this information and they don't need license or permission to do it. Just wipe any summaries that may have come from Amazon writers, and if we must, any imported images. To say that we can't use it to find out how many pages a book has, or who published it and when, is absurd. This is throwing out the baby with the bathwater.

2. While ..."
I've had the same problem. It's infuriating!!

I know this won't be changed during the current mess, but it's silly and now proving very expensive in time and aggravation.
And frankly, not that it matters now, it seems unlikely that Amazon would pull out of what must have been a fairly lucrative deal with GR, given both whatever they get paid for the data base and the books bought thru the site or because of the site and the large group of potential buyers they are now pissing off with no reason and/or notice. Just saying I don't believe it for a minute, though I understand spin.
The rescue form cannot update the work fields, only the edition-specific ones. That means that if the work field has a first-pub date that is incorrect, the pub date may not be able to be entered on the rescue page. Since pub date is optional, just leave it blank.
I understand that this is frustrating, but it's definitely better than when it didn't object to the pub date and just didn't fix the book. (It took some doing to figure out why, when we were testing it.)
I understand that this is frustrating, but it's definitely better than when it didn't object to the pub date and just didn't fix the book. (It took some doing to figure out why, when we were testing it.)


But is partially keeping that information allowed? We don't know for sure.
Not that it matters. They could still change it to "RESCUE-ME-{ISBN}"or just "RESCUE-ME" and it would mean we could fix the book titles later. If the titles are blank we can't search for the books that need to be fixed.

It's not hate - it's stress. We love Goodreads. :)

"
Well Amazon are the owners of Shelfari and if they have raised the price beyond what Goodreads can afford for use of their database, they may also consider pissed off GR users may flock to their rival site rather than blame their greed for what is after all information in the public domain. I think this is why the GR staff are urging the use of non-e-commerce sites to verify info including our own eyes.


I'm glad I'm not the only one who feels this way! I totally sympathize with the people who have hundreds of thousands of books at stake, but I don't think taking it out on Goodreads is the right approach.
If Amazon says that Goodreads has to remove all the data received from them by January 30th, then it has to happen. None of us know when Goodreads was given the deadline, but I wouldn't be surprised if it was January 1st. Before they could tell us, they had to develop the tools to properly attribute data to the librarians that made the change, the tools to rescue a book and the partnerships to enable them to import data from other databases.
If they had told this was happening without giving us the means to see what books were at risk, people would have been complaining that we had no way to see what books were at risk and would have been worrying for weeks longer than necessary.
I'm quite sure that Goodreads is taking this seriously and working as fast as they can to get new information. There is a lot of information we don't know about the situation from the legal and contractual points of view, and I have faith that Goodreads made the best decision they could within the amount of time they had.
No it's not optimal, yes there may be users who lose lots of books in the process (back them up!), but for Goodreads sake as a company and as a tool for users - I would rather have a handful (even a very large handful) of users loose books than have Goodreads go bankrupt and shut down when Amazon sues them for illegally using their information.
Angela wrote: "Couldn't the titles be renamed to "RESCUE-ME-{OLD-TITLE}" and "RESCUE-ME-{OLD-AUTHOR}"? Or "RESCUE-ME-{ISBN}"?"
They can't. Changing the title to "RESCUE-ME-{OLD-TITLE-STILL-FROM-AMAZON}" is the same as keeping the title from Amazon. Also, I'm sure the ISBN field in the database is a numerical field and would not support such a prefix.
I don't think Goodreads wants to remove the titles, authors, isbns - but it's not their choice. If it comes down to January 30th (which is a deadline imposed on Goodreads, not one they've chosen) and the title/author/isbn is still sourced by Amazon, the data has to be deleted. If they keep the data, they will face legal action from Amazon, which is something I suspect Goodreads is in no [financial] position to counter.
If the title/author/isbn are sourced by Amazon, they get deleted. We are then left with a ton of books with x number of pages, published in some month of some year by some publishing company - who's going to sleuth those books out?


I'm glad I'm not the only one who feels this way! I totally sympathiz..."
I'm in the same boat with you, and hope this is actually a teaching opportunity for any solution architect out there preaching "cloud", as well as anyone who has all their data (email, documents, photos, etc) in say.... Google.
Lindea wrote: "On the rescue page you should add a "information is accurate* button, I've just saved three books which had the exact same details as it was stated on their page already"
Lindea, that would just be confirming the Amazon data, and that's exactly what we can't do.
Lindea, that would just be confirming the Amazon data, and that's exactly what we can't do.

Most probably. I suspect if Amazon had given Goodreads the opportunity to grandfather in the data, we wouldn't be dealing with this now.
I don't know a lot about the Terms & Conditions for either B&N or Amazon, but I do know that they allow websites to use their APIs primarily as an end game to get them more money. Both B&N and Amazon required that books listed on Goodreads using their information were linked primarily to them. Which is why Barnes & Noble and Amazon have their own buttons separate from the "online stores" button on a book page.
Amazon is all about making money and Goodreads gives people any number of stores to compare prices. This is purely speculation, but I wouldn't be surprised if Amazon wanted to limit the purchase options for books sourced by them to Amazon and only Amazon.



Wikipedia says the current year on the Persian calendar is 1390. So an edition published anytime in the past 90 years or so . . .

And you aren't even paying for this service!

Completely agree. No-one's said you must fix the database. It's a free service and one that does a great job. But of course where would the self-important be without their outrage?

Librarians, who have been asked to do an unknown amount of work to help fix this, also have lives and families.
And no, we don't pay to use the service though some of us would if we could we are the ones who add a great deal of value to it. (By participating, we enable GR to sell ads and those of us who are librarians have made GR more useful by trying to make the database more comprehensive and accurate, a little bit at a time.)
I maintain that frustration is understandable, even if it seems excessive at times. Many people are looking at the possibility of losing the data entry work they've put in. Unhappiness is kinda natural. Even if no one held a gun to their head and made them do that data entry in the first place, or is making them fix it now.

No, they weren't, which, I guess is the part that has completely gone missing from people's attention spans. They were asked to test the system. Test. Try a couple of books to see if it works. That's it.


Link to WorldCat Blog about this, with link to their scanner app:
http://worldcat.org/wcblog_config/mt-...
Sorry if this has already proven to be stupid and/or unworkable. I know nothing about scanner apps.

The fact is, they don't know how many will still be endangered after the import. Would you have them import and then test a system? Or would you rather they put out this test first? I don't pretend to know how complicated this is, I just think all this animosity directed at Goodreads is misplaced.

You are again making assumptions and if you were a programmer you would know that you always test and retest before you ever put anything live or into production. Because the librarians have spent so much time on working on the data, they've asked us to run a few tests. They also asked us to wait until they imported the data from the new source. Which quite frankly has the potential to blow up so they are most likely putting in on a test server to analyze the data before they pull it into the main database. Do you think this all happens by itself? It takes hours and hours to comb through code to make sure a script you've written in one area doesn't break something in another area. It's tedious, mind number work and I'm so thankful that I don't have to do it and that I'm not the one who's going to hit the "live" button and pray I caught everything.



Thanks to Michael for providing a partial update here.

You can save the cover pics who are on GR and then add them again in the tab of the book.

Ok Rant over!
Lynne - The Book Squirrel wrote: "why when the ISBN number is there can't the title and author be there too to save time?"
Because that is exactly the information we cannot use unless and until it comes from another source.
Because that is exactly the information we cannot use unless and until it comes from another source.

We don't know where GR will take these new data and we ("we" means non-english readers) don't know if our editions will be again there. We have asked but nobody said us something. Only Otis said that GR will refer to Mondadori. This is not enough, we have a lot of other editions who aren't Mondadori. If GR needs more names of Italian publishers we are ready to help, they must only tell us and we are glad to be usefull.
I haven't tried a couple of books, I have tried and I will try to save as many foreing editions (above all Italian) as possible. For us it is a big risk to wait and see what will happen.
I don't blame GR and I am sure they will do the possible for every member but if they could only say that there will be a huge quantity of non-english editions we would be more relaxed.

Are we still allowed to use Amazon cover images? I have read earlier that this is okay under "fair use" but I'm not sure if the Jan 30 changes affect this.
I tried to look through the hundreds of comments, but didn't find a direct answer to this question.
This topic has been frozen by the moderator. No new comments can be posted.
Books mentioned in this topic
The Tough Guide to Fantasyland (other topics)The Tough Guide to Fantasyland (other topics)
The Tough Guide to Fantasyland (other topics)
The Newbery Companion: Booktalk and Related Materials for Newbery Medal and Honor Books (other topics)
Der Weiße Wolf (other topics)
More...
Authors mentioned in this topic
Norbert Elias (other topics)Catherine Gaskin (other topics)
Of course not, I am confident.
But it would be appreciated if somebody could answer the questions we make. This would surely calm members.