Goodreads Developers discussion
bugs
>
Encoding Errors
date
newest »


Thanks for bringing it to my attention.




Let me know if this actually makes a difference.


http://www.goodreads.com/book/show/68...
Thanks

http://www.goodreads.com/book/show/68...
Thanks"
Someone got an End of Medium character in their review. I'm not sure what that is, or why it matters, but i think it makes the xml invalid, so I edited it out.

http://www.goodreads.com/book/isbn?ke...

We've removed the bad data. We'll continue to look for more examples. Please keep us posted if you find any more.
Thanks!


So yeah, we've got a list of 100 or so isbns that are giving back invalid XML, and we're working through them, hopefully we'll push out another change tomorrow to address a large number of those.

Casper.

THats pretty much exactly the solution we're working on today. :)

I am using the forum because it might help some else out and the responses have been pretty quick. :)

Are there still books out there causing non-UTF-8 data to be generated in the feeds?

There are probably some, but if you find any examples let us know and we'll try and scrub them. We wrote a little method to clean out non utf-8 stuff.
1. The Short Second Life of Bree Tanner - ENCODING ERROR at line 959 (Book ID:7937462)
2. The Girl Who Kicked the Hornets Nest - ENCODING ERROR at line 384 (Book ID:6892870)
3. Mockingjay - OK (Book ID:7260188)
4. The Help - OK (Book ID:4667024)
5. The Girl with the Dragon Tattoo - OK (Book ID:2429135)
6. Delivering Happiness - OK (Book ID:6828896)
7. Dead in the Family - OK (Book ID:7091488)
8. Game Change - OK (Book ID:6694937)
9. Women, Food, and God - OK (Book ID:6758423)
10. The Girl Who Played with Fire - ENCODING ERROR at line 95 (Book ID:5060378)
The book.show call is not the only one that returns results with encoding errors either; encoding errors have appeared in any call the includes user generated content. Assuming this sampling is representative of the entire library, users accessing Goodreads content via the API have a 30% chance of running into an encoding error.
MICHAEL mentioned in another thread that this error is due to a bug in the version of Rails currently in use by the Goodreads site, and that upgrading to the latest version should eliminate the issue. Is there a date for that switchover? Is there any chance of resolving this issue in some other way in the meantime?
Thanks for your help, and a great site!