Goodreads Developers discussion
Broken XML in some responses
date
newest »


xmllint --noout "http://www.goodreads.com/book/isbn?is..."
http://www.goodreads.com/book/isbn?is... parser error : CData section not finished
Wein
^
http://www.goodreads.com/book/isbn?is... parser error : PCDATA invalid Char value 25
I've had to work around this problem too for descriptions -- it would be nice if only UTF-8 characters came from the XML feeds.
Casper.

xmllint --noout "http://www.goodreads.com/book/isbn?i..."
What are you using for parsing the response? On Android that is.

InputStreamReader reader = new InputStreamReader(is, "UTF-8");
InputSource source = new InputSource(reader);
parser.parse(source, handler);
Casper.
For example http://www.goodreads.com/book/isbn?is... gives this error:
"org.apache.harmony.xml.ExpatParser$ParseException: At line 270, column 210: not well-formed (invalid token)"
Opening the feed in Chrome gets chrome to spit out this:
"error on line 275 at column 20: Encoding error"
This corresponds to this line:
<body><![CDATA[Medical examiner Maura Isles and Boston homicide detective Jane Rizzoli are back.
><br/>
The extra > before the <br/> seems to be what's causing it. Any chance you can fix this?
Here are some more books that contains the extra >
http://www.goodreads.com/book/isbn?is...
http://www.goodreads.com/book/isbn?is...