Mr. Penumbra's 24-Hour Bookstore
discussion
resolution - decoding mystery
date
newest »

message 1:
by
Wendybird
(new)
-
rated it 2 stars
Oct 15, 2012 11:23PM

reply
|
flag

_______________________
Because the code was in tiny details in the Font, and in the digitization process the font was removed from the analysis. The person decoding had to see the message in the original typeface in order to decode it. When the book was scanned and then made searchable using Optical Character Recognition (OCR), the final digital document was in a generic font, not the original, so the message was stripped.

_______________________
Because the code was in tiny details in the Font, and in the digitization process the font was removed from the analysis. The person decoding had to see t..."
Since always the same letter has the same "tiny details" this is still a simple substitution. It would only be different if you had e.g. 24 different copies of the letter "a" each with different "tiny details". But this does not seem to be the case.
So I was very disappointed as well by this resolution.
Also the "making money from font copyright" does not work. The first copyright law in the USA is from 1790 ( http://en.wikipedia.org/wiki/US_copyr... ). And "it did not prohibit copying the works of foreign authors".
And: Fonts cannot be copyrighted: http://en.wikipedia.org/wiki/Typeface
"In Eltra Corp. v. Ringer,[16] the Court of Appeals for the Fourth Circuit held that typeface designs are not subject to copyright. However, novel and non-obvious typeface designs are subject to protection by design patents.[17] Digital fonts that embody a particular design are often subject to copyright as computer programs.[18][19] The names of the typefaces can become trademarked. As a result of these various means of legal protection, sometimes the same typeface exists in multiple names and implementations."
And: Copyright expires after a few years.
So while I found the story a nice read, the book really needed a lot more research :(

Also, USA laws would not even matter considering the original font was made in Italy (or somewhere in that area of the world, I can't remember exactly now).
The OCR with a new font for the original Gerritszoom makes perfect sense, you had to see the original font in order to see the code placed onto the actual letters not the words themselves. My thinking on this matter was Sloan is ingenious to come up with an interesting code only caught by a human being. Here we are again with computers vs. humans when a human wins over anything (even Google) a computer could come up with. Like it says, a computer can only do what a human tells it to do.


The type was more than just an A or a B, etc. Not only do you have uppercase and lowercase differences, but he mentions other special items such as a specific block for ff (which would then be considered as its own character). So you have to differentiate between two single f or the ff.
In order to properly print the book in the old method, they would need to have the blocks for all the punctuation, numbers, etc.
But, as Casey said, I wouldn't take this so seriously. I mean, do you really think that if all the servers at Google were busy it would bring the entire Internet to a screeching halt? Or that Google is researching life extension?

The type was more than just an A or a B, etc. Not only do you have..."
That's funny Random - they might be researching life extension. More than anything else, this was an enjoyable read with a wonderful twist in the end. I see it as the human mind reigning over technology. I really enjoyed that Sloan added in about the smell of books, because that is one of my favorite things about using the library or getting a new book from BAM or Amazon - that paper-y smell of cracking up a book, whether it's new or old!

The Unbroken Spine should have been able to solve it pretty easily, even without noticing the tiny notches. They surely would have considered the difference between uppercase and lowercase, the ff ligature, and the alternate glyphs (three P's, two C's, a truly epic Q, etc.). After that, they only would have had to do some frequency analysis, which has been used since Manutius's time.


Its in reference to a conversation he had with Kat earlier in the book.

I reached out to the author to ask him about this issue, and he responded! Here's his response:
"Let me know if this makes it any clearer (or more believable):
https://github.com/mattmills/penumbra...
The difficulty isn't in the subtlety of the code; it's in the information loss that occurs when you mis-transcribe the encoded text. Clay's insight was to see the encoded text differently. There's more detail & an example on the other side of that link above.
Of course, if this wasn't clear to most readers—and I don't think it was—that's my fault!"
I'm still not 100% convinced that either Google or the Unbroken Spine wouldn't have been able to pick up on this, but it's at least a better explanation. Your thoughts?

I was truly puzzled after reading the conclusion of this story. I'm not a math or digital whiz, so maybe there's something I've missed. I'd never even thought of the punctuation symbols as you suggested.
Why rely on OCR knowing the possibility of sampling error? I never considered that as a method. Analyzing thousands or millions of samples of manually-inserted symbols with even a simple program could figure this out quickly. Less quickly with frequency analysis as Dan suggested. It's still a simple substitution with an unchanging key, no?



Regarding the code: The notches on the different fonts were used to decode the codex vitae using substitution. The "final words" or secret message was decoded using the words "hidden in the colophon."
http://www.merriam-webster.com/dictio...
Does this help clear things up?
I loved this book.

It's still a bad piece of writing. At one point they're talking about measuring the levels of ink as a possible clue to the cipher, but then we're supposed..."
I'll have to go back and check, but I thought the notches were on the top of the punches, not on the "business end."

Sherri has a marvelous idea!


The type was more than just an A or a B, etc. Not only do you have..."
Google researching life extension? Absolutely.
It's called a moonshot and Project Calico was featured in Time Magazine.
So yes, they are.

Clark Moffat was a member and he did solve it.


I get that but the font itself was the same .. all throughout the world including all the supposed code in it ..
_______________________
Because the code was in tiny details in the Font, and in the digitization process the font was removed from the analysis. The person decoding had to see t..."

Random wrote: "The circumference of the earth.
Its in reference to a conversation he had with Kat earlier in the book."
Would you mind to help me? Which conversation? I figured out the circumference thing... but I do not know what he is hinting at


Thanks anyhow =)


Random wrote: "The circumference of the earth.
Its in reference to a conversation he had with Kat earlier in the book."
Would you mind to help me? Which conversation? I..."
Before the Google decoding attempt, Kat was speculating about the great secrets that might be revealed and Clay responded that medieval people were not very advanced technically. Kat replied that people could calculate the circumference of the world 1000 years before printing and she challenged Clay to do that. Kat is probably referring to Eratosthenes whom you can Google to understand the method.

I think the message means that immortality comes from thoughts of you remaining in the minds of people with whom friendships were developed. Hidden in plain sight means that when the original Gerrittszoon type was digitized the notches weer copied to and everyone missed it except Moffat.


You are absolutely correct if the Gerritzoon type set was small. But as the number of glyphs in the typeset go up there are multiple ways the the substitution cipher becomes more difficult to crack. As an extreme example, I read that a complete Times Roman typeset has 1674 glyphs. I think I told you that I found a resource that indicated there were 1674 glyphs for Times Roman. Assuming the entire type set were notched to produce the 26 letters of the alphabet that means there would be some 64 ways to represent to each letter (1674/26 = 64). Thus for the four letters of the name Dave there would be 64^4 (over sixteen million) ways to encode it.

all discussions on this book
|
post a new topic