Voynich Reconsidered: the {8am} strategy revisited

In my new book Voynich Reconsidered (Schiffer Books, August 2024), one of my objectives was to engage the reader in a quest for the underlying meaning of the inscrutable Voynich manuscript; and to provide him or her with the tools to do so.

In an earlier article on this platform, I set out my idea of an efficient strategy for identifying the presumed precursor languages of the Voynich manuscript. The strategy is based on testing whether any given transliteration of the manuscript, and any potential precursor language, together yield a mapping from Voynich “words” to real words. I proposed to focus on the v101 “word” {8am}, the most common “word” in the manuscript. For that reason, I called this approach the {8am} strategy.

My subsequent work on what I call “leaf words” has instilled a doubt in my mind as to whether {8am} is a “word” at all. I found some evidence that {8am} might be an arbitrary, and possibly meaningless, filler for gaps between intervening illustrations, like the plants in the “herbal” section. However, {8am} occurs hundreds of times outside the “herbal” section, and in locations which are not adjacent to illustrations. I thought it worthwhile, therefore, to persevere with the {8am} strategy.
The Voynich "word" {8am}
The Voynich “word” {8am}, rendered in the Voynich v101 font. Image credit: Rebecca G Bettencourt / KreativeKorp.

The {8am} strategy is, in essence, to try multiple transliterations of the Voynich manuscript, in permutations with multiple precursor languages; for each permutation, to map {8am} to a text string in the target language; and to search a suitable corpus of the target language for that string as a real word.

My transliterations, all derived from v101, are currently numbered from v101④ to v226. Some of my transliterations allow for {8am} to be not three glyphs, as it is in v101; but four glyphs, which I write as [8aîń], or five, which I write as [8aiiń].

Following Mary D’Imperio and Massimiliano Zattera, I allowed for the possibility of re-ordering the letters in the text string: even of reversing the order, to allow for target languages which are written from right to left (as is the case with Arabic, Hebrew, Persian and Ottoman Turkish)

In cases where {8am} could be mapped to a real word, it might be a common word, or a rare word. It seemed to me that, the more common the word, the more encouragement we might feel for our permutation of transliteration and language.

The first mappings

As reported earlier: in my first implementation of the {8am} strategy, I started with three potential precursor languages and four corpora, as follows:
• Finnish as written around 1548, represented by Uusi Testamentti (The New Testament)
• Early New High German as written around 1401, represented by von Tepl's Der Ackerman aus Böhmen (The Ploughman from Bohemia)
• vernacular Italian (perhaps I should say Tuscan-Italian) as written around 1308-21, represented by Dante's La Divina Commedia (The Divine Comedy)
• Italian as written prior to 1400, represented by the OVI corpus, or Opera del Vocabolario Italiano,
and with thirty-seven alternative transliterations of the Voynich manuscript, which I had numbered from v101④ to v202.

The most encouraging result came from the mappings to medieval Italian using the letter frequencies in the OVI corpus. Allowing for re-ordering or reversing the text string, it seemed possible to map {8am} to either of the following words:
• “con”, which occurs 135,186 times in OVI and translates to English as “with”
• “dio”, which occurs 55,550 times in OVI and translates to English as “God”.
The second stage of mapping

In the next stage of mapping, I tested the following potential precursor languages:
• Albanian as written in 1555, as represented by Gjon Buzuku’s Meshari (Missals)
• Arabic as written in the fourteenth century, represented mainly by Ibn Kathir’s البداية والنهاية (The Beginning and the End), by courtesy of Dr Mohsen Madi
• Arabic as written in the eighth through fifteenth centuries, represented by three corpora: Grammarians, Medieval Philosophy and Science, and The Thousand and One Nights, by courtesy of Dr Dilworth Parkinson
• Bohemian as written prior to 1314, represented by Dalimilova Kronika (The Chronicles of Dalimil)
• English as written in the 1330s, represented by the Auchinleck manuscript, lines 1 to 2330
• English as written between 1488 and 1500, represented by the Ashmole 61 manuscript, parts 1 to 27
• French as written around 1440, represented by La Farce de Maistre Pathelin
• Galician-Portuguese as written between 1220 and 1300, represented by Cantigas d'Amigo (Songs of a Friend)
• Latin as written in the seventh through fourteenth centuries, represented by the LatinISE medieval subcorpus, by courtesy of Barbara McGillivray
• Latin as written in 1312-13, represented by Dante’s Monarchia
• Persian, represented by a predominantly medieval corpus of the works of forty-eight poets, by courtesy of https://github.com/amnghd/Persian_poe...
• Ottoman Turkish as written prior to 1631, represented by Ismail Ankaravi’s كتاب منهاج الفقراء Kitabi Minhac ul-Fukara
• Welsh as written in the mid-fourteenth century, represented by The White Book of Mabinogion.
To cut to the chase: in addition to the previously identified mappings to medieval Italian, these tests yielded just two languages in which {8am} could be mapped to real words:
* Arabic as written in the fourteenth century: the words منع (“prevention” or “it prevents”), أمي (“my mother” or “maternal”), أيم (“widow” or “widowed”)
* English as written in the 1330s: the word “and”.
{8am} mappings to words
Selected mappings of the "word" [8am} to words in some medieval European languages. Author's analysis. Higher resolution at https://flic.kr/p/2q4ow8z

In each case, there were multiple transliterations which yielded the same mapping. I was inclined to prioritise the transliterations according to what I call the AFD metric (the average absolute frequency difference), as outlined in previous articles on this platform. What I identified as the “best transliteration” is not necessarily the right one. To narrow down the choices will require mapping of other Voynich “words”.

Next steps

The next step would be to map some other ubiquitous “words” in the Voynich manuscript. To avoid possible “filler words”, I am inclined to look at “words” of at least three glyphs, such as {1oe}, {2oe} and {1oy}. Again, the objective would be to map these “words” to text strings in selected target languages, and to see whether those strings are real words. If we can map several common Voynich “words” to real words in some medieval languages, we might continue towards mappings of whole lines. We will be looking for such mappings to make grammatical sense.
 •  0 comments  •  flag
Share on Twitter
Published on July 16, 2024 05:48 Tags: d-imperio, voynich, zattera
No comments have been added yet.


Great 20th century mysteries

Robert H. Edwards
In this platform on GoodReads/Amazon, I am assembling some of the backstories to my research for D. B. Cooper and Flight 305 (Schiffer Books, 2021), Mallory, Irvine, Everest: The Last Step But One (Pe ...more
Follow Robert H. Edwards's blog with rss.