Mathematics in Script Research

At the Budapest University of Technology the research work on script relics involves mathematic methods. Dr. Gábor Hosszú co-authored ‘Data Mining and Analysis in the Engineering Field’.


About the Rovas, in English

Until few years ago, the scientifically reliable sources about the Szekely-Hungarian Rovas script were rarely available in English. This situation started to change since 2009, when the members of the Hungarian scientific community started to publish the results of the latest Rovas researches in English in the international script standardization process led by Unicode. Unfortunately, Unicode totally disregarded these sources and based on “highly questionable” amateur theories created a practically useless, mistaken standard that the contemporary Rovas user community need to dodge and hack, in order to provide suitable technical background for the boosting digital Rovas usage.

Mathematics, statistics in the script analysis

The author of the first ever scientific summary (Heritage of scribes) about the Rovas script family – Gábor Hosszú, Phd – has contributed to a new professional source: Data mining and analysis in the engineering field.


This chapter presents statistical evaluations of script relics. Its concept is exploiting mathematical statistical methods to extract hidden correlations among different script relics. Examining the genealogy of the graphemes of scripts is necessary for exploring the evolution of the writing systems, reading undeciphered inscriptions, and deciphering undeciphered scripts. The chapter focuses on the cluster analysis as one of the most popular mathematical statistical methods. The chapter presents the application of the clustering in the classification of Rovash (pronounced ?rove-ash,? an alternative spelling: Rovas) relics. The various Rovash scripts were used by nations in the Eurasian Steppe and in the Carpathian Basin. The specialty of the Rovash paleography is that the Rovash script family shows a vital evolution during the last centuries; therefore, it is ideal to test the models of the evolution of the glyphs. The most important Rovash script is the Szekely-Hungarian Rovash. Cluster analysis algorithms are applied for determining the common sets among the significant Szekely-Hungarian Rovash alphabets. The determined Rovash relic ties prove the usefulness of the clustering methods in the Rovash paleography.


Mathematical statistical tools as the cluster analysis are useful in various scientific fields, including the computational paleography. The chapter deals with computational paleography that means modeling and analyzing methods to explore links among graphemes, deciphering ancient inscriptions, representing various glyphs in digital form by using mathematical algorithms and software tools. The applied mathematical statistical tools may accelerate the research time and provide more accurate results through the automatic process for exploring the genealogical ties among the various branches of a script.

The writing system is a symbolic representation of a language described in terms of linguistic units (Malatesha & Aaron, 2006). The script is a writing system, some examples of the scripts are the following: Aramaic, Arabic, Armenian, Brahmi, Carpathian Basin Rovash, Chinese, Cyrillic, Devanagari, Ethiopic, Georgian, Greek, Hebrew, Kannada, Kharoshthi, Latin, Runic, Szekely-Hungarian Rovash, Telugu, Umbrian, Venetic, etc.

The grapheme is a minimally distinctive unit in a writing system. The symbol is a distinct unit of an inscription, the inscriptions are composed of a series of symbols. Grapheme is the abstraction of a symbol. Graphemes can be letters, ligatures, numerical digits, or punctuation marks. The grapheme has the following properties: (i) the script belonging into, (ii) its glyphs, (iii) its sound values, (iv) periods of use. The glyph is the shape of the grapheme with topological information. One grapheme has usually more glyphs. The symbol in an inscription is an implementation of a glyph.

The orthography is the visual representation of a language, which uses graphemes belonging to a certain script, and it is determined by the features of a language. A script is used for different orthographies. For instance, the Latin script is used for several orthographies, including the French, German, English, Hungarian, etc. orthographies. The orthography of a spoken language changes periodically (Rogers, 1999). The changes can occur as changing set of graphemes, which encompasses the shape transformation of a glyph. A cause of writing system alteration is the establishment of more advanced writing media or instruments. The advanced writing technology introduces new writing technique, which impacts the glyphs of a grapheme.

The script family is a group of scripts, which are closely related to each other. The Rovash (pronounced ?rove-ash?) script family includes the Proto-Rovash, the Early Steppean Rovash, the Carpathian Basin Rovash, the Steppean Rovash, and the Szekely-Hungarian Rovash scripts (Hosszú, 2013a, 2013b). The history and the genealogical ties of the Rovash scripts have been the subject of heavy research efforts from the past until today (Hosszú, 2012). It is a difficult task to determine the accurate genealogy and timelines of the Rovash scripts (Hosszú, 2012). The various Rovash scripts gradually differentiated after the geographical isolation of their users. Carpathian Basin Rovash was in use in the Carpathian Basin by Hungarians mainly, the others were used by nations and tribes of the Eurasian Steppe up to the 10th/13th centuries. Most of the Rovash scripts became extinct in the Medieval Times; however, the Szekely-Hungarian Rovash continued to be used since the 9th century throughout history until the present by the Szekelys?an organized border guard subgroup of the Hungarians.

(igi-global – original article)


Related articles