“Structure of the Frequency of Occurrence of Consonants
in the Speech Sound Chain as an Indicator of the Phono-Typological Closeness of
Languages”
Yuri
Tambovtsev
Novosibirsk University
YUTAMB@HOTMAIL.COM
The structure of the frequency of occurrence of consonants in the speech sound
chain can be a good indicator of the typological closeness of languages from the
phonetic point of view.
A human being can realise that this or that language sounds closer to his own
native language without understanding the meaning. At the stage when it is hard
to teach computer to understand a human language, it is quite possible to make
it recognise the sound closeness of a language to this or that language on the
basis of the analysis of its sound speech chain. We have computed the frequency
of phonemic occurrence of 112 world languages as a teaching sample for the
computer. Then we took Japanese as a token language. The computer had to analyse
the sound chain of a language and then to put it closer to some languages and
far away from the others, basing on the frequency of occurrence of certain
consonantal groups. We have chosen Japanese because it is not categorically
assigned to any language family. It is still considered a genetically isolated
language. Therefore, it is useful to have some additional information about it
in any aspect. In this project we have used the procedures which are usually
used in pattern recognition.
Japanese, as any other human language, has a specific structure of the speech
sound chain. It can be distinguished by its structure from any other language.
Every language has a unique structure of distributions of speech sounds in its
phonemic chain. The distribution of Japanese vowels will not be considered till
the second stage of the investigation. Let's point out that consonants bear the
semantic load in the word, not vowels. Therefore, it is more possible to
understand the meaning of the message by consonants, rather by vowels. However,
if we fail to recognise and distinguish two languages, then we resort to the
structure of occurrence of vowels in the speech sound chain. While comparing
languages, it is necessary to keep to the principle of commensurability. Having
it in mind, it is not possible to compare languages on the basis of the
frequency of occurrence of separate phonemes, because the sets of phonemes in
languages are usually different. The articulartory features may serve as the
basic features in phono-typological reasoning. First of all, it is the
classification of consonants according to the work of the active organ of speech
or place of articulation (4 features). Secondly, it is the classification from
the point of view of the manner of articulation or the type of the obstruction
(3 features). Thirdly, it is the classification according to the work of the
vocal cords (1 feature). In this way, 8 basic features are obtained: 1) labial;
2) front; 3) mediolingual or palatal; 4) back or velar; 5) sonorant; 6)
occlusive; 7) fricative; and 8) voiced consonants. One should take the values of
the frequency of occurrence of these 8 features in the speech chain of Japanese
and compare them to those of the other languages. On the basis of the
"chi-square" test and Euclidean distance, we have developed our own method of
measuring the phono-typological distances between languages (Tambovtsev, 1994-a;
1994-b; 2001-a; 2001-b). It takes into account the frequency of occurrence of
the 8 consonantal groups mentioned above and builds up the overwhelming mosaic
of the language sound picture. Having compared Japanese to some languages, we
received the following phono-typological distances: Japanese - Ujgur (6.77);
Japanese - Nanaj (8.12); Japanese - Jakut (8.26); Japanese - See Dajak (8.86);
Japanese - Kazah (9.02); Japanese - Turkish (9.05); Japanese - Ket (9.52);
Japanese - Baraba Tatar (9.76); Japanese - Uzbek (10.63); Japanese - Hausa
(10.98); Japanese - Georgean (11.05); Japanese - Kazan Tatar (11.07) and so on.
One can see, that Ujgur, Jakut, Kazah, Turkish, Baraba Tatar, Uzbek and Kazan
Tatar are Turkic languages. Nanaj is a Tungus-Manchurian language. Therefore,
one can notice that Japanese is closer to the so-called Altaic languages which
include Turkic, Mongolian and Tungus-Manchurian languages. All in all 112
languages were compared to Japanese. We can't show all the distances measured
here for the lack of space. However, the maximum distances were found for
Japanese - German (22,24); Japanese - English (19.83); Japanese - Rumanian
(15,08) and Japanese - Swedish (17.03). Thus, one can see that the consonantal
distribution pattern in Japanese and Germanic languages is rather different. As
a conclusion, we can state that speech sound picture of Japanese is also far
away from the languages which are geographically close: Chinese, Nivh, Itelmen
or Indonesian. It was a surprise to us. Our data state that the speech sound
pattern of Japanese resembles that of Ujgur - one of the Turkic languages spoken
in the Middle Asia. The Ujgur people are often linked to the Old Turkic tribes,
who used to live in the stepps of Southern Russia before the Tatar-Mongols
captured them in the IXth century A.D. We must point out that it is not a
coincidence since the other native Altaic people have a very similar data of
closeness to Japanese. Turkic and Tungus-Manchurian tribes may have had a sort
of common origin with Japanese. It may verify the Altaic hypothesis of Japanese
origin. It is especially vivid, when the Austro-Oceanic and other languages do
not show such a closeness.
References
Yuri Tambovtsev. Dinamika funktsionorovanija fonem v zvukovyh tsepochkah jazykov razlichnogo stroja. Novosibirsk: Novosibirskij GosUniversitet, 1994.
Yuri Tambovtsev. Tipologija uporjadochennosti zvukovyh tsepej v jazyke. Novosibirsk: Novosibirskij GosUniversitet, 1994.
Yuri Tambovtsev. Kompendium osnovnyh statisticheskih harakteristik funktsionirovanija soglasnyh fonem v zvukovoj tsepochke anglijskogo, nemetskogo, frentsuzskogo i drugih indoevropejskih jazykov. Novosibirsk: Novosibirskij klassicheskij institut, 2001.
Yuri Tambovtsev. Funktsionirovanie soglasnyh fonem v zvukovoj tsepochke uralo-altajskih jazykov. Novosibirsk: Novosibirskij klassicheskij institut, 2001.
Yuri Tambovtsev. Nekotorye teoreticheskie polozhenija tipologii upor'adochennosti fonem v zvukovoj tsepochke jazyka i kompendium statisticheskih harakteristik osnovnyh grupp soglasnyh fonem. Novosibirsk: Novosibirsk klassicheskij institut, 2001.