2 posts tagged “proto-semitic”
One of the typologically puzzling things about Arabic, and Semitic languages in general, is that /i/ and /u/ very often contrast with /a/, but hardly ever with each other. This is usually an indication that these are allophones, but this explanation can not be held if these vowels can't freely interchange, and are perceived as separate vowels.
Although this issue is an issue in the whole of Semitic, as far as I am aware, I am most familiar with Arabic, so I'll stick to using examples from this language.
Of course, there is one extremely productive pattern of 'minimal pairs' of vowels in the form of case endings.
Nom. rajul-un
Gen. rajul-in
Acc. rajul-an
So, sure they seem quite phonemic in that context. But what I find puzzling is that in stem formations we can't find u and i to contrast normally.
To further research this I have made a table of the distribution of Arabic vowels in CVCVC roots. The table looks as follows:
|
V1 \ V2 |
a |
i |
u |
ā |
ī |
ū |
|
a |
+ |
+ |
+ |
+ |
+ |
+ |
|
i |
+ |
- |
- |
+ |
- |
- |
|
u |
+ |
- |
+ |
+ |
- |
+ |
|
ā |
- |
+ |
- |
- |
- |
- |
|
ī |
- |
- |
- |
- |
- |
- |
|
ū |
- |
- |
- |
- |
- |
- |
Several notes can be made about this table. I shaded the entry CaCiC, since it is difficult. The only word I can think of is malik 'king' (although doubtlessly there are more). Some people will probably know that this word is related to Hebrew mĕlĕḵ which paradoxically points to a CVCC root. Is malik perhaps from *malk with an epenthetic vowel? It is very reminiscent of dutch melk 'milk' which by many people is in fact pronounced [ˈmɛ.lǝk] rather than [ˈmɛlk].
Another thing that is strange is that, of the long vowels, only ā can occur in V1 position, and exclusively if it is followed by the vowel i. Could it perhaps be that the CaCiC is indeed from *CaCC, and that CāCiC represents the orignal *CaCiC?
If this were true, the table of vowel distribution would look a lot more elegant.
|
V1 \ V2 |
a |
i |
u |
ā |
ī |
ū |
|
a |
+ |
+ |
+ |
+ |
+ |
+ |
|
i |
+ |
- |
- |
+ |
- |
- |
|
u |
+ |
- |
+ |
+ |
- |
+ |
There is an enormous problem with this reductionist approach though. The Vowel pattern CāCiC is associated with a meaning of nomen agentis. It is quite productive, from the word kataba 'to write' we can form kātib 'writer'. That would be fine, if it wasn't that Hebrew has this exact same pattern. Hebrew has the verb ṣāfăr 'to count' besides ṣôfēr 'scribe, writer (litt. 'counter')' (ô < *ā, ē < *i). If we assume that CāCiC is from *CaCiC this must have been a common shift for Arabic, Hebrew and I've been told, also Aramaic. Could someone with knowledge of Akkadian/Ethiopian Languages let me know whether this pattern exists and whether it has CāCiC or CaCiC?
So, after the discussion on CaCiC, let's continue regarding this vowel table. Maybe not completely surprising, but for allowed vowel distributions, Arabic disregards vowel length. CiCiC isn't allowed, whether the second i is long or not. Same goes for the other disallowed vowel combinations. I wonder what this implies. I have no experience with languages that have long vowels and limitations on their distribution, so I'm not sure what scenario is typologically plausible.
It is good that I made this table, for it has shown me some stuff that I was previously unaware of. I was under the impression that the distribution of u and i was identical, but I have found absolutely no examples of words with CiCiC, while CuCuC is in fact quite a common plural formation. As I knew before writing this combinations with i and u in one root are impossible, which is mysterious. It almost looks like a sort of 'vowel disharmony' if I may coin that term.
I had written a large post of a proposal of a fourth proto-semitic vowel *ǝ , that would be affected by its surroundings, but often simply surface as a or i. But once I put the distribution into a table, I became uncertain if such a proposal would be feasible, and threw away most of this post.
It is true that i and also u sometimes have schwa-like properties, if malik indeed comes from *malk that's obviously an example, but there's even more readily available examples in the form of the 'alif al-waṣl. When a Arabic word starts with a CC cluster a vowel is placed in front of the first consonant to make the cluster pronounceable. For example *sm 'name' becomes (i)sm. When a vowel proceeds it, this vowel is lost again, it is purely epenthetic. When the root contains no vowels, or an a or i the value of the 'alif al-waṣli is i. But if the following vowel is an u the 'alif al-waṣl is also u as in *drus > (u)drus 'learn!'. This is in fact an example of vowel harmony. There are some nouns violate this rule though like (i)mru'' 'man'. Another strange thing is that the a in the definite article (a)l behaves just like 'alif al-waṣl except that it is always a in isolated pronunciation. Nevertheless it is quite obvious that this alif al-waṣl must have come from a subphonemic *ǝ.
Another example of a *ǝ is the i that is often used to break up clusters in a sentence especially the apocopate verb often needs an extra i places in between its final consonant and the following word.
If there was a *ǝ in the middle of words, would that help to explain the distribution of the vowels? It might, if we assume that all i were in fact *ǝ we would understand which CiCuC and CuCiC do not occur, since the u would have affected the *ǝ to become an u. But it still does not explain why CiCiC and CiCīC unless we assume that *ǝ and *ī turned a preceeding *ǝ into a. Such an explanation is entirely ad-hoc. Although it might be true, there is no indication that it was like that, and we would need comparative evidence to prove it.
So to conclude, Arabic gives quite strong indications that i was in fact rather a *ǝ than an *i that was heavily affected by its surroundings. This does not increase or decrease the amount of phonemic vowels, but it may help understanding the vocalic patterns in Arabic better.
There is no conclusive evidence though that i was *ǝ, one would have to look at deeper genetic relations (Afro-Asiatic? Maybe only Berbero-Semitic?). I do feel that one should probably position this *ǝ in proto-semitic times if it exists. Hebrew vowel distribution is as far as I can see it, quite similar to that of Arabic.
I hope to soon dive into correspondences between Arabic and Berber verbal morphology with this hypothesis that i should be interpretead as a *ǝ. But before that I should probably consider the Arabic verbal morphology first, since I've only considered nouns of the type CVCVC so far. The vowel distribution in the verbal morphology becomes quite a bit more difficult though.
I've written about vocalised in Arabic and how this can almost impossible represent the stage of language as it is written. Especially the case endings seem to have not existed in Arabic when the orthography as we know it now was brought into use.
In fact, I believe that the noun cases and nunation are most likely a result of a highly ritualistic language a 'high' language so to speak that was much closer to proto-semitic than the actual Arabic language. I believe that at the time Hebrew and Arabic split, Arabic can't have had case forms in the way we see them today. That's quite a claim, but let me elaborate.
Arabic and Hebrew are close in some ways and very divergent in others. Arabic broken plurals have run rampant for example. Nevertheless the amount of similar shifts that they seem to have undergone after Arabic lost it's noun endings and nunation is so absurdly similar to Hebrew that it's almost as if history redid some shifts it had lying around for those funny Semitic languages anyway.
I don't believe that's the case, I believe endings and nunation must have disappeared and then Arabic and Hebrew (and probably aramaic and a bunch of other languages of which I have too little knowledge) underwent a bunch of common changes.
Let's give some examples. Hebrew masculine plural ending is -īm Arabic masculine plural is nom. -ūna gen./acc. -īna. As we can see a shift of m to n occurred and word final -a was lost. in hebrew while it didn't in Arabic. This makes little sense to me. But now if we look at the 'pausal' and modern pronunciation of these two endings of Arabic we find it without an -a. -ūn/-īn The shift of word final m > n is much more natural than m > n between a long vowel and a.
This word final m > n shift is actually also found in the nunation, which in proto-semitic must have been mimation. I call ritualistic endings that were not part of the real language. But they must have been in use in some contexts. And if a language gets a constraint of final m turning to n it's quite likely that it will abide to this.
But that's just one of many more things I wanted to show. Another point is the loss of the final *t in feminines in both Arabic and Hebrew. These days Arabic has a etymological ة which is simply a h with the dots of a t over it. But dots are a more recent innovation thus only an h was used. The exact same spelling Hebrew uses to write feminines.
The situation is somewhat different though, hebrew always uses a he to write long final a's, while in arabic this orthography is quite restricted. Nevertheless it makes no sense for an Arab to writen a word with a final h if it was pronounced -atun or even -at. Why not write a t? I believe that Arabic and Hebrew both had final *-at that was then simplified to *-a which was later even lengthened in Hebrew to ā.
The accusative indefinite singular -an in Arabic can not be old, unless you feel like an alif is a perfectly logical letter to write -an instead of -ā you'll surely agree. This is not that controversial, it's generally accepted that in Old-Arabic the accusative indefinite singular was probably pronounced -ā and that the -an was reintroduced by analogy of the other cases. I've been told, and I hope someone will back this up, that in colloquial dialects -ā is still the normal pronunciation of the accusative. I personally am unsure whether for Proto-Semitic -an or -ā is reconstruced. But Akkadian has -am. So I guess the development of -ā is a later feature. A feature that Arabic shares with... You guess it, Hebrew!
Now I can hear you guys saying: aha! you made a mistake! And it's true there's no true accusative in Hebrew, but there are some petrified forms. For example BBLH or bāḇĕlā means 'towards Babylon'. Once again then a common feature of Hebrew and Arabic.
This leaves us with one major issue of course. HOW can Arabic have a ritualistic language so much more archaic than the spoken languages alive simultaneously. I'd suggest you go to Egypt, and watch al-jazira while listening to the people around you talking, and you'll find out that this is not all that bizarre as it sounds. It's essentially the same situation as all those years ago.
Luckily there's also some analogies to be made outside of Arabic. What to think of Vedic Sanskrit still existing as an oral tradition well after Classical Sanskrit died, and when the Prakrit languages were alive and kicking?
Or the (somewhat controversial) idea of Runic inscriptions that often look almost proto-germanic while languages must have differentiated into the different germanic languages at the time we find those inscriptions? It's pretty odd that in Old-Norse region not much before the first latin-written attestation we still find runic inscriptions that write the nom. sg. as -az rather than -r.
So this is the point where I ask you: Am I crazy? Am I seeing things? Surely these similarities can't just be coincidence can they? I'm open to all words of praise, critique and whatever else, as long as it doesn't change into a ridiculous mudfight. I want to be proven wrong, or someone to kindly point out that this interpretation has been the status-quo for centuries and has somehow gone past me.
[EDIT]
I should probably add that there's no direct reason to assume that there was no nominative -u and genitive -i or definite accusative -a. Since we also have to account for final vowels in the verbal system to explain certain vocalisms in Hebrew, this must have existed at some point. They may have been lost individually, or around the time that the two groups were starting to move away from each other.