6 posts tagged “arabic”
One of the typologically puzzling things about Arabic, and Semitic languages in general, is that /i/ and /u/ very often contrast with /a/, but hardly ever with each other. This is usually an indication that these are allophones, but this explanation can not be held if these vowels can't freely interchange, and are perceived as separate vowels.
Although this issue is an issue in the whole of Semitic, as far as I am aware, I am most familiar with Arabic, so I'll stick to using examples from this language.
Of course, there is one extremely productive pattern of 'minimal pairs' of vowels in the form of case endings.
Nom. rajul-un
Gen. rajul-in
Acc. rajul-an
So, sure they seem quite phonemic in that context. But what I find puzzling is that in stem formations we can't find u and i to contrast normally.
To further research this I have made a table of the distribution of Arabic vowels in CVCVC roots. The table looks as follows:
|
V1 \ V2 |
a |
i |
u |
ā |
ī |
ū |
|
a |
+ |
+ |
+ |
+ |
+ |
+ |
|
i |
+ |
- |
- |
+ |
- |
- |
|
u |
+ |
- |
+ |
+ |
- |
+ |
|
ā |
- |
+ |
- |
- |
- |
- |
|
ī |
- |
- |
- |
- |
- |
- |
|
ū |
- |
- |
- |
- |
- |
- |
Several notes can be made about this table. I shaded the entry CaCiC, since it is difficult. The only word I can think of is malik 'king' (although doubtlessly there are more). Some people will probably know that this word is related to Hebrew mĕlĕḵ which paradoxically points to a CVCC root. Is malik perhaps from *malk with an epenthetic vowel? It is very reminiscent of dutch melk 'milk' which by many people is in fact pronounced [ˈmɛ.lǝk] rather than [ˈmɛlk].
Another thing that is strange is that, of the long vowels, only ā can occur in V1 position, and exclusively if it is followed by the vowel i. Could it perhaps be that the CaCiC is indeed from *CaCC, and that CāCiC represents the orignal *CaCiC?
If this were true, the table of vowel distribution would look a lot more elegant.
|
V1 \ V2 |
a |
i |
u |
ā |
ī |
ū |
|
a |
+ |
+ |
+ |
+ |
+ |
+ |
|
i |
+ |
- |
- |
+ |
- |
- |
|
u |
+ |
- |
+ |
+ |
- |
+ |
There is an enormous problem with this reductionist approach though. The Vowel pattern CāCiC is associated with a meaning of nomen agentis. It is quite productive, from the word kataba 'to write' we can form kātib 'writer'. That would be fine, if it wasn't that Hebrew has this exact same pattern. Hebrew has the verb ṣāfăr 'to count' besides ṣôfēr 'scribe, writer (litt. 'counter')' (ô < *ā, ē < *i). If we assume that CāCiC is from *CaCiC this must have been a common shift for Arabic, Hebrew and I've been told, also Aramaic. Could someone with knowledge of Akkadian/Ethiopian Languages let me know whether this pattern exists and whether it has CāCiC or CaCiC?
So, after the discussion on CaCiC, let's continue regarding this vowel table. Maybe not completely surprising, but for allowed vowel distributions, Arabic disregards vowel length. CiCiC isn't allowed, whether the second i is long or not. Same goes for the other disallowed vowel combinations. I wonder what this implies. I have no experience with languages that have long vowels and limitations on their distribution, so I'm not sure what scenario is typologically plausible.
It is good that I made this table, for it has shown me some stuff that I was previously unaware of. I was under the impression that the distribution of u and i was identical, but I have found absolutely no examples of words with CiCiC, while CuCuC is in fact quite a common plural formation. As I knew before writing this combinations with i and u in one root are impossible, which is mysterious. It almost looks like a sort of 'vowel disharmony' if I may coin that term.
I had written a large post of a proposal of a fourth proto-semitic vowel *ǝ , that would be affected by its surroundings, but often simply surface as a or i. But once I put the distribution into a table, I became uncertain if such a proposal would be feasible, and threw away most of this post.
It is true that i and also u sometimes have schwa-like properties, if malik indeed comes from *malk that's obviously an example, but there's even more readily available examples in the form of the 'alif al-waṣl. When a Arabic word starts with a CC cluster a vowel is placed in front of the first consonant to make the cluster pronounceable. For example *sm 'name' becomes (i)sm. When a vowel proceeds it, this vowel is lost again, it is purely epenthetic. When the root contains no vowels, or an a or i the value of the 'alif al-waṣli is i. But if the following vowel is an u the 'alif al-waṣl is also u as in *drus > (u)drus 'learn!'. This is in fact an example of vowel harmony. There are some nouns violate this rule though like (i)mru'' 'man'. Another strange thing is that the a in the definite article (a)l behaves just like 'alif al-waṣl except that it is always a in isolated pronunciation. Nevertheless it is quite obvious that this alif al-waṣl must have come from a subphonemic *ǝ.
Another example of a *ǝ is the i that is often used to break up clusters in a sentence especially the apocopate verb often needs an extra i places in between its final consonant and the following word.
If there was a *ǝ in the middle of words, would that help to explain the distribution of the vowels? It might, if we assume that all i were in fact *ǝ we would understand which CiCuC and CuCiC do not occur, since the u would have affected the *ǝ to become an u. But it still does not explain why CiCiC and CiCīC unless we assume that *ǝ and *ī turned a preceeding *ǝ into a. Such an explanation is entirely ad-hoc. Although it might be true, there is no indication that it was like that, and we would need comparative evidence to prove it.
So to conclude, Arabic gives quite strong indications that i was in fact rather a *ǝ than an *i that was heavily affected by its surroundings. This does not increase or decrease the amount of phonemic vowels, but it may help understanding the vocalic patterns in Arabic better.
There is no conclusive evidence though that i was *ǝ, one would have to look at deeper genetic relations (Afro-Asiatic? Maybe only Berbero-Semitic?). I do feel that one should probably position this *ǝ in proto-semitic times if it exists. Hebrew vowel distribution is as far as I can see it, quite similar to that of Arabic.
I hope to soon dive into correspondences between Arabic and Berber verbal morphology with this hypothesis that i should be interpretead as a *ǝ. But before that I should probably consider the Arabic verbal morphology first, since I've only considered nouns of the type CVCVC so far. The vowel distribution in the verbal morphology becomes quite a bit more difficult though.
I've written about vocalised in Arabic and how this can almost impossible represent the stage of language as it is written. Especially the case endings seem to have not existed in Arabic when the orthography as we know it now was brought into use.
In fact, I believe that the noun cases and nunation are most likely a result of a highly ritualistic language a 'high' language so to speak that was much closer to proto-semitic than the actual Arabic language. I believe that at the time Hebrew and Arabic split, Arabic can't have had case forms in the way we see them today. That's quite a claim, but let me elaborate.
Arabic and Hebrew are close in some ways and very divergent in others. Arabic broken plurals have run rampant for example. Nevertheless the amount of similar shifts that they seem to have undergone after Arabic lost it's noun endings and nunation is so absurdly similar to Hebrew that it's almost as if history redid some shifts it had lying around for those funny Semitic languages anyway.
I don't believe that's the case, I believe endings and nunation must have disappeared and then Arabic and Hebrew (and probably aramaic and a bunch of other languages of which I have too little knowledge) underwent a bunch of common changes.
Let's give some examples. Hebrew masculine plural ending is -īm Arabic masculine plural is nom. -ūna gen./acc. -īna. As we can see a shift of m to n occurred and word final -a was lost. in hebrew while it didn't in Arabic. This makes little sense to me. But now if we look at the 'pausal' and modern pronunciation of these two endings of Arabic we find it without an -a. -ūn/-īn The shift of word final m > n is much more natural than m > n between a long vowel and a.
This word final m > n shift is actually also found in the nunation, which in proto-semitic must have been mimation. I call ritualistic endings that were not part of the real language. But they must have been in use in some contexts. And if a language gets a constraint of final m turning to n it's quite likely that it will abide to this.
But that's just one of many more things I wanted to show. Another point is the loss of the final *t in feminines in both Arabic and Hebrew. These days Arabic has a etymological ة which is simply a h with the dots of a t over it. But dots are a more recent innovation thus only an h was used. The exact same spelling Hebrew uses to write feminines.
The situation is somewhat different though, hebrew always uses a he to write long final a's, while in arabic this orthography is quite restricted. Nevertheless it makes no sense for an Arab to writen a word with a final h if it was pronounced -atun or even -at. Why not write a t? I believe that Arabic and Hebrew both had final *-at that was then simplified to *-a which was later even lengthened in Hebrew to ā.
The accusative indefinite singular -an in Arabic can not be old, unless you feel like an alif is a perfectly logical letter to write -an instead of -ā you'll surely agree. This is not that controversial, it's generally accepted that in Old-Arabic the accusative indefinite singular was probably pronounced -ā and that the -an was reintroduced by analogy of the other cases. I've been told, and I hope someone will back this up, that in colloquial dialects -ā is still the normal pronunciation of the accusative. I personally am unsure whether for Proto-Semitic -an or -ā is reconstruced. But Akkadian has -am. So I guess the development of -ā is a later feature. A feature that Arabic shares with... You guess it, Hebrew!
Now I can hear you guys saying: aha! you made a mistake! And it's true there's no true accusative in Hebrew, but there are some petrified forms. For example BBLH or bāḇĕlā means 'towards Babylon'. Once again then a common feature of Hebrew and Arabic.
This leaves us with one major issue of course. HOW can Arabic have a ritualistic language so much more archaic than the spoken languages alive simultaneously. I'd suggest you go to Egypt, and watch al-jazira while listening to the people around you talking, and you'll find out that this is not all that bizarre as it sounds. It's essentially the same situation as all those years ago.
Luckily there's also some analogies to be made outside of Arabic. What to think of Vedic Sanskrit still existing as an oral tradition well after Classical Sanskrit died, and when the Prakrit languages were alive and kicking?
Or the (somewhat controversial) idea of Runic inscriptions that often look almost proto-germanic while languages must have differentiated into the different germanic languages at the time we find those inscriptions? It's pretty odd that in Old-Norse region not much before the first latin-written attestation we still find runic inscriptions that write the nom. sg. as -az rather than -r.
So this is the point where I ask you: Am I crazy? Am I seeing things? Surely these similarities can't just be coincidence can they? I'm open to all words of praise, critique and whatever else, as long as it doesn't change into a ridiculous mudfight. I want to be proven wrong, or someone to kindly point out that this interpretation has been the status-quo for centuries and has somehow gone past me.
[EDIT]
I should probably add that there's no direct reason to assume that there was no nominative -u and genitive -i or definite accusative -a. Since we also have to account for final vowels in the verbal system to explain certain vocalisms in Hebrew, this must have existed at some point. They may have been lost individually, or around the time that the two groups were starting to move away from each other.
[First of all, I'm a big fan of using the original script of languages, but seeing as Arabic can do some crazy stuff with displaying properly on computers I'll use transcription only today]
Classical Arabic has approximately 6 types of noun endings, which can be homophonous in some forms, but in the most basic paradigm there's 6 different forms.
For example the word man 'rajul'
Indefinite
nom rajulun
gen rajulin
acc rajulan
definite
nom ar-rajulu
gen ar-rajuli
acc ar-rajula
As you can see, when a noun is indefinite it is marked with an additional /n/, this is so called nunation. This goes way back to proto-semitic, where it was still an /m/, and thus apporpriately called mimation. Although in fully vocalised texts this nunation is incredibly clear, when we just look at the plain, unvocalised texts, it becomes a lot weirder.
Indefinite
nom RJL
gen RJL
acc RJLA
definite
nom ʔL-RJL
gen ʔL-RJL
acc ʔL-RJL
Where's the nunation now? It disappeared! In unvocalised writing, there's actually no proof that there is such a thing as nunation. And the spelling RJLA seems to indicate a word rajulaa (which incidentally was the form in Old Arabic, or so I've been told).
And although you could say 'that's simply a spelling convention', this is in fact very odd. Yes these days it's a spelling convention, but as some of you may know, Arabic exclusively writes consonants and long vowel. I think we can all agree that in fact this nunation represents consonants. Why then isn't the indefinite paradigm like this?:
nom RJLN
gen RJLN
acc RJLA (RLJN?)
I personally believe that the written Arabic doesn't actually represent the Classical Arabic language, but rather a later stage that already lost its nunation (like all following Arabic languages have).
So the people who first wrote the Arabic language had already lost this nunation, but I believe that there must have been some religious tradition that continued this grammatical structure, which had been long lost when the Arabs started writing.
This is not entirely without precedent, think of the Sanskrit Rig Veda, which retains a much more archaic form of Sanskrit than we find in other texts. Not because it had been written down earlier, it was written down around the same time as classical texts. But it was passed on orally for much longer than the other texts, with virtually no change to the original position, thus retaining arachaisms not seen anywhere else.
The situation isn't 100% the same, as the Rig Veda texts, indeed retained old archaisms, but the language was never used to compose new texts. Which seems to be the case in Arabic.
I know that this idea feels 'far out', but Arabic combines fantastically with the other Semitic languages like this. Hebrew for example also lost its nunation (and case endings all together), but has some vestiges of this /aa/ from the Accusative indefinite.
Take layla evening. This is, although it looks just like a feminine, in fact a masculine word. the -a at the end is actually a retention of the Accusative indefinite long aa. Denoting 'during the night'. This is the exact same form that you see in writing of Arabic LYLA.
Although this all seems so crazy, I can't think of a good reason why nunation would not be written. Recognising that this nunation was a special grammatical ending, is way to linguistically deep to realise for the people who started writing it. That would be like Sanskrit using an s rather than an ḥ to denote the word final s, but then actually quite a bit more extreme.
I've looked for Old Arabic sources, hoping to find some nunation there, but even in those text it seems to be absent (although it's incredibly hard to tell).
One explantion often seen as to why nunation isn't written, and why the ending -an is written as -aa, is that written arabic is written 'pausal' The way you'd write the word in an individual context. This also explains the unpronounced Alif al-Waṣl (Connextive alif). I don't buy it though. In earlier texts for example, the Alif al-Waṣl was not actually written, because in many contexts it was unpronounced. So if back then they didn't write pausally, why didn't they write nunation?
I don't need any of you to agree with this idea, but what I'd like to hear from readers with at least some knowledge of Semitic, is a plausible explanation why nunation wasn't written.
Irish (and the other Celtic languages, where they haven't lost it) has two really eerie features. One, conjugation is done almost exclusively through initial and internal mutation, rather than than word final. Second is the word order, VSO (Verb-Subject-Object).
From an Indo-Europen point of view, these features are both very odd. In Indo-European the word order traditionally is considers SOV, though some argue that it's SVO, either way, it's definitely not VSO. Secondly, Indo-European is heavily reliant on suffixation (and internal mutation to some extent) to conjugate anything from verbs to nouns.
The interesting thing is. In SOV languages, we generally find suffixation, while in VSO languages we find prefixation. These are so called 'language universals'. I'm not a big fan of these universals, because they're disproved quite often, but right here it's not uninteresting.
Another famous VSO language is Arabic. Arabic makes use suffixation, but mostly relies on prefixes and internal mutation for the greatest differentiation. Now, the development of Irish initial mutation was a perfectly normal sub-phonemic process, which later became phonemic and grammatical. The great question is of course. Did these forms become grammatical because the word order shifted to VSO. Or did these initial mutations give rise a VSO word order.
It's as if some invisible force in our minds seems to have moulded this language into the 'universal' for no compelling reason except that it's not unusual to happen. But why? Very very interesting indeed.
Sorry I can't give any definite answers on this, it's currently far to mysterious, but I thought it would be worth mentioning
A question which you have probably never heard is "Do you speak Abaric", though there are singals that once maybe the word for Arabic should have been Abaric. What brought me to this thought is the Hebrew word for the Hebrew language 'ivrit' Or עברית in Hebrew. The Arabic word for the Arabic is 'ʕArabiyya' عربيّة. both -iyya and -it go back to a same suffix which was something like *-iyyat-um. which leaves us with the stems respectively עבר ʕ-b-r and عرب ʕ-r-b.
It's almost unthinkable to think that ʕ-b-r and ʕ-r-b do not go back to the same proto-root. Only one of these two languages must have metathesized it. Problem is if course that I know of absolutely no other roots where this happens. Never the less it's rather odd to think it didn't.
I just looked up both roots in my fantastic Arabic dictionary (The Hans Wehre dictionary of Modern Written Arabic), turn out both the root ʕ-b-r and ʕ-r-b exist in Arabic, with not an uncommon meaning.
ʕabara: to cross, to traverse; to ford, wade; to swim; to pass; to ferry; to carry someone across or over some thing; etc. etc.
ʕaraba: This word has taken a strong meaning to be connected with the Arabic nationality and has meanings like 'to translate into Arabic, to Arabicise etc. But in my dictionary there seems to be some indication of the original meaning if we look at the female noun derivation with suffix -atun
ʕarabatun: swift river; carriage, vehicle, wagon, cart; car, coach
So this also seems to have something to do with crossing/transporting. There's a very odd correlation here. Some of the derivations of the first root seem to refer to the latter root. And the other way around.
I don't have much to add to this, just thought I'd let you see the funny words and their similarities.
Arabic has a very odd quirk. It has so called 'broken plurals', besides the normal plurals, which are simply a suffixed form of the singular, there's plurals which consist of the same root, but vowels just in different places.
An example of this is to be seen in the title 'aflām' أفلام is the plural of 'fīlm' فيلم that's right, it's the loanword 'film'. From this even today, we can conclude that broken plurals are a productive category of the Arabic language. There's several common patterns, and several very rare ones. Which word takes a broken plural and which one takes a 'sound' plural is completely irregular. There's of course some indication, feminine nouns tend to have sound plurals, but this is by no means a fixed rule. Also which pattern goes with which noun is unpredictable.
Experience in historical linguistics teaches us that what are now irregularities were probably once regular patterns which got lost over time. To find such regularities you look back at other languages from the same language family. But here it gets disturbing.
The other Central Semitic languages (which include the languages Hebrew and Aramaic) have next to no indication of so called broken plurals. They're only sound plurals. In fact the split of the Central Semitic groups is done based on the existence of broken plurals:
There are two branches of Central Semitic, North-West Semitic and Arabic (The should really work on their terminology). The greatest distinguishing factor is the appearance of broken plurals in Arabic.
This gives the impression that the so called broken plural patterns just 'emerged' out of no where. This is hard for me to accept. A completely irregular system to create plurals while there was, and still is a perfectly regular system in the proto-language is very hard to accept.
There is no good reason why such a group of plurals would be created. The best explanation would be that a broken plural usually has fewer syllables, but this need for brevity would have also worked if they had just generalised one broken plural pattern instead of several.
Due to the unlikelihood that Arabic just sprouted irregular plurals, I'd like to think that Central Semitic in fact did have broken plurals, maybe not as wide spread as in Arabic, but definitely a big enough number of words to become productive. Then Aramaic and Hebrew generalised the simpler Sound Plural, while the broken plural became productive in Arabic.
I suspect that, at one point the different broken plural patterns attributed to some kind of class system, similar as you see in Bantu languages (maybe this is even areal influence?). Later this distinction become lost, especially in Arabic where several patterns became quite randomly productive.
Maybe the sound plural was an alternative way of forming a plural. Possibly when the speaker didn't feel it was necessary to categorise the plural into a class. Or maybe when he was speaking of a group of these things in general. There's countless scenario's you could think of to explain a 2 types of plurals system within one language.
If anyone has another idea, or can make the 'broken plurals sprung out of nowhere' idea sound plausible, please do reply.