5 posts tagged “tocharian”
An interesting feature of the Indo-European verbal (and sometimes nominal) system is so called 'reduplication'.
Reduplication in Indo-European is the repetition of the first consonant plus an extra vowel before the action root.
A famous example is the Indo-European word for 'wheel' one of the few nominal roots with reduplication:
*kʷe-kʷlo-
Note that the reduplication is solely the first consonant, never the cluster. Well never is, of course, a big word. It does seem to happen sometimes.
Typically in Indo-European we reconstruct two types of reduplication, one with a vowel *e after the reduplicated consonant, one with the vowel *i. Obviously we like to reconstruct some kind of earlier **ə that gave both reflexes, but in the latest phase of PIE these two were definitely there.
Are these reconstructions really that natural though? Maybe not. Many languages seem to indicate a different system. For once Sanskrit was not authoritative in the opinions of the earlier Indo-Europeanists (typically, when there's doubt, the Sanskrit form is considered correct). No, this time it's Greek. Greek knows solely two types of reduplication. i-reduplication in presents (and very rare aorists), and e-reduplication in the perfect.
When we have a look at three other languages with attest reduplication, we get a very different view though. The languages I will discuss are Gothic, Sanskrit and Tocharian B. I've been told that Latin also displays some non-standard reduplication, but since my Latin isn't much better than 'dulce et decorum est pro patria mori', I'll leave that languages for people with more expertise.
First, let's talk about Gothic. Gothic is a nice language, for it displays reduplication often, and consistently throughout it's full history (which is awefully short, but still), unlike any other Germanic language. There's some proof of reduplication in other languages (I believe an odd Old-English form for 'rowed' was 'roard' (though spelling is probably wrong)). The other for is what we see in the preterite of do did < *deda. But the *deda discussion is a different one all together where no consensus has been found yet.
Gothic reduplication, occurs in the preterite. This is exactly the way we like it in Indo-European reconstructions. There is one main issue though. The vowel use for reduplication is <ai> which phonetically is [ɛ] (some people are pro diphthong, especially in the long variant, I disagree, it has to be a monophthong).
This is not the vowel we expect in reduplication. PIE *e consistently became Gothic i in all contexts. Later though i was changed to ai in several contexts (this all happened before our first Gothic textual attestation).
*i > ai/_r,h,ƕ This is linguist speak for 'e became <ai> before r,h and ƕ (Similarly als *u > au/_r,h,ƕ for you symmetry lovers).
Not all reduplication verbs start with r,h or ƕ though. In fact I can't think of any. So a verb like letan you'd expect to have a preterite **lilot. What we actually have is lailot. Some people have desperately tried to explain this to say that the vocalism is analogy from the reduplicating verbs that do start with r,h and ƕ. Personally I do not find this very convincing. Nevertheless, it's very hard to have any <ai> in Gothic outside of the above mentioned position. Ony if it were the long variant of <ai> (not visible in writing), we would be able to have that vowel outside of that position.
If <ai> in reduplicating verbs is not from earlier i, we would expect it to be from an earlier *ai < PIE *oi, which is not at all convincing either. Because of this (and several other reasons), I think there might be reason to assume that reduplication in Gothic is not a direct descendant of Indo-European reduplication (this assumption is going to academically kill me :P),
If it is old though, we have either a very odd Indo-European reduplication, or a very odd development in Gothic. You choose ;).
Now, let us discuss Sanskrit. Sanskrit uses reduplication in much the same contexts as Greek. It's used in some presents/aorists (which, I believe, use exclusively the i-vocalism). Besides that the i-vocalic reduplication is used in desideratives and intensives (some indication of this, is also, and solely available in Greek (even more proof for my pro-Graeco-Iranian theory :P)). In the perfect though, we see a different situation all together. We don't see the velar-palatalising a that we would expect as a reflex of PIE *e. No, it reduplicates the sonorant semi-vowel or vowel of the root in the zero grade.
What does that mean? Here let me show you:
Stem: puṣ- present: puṣyati perfect: pupoṣa (< Vedic pupauṣa). So it actually reduplications the u. This is odd in many ways, especially because the sound u is not actually immediately following the first consonant when the verb is in the perfect. In the perfect plural it is though, so likely it got it from there.
This reduplication is also seen when the semi-vocalic consonant is an i. Also the a is reduplicated when it's in the zero grade of the root, or if the second consonant is a vocalic r. But to be honest, this might be the Indo-European *e, especially since it palatalises the reduplicated velars.
Note that this semi-vocalic root vowel even crosses cluster boundaries. So the root śru- gives śuśro-.
Another distinctive feature of Sanskrit reduplication is the way the consonants are treated. When a voiceless non-aspirate is reduplicated, the same consonant remains. An example was given above.
If a voiced aspirate and h < *gh, is aspirated, a voiced non-aspirate. For example
dhāv- > dadhāv
This is not that different from Greek, it's Grassmann's law. Only Greek has voiceless aspirates, and they thus give the voiceless non-aspirate. (τιθημι).
But in some roots, and this is really weird, a voiceless consonant reduplicated sometimes gives the most bizarre reflexes. One that comes to mind right now is the verb pibati 'he drinks' with the root pā-. That's right, the root shortened it's ā to a, then the root consonant became b while the reduplicative consonant remains p. Absolutely insane, but very real nevertheless. This is but a minority though. Most voiced reduplicated consonant remain voiced.
This is, once again, a very different form of reduplication than the one we see in Greek. I've been told that something similar happens in the few Latin verbs that have reduplication. But someone else is going to have to give examples.
Now we get to the last, and really crazy part. That's right, Tocharian B. Tocharian B (and probably also Tocharian A :P) is always crazy. It fails to conform to any type of normal behaviour. I love it.
Tocharian B has reduplication in only one class. The Preterite Participle.
The most normal Preterite Participle is perfectly normal. CäCaw (a<accented ä). Example:lätaw from the root lät-
The second group is C1eC1aw (a from accented ä). Example śeśaw. Though this looks like a perfectly normal reduplication, this is already bizarre. Because TochB e does not come from IE *e, rather it comes from IE *o. This o is also nowhere to be found in the root. śä-
The third group is C1eC1aC2äw (a from accented ä), For example lyelyakäw from root läk-
Next group is C1eC1éC2äw Example: teteku (u = äw) from täk-, At last here we find a decent reduplication of the root vowel.
(j)VCäw where V is accented. For example: yáku root: yäk-
C1aC1áC2äw for example: lalālu (ā = accented a) from root: la-. TochB. a comes from Indo-European *h2e.
The next class is rather normal (no reduplication!) CäCáu for example: krämpau root: krämp-
The before last class is C1aC1áC2au for example kakārpau root: korp-
The last class I don't have any example of, but luckily isn't reduplicating either. áCau.
So in Tocharian we have reduplication with o, or h2e. And not once with *e or *i. Nor is the root semi-vowel reduplicated as we'd hope to see considering Sanskrit.
So there we have it! Three completely different reduplication systems, which are opposed directly against the Greek and reconstructed Indo-European form. Why we are still reconstructing the reduplication the way we are is actually a bit mysterious. It's the easiest, that's for sure. All these systems might have also undergone some analogy, especially Sanskrit and Gothic.
This leaves us with Tocharian B though, which went about being it's own crazy self (he likes to do that). Now we might want to mention that Tocharian split of from the Indo-European family well before the rest of the languages started branching off (but well after Anatolian branched off). Maybe this was indeed the crazy reduplication system PIE used to have, but then all reduplication was reduced to a schwa.
One thing is sure, the last word has definitely not been spoken about Indo-European reduplication, it's no where near as straight forward as some people like to think (Though in all honesty, to keep your sanity, I can imagine you'd rather think it's that simple than look at Tocharian; What can I say? I'm a linguistic masochist :P).
Tocharian B has a very interesting thing about its accent system, which makes it look like it is absolutely filled with Indo-European-like ablaut (change of vowel in the root). Sadly for us, this often isn't the case, nevertheless, it's a nice system to talk about now.
The Tocharian accent is as we call 'Hysterophobic', afraid to be in the back. In a noun, the accent can technically be anywhere, but when it's in the last syllable of a word, it will be moved a syllable forward.
As an example we'll take the noun 'palsko' 'thought'.
The root of this word is /pəlskó-/.
In the nominative, this word takes a zero-suffix so /pəlskó-/+/Ø/. Because this would mean the accented vowel would be in the back, the accent jumps to the initial syllable and gives a surface form of /pə́lsko/. This is written <palsko> (accented schwa is written <a> unaccented it's written <ä>). The oblique case of palsko is the same as the nominative. The Genitive though, has suffx /ntse/ so /pəlskó-/+/-ntse/, which means the accent isn't in the final syllable, and thus, stays in place. /pəlskóntse/ written <pälskontse>. As you can see we now have something that looks like a a/ä ablaut, while actually it's a accented/non-accented schwa alternation.
This not only plays a vital role in Nouns, also verbs will have this switch between a/ä at times, so it's important to know that it happens. There is also a alternation between <ā> and <a>. Which is to be interpreted as /á/ and /a/. So in writing, one can't distinguish an accented schwa from an unaccented a. And of course, chances are, they were homophones.
I've always found this incredibly fascinating about Tocharian, I hope you do too.
Please note: I have next to no knowledge of Tocharian A, so whenever I talk about Tocharian I mean Tocharian B. Maybe I should start calling it Kuchean.
I come up with the worst puns for my article titles. But okay, let's talk about the the Tocharian verbal root yäm-.
As I was learning my Sanskrit vocabulary I came across a very odd word: dā- दा 'to give'. A perfectly normal Indo-european verbal root related to the Greek root δω- 'to give'. But when we have a look at the present of this verb, is where it gets odd. The present in Greek is δίδωμι, typical reduplicated present athematic verb. No such luck with Sanskrit though, Sanskrit actually uses a completely different root in the present, namely yam- यम् giving a first person present yacchāmi यच्छामि.
This is what we call 'suppletion' when there's multiple roots in one paradigm. Especially in common words it's to be expected to see a lot of suppletion. Also, due to Sanskrit's extremely big vocabulary, it tends to sprout a lot of suppletion. Usualy though, this suppletion is through loans from the Dravidic language, as I also expected to be the case with yam-. But then suddenly, in a moment of clarity, I realised that yam- has a cognate in Tocharian, namely yäm- `achieve, obtain; reach'. I realise this actually means the exact opposite. But it's still the concept of physical transference, and it's hard to believe these words are not related.
The Indo-European root would then be *iem-. Now, the Tocharian languages (A and B) are known for their extensive suppletion. So I was secretly hoping that Tocharian would display the same suppletion as Sanskrit, so we could conclude some kind of ancient Indo-European suppletion, but no such luck. In fact, the Tocharian B verb doesn't have any suppletion at all!
Adams' Tocharian B dictionary:
Paradigm: G Ps. X /yänmāsk'ä/e-/ [A yänmāskau, -, yänmāṣṣäṃ//; APart. yänmāṣṣeñca `achiever'; MPPart. yänmāskemane; Ger. yänmāṣṣälye*]; 1Ko. I /yonm-/ [A -, -, yonmäṃ//]; 2Ko. VI /yänmā-/ [A -, -, yanmaṃ//-, yänmācer, -; MP //-, -, yänmāntär; AOpt. -, -, yanmoy//; Inf. yänmātsi]; Pt. IIIa /yonm-/ [A yonmwa, yonmasta, yonmasa//]; PP /yeyänmu-/;
K Ko. IX /yänmäsk'ä/e-/ [Inf. yanmäs(t)si]
Just a 'normal' verb. Although actually, it's quite a disturbing paradigm. n-infixed ske/o-suffixed present. With accented -a- suffix in between the root and the ske/o-suffix. in the present. Lovely. But in Tocharian it's definitely not unheard of.What is crazy though, is the reduplication in the Past Participle. That's a reduplication with Tocharian B e, this might not look crazy, but believe me it is. TochB. e comes from PIE *o. We've seen *e in reduplication, *i in reduplication, but *o?! And just to mess with us, it also adds 'a' (<h2, h2e) reduplication in some other paradigms.
Makes you wonder what the hell those Tocharians were on.
Now, I figured for completeness I'd add the paradigm of 'to give' in Tocharian B, just for completeness, though I can assure you you're not going to find the preferred *deh3 in there either.
Paradigm: Ps. IXa /āisk'ä/e-/ [A aiskau, -, aiṣṣäṃ//aiskem, aiścer, aiskeṃ; AImpf. -, -, aiṣṣi// -, -, aiṣṣiyeṃ; APart. aiṣṣeñca (see also aiṣṣeñcaññe below); MPPart. aiskemane; Ger. aiṣṣälle]; Ko. I /āi-/ [A āyu, ait, aiṃ// -, -, aiṃ; AOpt. -, -, āyi//; MPOpt. -, -, ayītär//; Inf. aitsi; Ger. aille]; Ipv. /pete-/ [sg. pete; pl. petso ˜ petes]; Pt. I(sg.)/III(pl.) /wäsā-/ [A wsāwa, wsāsta, wasa//wasam, -, wsar ˜ wsare]; PP /āyu-/
ai- joy, what a great root that looks nothing like what we'd like to see. It is sometimes connected to the Greek αἴνυμαι 'to take, seize'.
But luckily this paradigms has some nice suppletion. A Preterite with the root wäs- ! Go make up your own etymology for that, Adams' best guesses are as good as anyone's.
And besides that, there's a very interesting Imperative. pete. Tocharian had a prefix p- or pä- for imperatives (not pe-!) which would be prefixed to the verbal root. How exactly the prefix came to be pe- is uncertain, maybe it was affected by the following root te-. te- is interesting though, it looks like our long lost *deh3. It doesn't go too well though, we're not to sure what to do with PIE *ō's in Tocharian (or *oH or *eh3 that merge with *ō), it tends to become u, but sometimes it becomes e. taking a reconstruction of the root like *dh3e- would give us a much more safe reconstruction as te-. but there's not a clear reason to assume this.
Although not a guide to the Tocharian verbal system, I hope you've gotten an idea how incredibly crazy, and thus, incredibly interesting it really is.
What's that? Want to check out Adams' Tocharian B dictionary yourself? Go to IEED and click the link. It might take a while, the server backing the database is terrible, but after a while it starts working.
Take note though, Adams is notoriously sloppy, and his proposed etymologies are more often than note disposable. He often has rather irregular sound laws. Besides that he doesn't quote textual sources too well every time, he does quote them quite consistently though, which I guess is a plus. The translations he provides can also be rather sloppy.
Some resources:
Indo-European Etymological Dictionary - Nice site with a lot of dictionaries, including the digitalisation of the only Tocharian dictionary out there. But a server you'll instantly come to hate.
TITUS Project - A fantastic resource of an enormous amount of digitalised and transliterated Tocharian texts, both Tocharian A and B. Be sure to check this one out!
One of the puzzling things about Tocharian* is the Alternant gender. This is a group of nouns which in the singular, is a masculine noun, and in the plural is a feminine noun. The adjectives will also agree to different gender depending on whether this noun is in the singular or in the plural. A good example of an alternant is the noun palsko `thought'.
| Singular | Plural | |
|---|---|---|
| Nom | palsko | pälskonta |
| Obl | palsko | pälskonta |
| Gen | pälskontse | pälskontaṃts |
I've been wondering how and why this alternant developed. The answer to the question how is a lot easier to explain than why. So I'll do this first.
The nta suffix comes from indo-european *-nt suffix in the nominative-accusative neuter plural *-nt-h2. All -a's come from the nominative-accusative neuter plural actually.
wa is from the *-u stems. nma is from the *-mn stems. a is from the *-h2 stems it seems. na is from the *-n stems and sa is from the *-s stems.
Many of these words originally seem to be Indo-European neuters. Also most loanwords were placed into this category of nouns. Indo-European Neuters completely merged with the Masculine and Alternant in Tocharian, except for the pronouns, which still have a neuter.
Now we get to several why's. First. Why do these suffixes only appear in the plural. Answer: I don't know. It's also not like these are the suffixes you'd expect in either the singular or the plural. Take "hundred" for example kante which points towards an Indo-European *dḱmtóm, which luckily is the form we expect. But kante takes a -nma suffix. Although the indo-european plural of hundred would technically be *dḱmth2. Although the -h2 suffix is there, there's absolutely no proof except for Tocharian that it should be *dḱmtmnh2. There seems to be some reason behind it sometimes. nta seems to be very productive in loanwords, and words ending in -r. While nma seems to be especially productive with words in a final consonant that isn't r. All the other categories are closed categories (new words/loan words would not be added to this category).
Second question: Why was the Neuter Nominative-Accusative Pleural suffix interpreted as a feminine suffix? This is slightly easier to answer. h2-stems after all, are generally feminine words, and even more so for the ih2-stems. But the nominative plural of feminine h2-stems ends in *h2-es! Luckily Tocharian solves this problem. *h2e > a and word final *s disappears. Resulting in the nominative singular *-eh2 and plural *-h2es being a homophonous "a".
This homophony was later solved by analogy (feminine ih2 stems get a -na suffix), but before this analogy this is the time the neuter plural nominative-accusative was indistinguishable from the feminine nominative plural, this is how the plural of the alternant nouns became feminine.
That was the traditional approach. But FTPlus kindly notified me of another, rather controversial option. His hypothesis was: "What if the *-h2 feminines are actually neuter plurals."
This hypothesis implies that the inanimate neuters were traditionally Singular only. While the plurals of these nouns would either be mass nouns, or abstract nouns. This would also often mean that certain plurals (mass nouns) did not even have a singular equivalent, and the other way around. Reinterpretation would then lead to a new class derived from plural-only neuters. This would then be the h2-stems. which were conjugated like any other animate consonant stem.
How exactly these abstract and mass nouns got to be associated with feminine nouns so far, is a mystery to me. FTPlus suggests that women are more emotional, and would have a closer bond with abstract things than men. But I wonder if this psychological insight would have had awareness among the Indo-Europeans. I thought this insight was more of a modern popular psychology thing.
What does this mean for Tocharian?
First off, it would be better to interpret the Masculine singular as a Neuter in the alternants. Then the expected plural *-h2 Might have been impossible to the Tocharians, for it created abstract and mass nouns, which would not be the same as the plural of a singular Neuter. Therefore the chose to add a derivative suffix behind the original singular, and then add this plural *-h2. How exactly these plural suffixes were selected is yet unclear, but this is in fact quite a plausible theory, and maybe even easier to understand than the above stated form, where we don't derive the *-h2 suffix from the nominative-accustive neuter plural
*Please note that I will almost always mean Tocharian B, and I will use Tocharian B examples. Although most things I say will also count for Tocharian A, I only have deep knowledge of Tocharian B.
All right, my blog hasn't been as active as I would like it to be. So, I'm going to change the way I'm going to go about this blog. From now on, I will make a weekly post on saturday. This post will analyse a sentence, or piece of text, from any language that I've studied, and of which I'm fairly confident I can inform people about.
Today I will start of with Tocharian B a.k.a Kuchean.
A small introduction on Tocharian
Tocharian is an Indo-European language which surprisingly enough, used to be spoken in Chinese Turkestan (around Kucha, hence the name Kuchean). The largest part of the texts that were found, were written in the 8th Century AD. This language phase is generally called Classical Tocharian, and most sentences and texts I will analyse will be from this period.
There are two main Tocharian languages, Tocharian A and Tocharian B (TA and TB respectively). I do not have much knowledge of TA, all I know is that it already seemed to be a dying language which mostly had ritualistic uses, while Tocharian B was still alive and kicking.
The Tocharians were Buddhists, so most texts that were found were Buddhist texts.
Another interesting thing about Tocharian is that it is not a satem language, but a centum language. With in the Indo-European language family, there's one big subdivision, and that's on what the languages do with the velars.
The satem languages merge the velars and labiovelars. While the centum languages merge the velars and palatovelars.
For a long time it was believe that this subdivision indicated a big dialectal isogloss which split the Indo-European community in a western and eastern dialectal region. West being centum, and east being satem. And then there was Tocharian, the furthest in the east of all, and a centum language. This first sparked my interest.
The text
Although I'm currently working on a Tocharian font, I do not think I'll annoy you people with it, and I will just write the texts in the standard transcription. Tocharian was written in an Indic script, and thus the transcription might look similar to Sanskrit for some people, because it was more or less based on it.
tarya akṣā-ne pudñäkte: teki ktstaitsñe srukalñe. toṃ mā tākoṃ śaiṣṣene, mā ṅke tsaṅkoy pudñäkte. toṃ ñyatstsenta wikässiś poyśinta tne tseṅkeṃtär.
The Buddha announced to him three [things]: sickness, old-age and
death. If these would not be in the world, then the Buddha would not arise.
All-knowing ones(Buddha’s) arise here for driving out these dangers.
Analysis
tarya : three (female), meant is 3 things
akṣā-ne : announce preterite 3 singular, with an enclitic 3rd person singular pronoun. This enclitic can either be the direct object or the indirect object.
pudñäkte : Buddha, in the nominative. (though, in morphology it could also be the oblique, the main use of the oblique is to mark the accusative).
teki : sickness
ktstaitsnñe : old-age
srukalñe : death
The Buddha announced to him three things: sickness, old-age and death.
toṃ : 'these' nominative plural.
mā : not, cognate to classical Greek 'μη'
tākoṃ : 'to be' pptative 3 plural
śaiṣṣene : world locative. Formed by the oblique 'śaiṣṣe' with locative suffix 'ne'
mā : not
ṅke : then
tsaṅkoy : `to arise' Optative 3 singular
pudñäkte : Buddha Oblique
If these would not be in the world, then the Buddha would not arise.
A point of interest is the optative here. The optative should be considered the past tense of the subjunctive here. The subjunctive is often used for if-then clauses. The main clause in the subjunctive and the condityional clause in the subjunctive give the structure of a specific condition.
toṃ : 'these' oblique plural
ñyatstsenta 'dangers' oblique plural (although in for could also be nominative plural).
wikässiś : 'to drive out' infinitive with allative suffix(expected is "wikäṣtsiś" which But the cluster ṣts is always found as sts, or ss, as in this case). Allative here giving 'in order to drive out'
poyśinta : All-knowing one in plural. Meaning 'Buddha' comes from po 'all' + aiśi Nomen Agentis of 'to know'.
tne : here, though a more wide translation would be 'thus' or 'so'.
tseṅkeṃtär. 'To arise' present 3rd person plural (Medium)
All-knowing ones(Buddha’s) arise here for driving out these dangers
Feel free to ask any questions.