Boltz on the origin of Chinese characters
Three proposed stages of writing systems:
-
Pictography: Drawings that portray things directly, without reference to language. Not writing, but a “forerunner”.
-
“Zodiography”: Pictography of words. Can be the same graphs as in pictography, but now they’re taken to represent words in a language; i.e. each graph now has a phonetic (P) and a semantic (S) value. This is the first stage in which specific linguistic utterances can be represented, though not any utterance.
-
Graphic multivalence: The zodiographs are extended in one of two ways:
-
Paronomasia or rebus: The character can represent homophonous words that have different meanings; holding a fixed P, acquire one or more new S. In this way the character becomes polysemous (with several possible S¹,S²…Sⁿ meanings, all pronounced the same (or “nearly” the same)).
Chinese example: 象 xiàng ← *dzjangx, a drawing of an ‘elephant’, extended to xiàng ← *dzjangx ‘image’. P = xiàng, S = elephant, S’ = image.
Another: 勿 wù ← *mjət ‘creature, animal’ extended to wù ← *mjət ‘do not’, a grammatical word that would be hard to draw zodiographically.
-
Parasemantics or homeosemous: The character can represent new words that are pronounced very differently, but are in the same semantic field as the original word (i.e. it’s possible to reasonably “depict” them by the same zodiograph). Fix the S (somewhat), vary the P. This makes the character polyphonic.
Example: 口 kǒu ← *khugx ‘mouth’ is based on the depiction of a mouth or opening. It was in the past also used for míng ← *mjing ‘call, name’ (modern 名, see below).
Similarly, 目 mù ← *mjəkw ‘eye’ was also used for kàn ← *khans and jiàn ← *kians, both variants of “to see” (modern 看、見).
With multivalence, the characters can now cover many more words, but there’s a higher degree of ambiguity (both polysemic and polyphonic).
-
-
Determinatives: Ambiguities are resolved with the addition of new graphic components, called determinatives, taken from the inventory of already existing (multivalent) graphs. Determinatives can be either phonetic or semantic.
- If polysemous, the character gets a semantic determinative: 勿 “not, creature” becomes divided into 勿 “not” and 物 “creature”, the latter adding 牛 “cow”.
- If polyphonic, the character gets a phonetic determinative: 口 {kǒu “mouth”, míng “call” } becomes 口 kǒu “mouth” plus 名 míng “call”, the latter adding 夕 míng “brighten” for its sound value (夕 was itself polyphonic, originally the same as 月, moon/night/brighten).
Table summing up how a few words were represent with characters in different stages:
| word | zodiographic stage | multivalent stage | determinative stage |
|---|---|---|---|
| *mjət→wù ‘creature’ | 勿 | 勿 | 物 |
| *mjət→wù ‘do not’ | - | 勿 | 勿 |
| *dzjangx→xiàng ‘elephant’ | 象 | 象 | 象 |
| *dzjangx→xiàng ‘image’ | - | 象 | 像 |
| *khugx→kǒu ‘mouth’ | 口 | 口 | 口 |
| *mjing→míng ‘call’ | - | 口 | 名 |
- Boltz think this pattern is universal, giving equivalent examples in Egyptian hieroglyphs and Mesopotamian cuneiform (and asserting Mayan also fits).
- Stages are accurate for individual graphs, but in the writing system as a whole, all stages coexist in different characters. Once new zodiographs cease to be created and enough ambiguity is remediated with determinatives, the formative period has ended.
- Shang oracle-bone inscriptions already have plenty of graphs in determinative stage.
- Processes are recursive, resulting in characters with >2 components (typically 2–6).
- By specifying in which of various pronunciations a graph should be read, a phonetic determinative also specify which of various words is intended; since words have different meanings, phonetic determinatives are, after all, also a kind of semantic determinative (though restricted to the same semantic field, and with the added property of changing the sound).
- By the same token, polyphonic characters can be sometimes (but unusually) disambiguated with semantic determinatives instead, since a semantic hint can be enough to select one of the (heterophonic) words.
- What’s considered “sufficiently” homophonic? The traditional restriction uses Shijing rhyme groups, which amounts to: same vocalic nucleus, same or homorganic coda (specifically, final nasals are allowed to alternate with corresponding orals). Initials vary a lot more.
- Semantic determinatives are the components traditionally called 部首 bùshǒu or classifiers (often but incorrectly translated as “radicals”—notice the other component is a “radical” or original root). The Kangxi list is arbitrary and limited; many other graphs appear as determinatives (in the formative period, potentially any character can be used, and the same word might be written with different determinatives).
- Notice that, in a composite phonosemantic graph, the phonophoric (“sound-carrying”, phonetic-hint) component can be either the original graph, to which a semantic determinative was added, or it can be a phonetic determinative, added a posteriori to select one of many possible readings of the base graph.
Traditional analyses tend to underestimated the role of phonetic determinatives, because they didn’t notice the possibility of earlier polyphony in the base characters. For example, consider:
| 名 | míng | ‘name, call’ |
| 鳴 | míng | ‘bird-call’ |
| 命 | mìng | ‘fate’ |
By all criteria, this should be considered a phonetic series (諧聲 xiéshēng), with 口 as a phonophoric for míng. The only reason it’s not is that míng is not currently a reading of 口. But, as reconstructed above, it was a valid reading before it became specialized as 名. Boltz call these “elusive phonetic series”.
Sometimes there is legitimate doubt as to which character is the basis and which is the determiner; this happens when the phonetic determiner is also semantically sugestive or appropriate (the example given is 孫; either 系 *gzwən ‘line’ is primary and 子 ‘descendants’ is a later sem.det., or 子 *gzwən ‘descendants’ is primary and 系 is a phon.det.). It seems many Chinese characters were built using such “dual-faced” determiners (I think this is the source of what ja.wikitionary calls 会意形声 kai’i-keisei, “combined-meaning phonosemantic” characters). Boltz is very much opposed to the notion of pure “combined meaning” characters (like 明 taken to be sun+moon=bright), believing there always must be a phonetic rationale if we dig deep enough.
⁂
Thanks to my History of Chinese Thought professor for lending me his copy of Boltz’ The Origin and Early Development of the Chinese Writing System (and damn you, academic presses, for your unbuyably overpriced textbooks; as I spend the holiday note-taking like a madman, I’m blaming you for my repetitive strain injury). Now if only I could somehow get my hands on Lurie’s Realms of Literacy: Early Japan and the History of Writing…