Meaning–text theory

Meaning–text theory (MTT) is a theoretical linguistic framework, first put forward in Moscow by Aleksandr Žolkovskij and Igor Mel’čuk,^[1] for the construction of models of natural language. The theory provides a large and elaborate basis for linguistic description and, due to its formal character, lends itself particularly well to computer applications, including machine translation, phraseology, and lexicography. General overviews of the theory can be found in Mel’čuk (1981)^[2] and (1988).^[3]

Levels of representation

Linguistic models in MTT operate on the principle that language consists in a mapping from the content or meaning (semantics) of an utterance to its form or text (phonetics). Intermediate between these poles are additional levels of representation at the syntactic and morphological levels.

Levels of representation in MTT

Representations at the different levels are mapped, in sequence, from the unordered network of the semantic representation (SemR) through the dependency tree-structures of the Syntactic Representation (SyntR) to a linearized chain of morphemes of the Morphological Representation (MorphR) and, ultimately, the temporally-ordered string of phones of the Phonetic Representation (PhonR) (not generally addressed in work in this theory). The relationships between representations on the different levels are considered to be translations or mappings, rather than transformations, and are mediated by sets of rules, called "components", which ensure the appropriate, language-specific transitions between levels.

Semantic representation

Semantic representations (SemR) in meaning–text theory consist primarily of a web-like semantic structure (SemS) which combines with other semantic-level structures (most notably the Semantic-Communicative Structure [SemCommS],^[4] which represents what is commonly referred to as "Information Structure" in other frameworks). The SemS itself consists of a network of predications, represented as nodes with arrows running from predicate nodes to argument node(s). Arguments can be shared by multiple predicates, and predicates can themselves be arguments of other predicates. Nodes generally correspond to lexical and grammatical meanings as these are directly expressed by items in the lexicon or by inflectional means, but the theory allows the option of decomposing meanings into more fine-grained representation via processes of semantic paraphrasing,^[5] which are also key to dealing with synonymy and translation-equivalencies between languages. SemRs are mapped onto the next level of representation, the Deep-Syntactic Representation, by the rules of the Semantic Component, which allow for a one to many relationship between levels (that is, one SemR can potentially be expressed by a variety of syntactic structures, depending on lexical choice, the complexity of the SemR, etc.). The structural description and the (semi-) automatic generation of SemR are subject to research.^[6] Here the decomposition takes advantage of the Semantic Primes of the Natural Semantic Metalanguage to determine a termination criterion of the decomposition.

Syntactic representation

Syntactic representations in MTT are implemented using dependency trees, which constitute the Syntactic Structure (SyntS). SyntS is accompanied by various other types of structure, most notably the syntactic communicative structure and the anaphoric structure. There are two levels of syntax in MTT, the deep syntactic representation (DSyntR) and the surface syntactic representation (SSyntR). A good overview of MTT syntax, including its descriptive application, can be found in Mel’čuk (1988).^[7] A comprehensive model of English surface syntax is presented in Mel’čuk & Pertsov (1987).^[8]

The deep syntactic representation (DSyntR) is related directly to SemS and seeks to capture the "universal" aspects of the syntactic structure. Trees at this level represent dependency relations between lexemes (or between lexemes and a limited inventory of abstract entities such as lexical functions). Deep syntactic relations between lexemes at DSyntR are restricted to a universal inventory of a dozen or syntactic relations including seven ranked actantial (argument) relations, the modificative relation, and the coordinative relation. Lexemes with purely grammatical function such as lexically-governed prepositions are not included at this level of representation; values of inflectional categories that are derived from SemR but implemented by the morphology are represented as subscripts on the relevant lexical nodes that they bear on. DSyntR is mapped onto the next level of representation by rules of the deep-syntactic component.

The surface-syntactic representation (SSyntR) represents the language-specific syntactic structure of an utterance and includes nodes for all the lexical items (including those with purely grammatical function) in the sentence. Syntactic relations between lexical items at this level are not restricted and are considered to be completely language-specific, although many are believed to be similar (or at least isomorphic) across languages. SSyntR is mapped onto the next level of representation by rules of the surface-syntactic component.

Morphological representation

Morphological Representations in MTT are implemented as strings of morphemes arranged in a fixed linear order reflecting the ordering of elements in the actual utterance. It should be noted that this is the first representational level at which linear precedence is considered to be linguistically significant, effectively grouping word-order together with morphological processes and prosody, as one of the three non-lexical means with which languages can encode syntactic structure. As with Syntactic Representation, there are two levels of Morphological Representation—Deep and Surface Morphological Representation. Detailed descriptions of MTT Morphological Representations are found in Mel’čuk (1993–2000)^[9] and Mel’čuk (2006).^[10]

Deep Morphological Representation (DMorphR) consists of strings of lexemes and morphemes—e.g., THE SHOE+{PL} ON BILL+{POSS} FOOT+{PL}. The deep morphological component of rules maps this string onto the Surface Morphological Representation (SMorphR), converting morphemes into the appropriate morphs and performing morphological operations implementing non-concatenative morphological processes—in the case of our example above, giving us /the shoe+s on Bill+s feet/. Rules of the surface morphological component, a subset of which include morphophonemic rules, map the SMorphR onto a phonetic representation [ðə ʃuz an bɪlz fit].

The lexicon

A crucial aspect of MTT is the lexicon, considered to be a comprehensive catalogue of the lexical units (LUs) of a language, these units being the lexemes, collocations and other phrasemes, constructions, and other configurations of linguistic elements that are learned and implemented in speech by users of language. The lexicon in MTT is represented by the Explanatory Combinatorial Dictionary (ECD)^[11]^[12] which includes entries for all of the LUs of a language along with information speakers must know regarding their syntactics (the LU-specific rules and conditions on their combinatorics). An ECD for Russian was produced by Mel’čuk et al. (1984),^[13] and ECDs for French were published as Mel’čuk et al. (1999)^[14] and Mel’čuk & Polguère (2007).^[15]

Lexical functions

One important discovery of meaning–text linguistics was the recognition that LUs in a language can be related to one another in an abstract semantic sense and that this same relation also holds across many lexically-unrelated pairs or sets of LUs. These relations are represented in MTT as lexical functions (LF).^[16] An example of a simple LF is Magn(L), which represents collocations used in intensification such as heavy rain, strong wind, or intense bombardment. A speaker of English knows that for a given lexeme L such as RAIN the value of Magn(RAIN) = HEAVY, whereas Magn(WIND) = STRONG, and so on. MTT currently recognizes several dozen standard LFs that are known to recur across languages.

References

↑ Žolkovskij, Aleksandr K.; Igor A. Mel’čuk (1965). "O vozmožnom metode i instrumentax semantičeskogo sinteza (On a possible method and instruments for semantic synthesis)". Naučno-texničeskaja informacija. 5: 23–28.
↑ Mel’čuk, Igor A. (1981). "Meaning-Text Models: A recent trend in Soviet linguistics". Annual Review of Anthropology. 10: 27–62. doi:10.1146/annurev.an.10.100181.000331.
↑ Mel’čuk, Igor A. (1988). Dependency syntax: Theory and practice. Albany, NY: SUNY Press.
↑ Mel’čuk, Igor A. (2001). Communicative organization in natural language: The semantic-communicative structure of sentences. Amsterdam: John Benjamins.
↑ Milićević, Jasmina (2007). La paraphrase. Modélisation de la paraphrase langagière. Bern: Peter Lang.
↑ Fähndrich, J. et al. 2014: "Formal Language Decomposition into Semantic Primes." ADCAIJ: Advances in Distributed Computing and Artificial Intelligence Journal 3.8 (2014): 56-73.
↑ Mel’čuk, Igor A. (1988). Dependency syntax: Theory and practice. Albany, NY: SUNY Press.
↑ Mel’čuk, Igor A.; Nikolai V. Pertsov (1987). Surface syntax of English: A formal model within the Meaning-Text framework. Amsterdam: John Benjamins.
↑ Mel'čuk, Igor A. (1993–2000). Cours de morphologie générale. Montréal: Les Presses de l’Université de Montréal.
↑ Mel’čuk, Igor A. (2006). Aspects of the Theory of Morphology. Berlin: Mouton de Gruyter.
↑ Mel’čuk, Igor A.; Andre Clas; Alain Polguère (1995). Introduction à la lexicologie explicative et combinatoire. Paris: Duculot.
↑ Mel’čuk, Igor A. (2006). Sica, G, ed. "Explanatory combinatorial dictionary". Open Problems in Linguistics and Lexicography. Monza:: Polimetrica: 222–355.
↑ Mel’čuk, Igor A.; Aleksandr K. Žolkovsky; Juri Apresjan (1984). Толково-комбинаторный словарь современного русского языка: Опыты семантико-синтаксического описания русской лексики. [Explanatory Combinatorial Dictionary of Modern Russian: Semantico-Syntactic Studies of Russian Vocabulary]. Wiener Slawistischer Almanach: Vienna.
↑ Mel’čuk, Igor A.; N. Arbatchewsky-Jumarie, Lida Iordanskaja, S. Mantha & Alain Polguère (1999). Dictionnaire explicatif et combinatoire du français contemporain. Recherches lexico-sémantiques IV. Montréal: Les Presses de l’Université de Montréal. Cite uses deprecated parameter |coauthors= (help)
↑ Mel’čuk, Igor A.; Alain Polguère (2007). Lexique actif du français : L'apprentissage du vocabulaire fondé sur 20000 dérivations sémantiques et collocations du français. Paris: Duculot.
↑ Mel’čuk, Igor A. (1996). Wanner, Leo, ed. "Lexical functions: a tool for the description of lexical relations in a lexicon". Lexical Functions in Lexicography and Natural Language Processing: 37–102.

External links

The Meaning-Text Theory web site, hosts the proceedings of the biannual MTT-conference
Observatoire de linguistique Sens-Texte (OLST)
Meaning–Text @ neuvel.net, an excellent introduction to the theory
Meaning–Text on-line library

Meaning–text software

Carabao Linguistic Virtual Machine provided by LinguaSys
ETAP-3 Linguistic Processing System, described as 'a Full-Fledged NLP Implementation of the Meaning-Text Theory' (official site)

This article is issued from Wikipedia - version of the 10/26/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.

Meaning–text theory

Levels of representation

Semantic representation

Syntactic representation

Morphological representation

The lexicon

Lexical functions

References

Further reading

General

Syntax

Morphology

Lexicography

External links

Meaning–text software