Citizendia

Linguistics
Theoretical linguistics
Phonetics
Phonology
Morphology
Syntax
Lexis
Semantics
Lexical semantics
Statistical semantics
Structural semantics
Prototype semantics
Pragmatics
Applied linguistics
Language acquisition
Psycholinguistics
Sociolinguistics
Linguistic anthropology
Generative linguistics
Cognitive linguistics
Computational linguistics
Descriptive linguistics
Historical linguistics
Comparative linguistics
Etymology
Stylistics
Prescription
Corpus linguistics
History of linguistics
List of linguists
Unsolved problems

'Morphology is the field of linguistics that studies the internal structure of words. Linguistics is the scientific study of Language, encompassing a number of sub-fields (Words as units in the lexicon are the subject matter of lexicology. Not to be mistaken with Lexicography. ) While words are generally accepted as being (with clitics) the smallest units of syntax, it is clear that in most (if not all) languages, words can be related to other words' by rules. In Linguistics, a clitic is a grammatically independent and phonologically dependent Word. In Linguistics, syntax (from Ancient Greek grc συν- syn-, "together" and grc τάξις táxis, "arrangement" is the For example, English speakers recognize that the words dog, dogs, and dog-catcher are closely related. English is a West Germanic language originating in England and is the First language for most people in the United Kingdom, the United States English speakers recognize these relations from their tacit knowledge of the rules of word-formation in English. They intuit that dog is to dogs as cat is to cats; similarly, dog is to dog-catcher as dish is to dishwasher. The rules understood by the speaker reflect specific patterns (or regularities) in the way words are formed from smaller units and how those smaller units interact in speech. In this way, morphology is the branch of linguistics that studies patterns of word-formation within and across languages, and attempts to formulate rules that model the knowledge of the speakers of those languages.

Contents

History

The history of morphological analysis dates back to the ancient Indian linguist Pāṇini, who formulated the 3,959 rules of Sanskrit morphology in the text Aṣṭādhyāyī by using a Constituency Grammar. Pāṇini ( IAST: Pāṇini Dēvanāgarī: sa पाणिनि a Patronymic meaning "descendant of {{IAST|Paṇi}} " was an ancient Sanskrit (sa संस्कृता वाक् saṃskṛtā vāk, for short sa संस्कृतम् saṃskṛtam) is a historical Pāṇini ( IAST: Pāṇini Dēvanāgarī: sa पाणिनि a Patronymic meaning "descendant of {{IAST|Paṇi}} " was an ancient The Graeco-Roman grammatical tradition also engaged in morphological analysis.

The term morphology was coined by August Schleicher in 1859[1]

Fundamental concepts

Lexemes and word forms

The distinction between these two senses of "word" is arguably the most important one in morphology. August Schleicher ( February 19, 1821 – December 6, 1868) was a German linguist born in Meiningen ( Duchy Year 1859 ( MDCCCLIX) was a Common year starting on Saturday (link will display the full calendar of the Gregorian calendar (or a Common The first sense of "word," the one in which dog and dogs are "the same word," is called lexeme. For its use in the context of Computer Science see Lexical analysis. The second sense is called word-form. We thus say that dog and dogs are different forms of the same lexeme. Dog and dog-catcher, on the other hand, are different lexemes; for example, they refer to two different kinds of entities. The form of a word that is chosen conventionally to represent the canonical form of a word is called a lemma, or citation form. In Linguistics a lemma (plural lemmas or lemmata) has two distinct interpretations morphology / Lexicography: the

Prosodic word vs. morphological word

Here are examples from other languages of the failure of a single phonological word to coincide with a single morphological word-form. In Latin, one way to express the concept of 'NOUN-PHRASE1 and NOUN-PHRASE2' (as in "apples and oranges") is to suffix '-que' to the second noun phrase: "apples oranges-and", as it were. An extreme level of this theoretical quandary posed by some phonological words is provided by the Kwak'wala language. [2] In Kwak'wala, as in a great many other languages, meaning relations between nouns, including possession and "semantic case", are formulated by affixes instead of by independent "words". The three word English phrase, "with his club", where 'with' identifies its dependent noun phrase as an instrument and 'his' denotes a possession relation, would consist of two words or even just one word in many languages. But RABBID SQUIRRELS AND CHICKEN NUGGETS affixation for semantic relations in Kwak'wala differs dramatically (from the viewpoint of those whose language is not Kwak'wala) from such affixation in other languages for this reason: the affixes phonologically attach not to the lexeme they pertain to semantically, but to the preceding lexeme. Consider the following example (in Kwakw'ala, sentences begin with what corresponds to an English verb):[3]

kwixʔid-i-da             bəgwanəmai-χ-a            q'asa-s-isi           t'alwagwayu

Morpheme by morpheme translation:

kwixʔid-i-da = clubbed-PIVOT-DETERMINER

bəgwanəma-χ-a = man-ACCUSATIVE-DETERMINER

q'asa-s-is = otter-INSTRUMENTAL-3. PERSON. SINGULAR-POSSESSIVE

t'alwagwayu = club.

"the man clubbed the otter with his club"

(Notation notes:

1. accusative case marks an entity that something is done to.

2. determiners are words such as "the", "this", "that".

3. the concept of "pivot" is a theoretical construct that is not relevant to this discussion. )

That is, to the speaker of Kwak'wala, the sentence does not contain the "words" 'him-the-otter' or 'with-his-club' Instead, the markers -i-da (PIVOT-'the'), referring to man, attaches not to bəgwanəma ('man'), but instead to the "verb"; the markers -χ-a (ACCUSATIVE-'the'), referring to otter, attach to bəgwanəma instead of to q'asa ('otter'), etc. To summarize differently: a speaker of Kwak'wala does not perceive the sentence to consist of these phonological words:

kwixʔid         i-da-bəgwanəma          χ-a-q'asa        s-isi-t'alwagwayu
"clubbed           PIVOT-the-mani         hit-the-otter         with-hisi-club

A central publication on this topic is the recent volume edited by Dixon and Aikhenvald (2007), examining the mismatch between prosodic-phonological and grammatical definitions of "word" in various Amazonian, Australian Aboriginal, Caucasian, Eskimo, Indo-European, Native North American, and West African languages, and in sign languages. Apparently, a wide variety of languages make use of the hybrid linguistic unit clitic, possessing the grammatical features of independent words but the prosodic-phonological lack of freedom of bound morphemes. The intermediate status of clitics poses a considerable challenge to linguistic theory.

Inflection vs. word-formation

Given the notion of a lexeme, it is possible to distinguish two kinds of morphological rules. Some morphological rules relate different forms of the same lexeme; while other rules relate two different lexemes. Rules of the first kind are called inflectional rules, while those of the second kind are called word-formation. In Grammar, inflection or inflexion is the way language handles grammatical relations and relational categories such as tense, mood, voice In Linguistics, word formation is the creation of a new Word. The English plural, as illustrated by dog and dogs, is an inflectional rule; compounds like dog-catcher or dishwasher provide an example of a word-formation rule. Informally, word-formation rules form "new words" (that is, new lexemes), while inflection rules yield variant forms of the "same" word (lexeme).

There is a further distinction between two kinds of word-formation: derivation and compounding. In Linguistics, derivation is "Used to form new words as with happi-ness and un-happy from happy, or determination from In Linguistics, a compound is a Lexeme (less precisely a Word) that consists of more than one stem. Compounding is a process of word-formation that involves combining complete word-forms into a single compound form; dog-catcher is therefore a compound, because both dog and catcher are complete word-forms in their own right before the compounding process has been applied, and are subsequently treated as one form. Derivation involves affixing bound (non-independent) forms to existing lexemes, whereby the addition of the affix derives a new lexeme. An affix is a Morpheme that is attached to a stem to form a word In Etymology, a bound morpheme is a Root morpheme that cannot stand alone as an independent word One example of derivation is clear in this case: the word independent is derived from the word dependent by prefixing it with the derivational prefix in-, while dependent itself is derived from the verb depend.

The distinction between inflection and word-formation is not at all clear-cut. There are many examples where linguists fail to agree whether a given rule is inflection or word-formation. The next section will attempt to clarify this distinction.

Paradigms and morphosyntax

Linguistic typology
Morphological
Isolating
Synthetic
Polysynthetic
Fusional
Agglutinative
Morphosyntactic
Alignment
Accusative
Ergative
Philippine
Active-stative
Tripartite
Inverse marking
Syntactic pivot
Theta role
Word Order
VO languages
Subject Verb Object
Verb Subject Object
Verb Object Subject
OV languages
Subject Object Verb
Object Subject Verb
Object Verb Subject
Time Manner Place
Place Manner Time
This box: view  talk  edit

A paradigm is the complete set of related word-forms associated with a given lexeme. Linguistic Typology is an international Peer-reviewed journal in the field of Linguistic typology, founded in 1997 Morphological typology is a way of classifying the languages of the world (see Linguistic typology) that groups languages according to their common morphological structures In morphological typology (in linguistics an isolating language (also analytic language) is any Language in which words are composed of A synthetic language, in Linguistic typology, is a Language with a high Morpheme -per- word ratio Polysynthetic languages are highly Synthetic languages ie languages in which words are composed of many Morphemes Definition The degree of For fusion in Word formation, see Compound (linguistics. A fusional language (also called inflecting language) is a An agglutinative language is a Language that uses Agglutination extensively most Words are formed by joining Morphemes together Morphology is the field of Linguistics that studies the internal structure of words In Linguistics, morphosyntactic alignment is the system used to distinguish between the arguments of Transitive verbs and those of Intransitive A nominative-accusative Language (or simply accusative language) is one that marks the direct object of Transitive verbs distinguishing them An ergative-absolutive Language (or simply ergative language is a language that treats the argument (" subject " of an Intransitive Austronesian alignment, commonly known as the Philippine- or Austronesian -type voice system, is a typologically unusual Morphosyntactic alignment An active-stative language, or active language for short is one in which the sole argument of an Intransitive verb is sometimes marked in the same way A tripartite language, also called an ergative-accusative language, is one that treats the subject of an intransitive verb the subject of a transitive verb and the object A direct-inverse language is a language where clauses with transitive verbs can be expressed either using a direct or an inverse construction The syntactic pivot is the Verb argument around which sentences "revolve" in a given Language. In Generative grammar, (in particular Government and binding theory and the Standard Theory of Transformational Grammar a theta role or θ-role is the In Linguistics, word order typology refers to the study of the different ways in which languages arrange the constituents of their sentences relative to each other and the systematic In Linguistics, a VO language is a language in which the Verb typically comes before the object (thus including SVO, VOS and In Linguistic typology, subject-verb-object ( SVO) is a sentence structure where the subject comes first the Verb second and the object Verb Subject Object ( VSO) is a term in Linguistic typology. It represents one type of languages when classifying languages according to the sequence of these In Linguistic typology, Verb Object Subject or Verb Object Agent - commonly used in its abbreviated form VOS or VOA - represents the language-classification In Linguistics, an OV language is a language in which the object comes before the Verb. In Linguistic typology, Subject Object Verb (SOV is the type of languages in which the subject, object, and Verb of a sentence appear or usually Object Subject Verb (OSV or Object Agent Verb (OAV is one of the permutations of expression used in Linguistic typology. Object Verb Subject (OVS or Object Verb Agent (OVA is one of the Permutations of expression used in Linguistic typology, although it is rare among Time Manner Place (TMP describes one possible ordering of Adpositional phrases in sentences Place Manner Time is a term used in Linguistic typology to state the general order of Adpositional phrases in a language's sentences "to the store by car The familiar examples of paradigms are the conjugations of verbs, and the declensions of nouns. In Linguistics, conjugation is the creation of derived forms of a Verb, Noun or Adjective from its Principal parts by Inflection In Linguistics, declension (or declination) is the occurrence of Inflection in Nouns Pronouns and Adjectives indicating Accordingly, the word-forms of a lexeme may be arranged conveniently into tables, by classifying them according to shared inflectional categories such as tense, aspect, mood, number, gender or case. Grammatical tense is a temporal linguistic quality expressing the time at during or over which a state or action denoted by a verb occurs In Linguistics, the grammatical aspect of a Verb defines the temporal flow (or lack thereof in the described event or state Grammatical mood is one of a set of distinctive Verb forms that are used to signal modality. In linguistics grammatical number is a Grammatical category of nouns pronouns and adjective and verb agreement that expresses count distinctions (such as "one" In Linguistics, grammatical genders, sometimes also called Noun classes are classes of nouns reflected in the behavior of associated words every noun must belong In Grammar, the case of a Noun or Pronoun indicates its Grammatical function in a greater Phrase or Clause; such as the For example, the personal pronouns in English can be organized into tables, using the categories of person (1st. , 2nd. , 3rd. ), number (singular vs. plural), gender (masculine, feminine, neuter), and case (subjective, objective, and possessive). In Grammar, the case of a Noun or Pronoun indicates its Grammatical function in a greater Phrase or Clause; such as the See English personal pronouns for the details. The personal pronouns of English can have various forms according to gender, number, person, and case.

The inflectional categories used to group word-forms into paradigms cannot be chosen arbitrarily; they must be categories that are relevant to stating the syntactic rules of the language. In Linguistics, syntax (from Ancient Greek grc συν- syn-, "together" and grc τάξις táxis, "arrangement" is the For example, person and number are categories that can be used to define paradigms in English, because English has grammatical agreement rules that require the verb in a sentence to appear in an inflectional form that matches the person and number of the subject. In Languages agreement is a form of cross-reference between different parts of a sentence or phrase In other words, the syntactic rules of English care about the difference between dog and dogs, because the choice between these two forms determines which form of the verb is to be used. In contrast, however, no syntactic rule of English cares about the difference between dog and dog-catcher, or dependent and independent. The first two are just nouns, and the second two just adjectives, and they generally behave like any other noun or adjective behaves.

An important difference between inflection and word-formation is that inflected word-forms of lexemes are organized into paradigms, which are defined by the requirements of syntactic rules, whereas the rules of word-formation are not restricted by any corresponding requirements of syntax. Inflection is therefore said to be relevant to syntax, and word-formation is not. The part of morphology that covers the relationship between syntax and morphology is called morphosyntax, and it concerns itself with inflection and paradigms, but not with word-formation or compounding. In Linguistics, syntax (from Ancient Greek grc συν- syn-, "together" and grc τάξις táxis, "arrangement" is the

Allomorphy

In the exposition above, morphological rules are described as analogies between word-forms: dog is to dogs as cat is to cats, and as dish is to dishes. In this case, the analogy applies both to the form of the words and to their meaning: in each pair, the first word means "one of X", while the second "two or more of X", and the difference is always the plural form -s affixed to the second word, signaling the key distinction between singular and plural entities.

One of the largest sources of complexity in morphology is that this one-to-one correspondence between meaning and form scarcely applies to every case in the language. In English, we have word form pairs like ox/oxen, goose/geese, and sheep/sheep, where the difference between the singular and the plural is signaled in a way that departs from the regular pattern, or is not signaled at all. Even cases considered "regular", with the final -s, are not so simple; the -s in dogs is not pronounced the same way as the -s in cats, and in a plural like dishes, an "extra" vowel appears before the -s. These cases, where the same distinction is effected by alternative forms of a "word", are called allomorphy. This article is about a linguistic term See Pseudomorph for another

Phonological rules constrain which sounds can appear next to each other in a language, and morphological rules, when applied blindly, would often violate phonological rules, by resulting in sound sequences that are prohibited in the language in question. For example, to form the plural of dish by simply appending an -s to the end of the word would result in the form *[dɪʃs], which is not permitted by the phonotactics of English. Phonotactics (in Greek phone = voice and tactic = course is a branch of Phonology that deals with restrictions in a Language on the In order to "rescue" the word, a vowel sound is inserted between the root and the plural marker, and [dɪʃəz] results. Similar rules apply to the pronunciation of the -s in dogs and cats: it depends on the quality (voiced vs. unvoiced) of the final preceding phoneme. The phoneME project is Sun Microsystems reference implementation of Java virtual machine and associated libraries of Java ME with source licensed under the GNU

Lexical morphology

Lexical morphology is the branch of morphology that deals with the lexicon, which, morphologically conceived, is the collection of lexemes in a language. In Linguistics, the lexicon (from Greek Λεξικόν of a language is its Vocabulary, including its words and expressions For its use in the context of Computer Science see Lexical analysis. As such, it concerns itself primarily with word-formation: derivation and compounding.

Models of morphology

There are three principal approaches to morphology, which each try to capture the distinctions above in different ways. These are,

Note that while the associations indicated between the concepts in each item in that list is very strong, it is not absolute.

Morpheme-based morphology

In morpheme-based morphology, word-forms are analyzed as arrangements of morphemes. Morpheme-based morphology is a view on morphology with the following three basic axioms Baudoin’s SINGLE MORPHEME HYPOTHESIS Roots and affixes have the same In Morpheme-based morphology, a morpheme is the smallest linguistic unit that has semantic meaning. A morpheme is defined as the minimal meaningful unit of a language. In a word like independently, we say that the morphemes are in-, depend, -ent, and ly; depend is the root and the other morphemes are, in this case, derivational affixes. The root is the primary lexical unit of a Word, which carries the most significant aspects of semantic content and cannot be reduced into smaller constituents [4] In a word like dogs, we say that dog is the root, and that -s is an inflectional morpheme. This way of analyzing word-forms as if they were made of morphemes put after each other like beads on a string, is called Item-and-Arrangement.

The morpheme-based approach is the first one that beginners to morphology usually think of, and which laymen tend to find the most obvious. This is so to such an extent that very often beginners think that morphemes are an inevitable, fundamental notion of morphology, and many five-minute explanations of morphology are, in fact, five-minute explanations of morpheme-based morphology. This is, however, not so. The fundamental idea of morphology is that the words of a language are related to each other by different kinds of rules. Analyzing words as sequences of morphemes is a way of describing these relations, but is not the only way. In actual academic linguistics, morpheme-based morphology certainly has many adherents, but is by no means the dominant approach.

Lexeme-based morphology

Lexeme-based morphology is (usually) an Item-and-Process approach. Instead of analyzing a word-form as a set of morphemes arranged in sequence, a word-form is said to be the result of applying rules that alter a word-form or stem in order to produce a new one. An inflectional rule takes a stem, changes it as is required by the rule, and outputs a word-form; a derivational rule takes a stem, changes it as per its own requirements, and outputs a derived stem; a compounding rule takes word-forms, and similarly outputs a compound stem.

Word-based morphology

Word-based morphology is a (usually) Word-and-paradigm approach. Word-and-paradigm or' Realizational morphology' concentrates on the word form rather than segments of the word This theory takes paradigms as a central notion. Instead of stating rules to combine morphemes into word-forms, or to generate word-forms from stems, word-based morphology states generalizations that hold between the forms of inflectional paradigms. The major point behind this approach is that many such generalizations are hard to state with either of the other approaches. The examples are usually drawn from fusional languages, where a given "piece" of a word, which a morpheme-based theory would call an inflectional morpheme, corresponds to a combination of grammatical categories, for example, "third person plural. For fusion in Word formation, see Compound (linguistics. A fusional language (also called inflecting language) is a " Morpheme-based theories usually have no problems with this situation, since one just says that a given morpheme has two categories. Item-and-Process theories, on the other hand, often break down in cases like these, because they all too often assume that there will be two separate rules here, one for third person, and the other for plural, but the distinction between them turns out to be artificial. Word-and-Paradigm approaches treat these as whole words that are related to each other by analogical rules. Analogy is both the cognitive process of transferring Information from a particular subject (the analogue or source to another particular subject (the target and Words can be categorized based on the pattern they fit into. This applies both to existing words and to new ones. Application of a pattern different than the one that has been used historically can give rise to a new word, such as older replacing elder (where older follows the normal pattern of adjectival superlatives) and cows replacing kine (where cows fits the regular pattern of plural formation). In Grammar, an adjective is a word whose main syntactic role is to modify a Noun or Pronoun, giving more information about the In Grammar the superlative of an Adjective or Adverb is the greatest form of adjective or adverb which indicates that something has some feature While a Word-and-Paradigm approach can explain this easily, other approaches have difficulty with phenomena such as this.

Morphological typology

In the 19th century, philologists devised a now classic classification of languages according to their morphology. Morphological typology is a way of classifying the languages of the world (see Linguistic typology) that groups languages according to their common morphological structures According to this typology, some languages are isolating, and have little to no morphology; others are agglutinative, and their words tend to have lots of easily-separable morphemes; while others yet are inflectional or fusional, because their inflectional morphemes are said to be "fused" together. In morphological typology (in linguistics an isolating language (also analytic language) is any Language in which words are composed of An agglutinative language is a Language that uses Agglutination extensively most Words are formed by joining Morphemes together For fusion in Word formation, see Compound (linguistics. A fusional language (also called inflecting language) is a This leads to one bound morpheme conveying multiple pieces of information. The classic example of an isolating language is Chinese; the classic example of an agglutinative language is Turkish; both Latin and Greek are classic examples of fusional languages. Turkish ( tr Türkçe IPA) is a language spoken by over 63 million people worldwide making it the most commonly spoken of the Turkic languages. Latin ( lingua Latīna, laˈtiːna is an Italic language, historically spoken in Latium and Ancient Rome. Greek (el ελληνική γλώσσα or simply el ελληνικά — "Hellenic" is an Indo-European language, spoken today by 15-22 million people mainly

Considering the variability of the world's languages, it becomes clear that this classification is not at all clear-cut, and many languages do not neatly fit any one of these types, and some fit in more than one. A continuum of complex morphology of language may be adapted when considering languages.

The three models of morphology stem from attempts to analyze languages that more or less match different categories in this typology. The Item-and-Arrangement approach fits very naturally with agglutinative languages; while the Item-and-Process and Word-and-Paradigm approaches usually address fusional languages.

The reader should also note that the classical typology also mostly applies to inflectional morphology. There is very little fusion going on with word-formation. Languages may be classified as synthetic or analytic in their word formation, depending on the preferred way of expressing notions that are not inflectional: either by using word-formation (synthetic), or by using syntactic phrases (analytic).

Footnotes

  1. ^ Für die Lehre von der Wortform wähle ich das Wort "Morphologie" ("for the science of word formation, I choose the term 'morphology'", Mémoires Acad. Impériale 7/1/7, 35)
  2. ^ Formerly known as Kwakiutl, Kwak'wala belongs to the Northern branch of the Wakashan language family. The term Kwakiutl was usually applied to a group of indigenous peoples of northern Vancouver Island, Queen Charlotte Strait and the Johnstone Strait "Kwakiutl" is still used to refer to the tribe itself, along with other terms.
  3. ^ Example taken from Foley 1998, using a modified transcription. This phenomenon of Kwak'wala was reported by Jacobsen as cited in van Valin and La Polla 1997.
  4. ^ The existence of words like appendix and pending in English does not mean that the English word depend is analyzed into a derivational prefix de- and a root pend. While all those were indeed once related to each other by morphological rules, this was so only in Latin, not in English. English borrowed the words from French and Latin, but not the morphological rules that allowed Latin speakers to combine de- and the verb pendere 'to hang' into the derivative dependere.

See also

Sources

(Abbreviations: CUP = Cambridge: Cambridge University Press; UP = University Press)

© 2009 citizendia.org; parts available under the terms of GNU Free Documentation License, from http://en.wikipedia.org
Dapyx Software network: MP3 Explorer | Ebook Manager | Zenithic