中国英语学习者语料库CLEC(桂诗春杨惠中) - 图文 联系客服

发布时间 : 星期一 文章中国英语学习者语料库CLEC(桂诗春杨惠中) - 图文更新完毕开始阅读ab274a88915f804d2a16c180

中国英语学习者语料库

CLEC收集了包括中学生、大学英语4级和6级、专业英语低年级和高年级在内的5种学生的语料一百多万词,并对言语失误进行标注。其目的就是观察各类学生的英语特征和言语失误的情况,希望通过定量和定性的方法对中国学习者英语作出较为精确的描写,为我国学生的英语教学提供有用的反馈信息。

表1 CLEC语料分布 类型 词次 ST2 208088 ST3 209043 ST4 212855 ST5 214510 ST6 226106 总计 1070602

言语失误标注原则

1. 简单合理,易于系统操作。参与标注的人比较多,分类表过于繁复,就

难于掌握。我们采取两级分类,第一级有11类:词形(fm)、动词短语(vp)、名词短语(np)、代词(pr)、形容词短语(aj)、副词(ad)、介词短语(pp)、连词(cj)、词汇(wd)、搭配(cc)、句子(sn)。每一类里再用数目字细分。如[cc]为词语搭配不当,[cc1]表示名词和名词的搭配,[cc2]表示名词和动词的搭配,[cc3]表示动词和名词的搭配,等等。

2. 分类表的类别要适中。过粗容易统一,但信息太少,不利于分析学习者的失误/过细难以统一,容易把同一种失误归到不同类别。目前我们采取的办法是对常见的失误从细(如vp和np都有9小类),对少见的失误从粗(如cj只有两小类)。现在的分类表有61个失误码,是属于中等规模的分类表。 提供足够的失误信息(失误本身、失误类型和失误发生范围)。例如In the past, people are [vp6, 4-] kind to each other…, 失误用方括号表示,放在失误之后。 [vp6]为vp(动词)第6种(时态)失误,4-为失误发生的范围,-表示失误的位置,4表示失误前有4个词。要联系这4个词,才能判断are这个词用错了。

开放性。容许研究者根据需要对失误类型进行补充或进一步再分出细类。例如[sn8]为句子结构有缺陷,研究者可以对这种失误再分为若干细类来研究。这需要把sn8的失误全部检索出来,然后定出第三级的分类范畴,如sn81,sn82,等等。

5. 对语体或失误的来由暂不作标注,因为这需要标注者较多的主观判断,更难以统一。

言语失误分类表(总数:61)

词形 码 类型 码 vp1 vp2 vp3 vp4 vp5 vp6 vp7 vp8 vp9 动词短语 类型 pattern set phrase agreement non-finite tense voice mood 码 np1 np2 np3 np5 np6 np7 np8 名词短语 类型 pattern set phrase agreement case countability number article quantifiers other determiners 形容词短语 码 类型 pattern degree -ed/-ing confusion aj5 predicative/attributive 词语 码 类型 order 码 cc1 搭配 类型 noun/noun noun/verb verb/noun adj/noun verb/adv adv/adj 码 sn1 sn2 sn3 sn4 sn5 sn6 sn7 sn8 句子 类型 run-on sentence wd2 wd3 wd4 wd5 wd6 wd7 part of speech cc2 substitution absence redundancy repetition ambiguity cc3 cc4 cc5 cc6 sentence fragment dangling modifier illogical comparison topic prominence Coordination Subordination structural deficiency 码 ad1 ad3 副词 类型 order modification degree 码 pp1 pp2 介词短语 类型 pattern set phrase 码 连词 类型 cj1 pattern cj2 set phrase 码 代词 类型 pr1 Reference pr2 anticipatory it fm3 capitalization pr3 Agreement pr4 Case pr5 wh- pr6 Indefinite fm1 Spelling fm2 word building finite/non-finite np4 modal/auxiliary np9 aj1 aj2 aj3 aj4 set phrase ad2 wd1 sn9 Punctuation

标注说明

码 分 类 类 别 说 明 fm1 word Spelling(拼写) spelling, coinage, abbreviation, apostrophe fm2 word word buildingderivation, inflection, compounding, (构词) plurality (noun), irregularity(verb), 3rd person singular form(verb), syllabification, hyphenation, word division or fusion fm3 vp1 word Capitalization(大小写) vb phr Pattern(及物性型式) lower initial letter for upper initial letter or vice versa error in transitivity(vi as vt or vice versa), transitive verb pattern/ grammatical(cf Oxford advanced learner’s dictionary of current English edited by A. S. Hornby) vp2 vp3 vp4 vp5 vb phr set phrase(固定词组) vb phr Agreement(主谓一致性) vb phr finite/non-finite(定式) vb phr non-finite(不定式) phrasal verb and verbal phrase: error in form or use number agreement with its subject (noun or pronoun) finite verb for non-finite verb or vice versa infinitive error: form and use/ infinitive for participle or vice versa/ -ed participle for -ing participle or vice versa error in tense use within a sentence/ the sequence of tenses between sentences error in the use of voice: active for passive or vice versa error in the use of mood: imperative, subjunctive/ improper structure of conditional sentences misuse of modal/auxiliary verbs/ wrong vp6 vb phr Tense(时态) vp7 vp8 vb phr voice (语态) vb phr Mood(语气) vp9 vb phr modal/auxiliaryform of modal verb(or auxiliary verb) and verb combination (e.g tense form, voice form, etc) np1 nn phr Pattern(名词型Error in combination with other 式) words/grammatical np2 nn phr set phrase(固定omission or replacement of a fixed 词组) element that goes after a certain noun np3 nn phr Agreement(主谓number agreement of a noun with its 一致性) determiner or a word that refers to it np4 nn phr Case(格) possessive case error: form or use np5 nn phr Countability(可uncountable noun used as countable 数性) noun np6 nn phr Number(数) countable noun used with no determiner or -s/ a or -s with plural noun np7 nn phr Article(冠词) a/an confusion or definite/indefinite confusion np8 nn phr Quantifiers(数misuse or confusion between many/much, 量词) (a) few/(a) little, some/any, etc np9 nn phr other misuse or confusion of demonstratives, determiners(其wh- determiners, numerals, etc. 他限定词) pr1 pron Reference(指称) incorrect/ambiguous pronoun reference/anaphoric pr2 pron anticipatory itimproper or wrong use of anticipatory (先行it) it / it replaced by a demonstrative, etc pr3 pron Agreement(主谓number agreement with a noun it refers 一致性) to pr4 pron Case(格) case error of any personal pronoun pr5 pron wh-(wh-代词) misuse or confusion of interrogative, relative and conjunctive pronouns pr6 pron Indefinite(不定misuse or confusion of indefinite 式) pronouns such as all/both, few/little, some/any, either/neither, etc aj1 adj Pattern(形容词error in the combination with other 型式) words/grammatical aj2 adj set phrase(固定error in the idiomatic use of an 词组) adjectival phrase/ omission or replacement of a fixed element that goes after a certain adjective aj3 adj Degree(级) adjective degree error: form and use aj4 adj -ed/-ing -ed adjective for -ing adjective or confusionvice versa (情态)