【正文】
on of linguistics from empiricism towards rationalism – “Any natural corpus will be skewed. Some sentences won?t occur because they are obvious, others because they are false, still others because they are impolite. The corpus, if natural, will be so wildly skewed that the description would be no more than a mere list.” (Chomsky 1962) – Our internal knowledge of language in human brain (petence, Ilanguage) replaces observed data (performance, Elanguage) – Intuitions started to be relied on as evidence ? [Xiao, R. (2021) “Theorydriven corpus research: using corpora to inform aspect theory”. In A. L252。 M. Kyto (eds.) Corpus Linguistics: An International Handbook. Berlin: Mouton de Gruyter] A brief history of CL ? Revival of CL – Corpus research was continued in a few centres (Brown, Lancaster) in the 60s70s ? The Brown University Standard Corpus of Presentday American English (Brown corpus) ? LancasterOsloBergen Corpus of BrE (LOB) – The hardware still imposed some restrictions until the real development started in the 1980s ? The marriage of corpora with puter technology rekindled interest in the corpus methodology ? Since then, the number and size of corpora and corpusbased studies have increased dramatically – Nowadays, the corpus methodology enjoys widespread popularity, and has opened up or foregrounded many new areas of research Areas that have used corpora ? Lexicography ? Lexical studies ? Grammatical studies ? Register/genre analysis ? Language variation ? Contrastive analysis ? Translation studies ? Language change ? Language teaching ? Semantics ? Pragmatics ? Stylistics ? Literary study ? Sociolinguistics ? Discourse analysis ? Forensic linguistics ? Computational linguistics ? … Nature of corpusbased approach ? It is empirical, analysing the actual patterns of use from natural texts ? It utilises a large and principled collection of natural texts as the basis for analysis ? It makes extensive use of puters for analysis, using both automatic and interactive techniques ? It integrates both quantitative and qualitative analytical techniques (Biber et al 1998: 45) Why use puters? ? Development of puter technology has revived CL ? Machinereadability is a de facto attribute of modern corpora ? Electronic corpora have advantages unavailable to their “shoebox” ancestors – It is the use of puterized corpora, together with puter programs which facilitate linguistic analysis, that distinguishes modern electronic corpora from early ?drawercumslip? corpora Why use puters? ? Computerized corpora can be processed and manipulated rapidly at minimal cost – . searching, selecting, sorting and formatting ? Computers can process machinereadable data accurately and consistently ? Computers can avoid human bias in an analysis, thus making the result more reliable ? Machinereadability allows further automatic processing to be performed on the corpus so that corpus texts can be enriched with various metadata and linguistic analyses – Corpus markup and corpus annotation A question for Deep Thought “Alright,” said the puter Deep Thought. “The Answer to the Great Question...” “Yes...!” “Of Life, the Universe and Everything ...” said Deep Thought. “Yes...!” “Is...” “Yes...!!!...?” “Fortytwo,” said Deep Thought, with infinite majesty and calm. It was a long time before anyone spoke. “Fortytwo!” yelled someone in the audience. “Is that all you?ve got