Developing Linguistic Corpora

Developing Linguistic Corpora
Author: Martin Wynne
Publsiher: Oxbow Books Limited
Total Pages: 100
Release: 2005
Genre: Language Arts & Disciplines
ISBN: UVA:X004991162

Download Developing Linguistic Corpora Book in PDF, Epub and Kindle

A linguistic corpus is a collection of texts which have been selected and brought together so that language can be studied on the computer. Today, corpus linguistics offers some of the most powerful new procedures for the analysis of language, and the impact of this dynamic and expanding sub-discipline is making itself felt in many areas of language study. In this volume, a selection of leading experts in various key areas of corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.

Corpora in Language Acquisition Research

Corpora in Language Acquisition Research
Author: Heike Behrens
Publsiher: John Benjamins Publishing
Total Pages: 280
Release: 2008
Genre: Language Arts & Disciplines
ISBN: 9027234760

Download Corpora in Language Acquisition Research Book in PDF, Epub and Kindle

Corpus research forms the backbone of research on children's language development. Leading researchers in the field present a survey on the history of data collection, different types of data, and the treatment of methodological problems. Morphologically and syntactically parsed corpora allow for the concise explorations of formal phenomena, the quick retrieval of errors, and reliability checks. New probabilistic and connectionist computations investigate how children integrate the multiple sources of information available in the input, and new statistical methods compute rates of acquisition as well as error rates dependent on sample size. Sample analyses show how multi-modal corpora are used to investigate the interaction of discourse and linguistic structure, how cross-linguistic generalizations for acquisition can be formulated and tested, and how individual variation can be explored. Finally, ways in which corpus research interacts with computational linguistics and experimental research are presented.

Corpus Linguistics for Writing Development

Corpus Linguistics for Writing Development
Author: Philip Durrant
Publsiher: Taylor & Francis
Total Pages: 194
Release: 2022-11-30
Genre: Language Arts & Disciplines
ISBN: 9781000790245

Download Corpus Linguistics for Writing Development Book in PDF, Epub and Kindle

Corpus Linguistics for Writing Development provides a practical introduction to using corpora in the study of first and second language learners’ written language over time and across different levels of proficiency. Focusing on development in the use of vocabulary, formulaic language, and grammar, this book • discusses how corpus research can contribute to our understanding of writing development and to pedagogical practice; • reviews a range of corpus techniques for studying writing development from the perspectives of vocabulary, grammar, and formulaic language and interrogates the methodological bases of those techniques; and • guides readers to perform practical analyses of learner writing using the R open-source programming language. Aimed at the novice researcher, this book will be key reading for advanced undergraduate and postgraduate students in the fields of education, language, and linguistics. It will be of particular interest to those interested in first or second language writing, language assessment, and learner corpus research.

Language Corpora Annotation and Processing

Language Corpora Annotation and Processing
Author: Niladri Sekhar Dash
Publsiher: Springer Nature
Total Pages: 135
Release: 2021
Genre: Computational linguistics
ISBN: 9789811629600

Download Language Corpora Annotation and Processing Book in PDF, Epub and Kindle

This book addresses the research, analysis, and description of the methods and processes that are used in the annotation and processing of language corpora in advanced, semi-advanced, and non-advanced languages. It provides the background information and empirical data needed to understand the nature and depth of problems related to corpus annotation and text processing and shows readers how the linguistic elements found in texts are analyzed and applied to develop language technology systems and devices. As such, it offers valuable insights for researchers, educators, and students of linguistics and language technology.

The Development of Corpus Linguistics to Its Present day Concept

The Development of Corpus Linguistics to Its Present day Concept
Author: Bernadette Wonner
Publsiher: GRIN Verlag
Total Pages: 29
Release: 2007-09-30
Genre: Electronic Book
ISBN: 9783638762281

Download The Development of Corpus Linguistics to Its Present day Concept Book in PDF, Epub and Kindle

Seminar paper from the year 2005 in the subject English Language and Literature Studies - Linguistics, grade: 1, LMU Munich (Institut für Englische Philologie), course: Corpus linguistics and teaching, 10 entries in the bibliography, language: English, abstract: [...] This paper will provide an overview of the different stages that CL has gone through. Early Corpus Linguistics will be presented first, a term that describes all corpus-based work up to the end of the 1950s. That is the time when Noam Chomsky makes the early researchers reflect on their work under certain aspects which neutralize somehow the work which was done up to that point. As an effect corpus research faces a certain discontinuity. Nevertheless, corpus-based work does not totally cease and the improvements in computer technology provide completely new possibilities in corpus research. Over the decades a considerable amount of machine-readable corpora is created for more and more different purposes and they initiate all variations of analysis. After the presenation of the chronological development of CL, the last but one chapter of the paper will finally deal with the concept of modern corpus linguistics and will give the definition of a corpus, which is not yet an definite thing to do. There is still a lot of work going on to improve the corpus linguistic methodology. The last chapter will give an overview of future prospects.

Corpus Linguistics 25 Years on

Corpus Linguistics 25 Years on
Author: Anonim
Publsiher: BRILL
Total Pages: 391
Release: 2015-07-14
Genre: Language Arts & Disciplines
ISBN: 9789401204347

Download Corpus Linguistics 25 Years on Book in PDF, Epub and Kindle

This volume offers a state-of-the-art picture of work undertaken in the field of computer-aided corpus linguistics. While the focus is on English, central insights can be generalised to other languages, as well. As a work intended to mark the Silver Jubilee of ICAME, the International Computer Archive of Modern and Medieval English, the book combines surveys of the discipline by some of its major pioneers, including founders of ICAME itself, with cutting-edge work by younger scholars. It is divided into three sections: “Overviewing years of corpus linguistic studies”, “Descriptive studies in English syntax and semantics”, and “Second Language Acquisition, parallel corpora and specialist corpora”. The book bears witness to the impressive advances that have characterised the development of corpus linguistics over the past few decades – from terminological issues to practical applications, from theoretical and descriptive research to applied approaches, from monolingual to multilingual and specialist corpora, from corpus design to corpus exploitation tools.

Spoken Corpora and Linguistic Studies

Spoken Corpora and Linguistic Studies
Author: Tommaso Raso,Heliana Mello
Publsiher: John Benjamins Publishing Company
Total Pages: 508
Release: 2014-11-14
Genre: Language Arts & Disciplines
ISBN: 9789027270030

Download Spoken Corpora and Linguistic Studies Book in PDF, Epub and Kindle

The authors of this book share a common interest in the following topics: the importance of corpora compilation for the empirical study of human language; the importance of pragmatic categories such as emotion, attitude, illocution and information structure in linguistic theory; and a passionate belief in the central role of prosody for the analysis of speech. Four distinct sections (spoken corpora compilation; spoken corpora annotation; prosody; and syntax and information structure) give the book the structure in which the authors present innovative methodologies that focus on the compilation of third generation spoken corpora; multilevel spoken corpora annotation and its functions; and additionally a debate is initiated about the reference unit in the study of spoken language via information structure. The book is accompanied by a web site with a rich array of audio/video files. The web site can be found at the following address: DOI: 10.1075/scl.61.media

Creating and Using English Language Corpora

Creating and Using English Language Corpora
Author: Fries
Publsiher: BRILL
Total Pages: 224
Release: 2023-11-20
Genre: Computers
ISBN: 9789004652804

Download Creating and Using English Language Corpora Book in PDF, Epub and Kindle