Corpus Linguistics and Linguistically Annotated Corpora

Corpus Linguistics and Linguistically Annotated Corpora
Author: Sandra Kuebler,Heike Zinsmeister
Publsiher: Bloomsbury Publishing
Total Pages: 321
Release: 2014-12-18
Genre: Language Arts & Disciplines
ISBN: 9781441119919

Download Corpus Linguistics and Linguistically Annotated Corpora Book in PDF, Epub and Kindle

Linguistically annotated corpora are becoming a central part of the corpus linguistics field. One of their main strengths is the level of searchability they offer, but with the annotation come problems of the initial complexity of queries and query tools. This book gives a full, pedagogic account of this burgeoning field. Beginning with an overview of corpus linguistics, its prerequisites and goals, the book then introduces linguistically annotated corpora. It explores the different levels of linguistic annotation, including morphological, parts of speech, syntactic, semantic and discourse-level, as well as advantages and challenges for such annotations. It covers the main annotated corpora for English, the Penn Treebank, the International Corpus of English, and OntoNotes, as well as a wide range of corpora for other languages. In its third part, search strategies required for different types of data are explored. All chapters are accompanied by exercises and by sections on further reading.

Corpus Linguistics 25 Years on

Corpus Linguistics 25 Years on
Author: Anonim
Publsiher: BRILL
Total Pages: 391
Release: 2015-07-14
Genre: Language Arts & Disciplines
ISBN: 9789401204347

Download Corpus Linguistics 25 Years on Book in PDF, Epub and Kindle

This volume offers a state-of-the-art picture of work undertaken in the field of computer-aided corpus linguistics. While the focus is on English, central insights can be generalised to other languages, as well. As a work intended to mark the Silver Jubilee of ICAME, the International Computer Archive of Modern and Medieval English, the book combines surveys of the discipline by some of its major pioneers, including founders of ICAME itself, with cutting-edge work by younger scholars. It is divided into three sections: “Overviewing years of corpus linguistic studies”, “Descriptive studies in English syntax and semantics”, and “Second Language Acquisition, parallel corpora and specialist corpora”. The book bears witness to the impressive advances that have characterised the development of corpus linguistics over the past few decades – from terminological issues to practical applications, from theoretical and descriptive research to applied approaches, from monolingual to multilingual and specialist corpora, from corpus design to corpus exploitation tools.

Spoken Corpora and Linguistic Studies

Spoken Corpora and Linguistic Studies
Author: Tommaso Raso,Heliana Mello
Publsiher: John Benjamins Publishing Company
Total Pages: 508
Release: 2014-11-14
Genre: Language Arts & Disciplines
ISBN: 9789027270030

Download Spoken Corpora and Linguistic Studies Book in PDF, Epub and Kindle

The authors of this book share a common interest in the following topics: the importance of corpora compilation for the empirical study of human language; the importance of pragmatic categories such as emotion, attitude, illocution and information structure in linguistic theory; and a passionate belief in the central role of prosody for the analysis of speech. Four distinct sections (spoken corpora compilation; spoken corpora annotation; prosody; and syntax and information structure) give the book the structure in which the authors present innovative methodologies that focus on the compilation of third generation spoken corpora; multilevel spoken corpora annotation and its functions; and additionally a debate is initiated about the reference unit in the study of spoken language via information structure. The book is accompanied by a web site with a rich array of audio/video files. The web site can be found at the following address: DOI: 10.1075/scl.61.media

Understanding Corpus Linguistics

Understanding Corpus Linguistics
Author: Danielle Barth,Stefan Schnell
Publsiher: Routledge
Total Pages: 276
Release: 2021-11-18
Genre: Language Arts & Disciplines
ISBN: 9781000466751

Download Understanding Corpus Linguistics Book in PDF, Epub and Kindle

This textbook introduces the fundamental concepts and methods of corpus linguistics for students approaching this topic for the first time, putting specific emphasis on the enormous linguistic diversity represented by approximately 7,000 human languages and broadening the scope of current concerns in general corpus linguistics. Including a basic toolkit to help the reader investigate language in different usage contexts, this book: Shows the relevance of corpora to a range of linguistic areas from phonology to sociolinguistics and discourse Covers recent developments in the application of corpus linguistics to the study of understudied languages and linguistic typology Features exercises, short problems, and questions Includes examples from real studies in over 15 languages plus multilingual corpora Providing the necessary corpus linguistics skills to critically evaluate and replicate studies, this book is essential reading for anyone studying corpus linguistics.

Corpus Linguistics and Linguistic Theory

Corpus Linguistics and Linguistic Theory
Author: Christian Mair,Marianne Hundt
Publsiher: Rodopi
Total Pages: 408
Release: 2000
Genre: Computers
ISBN: 9042014938

Download Corpus Linguistics and Linguistic Theory Book in PDF, Epub and Kindle

From being the occupation of a marginal (and frequently marginalised) group of researchers, the linguistic analysis of machine-readable language corpora has moved to the mainstream of research on the English language. In this process an impressive body of results has accumulated which, over and above the intrinsic descriptive interest it holds for students of the English language, forces a major and systematic re-thinking of foundational issues in linguistic theory. Corpus linguistics and linguistic theory was accordingly chosen as the motto for the twentieth annual gathering of ICAME, the International Computer Archive of Modern/ Medieval English, which was hosted by the University of Freiburg (Germany) in 1999. The present volume, which presents selected papers from this conference, thus builds on previous successful work in the computer-aided description of English and at the same time represents an attempt at stock-taking and methodological reflection in a linguistic subdiscipline that has clearly come of age.Contributions cover all levels of linguistic description - from phonology/ prosody, through grammar and semantics to discourse-analytical issues such as genre or gender-specific linguistic usage. They are united by a desire to further the dialogue between the corpus-linguistic community and researchers working in other traditions. Thereby, the atmosphere ranges from undisguised skepticism (as expressed by Noam Chomsky in an interview which is part of the opening contribution by Bas Aarts) to empirically substantiated optimism (as, for example, in Bernadette Vine's significantly titled contribution Getting things done).

Corpus Annotation

Corpus Annotation
Author: R.G. Garside,Geoffrey Leech,Anthony Mark Mcenery
Publsiher: Routledge
Total Pages: 292
Release: 2014
Genre: Computational linguistics
ISBN: 1317885651

Download Corpus Annotation Book in PDF, Epub and Kindle

Corpus Annotation gives an up-to-date picture of this fascinating new area of research, and will provide essential reading for newcomers to the field as well as those already involved in corpus annotation. Early chapters introduce the different levels and techniques of corpus annotation. Later chapters deal with software developments, applications, and the development of standards for the evaluation of corpus annotation. While the book takes detailed account of research world-wide, its focus is particularly on the work of the UCREL (University Centre for Computer Corpus Research on Language) team at Lancaster University, which has been at the forefront of developments in the field of corpus annotation since its beginnings in the 1970s.

Corpus Linguistics

Corpus Linguistics
Author: Geoffrey Sampson,Diana McCarthy
Publsiher: A&C Black
Total Pages: 541
Release: 2005-10-01
Genre: Language Arts & Disciplines
ISBN: 9781441139375

Download Corpus Linguistics Book in PDF, Epub and Kindle

Corpus Linguistics seeks to provide a comprehensive sampling of real-life usage in a given language, and to use these empirical data to test language hypotheses. Modern corpus linguistics began fifty years ago, but the subject has seen explosive growth since the early 1990s. These days corpora are being used to advance virtually every aspect of language study, from computer processing techniques such as machine translation, to literary stylistics, social aspects of language use, and improved language-teaching methods. Because corpus linguistics has grown fast from small beginnings, newcomers to the field often find it hard to get their bearings. Important papers can be difficult to track down. This volume reprints forty-two articles on corpus linguistics by an international selection of authors, which comprehensively illustrate the directions in which the subject is developing. It includes articles that are already recognized as classics, and others which deserve to become so, supplemented with editorial introductions relating the individual contributions to the field as a whole. This collection of readings will be useful to students of corpus linguistics at both undergraduate and postgraduate level, as well as academics researching this fascinating area of linguistics.

Computational Methods for Corpus Annotation and Analysis

Computational Methods for Corpus Annotation and Analysis
Author: Xiaofei Lu
Publsiher: Springer
Total Pages: 186
Release: 2014-03-24
Genre: Language Arts & Disciplines
ISBN: 9401786445

Download Computational Methods for Corpus Annotation and Analysis Book in PDF, Epub and Kindle

In the past few decades the use of increasingly large text corpora has grown rapidly in language and linguistics research. This was enabled by remarkable strides in natural language processing (NLP) technology, technology that enables computers to automatically and efficiently process, annotate and analyze large amounts of spoken and written text in linguistically and/or pragmatically meaningful ways. It has become more desirable than ever before for language and linguistics researchers who use corpora in their research to gain an adequate understanding of the relevant NLP technology to take full advantage of its capabilities. This volume provides language and linguistics researchers with an accessible introduction to the state-of-the-art NLP technology that facilitates automatic annotation and analysis of large text corpora at both shallow and deep linguistic levels. The book covers a wide range of computational tools for lexical, syntactic, semantic, pragmatic and discourse analysis, together with detailed instructions on how to obtain, install and use each tool in different operating systems and platforms. The book illustrates how NLP technology has been applied in recent corpus-based language studies and suggests effective ways to better integrate such technology in future corpus linguistics research. This book provides language and linguistics researchers with a valuable reference for corpus annotation and analysis.