Computational Methods for Corpus Annotation and Analysis

Computational Methods for Corpus Annotation and Analysis
Author: Xiaofei Lu
Publsiher: Springer
Total Pages: 192
Release: 2014-07-08
Genre: Language Arts & Disciplines
ISBN: 9789401786454

Download Computational Methods for Corpus Annotation and Analysis Book in PDF, Epub and Kindle

In the past few decades the use of increasingly large text corpora has grown rapidly in language and linguistics research. This was enabled by remarkable strides in natural language processing (NLP) technology, technology that enables computers to automatically and efficiently process, annotate and analyze large amounts of spoken and written text in linguistically and/or pragmatically meaningful ways. It has become more desirable than ever before for language and linguistics researchers who use corpora in their research to gain an adequate understanding of the relevant NLP technology to take full advantage of its capabilities. This volume provides language and linguistics researchers with an accessible introduction to the state-of-the-art NLP technology that facilitates automatic annotation and analysis of large text corpora at both shallow and deep linguistic levels. The book covers a wide range of computational tools for lexical, syntactic, semantic, pragmatic and discourse analysis, together with detailed instructions on how to obtain, install and use each tool in different operating systems and platforms. The book illustrates how NLP technology has been applied in recent corpus-based language studies and suggests effective ways to better integrate such technology in future corpus linguistics research. This book provides language and linguistics researchers with a valuable reference for corpus annotation and analysis.

Corpus Annotation

Corpus Annotation
Author: R. G. Garside,Emeritus Professor of English Linguistics Geoffrey Leech,Geoffrey Leech,Anthony Mark Mcenery
Publsiher: Routledge
Total Pages: 0
Release: 2016-07-10
Genre: Computational linguistics
ISBN: 113814858X

Download Corpus Annotation Book in PDF, Epub and Kindle

Corpus Annotation gives an up-to-date picture of this fascinating new area of research, and will provide essential reading for newcomers to the field as well as those already involved in corpus annotation. Early chapters introduce the different levels and techniques of corpus annotation. Later chapters deal with software developments, applications, and the development of standards for the evaluation of corpus annotation. While the book takes detailed account of research world-wide, its focus is particularly on the work of the UCREL (University Centre for Computer Corpus Research on Language) team at Lancaster University, which has been at the forefront of developments in the field of corpus annotation since its beginnings in the 1970s.

Corpus based Language Studies

Corpus based Language Studies
Author: Tony McEnery,Richard Xiao,Yukio Tono
Publsiher: Taylor & Francis
Total Pages: 412
Release: 2006
Genre: Foreign Language Study
ISBN: 0415286220

Download Corpus based Language Studies Book in PDF, Epub and Kindle

Covering the major approaches to the use of corpus data, this work gathers together influential readings from leading names in the discipline, including Biber, Widdowson, Sinclair, Carter and McCarthy.

Natural Language Annotation for Machine Learning

Natural Language Annotation for Machine Learning
Author: James Pustejovsky,Amber Stubbs
Publsiher: "O'Reilly Media, Inc."
Total Pages: 344
Release: 2013
Genre: Computers
ISBN: 9781449306663

Download Natural Language Annotation for Machine Learning Book in PDF, Epub and Kindle

Includes bibliographical references (p. 305-315) and index.

Language Corpora Annotation and Processing

Language Corpora Annotation and Processing
Author: Niladri Sekhar Dash
Publsiher: Springer Nature
Total Pages: 135
Release: 2021
Genre: Computational linguistics
ISBN: 9789811629600

Download Language Corpora Annotation and Processing Book in PDF, Epub and Kindle

This book addresses the research, analysis, and description of the methods and processes that are used in the annotation and processing of language corpora in advanced, semi-advanced, and non-advanced languages. It provides the background information and empirical data needed to understand the nature and depth of problems related to corpus annotation and text processing and shows readers how the linguistic elements found in texts are analyzed and applied to develop language technology systems and devices. As such, it offers valuable insights for researchers, educators, and students of linguistics and language technology.

Corpus Linguistics and Linguistically Annotated Corpora

Corpus Linguistics and Linguistically Annotated Corpora
Author: Sandra Kuebler,Heike Zinsmeister
Publsiher: Bloomsbury Publishing
Total Pages: 321
Release: 2014-12-18
Genre: Language Arts & Disciplines
ISBN: 9781441119919

Download Corpus Linguistics and Linguistically Annotated Corpora Book in PDF, Epub and Kindle

Linguistically annotated corpora are becoming a central part of the corpus linguistics field. One of their main strengths is the level of searchability they offer, but with the annotation come problems of the initial complexity of queries and query tools. This book gives a full, pedagogic account of this burgeoning field. Beginning with an overview of corpus linguistics, its prerequisites and goals, the book then introduces linguistically annotated corpora. It explores the different levels of linguistic annotation, including morphological, parts of speech, syntactic, semantic and discourse-level, as well as advantages and challenges for such annotations. It covers the main annotated corpora for English, the Penn Treebank, the International Corpus of English, and OntoNotes, as well as a wide range of corpora for other languages. In its third part, search strategies required for different types of data are explored. All chapters are accompanied by exercises and by sections on further reading.

News Discourse and Digital Currents

News Discourse and Digital Currents
Author: Antonio Fruttaldo
Publsiher: Cambridge Scholars Publishing
Total Pages: 250
Release: 2017-05-11
Genre: Language Arts & Disciplines
ISBN: 9781443893404

Download News Discourse and Digital Currents Book in PDF, Epub and Kindle

In recent years, journalistic practices have undergone a radical change due to the increasing pressure of new digital media on the professional practice. The ever-growing development of new technologies and the ceaseless fluctuation of social practices have challenged some of the traditional genres found in these professional contexts. On the basis of these premises, this book investigates a particular genre found in the context of TV newscasts. The genre under investigation is that of news tickers (or crawlers), that is, the graphic elements that scroll at the bottom of the screen during newscasts. The book introduces readers to this under-researched genre through a year-long collection of the news tickers displayed on BBC World News. Thanks to a corpus-based genre analysis, the generic status of news tickers is better defined by highlighting the presence of given strategies of marketization. Additionally, this volume investigates if news tickers can be seen as a mixed (sub-)genre that interdiscursively combines traditional linguistic elements of headlines and lead paragraphs to achieve, from a (Critical) Genre Analysis point of view, a specific private intention in the context of the BBC.

Corpus Annotation

Corpus Annotation
Author: Roger Garside,Geoffrey N. Leech
Publsiher: Routledge
Total Pages: 304
Release: 1997
Genre: Computers
ISBN: UOM:39015040361480

Download Corpus Annotation Book in PDF, Epub and Kindle

This is a text which surveys the growing field of research known as corpus annotation - an electronic collection of texts. Corpus annotation is a central resource in linguisticsi̧nformation technology and the processing of human language. The book seeks to show the nature of language and the most effective means of analysing it. A bibliography lists relevant e-mail addresses and Web sites.