The Open Handbook of Linguistic Data Management

The Open Handbook of Linguistic Data Management
Author: Andrea L. Berez-Kroeker,Bradley McDonnell,Eve Koller,Lauren B. Collister
Publsiher: MIT Press
Total Pages: 687
Release: 2022-01-18
Genre: Language Arts & Disciplines
ISBN: 9780262362177

Download The Open Handbook of Linguistic Data Management Book in PDF, Epub and Kindle

A guide to principles and methods for the management, archiving, sharing, and citing of linguistic research data, especially digital data. "Doing language science" depends on collecting, transcribing, annotating, analyzing, storing, and sharing linguistic research data. This volume offers a guide to linguistic data management, engaging with current trends toward the transformation of linguistics into a more data-driven and reproducible scientific endeavor. It offers both principles and methods, presenting the conceptual foundations of linguistic data management and a series of case studies, each of which demonstrates a concrete application of abstract principles in a current practice. In part 1, contributors bring together knowledge from information science, archiving, and data stewardship relevant to linguistic data management. Topics covered include implementation principles, archiving data, finding and using datasets, and the valuation of time and effort involved in data management. Part 2 presents snapshots of practices across various subfields, with each chapter presenting a unique data management project with generalizable guidance for researchers. The Open Handbook of Linguistic Data Management is an essential addition to the toolkit of every linguist, guiding researchers toward making their data FAIR: Findable, Accessible, Interoperable, and Reusable.

The Open Handbook of Linguistic Data Management

The Open Handbook of Linguistic Data Management
Author: Andrea L. Berez-Kroeker,Bradley McDonnell,Eve Koller,Lauren B. Collister
Publsiher: MIT Press
Total Pages: 687
Release: 2022-01-18
Genre: Language Arts & Disciplines
ISBN: 9780262045261

Download The Open Handbook of Linguistic Data Management Book in PDF, Epub and Kindle

A guide to principles and methods for the management, archiving, sharing, and citing of linguistic research data, especially digital data. "Doing language science" depends on collecting, transcribing, annotating, analyzing, storing, and sharing linguistic research data. This volume offers a guide to linguistic data management, engaging with current trends toward the transformation of linguistics into a more data-driven and reproducible scientific endeavor. It offers both principles and methods, presenting the conceptual foundations of linguistic data management and a series of case studies, each of which demonstrates a concrete application of abstract principles in a current practice. In part 1, contributors bring together knowledge from information science, archiving, and data stewardship relevant to linguistic data management. Topics covered include implementation principles, archiving data, finding and using datasets, and the valuation of time and effort involved in data management. Part 2 presents snapshots of practices across various subfields, with each chapter presenting a unique data management project with generalizable guidance for researchers. The Open Handbook of Linguistic Data Management is an essential addition to the toolkit of every linguist, guiding researchers toward making their data FAIR: Findable, Accessible, Interoperable, and Reusable.

Linked Data in Linguistics

Linked Data in Linguistics
Author: Christian Chiarcos,Sebastian Nordhoff,Sebastian Hellmann
Publsiher: Springer Science & Business Media
Total Pages: 218
Release: 2012-02-21
Genre: Computers
ISBN: 9783642282492

Download Linked Data in Linguistics Book in PDF, Epub and Kindle

The explosion of information technology has led to substantial growth of web-accessible linguistic data in terms of quantity, diversity and complexity. These resources become even more useful when interlinked with each other to generate network effects. The general trend of providing data online is thus accompanied by newly developing methodologies to interconnect linguistic data and metadata. This includes linguistic data collections, general-purpose knowledge bases (e.g., the DBpedia, a machine-readable edition of the Wikipedia), and repositories with specific information about languages, linguistic categories and phenomena. The Linked Data paradigm provides a framework for interoperability and access management, and thereby allows to integrate information from such a diverse set of resources. The contributions assembled in this volume illustrate the band-width of applications of the Linked Data paradigm for representative types of language resources. They cover lexical-semantic resources, annotated corpora, typological databases as well as terminology and metadata repositories. The book includes representative applications from diverse fields, ranging from academic linguistics (e.g., typology and corpus linguistics) over applied linguistics (e.g., lexicography and translation studies) to technical applications (in computational linguistics, Natural Language Processing and information technology). This volume accompanies the Workshop on Linked Data in Linguistics 2012 (LDL-2012) in Frankfurt/M., Germany, organized by the Open Linguistics Working Group (OWLG) of the Open Knowledge Foundation (OKFN). It assembles contributions of the workshop participants and, beyond this, it summarizes initial steps in the formation of a Linked Open Data cloud of linguistic resources, the Linguistic Linked Open Data cloud (LLOD).

Linked Data in Linguistics

Linked Data in Linguistics
Author: Anonim
Publsiher: Springer
Total Pages: 232
Release: 2012-02-22
Genre: Electronic Book
ISBN: 3642282504

Download Linked Data in Linguistics Book in PDF, Epub and Kindle

Development of Linguistic Linked Open Data Resources for Collaborative Data Intensive Research in the Language Sciences

Development of Linguistic Linked Open Data Resources for Collaborative Data Intensive Research in the Language Sciences
Author: Antonio Pareja-Lora,Maria Blume,Barbara C. Lust,Christian Chiarcos
Publsiher: MIT Press
Total Pages: 273
Release: 2020-01-07
Genre: Language Arts & Disciplines
ISBN: 9780262536257

Download Development of Linguistic Linked Open Data Resources for Collaborative Data Intensive Research in the Language Sciences Book in PDF, Epub and Kindle

Making diverse data in linguistics and the language sciences open, distributed, and accessible: perspectives from language/language acquistiion researchers and technical LOD (linked open data) researchers. This volume examines the challenges inherent in making diverse data in linguistics and the language sciences open, distributed, integrated, and accessible, thus fostering wide data sharing and collaboration. It is unique in integrating the perspectives of language researchers and technical LOD (linked open data) researchers. Reporting on both active research needs in the field of language acquisition and technical advances in the development of data interoperability, the book demonstrates the advantages of an international infrastructure for scholarship in the field of language sciences. With contributions by researchers who produce complex data content and scholars involved in both the technology and the conceptual foundations of LLOD (linguistics linked open data), the book focuses on the area of language acquisition because it involves complex and diverse data sets, cross-linguistic analyses, and urgent collaborative research. The contributors discuss a variety of research methods, resources, and infrastructures. Contributors Isabelle Barrière, Nan Bernstein Ratner, Steven Bird, Maria Blume, Ted Caldwell, Christian Chiarcos, Cristina Dye, Suzanne Flynn, Claire Foley, Nancy Ide, Carissa Kang, D. Terence Langendoen, Barbara Lust, Brian MacWhinney, Jonathan Masci, Steven Moran, Antonio Pareja-Lora, Jim Reidy, Oya Y. Rieger, Gary F. Simons, Thorsten Trippel, Kara Warburton, Sue Ellen Wright, Claus Zinn

Linguistic Linked Data

Linguistic Linked Data
Author: Philipp Cimiano,Christian Chiarcos,John P. McCrae,Jorge Gracia
Publsiher: Springer Nature
Total Pages: 286
Release: 2020-01-13
Genre: Computers
ISBN: 9783030302252

Download Linguistic Linked Data Book in PDF, Epub and Kindle

This is the first monograph on the emerging area of linguistic linked data. Presenting a combination of background information on linguistic linked data and concrete implementation advice, it introduces and discusses the main benefits of applying linked data (LD) principles to the representation and publication of linguistic resources, arguing that LD does not look at a single resource in isolation but seeks to create a large network of resources that can be used together and uniformly, and so making more of the single resource. The book describes how the LD principles can be applied to modelling language resources. The first part provides the foundation for understanding the remainder of the book, introducing the data models, ontology and query languages used as the basis of the Semantic Web and LD and offering a more detailed overview of the Linguistic Linked Data Cloud. The second part of the book focuses on modelling language resources using LD principles, describing how to model lexical resources using Ontolex-lemon, the lexicon model for ontologies, and how to annotate and address elements of text represented in RDF. It also demonstrates how to model annotations, and how to capture the metadata of language resources. Further, it includes a chapter on representing linguistic categories. In the third part of the book, the authors describe how language resources can be transformed into LD and how links can be inferred and added to the data to increase connectivity and linking between different datasets. They also discuss using LD resources for natural language processing. The last part describes concrete applications of the technologies: representing and linking multilingual wordnets, applications in digital humanities and the discovery of language resources. Given its scope, the book is relevant for researchers and graduate students interested in topics at the crossroads of natural language processing / computational linguistics and the Semantic Web / linked data. It appeals to Semantic Web experts who are not proficient in applying the Semantic Web and LD principles to linguistic data, as well as to computational linguists who are used to working with lexical and linguistic resources wanting to learn about a new paradigm for modelling, publishing and exploiting linguistic resources.

Natural Language Data Management and Interfaces

Natural Language Data Management and Interfaces
Author: Yunyao Li,Davood Rafiei
Publsiher: Morgan & Claypool Publishers
Total Pages: 158
Release: 2018-08-13
Genre: Computers
ISBN: 9781681734095

Download Natural Language Data Management and Interfaces Book in PDF, Epub and Kindle

The volume of natural language text data has been rapidly increasing over the past two decades, due to factors such as the growth of the Web, the low cost associated with publishing, and the progress on the digitization of printed texts. This growth combined with the proliferation of natural language systems for search and retrieving information provides tremendous opportunities for studying some of the areas where database systems and natural language processing systems overlap. This book explores two interrelated and important areas of overlap: (1) managing natural language data and (2) developing natural language interfaces to databases. It presents relevant concepts and research questions, state-of-the-art methods, related systems, and research opportunities and challenges covering both areas. Relevant topics discussed on natural language data management include data models, data sources, queries, storage and indexing, and transforming natural language text. Under natural language interfaces, it presents the anatomy of these interfaces to databases, the challenges related to query understanding and query translation, and relevant aspects of user interactions. Each of the challenges is covered in a systematic way: first starting with a quick overview of the topics, followed by a comprehensive view of recent techniques that have been proposed to address the challenge along with illustrative examples. It also reviews some notable systems in details in terms of how they address different challenges and their contributions. Finally, it discusses open challenges and opportunities for natural language management and interfaces. The goal of this book is to provide an introduction to the methods, problems, and solutions that are used in managing natural language data and building natural language interfaces to databases. It serves as a starting point for readers who are interested in pursuing additional work on these exciting topics in both academic and industrial environments.

Linguistic Databases

Linguistic Databases
Author: John A. Nerbonne
Publsiher: Center for the Study of Language and Information Publications
Total Pages: 255
Release: 1998-01-28
Genre: Language Arts & Disciplines
ISBN: 1575860937

Download Linguistic Databases Book in PDF, Epub and Kindle

Linguistic Databases explores the increasing use of databases in linguistics. The enormous potential in linguistic data - billions of utterances and messages daily - has been difficult to exploit. Many linguists have had to concentrate on introspective data with its inevitable blinders toward frequency, variation, and naturalness. Applications of linguistics have been handicapped. This volume explores the potential advantages of database applications to linguistics. Included in this volume are reports on database activities in phonetics, phonology, lexicography and syntax, comparative grammar, second-language acquisition, linguistic fieldwork, and language pathology. The book presents the specialized problems of multi-media (especially audio) and multi-lingual texts, including those in exotic writing systems. Implemented solutions are also discussed. The opportunities to use existing, minimally structured text repositories are presented.