Text Analytics for Corpus Linguistics and Digital Humanities

Text Analytics for Corpus Linguistics and Digital Humanities
Author: Gerold Schneider
Publsiher: Bloomsbury Publishing
Total Pages: 164
Release: 2024-05-02
Genre: Computers
ISBN: 9781350370845

Download Text Analytics for Corpus Linguistics and Digital Humanities Book in PDF, Epub and Kindle

Do you want to gain a deeper understanding of how big tech analyses and exploits our text data, or investigate how political parties differ by analysing textual styles, associations and trends in documents? Or create a map of a text collection and write a simple QA system yourself? This book explores how to apply state-of-the-art text analytics methods to detect and visualise phenomena in text data. Solidly based on methods from corpus linguistics, natural language processing, text analytics and digital humanities, this book shows readers how to conduct experiments with their own corpora and research questions, underpin their theories, quantify the differences and pinpoint characteristics. Case studies and experiments are detailed in every chapter using real-world and open access corpora from politics, World English, history, and literature. The results are interpreted and put into perspective, pitfalls are pointed out, and necessary pre-processing steps are demonstrated. This book also demonstrates how to use the programming language R, as well as simple alternatives and additions to R, to conduct experiments and employ visualisations by example, with extensible R-code, recipes, links to corpora, and a wide range of methods. The methods introduced can be used across texts of all disciplines, from history or literature to party manifestos and patient reports.

Applying Language Technology in Humanities Research

Applying Language Technology in Humanities Research
Author: Barbara McGillivray,Gábor Mihály Tóth
Publsiher: Springer Nature
Total Pages: 133
Release: 2020-07-13
Genre: Language Arts & Disciplines
ISBN: 9783030464936

Download Applying Language Technology in Humanities Research Book in PDF, Epub and Kindle

This book presents established and state-of-the-art methods in Language Technology (including text mining, corpus linguistics, computational linguistics, and natural language processing), and demonstrates how they can be applied by humanities scholars working with textual data. The landscape of humanities research has recently changed thanks to the proliferation of big data and large textual collections such as Google Books, Early English Books Online, and Project Gutenberg. These resources have yet to be fully explored by new generations of scholars, and the authors argue that Language Technology has a key role to play in the exploration of large-scale textual data. The authors use a series of illustrative examples from various humanistic disciplines (mainly but not exclusively from History, Classics, and Literary Studies) to demonstrate basic and more complex use-case scenarios. This book will be useful to graduate students and researchers in humanistic disciplines working with textual data, including History, Modern Languages, Literary studies, Classics, and Linguistics. This is also a very useful book for anyone teaching or learning Digital Humanities and interested in the basic concepts from computational linguistics, corpus linguistics, and natural language processing.

Python Programming for Linguistics and Digital Humanities

Python Programming for Linguistics and Digital Humanities
Author: Martin Weisser
Publsiher: John Wiley & Sons
Total Pages: 295
Release: 2024-01-31
Genre: Computers
ISBN: 9781119907947

Download Python Programming for Linguistics and Digital Humanities Book in PDF, Epub and Kindle

Learn how to use Python for linguistics and digital humanities research, perfect for students working with Python for the first time Python programming is no longer only for computer science students; it is now an essential skill in linguistics, the digital humanities (DH), and social science programs that involve text analytics. Python Programming for Linguistics and Digital Humanities provides a comprehensive introduction to this widely used programming language, offering guidance on using Python to perform various processing and analysis techniques on text. Assuming no prior knowledge of programming, this student-friendly guide covers essential topics and concepts such as installing Python, using the command line, working with strings, writing modular code, designing a simple graphical user interface (GUI), annotating language data in XML and TEI, creating basic visualizations, and more. This invaluable text explains the basic tools students will need to perform their own research projects and tackle various data analysis problems. Throughout the book, hands-on exercises provide students with the opportunity to apply concepts to particular questions or projects in processing textual data and solving language-related issues. Each chapter concludes with a detailed discussion of the code applied, possible alternatives, and potential pitfalls or error messages. Teaches students how to use Python to tackle the types of problems they will encounter in linguistics and the digital humanities Features numerous practical examples of language analysis, gradually moving from simple concepts and programs to more complex projects Describes how to build a variety of data visualizations, such as frequency plots and word clouds Focuses on the text processing applications of Python, including creating word and frequency lists, recognizing linguistic patterns, and processing words for morphological analysis Includes access to a companion website with all Python programs produced in the chapter exercises and additional Python programming resources Python Programming for Linguistics and Digital Humanities: Applications for Text-Focused Fields is a must-have resource for students pursuing text-based research in the humanities, the social sciences, and all subfields of linguistics, particularly computational linguistics and corpus linguistics.

Corpus Linguistics and Translation Tools for Digital Humanities

Corpus Linguistics and Translation Tools for Digital Humanities
Author: Stefania M. Maci,Michele Sala
Publsiher: Bloomsbury Publishing
Total Pages: 248
Release: 2022-07-14
Genre: Language Arts & Disciplines
ISBN: 9781350275232

Download Corpus Linguistics and Translation Tools for Digital Humanities Book in PDF, Epub and Kindle

Presenting the digital humanities as both a domain of practice and as a set of methodological approaches to be applied to corpus linguistics and translation, chapters in this volume provide a novel and original framework to triangulate research for pursuing both scientific and educational goals within the digital humanities. They also highlight more broadly the importance of data triangulation in corpus linguistics and translation studies. Putting forward practical applications for digging into data, this book is a detailed examination of how to integrate quantitative and qualitative approaches through case studies, sample analysis and practical examples.

Text Analytics

Text Analytics
Author: Domenica Fioredistella Iezzi,Damon Mayaffre,Michelangelo Misuraca
Publsiher: Springer Nature
Total Pages: 298
Release: 2020-11-24
Genre: Social Science
ISBN: 9783030526801

Download Text Analytics Book in PDF, Epub and Kindle

Focusing on methodologies, applications and challenges of textual data analysis and related fields, this book gathers selected and peer-reviewed contributions presented at the 14th International Conference on Statistical Analysis of Textual Data (JADT 2018), held in Rome, Italy, on June 12-15, 2018. Statistical analysis of textual data is a multidisciplinary field of research that has been mainly fostered by statistics, linguistics, mathematics and computer science. The respective sections of the book focus on techniques, methods and models for text analytics, dictionaries and specific languages, multilingual text analysis, and the applications of text analytics. The interdisciplinary contributions cover topics including text mining, text analytics, network text analysis, information extraction, sentiment analysis, web mining, social media analysis, corpus and quantitative linguistics, statistical and computational methods, and textual data in sociology, psychology, politics, law and marketing.

Data Analytics in Digital Humanities

Data Analytics in Digital Humanities
Author: Shalin Hai-Jew
Publsiher: Springer
Total Pages: 295
Release: 2017-05-03
Genre: Computers
ISBN: 9783319544991

Download Data Analytics in Digital Humanities Book in PDF, Epub and Kindle

This book covers computationally innovative methods and technologies including data collection and elicitation, data processing, data analysis, data visualizations, and data presentation. It explores how digital humanists have harnessed the hypersociality and social technologies, benefited from the open-source sharing not only of data but of code, and made technological capabilities a critical part of humanities work. Chapters are written by researchers from around the world, bringing perspectives from diverse fields and subject areas. The respective authors describe their work, their research, and their learning. Topics include semantic web for cultural heritage valorization, machine learning for parody detection by classification, psychological text analysis, crowdsourcing imagery coding in natural disasters, and creating inheritable digital codebooks.Designed for researchers and academics, this book is suitable for those interested in methodologies and analytics that can be applied in literature, history, philosophy, linguistics, and related disciplines. Professionals such as librarians, archivists, and historians will also find the content informative and instructive.

Corpus Linguistics and Statistics with R

Corpus Linguistics and Statistics with R
Author: Guillaume Desagulier
Publsiher: Springer
Total Pages: 353
Release: 2017-11-17
Genre: Computers
ISBN: 9783319645728

Download Corpus Linguistics and Statistics with R Book in PDF, Epub and Kindle

This textbook examines empirical linguistics from a theoretical linguist’s perspective. It provides both a theoretical discussion of what quantitative corpus linguistics entails and detailed, hands-on, step-by-step instructions to implement the techniques in the field. The statistical methodology and R-based coding from this book teach readers the basic and then more advanced skills to work with large data sets in their linguistics research and studies. Massive data sets are now more than ever the basis for work that ranges from usage-based linguistics to the far reaches of applied linguistics. This book presents much of the methodology in a corpus-based approach. However, the corpus-based methods in this book are also essential components of recent developments in sociolinguistics, historical linguistics, computational linguistics, and psycholinguistics. Material from the book will also be appealing to researchers in digital humanities and the many non-linguistic fields that use textual data analysis and text-based sensorimetrics. Chapters cover topics including corpus processing, frequencing data, and clustering methods. Case studies illustrate each chapter with accompanying data sets, R code, and exercises for use by readers. This book may be used in advanced undergraduate courses, graduate courses, and self-study.

Corpus Linguistics and Translation Tools for Digital Humanities

Corpus Linguistics and Translation Tools for Digital Humanities
Author: Stefania M. Maci,Michele Sala
Publsiher: Bloomsbury Publishing
Total Pages: 249
Release: 2022-07-14
Genre: Language Arts & Disciplines
ISBN: 9781350275249

Download Corpus Linguistics and Translation Tools for Digital Humanities Book in PDF, Epub and Kindle

Presenting the digital humanities as both a domain of practice and as a set of methodological approaches to be applied to corpus linguistics and translation, chapters in this volume provide a novel and original framework to triangulate research for pursuing both scientific and educational goals within the digital humanities. They also highlight more broadly the importance of data triangulation in corpus linguistics and translation studies. Putting forward practical applications for digging into data, this book is a detailed examination of how to integrate quantitative and qualitative approaches through case studies, sample analysis and practical examples.