Multiword Units in Machine Translation and Translation Technology

Multiword Units in Machine Translation and Translation Technology
Author: Ruslan Mitkov,Johanna Monti,Gloria Corpas Pastor,Violeta Seretan
Publsiher: John Benjamins Publishing Company
Total Pages: 259
Release: 2018-07-15
Genre: Language Arts & Disciplines
ISBN: 9789027264206

Download Multiword Units in Machine Translation and Translation Technology Book in PDF, Epub and Kindle

The correct interpretation of Multiword Units (MWUs) is crucial to many applications in Natural Language Processing but is a challenging and complex task. In recent years, the computational treatment of MWUs has received considerable attention but there is much more to be done before we can claim that NLP and Machine Translation (MT) systems process MWUs successfully. This volume provides a general overview of the field with particular reference to Machine Translation and Translation Technology and focuses on languages such as English, Basque, French, Romanian, German, Dutch and Croatian, among others. The chapters of the volume illustrate a variety of topics that address this challenge, such as the use of rule-based approaches, compound splitting techniques, MWU identification methodologies in multilingual applications, and MWU alignment issues.

Machine Translation and the Lexicon

Machine Translation and the Lexicon
Author: Petra Steffens
Publsiher: Springer Science & Business Media
Total Pages: 268
Release: 1995-03-06
Genre: Computers
ISBN: 3540590404

Download Machine Translation and the Lexicon Book in PDF, Epub and Kindle

This volume constitutes the proceedings of the Third International Workshop of the European Association for Machine Translation, held in Heidelberg, Germany in April 1993. The EAMT Workshops traditionally aim at bringing together researchers, developers, users, and others interested in the field of machine or computer-assisted translation research, development and use. The volume presents thoroughly revised versions of the 15 best workshop contributions together with an introductory survey by the volume editor. The presentations are centered primarily on questions of acquiring, sharing, and managing lexical data, but also address aspects of lexical description.

Computer Aided Translation Technology

Computer Aided Translation Technology
Author: Lynne Bowker
Publsiher: University of Ottawa Press
Total Pages: 202
Release: 2002-04-17
Genre: Language Arts & Disciplines
ISBN: 9780776615677

Download Computer Aided Translation Technology Book in PDF, Epub and Kindle

Lynne Bowker introduces the world of technology to the world of translation in this unique book, the first of its kind. Bowker reveals the role of technology in translation and how to use this ever developing tool.

Multiword Expressions Acquisition

Multiword Expressions Acquisition
Author: Carlos Ramisch
Publsiher: Springer
Total Pages: 230
Release: 2014-09-24
Genre: Computers
ISBN: 9783319092072

Download Multiword Expressions Acquisition Book in PDF, Epub and Kindle

​This book is an excellent introduction to multiword expressions. It provides a unique, comprehensive and up-to-date overview of this exciting topic in computational linguistics. The first part describes the diversity and richness of multiword expressions, including many examples in several languages. These constructions are not only complex and arbitrary, but also much more frequent than one would guess, making them a real nightmare for natural language processing applications. The second part introduces a new generic framework for automatic acquisition of multiword expressions from texts. Furthermore, it describes the accompanying free software tool, the mwetoolkit, which comes in handy when looking for expressions in texts (regardless of the language). Evaluation is greatly emphasized, underlining the fact that results depend on parameters like corpus size, language, MWE type, etc. The last part contains solid experimental results and evaluates the mwetoolkit, demonstrating its usefulness for computer-assisted lexicography and machine translation. This is the first book to cover the whole pipeline of multiword expression acquisition in a single volume. It is addresses the needs of students and researchers in computational and theoretical linguistics, cognitive sciences, artificial intelligence and computer science. Its good balance between computational and linguistic views make it the perfect starting point for anyone interested in multiword expressions, language and text processing in general.

Computational Phraseology

Computational Phraseology
Author: Gloria Corpas Pastor,Jean-Pierre Colson
Publsiher: John Benjamins Publishing Company
Total Pages: 341
Release: 2020-05-15
Genre: Language Arts & Disciplines
ISBN: 9789027261397

Download Computational Phraseology Book in PDF, Epub and Kindle

Whether you wish to deliver on a promise, take a walk down memory lane or even on the wild side, phraseological units (also often referred to as phrasemes or multiword expressions) are present in most communicative situations and in all world’s languages. Phraseology, the study of phraseological units, has therefore become a rare unifying theme across linguistic theories. In recent years, an increasing number of studies have been concerned with the computational treatment of multiword expressions: these pertain among others to their automatic identification, extraction or translation, and to the role they play in various Natural Language Processing applications. Computational Phraseology is a comparatively new field where better understanding and more advances are urgently needed. This book aims to address this pressing need, by bringing together contributions focusing on different perspectives of this promising interdisciplinary field.

Translation Brains and the Computer

Translation  Brains and the Computer
Author: Bernard Scott
Publsiher: Springer
Total Pages: 241
Release: 2018-06-06
Genre: Computers
ISBN: 9783319766294

Download Translation Brains and the Computer Book in PDF, Epub and Kindle

This book is about machine translation (MT) and the classic problems associated with this language technology. It examines the causes of these problems and, for linguistic, rule-based systems, attributes the cause to language’s ambiguity and complexity and their interplay in logic-driven processes. For non-linguistic, data-driven systems, the book attributes translation shortcomings to the very lack of linguistics. It then proposes a demonstrable way to relieve these drawbacks in the shape of a working translation model (Logos Model) that has taken its inspiration from key assumptions about psycholinguistic and neurolinguistic function. The book suggests that this brain-based mechanism is effective precisely because it bridges both linguistically driven and data-driven methodologies. It shows how simulation of this cerebral mechanism has freed this one MT model from the all-important, classic problem of complexity when coping with the ambiguities of language. Logos Model accomplishes this by a data-driven process that does not sacrifice linguistic knowledge, but that, like the brain, integrates linguistics within a data-driven process. As a consequence, the book suggests that the brain-like mechanism embedded in this model has the potential to contribute to further advances in machine translation in all its technological instantiations.

Lexical Collocation Analysis

Lexical Collocation Analysis
Author: Pascual Cantos-Gómez,Moisés Almela-Sánchez
Publsiher: Springer
Total Pages: 140
Release: 2018-08-21
Genre: Social Science
ISBN: 9783319925820

Download Lexical Collocation Analysis Book in PDF, Epub and Kindle

This book re-examines the notion of word associations, more precisely collocations. It attempts to come to a potentially more generally applicable definition of collocation and how to best extract, identify and measure collocations. The book highlights the role played by (i) automatic linguistic annotation (part-of-speech tagging, syntactic parsing, etc.), (ii) using semantic criteria to facilitate the identification of collocations, (iii) multi-word structured, instead of the widespread assumption of bipartite collocational structures, for capturing the intricacies of the phenomenon of syntagmatic attraction, (iv) considering collocation and valency as near neighbours in the lexis-grammar continuum and (v) the mathematical properties of statistical association measures in the automatic extraction of collocations from corpora. This book is an ideal guide to the use of statistics in collocation analysis and lexicography, as well as a practical text to the development of skills in the application of computational lexicography. Lexical Collocation Analysis: Advances and Applications begins with a proposal for integrating both collocational and valency phenomena within the overarching theoretical framework of construction grammar. Next the book makes the case for integrating advances in syntactic parsing and in collocational analysis. Chapter 3 offers an innovative look at complementing corpus data and dictionaries in the identification of specific types of collocations consisting of restricted predicate-argument combinations. This strategy complements corpus collocational data with network analysis techniques applied to dictionary entries. Chapter 4 explains the potential of collocational graphs and networks both as a visualization tool and as an analytical technique. Chapter 5 introduces MERGE (Multi-word Expressions from the Recursive Grouping of Elements), a data-driven approach to the identification and extraction of multi-word expressions from corpora. Finally the book concludes with an analysis and evaluation of factors influencing the performance of collocation extraction methods in parsed corpora.

Computational and Corpus Based Phraseology

Computational and Corpus Based Phraseology
Author: Ruslan Mitkov
Publsiher: Springer
Total Pages: 463
Release: 2017-11-03
Genre: Computers
ISBN: 9783319698052

Download Computational and Corpus Based Phraseology Book in PDF, Epub and Kindle

This book constitutes the refereed proceedings of the International Conference on Computational and Corpus-Based Phraseology, Europhras 2017, held in London, UK, in November 2017. The 31 full papers presented were carefully reviewed and selected from numerous submissions and are organized into the following thematic sessions: Phraseology in translation and contrastive studies, Lexicography and terminography, Exploitation of corpora in phraseological studies, Development of corpora for phraseological studies, Phraseology and language learning, Cognitive and cultural aspects of phraseology, Theoretical and descriptive approaches to phraseology, and Computational approaches to phraseology. The chapter 'Frequency Consolidation Among Word N-Grams' is available open access under a CC BY 4.0 license.