Linguistic Features of Genre and Method Variation in Translation: A   Computational Perspective

Ekaterina Lapshninova-Koltunski; Marcos Zampieri

arXiv:1709.04359·cs.CL·September 14, 2017

Linguistic Features of Genre and Method Variation in Translation: A Computational Perspective

Ekaterina Lapshninova-Koltunski, Marcos Zampieri

PDF

Open Access

TL;DR

This study employs text classification with linguistically motivated features to analyze genre and method variation in English-German translation, revealing key linguistic differences through a Bayesian classifier approach.

Contribution

It introduces a computational method using part-of-speech n-grams and Bayesian classification to distinguish translation genres and methods, providing detailed feature analysis.

Findings

01

Successful classification of translation genres and methods

02

Identification of key linguistic features differentiating genres

03

Insights into linguistic variation in translation

Abstract

In this paper we describe the use of text classification methods to investigate genre and method variation in an English - German translation corpus. For this purpose we use linguistically motivated features representing texts using a combination of part-of-speech tags arranged in bigrams, trigrams, and 4-grams. The classification method used in this paper is a Bayesian classifier with Laplace smoothing. We use the output of the classifiers to carry out an extensive feature analysis on the main difference between genres and methods of translation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Authorship Attribution and Profiling · Translation Studies and Practices