Language & NLP

Latest Machine Translation Research Papers

The newest Machine Translation papers from across the field — arXiv, NeurIPS, CVPR, Nature, and more — refreshed daily and ranked by relevance. Distill AI tracks Machine Translation so you don’t have to: get the standout work delivered to your inbox every morning, with 2-sentence summaries and the option to chat with any paper.

Get the latest Machine Translation papers in your inbox — free →

Recent papers

Human-Based Machine Translation Evaluation: A Multi-Dimensional Approach to Sentiment, Emotion, and Argumentation Preservation in Chinese-English Translation
Jingshi Zhou · Open MIND · Jan 1, 2028
The research landscape of Machine Translation Evaluation (MTE) has traditionally been dominated by automated metrics that, while computationally efficient, often fail to capture the nuanced aspects of translation quality paramount to human …
DONDO: Open w2v-BERT Speech-Recognition Base Models for African Languages
Paul Azunre · arXiv · Jul 23, 2026
We present DONDO, a family of open, permissively licensed automatic speech recognition (ASR) base models for African languages, built on the w2v-BERT 2.0 self-supervised speech encoder. DONDO comprises twenty-one monolingual models and five…
When Trivia Is Not Trivial: Everyday Knowledge Failures in Multilingual LLMs
Anna Mosolova, Djamé Seddah · arXiv · Jul 23, 2026
Quiz rooms, trivia nights, and quiz shows challenge human knowledge across a wide range of topics, from canonical facts to everyday culture. In this paper, we examine whether large language models (LLMs) can perform competitively in such se…
A Comparative Evaluation of Embeddings and LLMs in a Greek Book Publisher Setting - The CUP Dataset
Katerina Papantoniou, Panagiotis Papadakos, Theodore Patkos, Dimitris Garefalakis et al. · arXiv · Jul 23, 2026
We present CUP, a Greek book retrieval benchmark consisting of 868 catalog records and 104 expert-annotated queries with graded relevance judgments. We evaluate sparse (BM25), dense (sentence-transformers), hybrid, and LLM-assisted retrieva…
From a Word-Level Dictionary to Sentence-Level Semantics: Multilingual Grievance Labelling with Contextual Models
Lin Tian, Marian-Andrei Rizoiu · arXiv · Jul 23, 2026
Grievance is one of the warning signs analysts look for when assessing threats of violence. It is increasingly measured at scale from online text, most often with word-level lexicons like the Grievance Dictionary that score by matching weig…
LKValues: Aligning Large Language Models with Sri Lankan Societal Values
Nethmi Muthugala, Supryadi, Surangika Ranathunga, Nisansa de Silva et al. · arXiv · Jul 22, 2026
Value alignment of Large Language Models (LLMs) has been shown to be culturally biased toward Western norms. This results in the mishandling of local values in multilingual societies such as Sri Lanka that have their unique cultural dynamic…
On the Systematic Challenges of Culturally Loaded Machine Translation: Dream of the Red Chamber as the Cultural Lens
Yiming Wang, Jiayuan Di · arXiv · Jul 22, 2026
Culturally loaded translation poses unique challenges for machine translation (MT), as meanings are deeply embedded in socio-cultural contexts beyond surface linguistic forms. Although large language models (LLMs) have enabled MT systems to…
Inference-Time Steering for Cross-Lingual Factual Consistency in LLMs
Alexander Manev · arXiv · Jul 21, 2026
Although Large Language Models (LLMs) demonstrate remarkable multilingual fluency, their internal knowledge representations remain disproportionately biased toward high-resource languages. This leads to cross-lingual factual inconsistency, …
Translation as Augmentation: Effect of Translated Data on Assessment of Difficulty
Yiheng Wu, Jue Hou, Roman Yangarber · arXiv · Jul 21, 2026
Reliable Text Difficulty Assessment is a prerequisite for valid text simplification workflows and personalized learning applications. However, the development of robust assessment models is severely hindered by a critical bottleneck: the sc…
From a Multilingual Streaming ASR Backbone to Kenyan-Language Systems: Data-Centric Adaptation of Nemotron 3.5 for Kikuyu, Dholuo, and Kalenjin
Mark Gatere · arXiv · Jul 21, 2026
Automatic speech recognition (ASR) for African languages is constrained by orthographic inconsistency, annotation artifacts, missing audio, speaker and domain imbalance, and evaluation procedures that differ from deployment. We present an e…
AgentDebugX: An Open-Source Toolkit for Failure Observability, Attribution, and Recovery in LLM Agents
Kunlun Zhu, Xuyan Ye, Zhiguang Han, Yuchen Zhao et al. · arXiv · Jul 21, 2026
LLM agent failures are difficult to debug because the step where an error surfaces is often not the one that caused it. Existing observability tools replay execution traces but provide little support for identifying the root cause or transl…
Is EEG-to-Text Feasible in Real-World Scenarios? An In-Depth Analysis Using a Neuropsychology-Inspired Benchmark
Zihan Zhang, Yu Bao, Xiao Ding, Tianyi Jiang et al. · arXiv · Jul 21, 2026
Translating brain signals into text could restore communication for people with severe paralysis, yet practically usable systems to date rely on invasive electrocorticography (ECoG). Electroencephalography (EEG) offers a non-invasive altern…
LatentMT: Machine Translation with Latent Reasoning
Wei-Rui Chen, Samar M. Magdy, Chiyu Zhang, Wenhui Zhu et al. · arXiv · Jul 21, 2026
Latent-reasoning looped language models (LoopLMs) offer a different scaling path for machine translation (MT): instead of increasing parameter count or emitting explicit chain-of-thought tokens, they spend additional recurrent computation i…
Building a European Multilingual Evaluation Dataset: The MMLU Localisation Project within the EMT Network
Pilar Sánchez-Gijón, Susana Valdez, Sofía Calvo Del Barrio, Florence Bellemont et al. · arXiv · Jul 20, 2026
This paper reports on a collaboration between the Directorate-General for Translation (DGT) and the European Master's in Translation (EMT) to localise the MMLU dataset into 11 European languages. Beyond creating a more inclusive benchmark f…
Enabling Multilingual Privacy Policy Audits: Large-Scale Analysis of Spanish Mobile Apps
Marcos Moran, David Rodriguez, Luka Nenadic, Norman Sadeh et al. · arXiv · Jul 20, 2026
Automated analyses of privacy policies enable large-scale assessments of transparency in digital ecosystems, yet existing auditing pipelines remain predominantly English-centric. This limits their ability to systematically evaluate multilin…
Expanding the Lexicon of Ge'ez Based African Languages: A Comparative Study of Amharic and Tigrinya
Hailay Kidu Teklehaymanot, Debela Desalegn Yadeta, Wolfgang Nejdl · arXiv · Jul 16, 2026
Multilingual pre-trained language models (PLMs) exhibit degraded performance on low-resource, non-Latin-script languages, driven by high out-of-vocabulary (OOV) rates and excessive subword fragmentation that result from Latin-script-centric…
LLM Evaluators are Biased across Languages
Ej Zhou, Lucas Resck, Zheng Hui, Anna Korhonen · arXiv · Jul 16, 2026
LLM evaluators (trained reward models and prompted LLM-as-a-Judge) are routinely validated via pairwise accuracy. In a multilingual setting, this operates under the premise that high pairwise accuracy implies reliable, language-neutral scor…
Can an Old Dog Be Taught New Tricks? Taking LLMs Beyond Sentence Level Translation
Alaina Brandt · arXiv · Jul 15, 2026
Automatic translation systems, from CAT tools to MT, overwhelmingly treat translation as a sentence-by-sentence act. This paper asks whether LLMs can be moved beyond that paradigm through whole-document, corpus-informed translation. We pres…
DeltaMerge-LowRes: Composing Language and Task Deltas for Low-Resource Adaptation
Son Ha Xuan, Xuan-Bach Le, Phat T. Tran-Truong · arXiv · Jul 15, 2026
Adapting a multilingual encoder to a new language \emph{and} a new task with only a few hundred gold examples is a common low-resource NLP setting, yet the two axes are usually fused via an expensive language--task fine-tuning run. We ask w…
High-Order Question Generation in a Multilingual Educational Context
Suna-Şeyma Uçar, Itziar Aldabe, Nora Aranberri, Orphée De Clercq · arXiv · Jul 15, 2026
Critical thinking is a fundamental skill that helps learners move beyond simple memorization. One way to develop this skill is through high-order questioning. However, crafting such questions remains a challenge for educators, and classroom…
The Test Oracle Problem in Synthetic LLM-as-Judge Corpora: Disappearance, Distortion and a Validation Protocol
Serkan Ballı · arXiv · Jul 15, 2026
Studies of bias in LLM-as-judge systems typically build synthetic corpora by prompting an LLM to generate a hallucinated answer to pair with a factual one, then presenting both to a judge. We report a case in which this generation step sile…
MET: Theory-Grounded and Culture-Aware Multilingual Moral Reasoning
Ayoung Lee, Ryan Kwon, Yunxiang Zhang, Yuxuan Liu et al. · arXiv · Jul 13, 2026
Language models are increasingly used for moral decision-making across diverse linguistic and cultural contexts, yet existing work overlooks multilinguality on three aspects: 1) multilingual evaluation benchmarks use direct translation, fai…
STEP: Career-Path Recommendation via Temporal and Educational Trajectory Modeling
Iman Johary, Guillaume Bied, Alexandru C. Mara, Tijl De Bie · arXiv · Jul 13, 2026
Career paths encode decades of skill acquisition, role transitions, and educational investment, and understanding them at scale underpins workforce planning, labor market policy, and job recommendation. Resumes are a rich source of informat…
Direct Image-to-Modern Vietnamese Translation of Han-Nom Manuscripts via Multimodal RLHF Preference Alignment
Thi Kim Trang Vo, Nghia Hieu Nguyen, Ha Minh Tan · arXiv · Jul 13, 2026
Translating Han-Nom manuscripts into modern Vietnamese is challenging because historical pages are often degraded, the script contains rare logographic characters, and parallel supervision is limited. We propose a multimodal RLHF preference…
Q-BridgeNet: A Quantization Network for Cross-Lingual Sign Language Translation
Liqian Feng, Lintao Wang, Xiaochen Liu, Anusha Withana et al. · arXiv · Jul 13, 2026
Most sign language translation (SLT) methods focus on isolated native sign-spoken pairs (e.g., American Sign Language - English). Extending language-specific SLT models to multilingual translation would improve accessibility by enabling com…
Unified Gradient Projection: Language-Balanced Continual Learning for Multilingual Low-Resource ASR
Ziang Ren, Guodong Lin, Yuchen Ai, Kaize Tan et al. · arXiv · Jul 13, 2026
Large-scale pretrained ASR models such as Whisper exhibit strong multilingual capabilities. However, fine-tuning on low-resource languages often causes catastrophic forgetting. Although continual learning mitigates this issue, existing meth…
The Nuts and Bolts of Natural Language to SQL Translation: A Systematic Analysis of Model Pipeline Optimisation Approaches and their Interactions
Filip Klubicka, Vasudevan Nedumpozhimana, Sneha Rautmare, Bora Caglayan et al. · arXiv · Jul 12, 2026
In the age of large language models, Natural Language to SQL (NL2SQL) translation remains an open problem with many useful applications. We explore interactions between several NL2SQL pipeline extensions to inspire development of more light…
Toward Real-Time Sentence-Level Sign Language Translation
Thanh-Hoang Nguyen Doan · arXiv · Jul 10, 2026
Most sign language understanding systems operate at the level of isolated signs, limiting their usefulness in natural communication. We study sentence-level sign language translation (SLT) with the primary goal of real-time deployment rathe…
Test-Time Scaling for Small VLMs on Multilingual Visual MCQ
Spiros Baxevanakis, Peng-Jian Yang · arXiv · Jul 10, 2026
Test-time scaling (TTS) reliably improves reasoning in large language models, but whether it transfers to small open vision-language models remains unclear. We examine this on EXAMS-V, a multilingual visual multiple-choice benchmark, compar…
VTaMo: Video-Text Alignment Model for Sign Language Translation
Junyi Hu, Zhewen He, Haomian Huang, Aoxiang Yang et al. · arXiv · Jul 10, 2026
Sign language translation (SLT) converts continuous sign videos into spoken language text. Gloss-free approaches leverage pre-trained visual encoders and language models but rely on implicit cross-modal alignment from translation supervisio…

Track Machine Translation on Distill AI — start free →

Latest Machine Translation Research Papers

Recent papers

Related topics