SAIMSARA Journal

Machine Generated Science • ISSN 3054-3991

Limitations of Medical Machine Translation: Scoping Review with ☸️SAIMSARA.

Editorial note
• Last update: 2026-03-29 21:55:05
What is this paper about
Medical machine translation may look fluent, but this review shows where it still breaks: semantic precision, cultural nuance, audience adaptation, and high-stakes clinical reliability. The full read is worth it because it separates the real strengths of MT in constrained tasks from the specific failure modes that still make expert human oversight essential.

DOI: 10.62487/saimsarae1a843d5

Abstract: The aim of this review is to synthesize the documented limitations of medical machine translation across various architectures, including LLMs and NMT systems, focusing on terminological accuracy, contextual interpretation, and audience adaptation. The review utilises 15 references. This evidence map suggests that the dominant limitation of medical machine translation is not simple lexical inaccuracy alone, but a broader failure to preserve semantic precision, contextual meaning, and audience-appropriate expression in high-stakes settings. Even when machine performance appears strong on conventional metrics, important residual weaknesses remain, as illustrated by a TER of 0.99 in a classical medical translation task and lower syntactic complexity in LLM output than in human translation (CP/T 1.16 vs 1.30). Across the mapped topics, recurrent signals pointed to terminology errors, polysemy-related mistranslation, discourse-level simplification, and loss of cultural nuance, with especially important implications for sensitive communication such as mental healthcare and for low-resource or unevenly supported language pairs. Taken together, the literature supports using medical MT as an assistive tool that still requires expert human oversight rather than as a stand-alone substitute for clinical or specialist translation judgment. Future research should focus on clinically sensitive evaluation frameworks, register-adaptive multilingual systems, and stronger real-world validation in complex medical communication tasks.

Keywords: Medical machine translation; Neural machine translation; Large language models; Medical terminology accuracy; Contextual ambiguity; Syntactic simplification; Translation quality metrics; Post-editing; Domain-specific translation; Cultural connotations

Review Stats

Get access to the full paper

temporary link or permanent access.

The remaining part of the paper opens after purchase or unlock.

Reference Index (15)