基于BERTScore语义匹配度的大语言模型中医药典籍英译效能研究

    Performance Assessment of Large Language Model-Generated English Translations of Traditional Chinese Medicine Classics Based on BERTScore Semantic Matching

    • 摘要: 中国古代医学典籍传承两千年整体医学范式,为全球慢病治理与个性化医疗提供借鉴,其哲学意蕴和医学价值不言而喻。文章从《黄帝内经·素问》《银海精微》《本草纲目》《伤寒论》《金匮要略》五部中医药典籍中选取100条例句为测试语料,选用DeepSeek、豆包、通义千问、文心一言、Kimi、腾讯混元六款国产大语言模型,基于BERTScore语义匹配值对其翻译性能进行实证研究。定量分析结果显示:文心一言ERNIE4.5领跑榜单,与本测试中评分末位的通义千问Qwen3差异达到统计学显著水平,其余模型之间差异均不显著,可见国产大语言模型生成的中医药典籍英译文本整体达到可理解性阈值,满足最低译后编辑起点要求。基于定量分析数据筛选最高和最低得分例句进行定性研究,可明确大语言模型翻译在中医术语识解、时空概念映射、文本语境呼应及文化隐喻传递方面的短板及原因。鉴于此,提出搭建术语知识图谱网络、构建时空概念映射模块、改进文本语境智能框架与增强文化隐喻模型训练四条路径。

       

      Abstract: Traditional Chinese Medicine(TCM) classics have preserved a holistic medical paradigm for over two millennia, offering valuable insights for global chronic disease management and personalized healthcare.Their philosophical and medical significance is undeniable.This study selected 100 sample sentences from five representative TCM classics—Huangdi Neijing:Suwen(Yellow Emperor's Inner Canon: Basic Questions),Yinhai Jingwei(Essential Subtleties on the Silver Sea),Bencao Gangmu(Compendium of Materia Medica),Shanghan Lun(Treatise on Cold Damage),and Jingui Yaolue(Essential Prescriptions from the Golden Cabinet)—as test corpora.Six domestic large language models(LLMs)—DeepSeek, Doubao, Qwen3,ERNIE 4.5,Kimi, and Hunyuan T1—were evaluated for translation performance using BERTScore semantic matching scores.Quantitative analysis revealed that ERNIE 4.5 ranked first, showing statistically significant superiority over the lowest-ranking Qwen3,while differences among other models were not significant.Overall, the LLM-generated translations met the comprehensibility threshold and minimum post-editing requirements.Qualitative analysis of the highest-and lowest-scoring sentences identified key weaknesses in TCM term interpretation, spatiotemporal concept mapping, contextual coherence, and cultural metaphor transfer.Based on these findings, this paper proposes four improvement pathways: constructing terminological knowledge graph networks, developing spatiotemporal concept mapping modules, enhancing contextual intelligence frameworks, and strengthening cultural metaphor model training.These strategies aim to advance the English translation and global dissemination of TCM classics, promoting the worldwide sharing and inheritance of TCM culture.

       

    /

    返回文章
    返回