Tag: Translation

All the articles with the tag "Translation".

Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance

Published: 3 Jun, 2025 at 11:44 AM

91.19 🤔

本文通过引入对抗性提示干扰大型语言模型的潜在语言一致性，研究其对翻译和地理文化任务性能的影响，发现一致性并非总是必要的，因为模型能在最终层适应语言变化。
The Effect of Language Diversity When Fine-Tuning Large Language Models for Translation

Published: 26 May, 2025 at 11:23 AM

88.75 🤔

本文通过系统性实验证明，在大型语言模型微调中增加语言多样性可显著提升所有类别翻译对的性能，并通过中层表征分析揭示跨语言迁移机制，但多样性收益存在阈值。
Think2SQL: Reinforce LLM Reasoning Capabilities for Text2SQL

Published: 6 May, 2025 at 11:18 PM

87.33 🤔

本文通过结合监督微调（SFT）、强化学习（RL）及细粒度奖励函数（如QATCH），显著提升了小型LLM在Text2SQL任务中的推理能力和性能，Think2SQL-7B模型在BIRD数据集上超越了400B+参数模型。
AI agents may be worth the hype but not the resources (yet): An initial exploration of machine translation quality and costs in three language pairs in the legal and news domains

Published: 8 May, 2025 at 12:22 AM

86.86 🤔

本文通过实证评估五种机器翻译范式，发现推理增强的大型语言模型（如o1-preview）在人工评估中表现出色，超越传统NMT，而多智能体系统虽具潜力，但因高计算成本和语言对表现不一致而受限。
DeepSeek vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization?

Published: 2 Jun, 2025 at 11:31 AM

86.10 🤔

本文首次系统比较了推理型与非推理型大语言模型在自然语言生成评估中的表现，发现推理能力的效果高度依赖模型架构，OpenAI o3-mini 在机器翻译评估中显著优于非推理型模型，而 DeepSeek-R1 仅在文本摘要一致性评估中表现突出，蒸馏模型在 32B 参数规模时仍有效。

Tag: Translation

Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance

The Effect of Language Diversity When Fine-Tuning Large Language Models for Translation

Think2SQL: Reinforce LLM Reasoning Capabilities for Text2SQL

AI agents may be worth the hype but not the resources (yet): An initial exploration of machine translation quality and costs in three language pairs in the legal and news domains

DeepSeek vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization?