Tag: Large Language Model

All the articles with the tag "Large Language Model".

HAIR: Hardness-Aware Inverse Reinforcement Learning with Introspective Reasoning for LLM Alignment

Published: 11 May, 2025 at 11:12 AM

67.37 🤔

HAIR introduces a novel LLM alignment method using hardness-aware inverse reinforcement learning and introspective reasoning, constructing a balanced safety dataset and training category-specific reward models with GRPO-S, achieving state-of-the-art harmlessness while preserving usefulness across multiple benchmarks.
LZ Penalty: An information-theoretic repetition penalty for autoregressive language models

Published: 6 May, 2025 at 11:19 PM

67.26 🤔

本文提出LZ惩罚方法，基于LZ77压缩算法的码长变化动态调整自回归语言模型的采样分布，在贪婪解码下有效消除退化重复，同时保持推理基准性能。
X-Fusion: Introducing New Modality to Frozen Large Language Models

Published: 4 May, 2025 at 04:31 PM

66.52 🤔

本文提出X-Fusion框架，通過凍結LLM參數並添加雙塔結構，高效實現多模態理解和生成，同時保留原始語言能力。
Prompt-Based Cost-Effective Evaluation and Operation of ChatGPT as a Computer Programming Teaching Assistant

Published: 4 May, 2025 at 04:26 PM

66.50 🤔

本文通过设计基于ICL和CoT的提示模板，实现了ChatGPT在编程教育中的成本效益评估和操作，显著降低了手动评估需求并提升了反馈的结构化分析。
Less is More: Towards Green Code Large Language Models via Unified Structural Pruning

Published: 4 May, 2025 at 04:27 PM

66.29 🤔

本文提出Flab-Pruner，一种结合词汇、层和FFN剪枝的统一结构剪枝方法，通过KL散度优化和自定义微调策略，在减少代码LLM参数的同时保持高性能和效率。

Tag: Large Language Model

HAIR: Hardness-Aware Inverse Reinforcement Learning with Introspective Reasoning for LLM Alignment

LZ Penalty: An information-theoretic repetition penalty for autoregressive language models

X-Fusion: Introducing New Modality to Frozen Large Language Models

Prompt-Based Cost-Effective Evaluation and Operation of ChatGPT as a Computer Programming Teaching Assistant

Less is More: Towards Green Code Large Language Models via Unified Structural Pruning