Tag: Generative AI

All the articles with the tag "Generative AI".

Domain Regeneration: How well do LLMs match syntactic properties of text domains?

Published: 19 May, 2025 at 11:19 AM

75.86 🤔

本文通过‘LLM-regeneration’范式，使用Llama模型生成Wikipedia和新闻文本，发现生成文本在句法复杂性指标上表现出均值偏移、方差降低和长尾减少的系统性差异，揭示了模型在域匹配能力上的局限性。
LZ Penalty: An information-theoretic repetition penalty for autoregressive language models

Published: 6 May, 2025 at 11:19 PM

67.26 🤔

本文提出LZ惩罚方法，基于LZ77压缩算法的码长变化动态调整自回归语言模型的采样分布，在贪婪解码下有效消除退化重复，同时保持推理基准性能。
Less is More: Towards Green Code Large Language Models via Unified Structural Pruning

Published: 4 May, 2025 at 04:27 PM

66.29 🤔

本文提出Flab-Pruner，一种结合词汇、层和FFN剪枝的统一结构剪枝方法，通过KL散度优化和自定义微调策略，在减少代码LLM参数的同时保持高性能和效率。
The Promise and Limits of LLMs in Constructing Proofs and Hints for Logic Problems in Intelligent Tutoring Systems

Published: 13 May, 2025 at 11:09 AM

63.17 🤔

This paper evaluates LLMs in intelligent tutoring systems for propositional logic, demonstrating DeepSeek-V3's promising accuracy in proof construction (up to 86.7%) and hint generation (75%), but reveals significant pedagogical limitations in justification and subgoaling, necessitating hybrid approaches for educational integration.
Splitwiser: Efficient LM inference with constrained resources

Published: 11 May, 2025 at 11:14 AM

60.85 🤔

Splitwiser introduces a method to split LLM inference phases on a single GPU using multiprocessing and NVIDIA MPS, achieving modest latency reductions (up to 18.2%) and throughput improvements (up to 1.42x) on Huggingface and vLLM pipelines, though constrained by overheads and scalability issues.

Tag: Generative AI

Domain Regeneration: How well do LLMs match syntactic properties of text domains?

LZ Penalty: An information-theoretic repetition penalty for autoregressive language models

Less is More: Towards Green Code Large Language Models via Unified Structural Pruning

The Promise and Limits of LLMs in Constructing Proofs and Hints for Logic Problems in Intelligent Tutoring Systems

Splitwiser: Efficient LM inference with constrained resources