Tag: Classification
All the articles with the tag "Classification".
-
Small or Large? Zero-Shot or Finetuned? Guiding Language Model Choice for Specialized Applications in Healthcare
本文通过实证实验指导在医疗专业应用中语言模型的选择,强调微调小语言模型和领域特定预训练的显著优势,使其在特定任务上超越零-shot 大语言模型。
-
Compact Recurrent Transformer with Persistent Memory
This paper introduces the Compact Recurrent Transformer (CRT), which combines shallow Transformers with RNNs to efficiently process long sequences using a single persistent memory vector, achieving superior or comparable performance to full-length Transformers and Transformer-XL on language and video tasks with significantly reduced computational cost.
-
How do Humans and Language Models Reason About Creativity? A Comparative Analysis
This paper conducts a comparative analysis of creativity evaluation in STEM, revealing that human experts and LLMs prioritize different facets of originality (cleverness vs. remoteness/uncommonness) and are differentially influenced by contextual examples, with LLMs showing higher predictive accuracy but poorer construct validity due to homogenized facet correlations.
-
Survey of Abstract Meaning Representation: Then, Now, Future
本文综述了抽象意义表示(AMR)作为一种图结构语义表示框架的发展、解析与生成方法、多语言扩展及下游应用,揭示其在提升机器语言理解中的潜力与局限。
-
Do We Need a Detailed Rubric for Automated Essay Scoring using Large Language Models?
本文通过对比详细、简化和无评分标准在四个大型语言模型上的自动作文评分表现,发现简化标准在大多数模型中能保持与详细标准相似的准确性并显著降低token使用量,但模型特异性和整体性能不足仍需关注。