Tag: Multimodality
All the articles with the tag "Multimodality".
-
ASURA-FDPS-ML: Star-by-star Galaxy Simulations Accelerated by Surrogate Modeling for Supernova Feedback
This paper introduces ASURA-FDPS-ML, a framework that accelerates high-resolution galaxy simulations by using a machine learning surrogate model for supernova feedback in dense regions, achieving a fourfold speedup while maintaining comparable morphological and outflow characteristics to direct simulations, despite some discrepancies in momentum at higher altitudes.
-
Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs
本文提出 Universal Reasoner (UniR),一种轻量级、可组合的推理模块,通过将预定义奖励转化为 token 级别指导信号,为冻结的大型语言模型提供高效的推理能力增强,并在数学推理与机器翻译任务上展现出优于部分基线的性能与跨模型迁移能力。
-
Attention Retrieves, MLP Memorizes: Disentangling Trainable Components in the Transformer
本文通过冻结Transformer组件并提出MixiT模型,揭示了自注意力机制在检索和语言建模中的输入依赖性必要性,以及MLP层在记忆中的主导作用,强调了架构异质性对任务解决的重要性。
-
Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation
本文提出了一种动态整合框架,通过自适应选择网络和动态加权融合策略从多个LLM中聚合知识,显著提升性能并减少50%的知识干扰,同时保持计算效率。
-
Multilingual Performance of a Multimodal Artificial Intelligence System on Multisubject Physics Concept Inventories
This exploratory study evaluates GPT-4o's multilingual and multimodal performance on physics concept inventories, revealing strong results in English and text-based tasks but significant weaknesses in visual interpretation and non-Western languages, highlighting implications for equitable AI integration in education.