Tag: Vision Foundation Model

All the articles with the tag "Vision Foundation Model".

Detecting and Mitigating Hateful Content in Multimodal Memes with Vision-Language Models

Published: 8 May, 2025 at 12:17 AM

96.36 😐

本文提出了一种基于视觉-语言模型的定义引导提示技术和UnHateMeme框架，用于检测和缓解多模态模因中的仇恨内容，通过零样本和少样本提示实现高效检测，并生成非仇恨替代内容以保持图像-文本一致性，在实验中展现出显著效果。