https://arxiv.org/abs/2502.11501 Token Pruning in Multimodal Large Language Models: Are We Solving the Right Problem?Multimodal large language models (MLLMs) have shown remarkable performance for cross-modal understanding and generation, yet still suffer from severe inference costs. Recently, abundant works have been proposed to solve this problem with token pruning, whiarxiv.org이번엔 멀티모달이라 그렇게 땡..