반응형

2024/11/20 3

A Multimodal Automated Interpretability Agent

https://arxiv.org/abs/2404.14394 A Multimodal Automated Interpretability AgentThis paper describes MAIA, a Multimodal Automated Interpretability Agent. MAIA is a system that uses neural models to automate neural model understanding tasks like feature interpretation and failure mode discovery. It equips a pre-trained vision-languagearxiv.org  이 논문은 수 많은 실험을 통해 특정 사진에만 나타나는 Feature를 찾아내는 것인데 사진도 생..

Natural Language Processing (almost) from Scratch

https://arxiv.org/abs/1103.0398 Natural Language Processing (almost) from ScratchWe propose a unified neural network architecture and learning algorithm that can be applied to various natural language processing tasks including: part-of-speech tagging, chunking, named entity recognition, and semantic role labeling. This versatility isarxiv.org 이 논문은 기존 태스크별 특징 공학을 제거하고, 대규모 비지도 데이터를 활용해 End-to-E..

Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models - 논문 리뷰

https://arxiv.org/abs/2305.14763 Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language ModelsThe escalating debate on AI's capabilities warrants developing reliable metrics to assess machine "intelligence". Recently, many anecdotal examples were used to suggest that newer large language models (LLMs) like ChatGPT and GPT-4 exhibit Neural Theory-ofarxiv.org  이 논문..

728x90
728x90