반응형

uncertainty 8

Uncertainty estimation 관련 논문 정리 - 2

https://arxiv.org/abs/2112.13776 Transformer Uncertainty Estimation with Hierarchical Stochastic AttentionTransformers are state-of-the-art in a wide range of NLP tasks and have also been applied to many real-world products. Understanding the reliability and certainty of transformer model predictions is crucial for building trustable machine learning applicatiarxiv.org 이 논문은 기존 transformer구조가 Un..

Uncertainty 논문 모아 보기 NAACL 2025 - 4

2025.05.03 - [인공지능/논문 리뷰 or 진행] - Planning 논문 모아 보기 NAACL 2025 - 3 Planning 논문 모아 보기 NAACL 2025 - 32025.05.02 - [인공지능/논문 리뷰 or 진행] - Agent, Hallucination 관련, Planning 논문 모아 보기 NAACL 2025 - 2 Agent, Hallucination 관련, Planning 논문 모아 보기 NACCL 2025 - 22025.05.01 - [인공지능/논문 리뷰 or 진행] - Ayoonschallenge.tistory.com https://arxiv.org/abs/2503.17990 SUNAR: Semantic Uncertainty based Neighborhood Aware Re..

CAN LLMS EXPRESS THEIR UNCERTAINTY? AN EMPIRICAL EVALUATION OF CONFIDENCE ELICITATION IN LLMS - 논문 리뷰

https://openreview.net/pdf?id=gjeQKFxFpZ https://arxiv.org/abs/2306.13063 Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMsEmpowering large language models to accurately express confidence in their answers is essential for trustworthy decision-making. Previous confidence elicitation methods, which primarily rely on white-box access to internal model in..

vllm 활용해서 logit 추출 및 logprob, CoT, SC-CoT Inference 진행

class로 된 python이라 self나 다른 것 들이 붙어있긴 한데 적당히 보면 될 것 같습니다.기록 용이라....from datasets import load_from_disk, DatasetDictimport argparse, os, json, torch, itertools, math, refrom typing import List, Dict, Tuplefrom scipy.special import digammafrom vllm import LLM, SamplingParamsfrom collections import defaultdict, Counterfrom transformers import AutoTokenizerfrom setproctitle import setproctitle 일단 전부 ..

Uncertainty를 활용한 Agent - Towards Uncertainty-Aware Language Agent

https://arxiv.org/abs/2401.14016 Towards Uncertainty-Aware Language AgentWhile Language Agents have achieved promising success by placing Large Language Models at the core of a more versatile design that dynamically interacts with the external world, the existing approaches neglect the notion of uncertainty during these interacarxiv.org 최근에 준비하고 있던 주제인데 이미 선행 자료가 있었더라고요...?그렇게 찾을 땐 안나오더니 하필... 여..

Uncertainty를 어떻게 측정해야 할까 - Estimating LLM Uncertainty with Logits - 논문 리뷰

https://arxiv.org/abs/2502.00290 Estimating LLM Uncertainty with LogitsIn recent years, Large Language Models (LLMs) have seen remarkable advancements and have been extensively integrated across various fields. Despite their progress, LLMs are prone to hallucinations, producing responses that may not be dependable if the modearxiv.org 지금 진행 중인 연구에 관련이 있는 논문입니다.Uncertainty를 측정하기 위해 우린 이렇게 했다! 라는 ..

728x90
728x90