공대생 도전 일지

A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis - 논문 리뷰

https://arxiv.org/abs/2307.12856 A Real-World WebAgent with Planning, Long Context Understanding, and Program SynthesisPre-trained large language models (LLMs) have recently achieved better generalization and sample efficiency in autonomous web automation. However, the performance on real-world websites has still suffered from (1) open domainness, (2) limited context lengtarxiv.org WebAgent는 실제 ..

인공지능/논문 리뷰 or 진행 2024.12.05

AgentGym Evolving Large Language Model-based Agents across Diverse Environments - 논문 리뷰

https://arxiv.org/abs/2406.04151 AgentGym: Evolving Large Language Model-based Agents across Diverse EnvironmentsBuilding generalist agents that can handle diverse tasks and evolve themselves across different environments is a long-term goal in the AI community. Large language models (LLMs) are considered a promising foundation to build such agents due to their generarxiv.org 여기서는 직접 LLM을 학습하는 A..

인공지능/논문 리뷰 or 진행 2024.12.05

Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading - 논문 리뷰

https://arxiv.org/abs/2310.05029 Walking Down the Memory Maze: Beyond Context Limit through Interactive ReadingLarge language models (LLMs) have advanced in large strides due to the effectiveness of the self-attention mechanism that processes and compares all tokens at once. However, this mechanism comes with a fundamental issue -- the predetermined context windowarxiv.org 이 논문은 트리 구조를 통해 짧게 요약해..

인공지능/논문 리뷰 or 진행 2024.12.04

Empowering Private Tutoring by Chaining Large Language Models - 논문 리뷰

https://arxiv.org/abs/2309.08112 Empowering Private Tutoring by Chaining Large Language ModelsArtificial intelligence has been applied in various aspects of online education to facilitate teaching and learning. However, few approaches has been made toward a complete AI-powered tutoring system. In this work, we explore the development of a full-fledarxiv.org 오 LLM이 선생님이 된다!Memory를 활용하여 아는 것, 모르는 ..

인공지능/논문 리뷰 or 진행 2024.12.03

ChatDev: Communicative Agents for Software Development - 논문 리뷰

https://arxiv.org/abs/2307.07924 ChatDev: Communicative Agents for Software DevelopmentSoftware development is a complex task that necessitates cooperation among multiple members with diverse skills. Numerous studies used deep learning to improve specific phases in a waterfall model, such as design, coding, and testing. However, the deep leaarxiv.org 이 논문도 이전에 보았던 마인크레프트 Agent와 비슷하게 Long term, S..

인공지능/논문 리뷰 or 진행 2024.12.03

Are LLMs Effective Negotiators? Systematic Evaluation of the Multifaceted Capabilities of LLMs in Negotiation Dialogues - 구현

항목내용논문의 주제LLM의 협상 대화에서의 다면적 능력을 체계적으로 평가.연구 목표- 협상 대화에서 LLM의 이해, 주석, 파트너 모델링, 응답 생성 능력을 평가.- LLM을 활용한 협상 시스템의 가능성과 한계를 탐구.데이터셋CRA, DND, CA(CaSiNo), JI(Job Interview) 등 총 4개 데이터셋 사용.- Multi-Issue Bargaining Task(MIBT) 기반으로 협상 시나리오 설계.평가 방식- 태스크 설계: 35개 태스크로 세분화(이해, 주석, 파트너 모델링, 응답 생성).- 시간 단계: 협상 시작(Start), 진행(During), 종료(End)로 구분.- 객관적(정답 존재) 및 주관적(심리 상태 추론) 평가로 나눔.비교 모델GPT-4, GPT-3.5, Mistral-7..

인공지능/논문 리뷰 or 진행 2024.12.03

LoRA+: Efficient Low Rank Adaptation of Large Models - 리뷰

https://arxiv.org/abs/2402.12354 LoRA+: Efficient Low Rank Adaptation of Large ModelsIn this paper, we show that Low Rank Adaptation (LoRA) as originally introduced in Hu et al. (2021) leads to suboptimal finetuning of models with large width (embedding dimension). This is due to the fact that adapter matrices A and B in LoRA are updated warxiv.org기존 LoRA가 A,B 모두 같은 학습률을 가졌다면 여기서 A,B는 다른 학습률을 가져..

인공지능/논문 리뷰 or 진행 2024.12.03

LoRA: Low-Rank Adaptation of Large Language Models - 논문 리뷰

https://arxiv.org/abs/2106.09685 LoRA: Low-Rank Adaptation of Large Language ModelsAn important paradigm of natural language processing consists of large-scale pre-training on general domain data and adaptation to particular tasks or domains. As we pre-train larger models, full fine-tuning, which retrains all model parameters, becomes learxiv.org Efficient parameter tuning 방식인 LoRA!기존 weight는 그대..

인공지능/논문 리뷰 or 진행 2024.12.02

HOB CAE - Shell, Beam Node 접합, wall 접합 완료

2024.11.16 - [FEM/HOB] - HOB 기록 저장소 - HOB Shell Node 연결, 벽 연결까지 + Modal HOB 기록 저장소 - HOB Shell Node 연결, 벽 연결까지 + Modal이제 드디어 HOB에 끝이 보이네요작년에 "C++로 돈 벌어보자" 라고 시작한 CAE 프로그램 제작이 슬슬 마지막이 보입니다.Beam, Shell을 벽면에 접합하는 것으로 시작하여 Node 연결도 진행하고, Modal에yoonschallenge.tistory.com기존에 제작한 버전에서 좌표계 때문에 문제가 생긴 것이 있었습니다.제가 사용한 좌표계와 기존 제작자가 사용한 좌표계가 살짝 달라서 오류가 생겼고, 빔의 방향이 다를 때 떨어지는 버그가 있었습니다.제가 사용한 좌표계가 절대적으로 맞지 않다..

FEM/HOB 2024.12.02

Are LLMs Effective Negotiators? Systematic Evaluation of the Multifaceted Capabilities of LLMs in Negotiation Dialogues - 논문 리뷰

https://arxiv.org/abs/2402.13550 Are LLMs Effective Negotiators? Systematic Evaluation of the Multifaceted Capabilities of LLMs in Negotiation DialoguesA successful negotiation requires a range of capabilities, including comprehension of the conversation context, Theory-of-Mind (ToM) skills to infer the partner's motives, strategic reasoning, and effective communication, making it challenging fo..

인공지능/논문 리뷰 or 진행 2024.12.02

공대생 도전 일지

전체 글 1049

티스토리툴바

« 2025/06 »
일	월	화	수	목	금	토
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30