
https://aclanthology.org/2024.findings-acl.259.pdf https://arxiv.org/abs/2401.17167 Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex ScenariosThe recent trend of using Large Language Models (LLMs) as tool agents in real-world applications underscores the necessity for comprehensive evaluations of their capabilities, particularly in complex sce..