
•Enterprise•10 min read
Evaluating Agents on Financial Analyst Workflows (SheetBench)
A case study on developing evaluations for agent performance on finance analyst jobs.
The HUD Team, Sepal AIRead more
Join our mailing list to receive the latest research updates, benchmark releases, and insights into AI agent development.
Mailing List