Contact Us

I-X Seminar Series: Formulating and Evaluating Language Agents with Shunyu Yao

Key Details:

Time: 14.00 – 15.30
Date: Tuesday 7 November
Location: Livestreamed

Registration is
now closed
Recorded Event

Speaker

Shunyu Yao

Shunyu Yao is a final year PhD student with Karthik Narasimhan at Princeton NLP Group. His research focuses on language agents, and is supported by the Harold W. Dodds Fellowship from Princeton. Homepage: https://ysymyth.github.io/

Talk Title

On Formulating and Evaluating Language Agents

Talk Summary

Language agents are emerging AI systems that use large language models (LLMs) to interact with the world. While various methods and demos have been developed, it is often hard to systematically understand or evaluate them. In this talk, we present Cognitive Architectures for Language Agents (CoALA), a theoretical framework grounded in the classical research of cognitive architectures. We show how CoALA can simplify the understanding of existing agents, and provide actionable insights for future agent development.

We also present three benchmarks (WebShop, InterCode, SWE-Bench) to develop and evaluate language agents using web, programming, and GitHub repos. Notably, all three are scalable, practical, and challenging for current LLMs or language agents, with simple and faithful evaluation metics that do not rely on human or LLM scoring.

Event Recording

Watch on Youtube

More Events

Oct
28

Dr Sanson Poon disusses Using AI to Accelerate Natural Science Research: The Journey from a Museum Lab to a National Programme

Sep
30

A Seminar with Dr Gonzalo Mena where he will discuss Statistical Properties of the Rectified Transport

Nov
24

The third annual edition of the Breaking Topics in AI Conference, supported by the IX AI in Science Centre.