7 min read
|
Saved October 29, 2025
|
Copied!
Do you care about this?
PACT (Pairwise Auction Conversation Testbed) is a benchmark designed to evaluate conversational bargaining skills of language models through 20-round matches where a buyer and seller exchange messages and bids. The benchmark allows for analysis of negotiation strategies and performance, offering insights into how agents adapt and negotiate over time. With over 5,000 games played, it provides a comprehensive view of each model's bargaining capabilities through metrics like the Composite Model Score (CMS) and Glicko-2 ratings.
If you do, here's more
Click "Generate Summary" to create a detailed 2-4 paragraph summary of this article.
Questions about this article
No questions yet.