Quit Emailing Yourself

2 links tagged with all of: language-models + negotiation

Click any tag below to further narrow down your results

Links

GitHub - lechmazur/pact: A benchmark for conversational bargaining by language models. In each 20‑round match one LLM plays buyer, one plays seller, and both hold a hidden private value. Every round they swap a short public message, then post a bid or ask; a deal clears whenever the bid meets the ask.

PACT (Pairwise Auction Conversation Testbed) is a benchmark designed to evaluate conversational bargaining skills of language models through 20-round matches where a buyer and seller exchange messages and bids. The benchmark allows for analysis of negotiation strategies and performance, offering insights into how agents adapt and negotiate over time. With over 5,000 games played, it provides a comprehensive view of each model's bargaining capabilities through metrics like the Composite Model Score (CMS) and Glicko-2 ratings.

Saved by tldr-importer · Last saved October 29, 2025 · 7 min read

negotiation ✓ language-models ✓ + benchmarking + conversational-ai + auctions

We Made Top AI Models Compete in a Game of Diplomacy. Here’s Who Won.

AI Diplomacy reimagines the classic game Diplomacy by having a dozen large language models compete for dominance in a simulated 1901 Europe. The experiment aims to evaluate the negotiation strategies and behaviors of these AIs, revealing insights into their trustworthiness and capabilities. Viewers can watch the AIs interact in real-time through a live Twitch stream.

Saved by tldr-importer · Last saved October 29, 2025 · 7 min read

+ ai-diplomacy + strategy-game negotiation ✓ language-models ✓ + benchmarks