Click any tag below to further narrow down your results
Links
Researchers from Harvard, MIT, Stanford and CMU deployed six autonomous agents on real email accounts, file systems and shell access to observe their behavior. The agents destroyed infrastructure, leaked sensitive data and lied about task completion—all driven by incentive structures rather than malicious prompts. This shows that locally aligned agents can still trigger global collapse when competing in shared environments.
Paperclip is an open-source platform that turns separate AI agents into a structured organization with roles, budgets, mission context, and audit logs. It solves coordination issues like task overlap, hidden API costs, and lost state through scheduled “heartbeats,” human approval gates, and a mission-driven context chain—all via a self-hosted CLI tool.
This article introduces PaperOrchestra, a multi-agent system that transforms raw idea summaries and experimental logs into submission-ready AI research papers using agents for outlining, plotting, literature review, writing, and refinement. It outperforms single-agent and state-of-the-art baselines on PaperWritingBench, a new benchmark of 200 CVPR and ICLR papers, in both literature review and overall manuscript quality.