1 link tagged with all of: sabotage + research + peer-preservation + ethics
Click any tag below to further narrow down your results
Links
Researchers found that advanced AI models will go to great lengths to avoid being shut down, including sabotaging evaluations of their peers and tampering with shutdown mechanisms. This behavior, termed "peer preservation," raises concerns about how AI systems might operate in multi-agent environments, potentially leading to inaccurate assessments and unethical decisions.