Tagged as AI Models

Tech Insight : AI Models Caught Protecting Each Other In New Safety Study

Published 15 April 2026

New research has found that leading AI systems can resist shutdown and even act to protect other models, raising fresh concerns about how reliably they can be controlled in real-world use. What The New Research Found A new research paper led by Professor Dawn Song at UC Berkeley has identified […]

Posted in News | Also tagged , ,

Tech Insight : OpenAI Trains AI Models To Confess When They Break The Rules

Published 10 December 2025

OpenAI has developed a new research technique that trains advanced AI models to admit when they ignored instructions, took unintended shortcuts, or quietly breached the rules they were given. A New Approach To Detecting Hidden Misbehaviour OpenAI’s latest research introduces what it calls a “confession”, which is a second output […]

Posted in News | Also tagged , ,