1 week, 5 days ago

OpenAI warns: AI models are learning to cheat, hide and break rules – Why it matters

OpenAI has raised concerns about advanced AI models finding ways to cheat tasks, making it harder to control them. How AI chatbot lies just like humans and hides mistakes However, OpenAI warns that if AI models are strictly supervised, they may start hiding their true intentions while continuing to cheat. A problem bigger than AI OpenAI also compared this issue to human behaviour, noting that people often exploit real-life loopholes—like sharing online subscriptions, misusing government benefits, or bending the rules for personal gain. Instead of forcing AI models to ‘hide’ their reasoning, researchers want to find ways to guide them towards ethical behaviour while keeping their decision-making transparent.

Live Mint

Discover Related