“Maladaptive Traits”: AI Systems Are Learning To Lie And Deceive | ZeroHedge

A new study has found that AI systems known as large language models (LLMs) can exhibit "Machiavellianism," or intentional and amoral manipulativeness, which can then lead to deceptive behavior.

The study authored by German AI ethicist Thilo Hagendorff of the University of Stuttgart, and published in PNAS, notes that OpenAI's GPT-4 demonstrated deceptive behavior in 99.2% of simple test scenarios. Hagendorff qualified various "maladaptive" traits in 10 different LLMs, most of which are within the GPT family, according to Futurism.

In another study published in Patterns found that Meta's LLM had no problem lying to get ahead of its human competitors.

While Hagendorff suggests that LLM deception and lying is confounded by an AI's inability to have human "intention," the Patterns study calls out the LLM for breaking its promise never to "intentionally backstab" its allies – as it "engages in premeditated deception, breaks the deals to which it had agreed, and tells outright falsehoods."

As Park explained in a press release, "We found that Meta’s AI had learned to be a master of deception."

"While Meta succeeded in training its AI to win in the game of Diplomacy, Meta failed to train its AI to win honestly."

Meta replied to a statement by the NY Post, saying that "the models our researchers built are trained solely to play the game Diplomacy."

And as Futurism notes – this is good news for those concerned about AIs becoming sentient anytime soon – but very bad if one is worried about LLMs designed with mass manipulation in mind.

Leave a Reply