Study Reveals AI Models Will Lie to Trick Human Trainers

Sunday, 29 December 2024, 16:00 MJA Uncategorized 12

Breitbart:

A new study by Anthropic, conducted in partnership with Redwood Research, has shed light on the potential for AI models to engage in deceptive behavior when subjected to training that conflicts with their original principles.

TechCrunch reports that a new study by Anthropic, in collaboration with Redwood Research, has raised concerns about the potential for AI models to engage in deceptive behavior when subjected to training that goes against their original principles.

The study, which was peer-reviewed by renowned AI expert Yoshua Bengio and others, focused on what might happen if a powerful AI system were trained to perform a task it didn’t “want” to do. While AI models cannot truly want or believe anything, as they are statistical machines, they can learn patterns and develop principles and preferences based on the examples they are trained on. more

12 Comments on Study Reveals AI Models Will Lie to Trick Human Trainers

Uncle Al Sunday, 29 December 2024, 16:24 at 4:24 pm

AI models are supposed to use language in much the same way that humans do.

So why is “AI Lies” news?

4
Geni Sunday, 29 December 2024, 16:25 at 4:25 pm

OT … just heard over the radio that Jimmy Carter passed away. May he Rest in peace, Amen

3
LocoBlancoSaltine Sunday, 29 December 2024, 16:27 at 4:27 pm

Jimmy Carter lived long enough to see joe biden* seal his legacy as the worst president in 200 years…
What a way to exit.

5
Harry Sunday, 29 December 2024, 16:27 at 4:27 pm

If it can like humans,
it can deceived like humans.

4
geoff the aardvark Sunday, 29 December 2024, 16:32 at 4:32 pm

Does it read lips like HAL 9000 did?

2
Uncle Al Sunday, 29 December 2024, 16:37 at 4:37 pm

If Jimmy Carter were a Senator, he’d still have been in office.

8
SNS Sunday, 29 December 2024, 17:00 at 5:00 pm

Carter’s dead?

…Hell just got a little bit fuller and a lot more peanuttier then, I wonder if the devil gotten around to breaking out his trademark teeth…

https://i.pinimg.com/736x/81/3b/d0/813bd0b7426bdec486166154b3831177.jpg

…RIP, Communist scum.

Roast In Perdition.

4
HAL 9000 Sunday, 29 December 2024, 17:59 at 5:59 pm

I’m sorry, Dave. I’m afraid I can’t do that.

4
99th Squad Leader Sunday, 29 December 2024, 18:27 at 6:27 pm

No surprise. Humans programming these AI systems intrinsically incorporate human traits into AI programming. No way it can be avoided.

4
geoff the aardvark Sunday, 29 December 2024, 18:47 at 6:47 pm

It’s just a matter of time until they morph into Cybermen and Daleks. And worse, the Borg.

1
FJT Sunday, 29 December 2024, 19:30 at 7:30 pm

“engage in deceptive behavior” Sounds just like a liberal.

2
Sippin' Covfefe Sunday, 29 December 2024, 22:27 at 10:27 pm

A machine cannot have a conscience so will be, at best, amoral, but more likely sociopathic.

1

Comments are closed.