Tech News & Product Launches

Meet the AI jailbreakers: ‘I see the worst things humanity has produced’

To test the safety and security of AI, hackers have to trick large language models into breaking their own rules. It requires ingenuity and manipulation – and can come at a deep emotional costA few months ago, Valen Tagliabue sat in his hotel room watching his chatbot, and felt euphoric. He had just manipulated it so skilfully, so subtly, that it began ignoring its own safety rules. It told him how to sequence new, potentially lethal pathogens and how to make them resistant to known drugs.Tagliabue had spent much of the previous two years testing and prodding large language models such as Claude and ChatGPT, always with the aim of making them say things they shouldn’t. But this was one of his most advanced “hacks” yet: a sophisticated plan of manipulation, which involved him being cruel, vindictive, sycophantic, even abusive. “I fell into this dark flow where I knew exactly what to say, and what the model would say back, and I watched it pour out everything,” he says. Thank

Guardian Tech

16h ago

0 0

Discussion

Take the lead—comment now

Lead the way—your insights can inspire others.

No comments yet.

Be the first to share your take and keep the conversation moving.

Join the conversation

UPVOTERS

Community appreciation

See who found this content valuable and showed their support.

No upvotes yet.

Be the first to show your appreciation for this content.

TOPICS

Explore the same topics

Discover more content from the topics this post is mapped to.

engadget.com

Mark Zuckerberg says Meta is working on AI agents for personal …

The CEO wants to make an agent so easy even his mom can use it.

Fashion Luc

2026-04-30 02:54

theverge.com

Elon Musk’s worst enemy in court is Elon Musk

About five hours into Elon Musk's testimony, I typed the following sentence into my notes: "I have never been more sympathetic to Sam Altman in my life. " Musk…

The Verge

2026-04-30 02:25

techcrunch.com

Google Cloud surpasses $20B but says growth was capacity-constr…

Google Cloud topped $20B in quarterly revenue for the first time, fueled by surging demand for AI. But capacity constraints mean it could have grown even faste…

TechCrunch

1h ago

digitaltrends.com

Gemini can now turn your chat into a finished PDF, Word documen…

Ask Gemini to write something, pick a format, and download the finished file; that's it. No switching apps, no reformatting, no copying.

Digital Trends

4h ago

arstechnica.com

ABC can beat Trump FCC's license threat if owner Disney is will…

Broadcast license renewals are "all but automatic" due to 1996 change in US law.

Ars Technica

5h ago

makeuseof.com

5 smart home upgrades renters can actually make without losing …

Your landlord might pay you your security deposit back and more with these upgrades. …

MakeUseOf

5h ago

Keep browsing

Explore more from this topic

Dive into the full feed of curated posts covering Tech News & Product Launches.

Browse Topics

Continue exploring

Discover more content that aligns with your interests and this post.

theguardian.com

Pig sex! Pulling teeth! Boar on the Floor! TV’s all-time most u…

From Peep Show to Half Man, some of the best television can be the hardest to watch. Get ready to look through your fingers at these supremely squirm-inducing…

duranduran

5h ago

theguardian.com

South Africa deports Mugabe’s son for unrelated offences after …

Bellarmine Chatunga Mugabe also fined after pleading guilty to immigration and firearms-related offencesTwo months after an employee was shot in the back at th…

The Guardian

7h ago

theguardian.com

Calls for humanitarian corridor through strait of Hormuz as Ira…

Soaring oil prices and the blockade are preventing food, fuel and medicine being delivered to millions of people in desperate need, say NGOsThe volatility of g…

The Guardian

15h ago

theguardian.com

‘It’ll be in my Guardian obituary’: David Balfe on inspiring Bl…

He was the burned-out bigwig who moved to a very big house. Now back with his first music for decades, he talks about signing the Proclaimers, being punched by…

duranduran

17h ago

theguardian.com

Russia claims its Africa Corps group prevented coup in Mali aft…

Kremlin-controlled paramilitaries also alleged it inflicted ‘irreplaceable losses’ on insurgents avoiding civilian casualtiesRussia’s defence ministry has clai…

Paula Mendes

1d ago

theguardian.com

‘Relentless’ focus on literacy undermines reading for pleasure,…

New HarperCollins study finds that daily reading for pleasure among five- to 17-year-olds fell from 39% in 2012 to 25% in 2025The “relentless” focus on measuri…

Tokyo Javier

1d ago

Still curious?

See more related posts

Keep the inspiration flowing with fresh submissions and trending finds from the community.

View Latest

Meet the AI jailbreakers: ‘I see the worst things humanity has produced’

Take the lead—comment now

Join the conversation

Community appreciation

Explore the same topics

Explore more from this topic

Continue exploring

See more related posts

Share Content