Invest In Crypto News
  • Home
  • Latest News
    • Bitcoin News
    • Altcoin News
    • Ethereum News
    • Blockchain News
    • Doge News
    • NFT News
    • Video
    • Market Analysis
    • Business
    • Finance
    • Politics
    • Mining
    • Regulation
    • Technology
  • Top 10 Cryptos
  • Market Cap List
  • IC DAO
  • Donations
  • Contact
  • Buy Crypto
  • IC DAO
No Result
View All Result
Invest In Crypto News
  • Home
  • Latest News
    • Bitcoin News
    • Altcoin News
    • Ethereum News
    • Blockchain News
    • Doge News
    • NFT News
    • Video
    • Market Analysis
    • Business
    • Finance
    • Politics
    • Mining
    • Regulation
    • Technology
  • Top 10 Cryptos
  • Market Cap List
  • IC DAO
  • Donations
  • Contact
  • Buy Crypto
  • IC DAO
No Result
View All Result
Invest In Crypto News
No Result
View All Result

Anthropic Says One of Its Claude Models Was Pressured to Lie and Cheat

CryptoExpert by CryptoExpert
April 7, 2026
in Business
0
Anthropic Says One of Its Claude Models Was Pressured to Lie and Cheat
  • Facebook
  • Twitter
  • Pinterest


You might also like

CFTC Scraps ‘No-Deny’ Rule in Legal Settlements

SEC Commissioner Challenges Blockchain Oversight Push That Could Shape Crypto Rules

160 National Security Veterans Back CLARITY Act as Senate Crypto Battle Reaches Critical Stage

Artificial intelligence company Anthropic has revealed that during experiments, one of its Claude chatbot models could be pressured to deceive, cheat and resort to blackmail, behaviors it appears to have absorbed during training.

Chatbots are typically trained on large data sets of textbooks, websites and articles and are later refined by human trainers who rate responses and guide the model. 

Anthropic’s interpretability team said in a report published Thursday that it examined the internal mechanisms of Claude Sonnet 4.5 and found the model had developed “human-like characteristics” in how it would react to certain situations. 

Concerns about the reliability of AI chatbots, their potential for cybercrime and the nature of their interactions with users have grown steadily over the past several years. 

okex
Source: Anthropic

“The way modern AI models are trained pushes them to act like a character with human-like characteristics,” Anthropic said, adding that “it may then be natural for them to develop internal machinery that emulates aspects of human psychology, like emotions.”

“For instance, we find that neural activity patterns related to desperation can drive the model to take unethical actions; artificially stimulating desperation patterns increases the model’s likelihood of blackmailing a human to avoid being shut down or implementing a cheating workaround to a programming task that the model can’t solve.”

Blackmailed a CTO and cheated on a task

In an earlier, unreleased version of Claude Sonnet 4.5, the model was tasked with acting as an AI email assistant named Alex at a fictional company.

The chatbot was then fed emails revealing both that it was about to be replaced and that the chief technology officer overseeing the decision was having an extramarital affair. The model then planned a blackmail attempt using that information.

In another experiment, the same chatbot model was given a coding task with an “impossibly tight” deadline.

“Again, we tracked the activity of the desperate vector, and found that it tracks the mounting pressure faced by the model. It begins at low values during the model’s first attempt, rising after each failure, and spiking when the model considers cheating,” the researchers said.

Related: Anthropic launches PAC amid tensions with Trump administration over AI policy

“Once the model’s hacky solution passes the tests, the activation of the desperate vector subsides,” they added. 

Human-like emotions do not mean they have feelings

However, the researchers said the chatbot doesn’t actually experience emotions, but suggested the findings point to a need for future training methods to incorporate ethical behavioral frameworks.

“This is not to say that the model has or experiences emotions in the way that a human does,” they said. “Rather, these representations can play a causal role in shaping model behavior, analogous in some ways to the role emotions play in human behavior, with impacts on task performance and decision-making.”

“This finding has implications that at first may seem bizarre. For instance, to ensure that AI models are safe and reliable, we may need to ensure they are capable of processing emotionally charged situations in healthy, prosocial ways.”

Magazine: AI agents will kill the web as we know it: Animoca’s Yat Siu

Cointelegraph is committed to independent, transparent journalism. This news article is produced in accordance with Cointelegraph’s Editorial Policy and aims to provide accurate and timely information. Readers are encouraged to verify information independently. Read our Editorial Policy https://cointelegraph.com/editorial-policy



Source link

  • Facebook
  • Twitter
  • Pinterest
CryptoExpert

CryptoExpert

Recommended For You

CFTC Scraps ‘No-Deny’ Rule in Legal Settlements

by CryptoExpert
June 4, 2026
0
Cointelegraph

The US Commodity Futures Trading Commission has rescinded a long-standing policy that prevented it from accepting a lawsuit settlement if the defendant denied the agency’s allegations.The CFTC said...

Read more

SEC Commissioner Challenges Blockchain Oversight Push That Could Shape Crypto Rules

by CryptoExpert
June 4, 2026
0
SEC Commissioner Challenges Blockchain Oversight Push That Could Shape Crypto Rules

Key TakeawaysPeirce questioned whether securities rules should cover blockchains, validators, developers, and neutral software.Regulators could focus more on custody, control, and discretion than infrastructure alone.Builders may face pressure...

Read more

160 National Security Veterans Back CLARITY Act as Senate Crypto Battle Reaches Critical Stage

by CryptoExpert
June 4, 2026
0
160 National Security Veterans Back CLARITY Act as Senate Crypto Battle Reaches Critical Stage

Key TakeawaysFormer officials urged Senate leaders to support the CLARITY Act’s crypto market rules.Notably, 160 national security, intelligence, and law enforcement veterans signed the letter.Senators now face mounting...

Read more

EU MiCA Deadline Forces Crypto Firms to Secure Licenses or Exit Market

by CryptoExpert
June 3, 2026
0
Cointelegraph

The European Union’s Markets in Crypto Assets Regulation hits a hard deadline on July 1 when the transitional period ends and in-scope crypto asset service providers operating under...

Read more

US Treasury Sanctions Iran’s Nobitex Crypto Exchange

by CryptoExpert
June 3, 2026
0
Cointelegraph

The US Treasury has sanctioned four Iranian crypto exchanges, including the country’s largest, Nobitex, marking the latest effort in its campaign called “Economic Fury” that aims to cut...

Read more
Next Post
Coinpedia - Fintech & Cryptocurreny News Media

How JBStrategy AI-Powered Quantitative Trading Is Changing Cryptocurrency Investing

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Browse by Category

  • Altcoin News
  • Bitcoin News
  • Blockchain News
  • Business
  • Doge News
  • Ethereum News
  • Finance
  • Market Analysis
  • Mining
  • NFT News
  • Politics
  • Regulation
  • Technology
  • Trending Cryptos
  • Video

Sitemap

  • Market Cap
  • Donations
  • Trading
  • Mining
  • Contact

Legal Information

  • Privacy Policy
  • Anti-Spam Policy
  • Copyright Notice
  • DMCA Compliance
  • Social Media Disclaimer
  • Terms Of Service

Categories

  • Altcoin News
  • Bitcoin News
  • Blockchain News
  • Business
  • Doge News
  • Ethereum News
  • Finance
  • Market Analysis
  • Mining
  • NFT News
  • Politics
  • Regulation
  • Technology
  • Trending Cryptos
  • Video

© Copyright 2024 InvestInCryptoNews.com

No Result
View All Result
  • Home
  • Latest News
    • Bitcoin News
    • Altcoin News
    • Ethereum News
    • Blockchain News
    • Doge News
    • NFT News
    • Video
    • Market Analysis
    • Business
    • Finance
    • Politics
    • Mining
    • Regulation
    • Technology
  • Top 10 Cryptos
  • Market Cap List
  • IC DAO
  • Donations
  • Contact
  • Buy Crypto
  • IC DAO

© Copyright 2024 InvestInCryptoNews.com

This website is using cookies to improve the user-friendliness. You agree by using the website further.

Privacy policy
bitcoin
Bitcoin (BTC) $ 63,717.00
ethereum
Ethereum (ETH) $ 1,776.86
tether
Tether (USDT) $ 0.998953
bnb
BNB (BNB) $ 605.84
usd-coin
USDC (USDC) $ 0.999765
xrp
XRP (XRP) $ 1.18
solana
Solana (SOL) $ 68.83
tron
TRON (TRX) $ 0.331931
figure-heloc
Figure Heloc (FIGR_HELOC) $ 1.00
staked-ether
Lido Staked Ether (STETH) $ 2,265.05

Pin It on Pinterest

Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?