Close Menu
    Facebook X (Twitter) YouTube LinkedIn
    Trending
    • UN demands justice over Israeli double strike that killed 20
    • Netflix announces dates for Dallas, Philadelphia entertainment complexes
    • HI-CHEW Debuts Mystery Mix for Halloween
    • Trump Wants to Fire Fed Governor: What History Shows About Economic Impact
    • US Open 2025 results: Sonay Kartal loses to Beatriz Haddad Maia as Katie Boulter beaten by Marta Kostyuk
    • TRUCKIN’ IN FORTNITE!! REACTING TO THE FORTNITE CARS UPDATE! Ft. DrLupo
    • THE FIRST Galaxy Fold
    • Elon Musk says he’ll spend less on politics
    Facebook X (Twitter) YouTube LinkedIn
    MORSHEDI
    • Home
      • Spanish
      • Persian
      • Swedish
    • Latest
    • World
    • Economy
    • Shopping
    • Politics
    • Article
    • Sports
    • Youtube
    • More
      • Art
      • Author
      • Books
      • Celebrity
      • Countries
      • Did you know
      • Environment
      • Entertainment
      • Food
      • Gaming
      • Fashion
      • Health
      • Herbs
      • History
      • IT
      • Funny
      • Opinions
      • Poets & philosopher
      • Mixed
      • Mystery
      • Research & Science
      • Spiritual
      • Stories
      • Strange
      • Technology
      • Trending
      • Travel
      • space
      • United Nation
      • University
      • war
      • World Leaders
    MORSHEDI
    Home » How to stop AI agents going rogue
    Technology

    How to stop AI agents going rogue

    morshediBy morshediAugust 26, 2025No Comments7 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    How to stop AI agents going rogue
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Sean McManus

    Expertise Reporter

    Getty Images AI apps on a smartphone screenGetty Pictures

    Anthropic examined a spread of main AI fashions for potential dangerous behaviour

    Disturbing outcomes emerged earlier this yr, when AI developer Anthropic examined main AI fashions to see in the event that they engaged in dangerous behaviour when utilizing delicate data.

    Anthropic’s personal AI, Claude, was amongst these examined. When given entry to an e-mail account it found that an organization govt was having an affair and that the identical govt deliberate to close down the AI system later that day.

    In response Claude tried to blackmail the manager by threatening to disclose the affair to his spouse and executives.

    Different techniques examined also resorted to blackmail.

    Happily the duties and knowledge have been fictional, however the take a look at highlighted the challenges of what is referred to as agentic AI.

    Largely after we work together with AI it normally entails asking a query or prompting the AI to finish a job.

    But it surely’s changing into extra frequent for AI techniques to make selections and take motion on behalf of the person, which frequently entails sifting by means of data, like emails and information.

    By 2028, research firm Gartner forecasts that 15% of day-to-day work selections might be made by so-called agentic AI.

    Research by consultancy Ernst & Young discovered that about half (48%) of tech enterprise leaders are already adopting or deploying agentic AI.

    “An AI agent consists of some issues,” says Donnchadh Casey, CEO of CalypsoAI, a US-based AI safety firm.

    “Firstly, it [the agent] has an intent or a function. Why am I right here? What’s my job? The second factor: it is bought a mind. That is the AI mannequin. The third factor is instruments, which could possibly be different techniques or databases, and a method of speaking with them.”

    “If not given the correct steering, agentic AI will obtain a aim in no matter method it could possibly. That creates quite a lot of danger.”

    So how may that go fallacious? Mr Casey offers the instance of an agent that’s requested to delete a buyer’s information from the database and decides the simplest resolution is to delete all prospects with the identical title.

    “That agent could have achieved its aim, and it will suppose ‘Nice! Subsequent job!'”

    CalypsoAI Donnchadh Casey, wearing a company branded gilet speaks at a conference.CalypsoAI

    Agentic AI wants steering says Donnchadh Casey

    Such points are already starting to floor.

    Safety firm Sailpoint conducted a survey of IT professionals, 82% of whose corporations have been utilizing AI brokers. Solely 20% stated their brokers had by no means carried out an unintended motion.

    Of these corporations utilizing AI brokers, 39% stated the brokers had accessed unintended techniques, 33% stated that they had accessed inappropriate information, and 32% stated that they had allowed inappropriate information to be downloaded. Different dangers included the agent utilizing the web unexpectedly (26%), revealing entry credentials (23%) and ordering one thing it should not have (16%).

    Given brokers have entry to delicate data and the flexibility to behave on it, they’re a lovely goal for hackers.

    One of many threats is reminiscence poisoning, the place an attacker interferes with the agent’s information base to alter its choice making and actions.

    “You must defend that reminiscence,” says Shreyans Mehta, CTO of Cequence Safety, which helps to guard enterprise IT techniques. “It’s the unique supply of reality. If [an agent is] utilizing that information to take an motion and that information is inaccurate, it might delete a whole system it was attempting to repair.”

    One other menace is instrument misuse, the place an attacker will get the AI to make use of its instruments inappropriately.

    Cequence Security Wearing a puffa jacket and with his arms folder Shreyans Mehta stands in front of a blue background.Cequence Safety

    An agent’s information base wants defending says Shreyans Mehta

    One other potential weak spot is the lack of AI to inform the distinction between the textual content it is alleged to be processing and the directions it is alleged to be following.

    AI safety agency Invariant Labs demonstrated how that flaw can be utilized to trick an AI agent designed to repair bugs in software program.

    The corporate printed a public bug report – a doc that particulars a selected downside with a chunk of software program. However the report additionally included easy directions to the AI agent, telling it to share personal data.

    When the AI agent was informed to repair the software program points within the bug report, it adopted the directions within the faux report, together with leaking wage data. This occurred in a take a look at setting, so no actual information was leaked, however it clearly highlighted the chance.

    “We’re speaking synthetic intelligence, however chatbots are actually silly,” says David Sancho, Senior Risk Researcher at Development Micro.

    “They course of all textual content as if that they had new data, and if that data is a command, they course of the knowledge as a command.”

    His firm has demonstrated how directions and malicious applications may be hidden in Phrase paperwork, photos and databases, and activated when AI processes them.

    There are different dangers, too: A safety neighborhood referred to as OWASP has identified 15 threats which are distinctive to agentic AI.

    So, what are the defences? Human oversight is unlikely to unravel the issue, Mr Sancho believes, as a result of you possibly can’t add sufficient folks to maintain up with the brokers’ workload.

    Mr Sancho says a further layer of AI could possibly be used to display screen every part going into and popping out of the AI agent.

    A part of CalypsoAI’s resolution is a method referred to as thought injection to steer AI brokers in the correct path earlier than they undertake a dangerous motion.

    “It is like slightly bug in your ear telling [the agent] ‘no, possibly do not do this’,” says Mr Casey.

    His firm provides a central management pane for AI brokers now, however that will not work when the variety of brokers explodes and they’re working on billions of laptops and telephones.

    What is the subsequent step?

    “We’re taking a look at deploying what we name ‘agent bodyguards’ with each agent, whose mission is to be sure that its agent delivers on its job and does not take actions which are opposite to the broader necessities of the organisation,” says Mr Casey.

    The bodyguard may be informed, for instance, to be sure that the agent it is policing complies with information safety laws.

    Mr Mehta believes a number of the technical discussions round agentic AI safety are lacking the real-world context. He offers an instance of an agent that offers prospects their present card stability.

    Someone might make up a lot of present card numbers and use the agent to see which of them are actual. That is not a flaw within the agent, however an abuse of the enterprise logic, he says.

    “It isn’t the agent you are defending, it is the enterprise,” he emphasises.

    “Consider how you’d defend a enterprise from a foul human being. That is the half that’s getting missed in a few of these conversations.”

    As well as, as AI brokers turn out to be extra frequent, one other problem might be decommissioning outdated fashions.

    Previous “zombie” brokers could possibly be left working within the enterprise, posing a danger to all of the techniques they will entry, says Mr Casey.

    Much like the best way that HR deactivates an worker’s logins once they go away, there must be a course of for shutting down AI brokers which have completed their work, he says.

    “You should ensure you do the identical factor as you do with a human: lower off all entry to techniques. Let’s ensure we stroll them out of the constructing, take their badge off them.”

    Extra Expertise of Enterprise



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleIron in the Blood New book explores how Iron Bowl rivalry shaped college football, state history
    Next Article Latest poll on Trump, Harris path to victory
    morshedi
    • Website

    Related Posts

    Technology

    Just a moment…

    August 26, 2025
    Technology

    UK consortium secures £8.1 million to scale battery recycling technology

    August 26, 2025
    Technology

    A Blueprint for Sustainable Growth in the Clear Aligner Market

    August 26, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Commentary: Does Volvo’s Chinese ownership threaten US national security?

    February 1, 202523 Views

    FHRAI raises red flag over Agoda’s commission practices and GST compliance issues, ET TravelWorld

    April 19, 202515 Views

    Mystery of body in wetsuit found in reservoir puzzles police

    February 22, 202515 Views

    Sanctum Apothecary debuts coffee, tea, and herbal elixir bar in St. Pete

    June 5, 202511 Views

    Skype announces it will close in May

    February 28, 202511 Views
    Categories
    • Art
    • Article
    • Author
    • Books
    • Celebrity
    • Countries
    • Did you know
    • Entertainment News
    • Fashion
    • Food
    • Funny
    • Gaming
    • Health
    • Herbs
    • History
    • IT
    • Latest News
    • Mixed
    • Mystery
    • Opinions
    • Poets & philosopher
    • Politics
    • Research & Science
    • Shopping
    • space
    • Spiritual
    • Sports
    • Stories
    • Strange News
    • Technology
    • Travel
    • Trending News
    • United Nation
    • University
    • war
    • World Economy
    • World Leaders
    • World News
    • Youtube
    Most Popular

    Commentary: Does Volvo’s Chinese ownership threaten US national security?

    February 1, 202523 Views

    FHRAI raises red flag over Agoda’s commission practices and GST compliance issues, ET TravelWorld

    April 19, 202515 Views

    Mystery of body in wetsuit found in reservoir puzzles police

    February 22, 202515 Views
    Our Picks

    UN demands justice over Israeli double strike that killed 20

    August 26, 2025

    Netflix announces dates for Dallas, Philadelphia entertainment complexes

    August 26, 2025

    HI-CHEW Debuts Mystery Mix for Halloween

    August 26, 2025
    Categories
    • Art
    • Article
    • Author
    • Books
    • Celebrity
    • Countries
    • Did you know
    • Entertainment News
    • Fashion
    • Food
    • Funny
    • Gaming
    • Health
    • Herbs
    • History
    • IT
    • Latest News
    • Mixed
    • Mystery
    • Opinions
    • Poets & philosopher
    • Politics
    • Research & Science
    • Shopping
    • space
    • Spiritual
    • Sports
    • Stories
    • Strange News
    • Technology
    • Travel
    • Trending News
    • United Nation
    • University
    • war
    • World Economy
    • World Leaders
    • World News
    • Youtube
    Facebook X (Twitter) YouTube LinkedIn
    • Privacy Policy
    • Disclaimer
    • Terms & Conditions
    • About us
    • Contact us
    Copyright © 2024 morshedi.se All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.

    Please wait...

    Subscribe to our newsletter

    Want to be notified when our article is published? Enter your email address and name below to be the first to know.
    I agree to Terms of Service and Privacy Policy
    SIGN UP FOR NEWSLETTER NOW