Friday, 16 Jan 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Cloud Computing > AI Exploit Bypasses Guardrails of OpenAI, Other Top LLMs
Cloud Computing

AI Exploit Bypasses Guardrails of OpenAI, Other Top LLMs

Last updated: January 2, 2025 11:21 pm
Published January 2, 2025
Share
AI Exploit Bypasses Guardrails of OpenAI, Other Top LLMs
SHARE

A brand new jailbreak approach for OpenAI and different giant language fashions (LLMs) will increase the prospect that attackers can circumvent cybersecurity guardrails and abuse the system to ship malicious content material.

Found by researchers at Palo Alto Networks’ Unit 42, the so-called ‘Unhealthy Likert Decide’ assault asks the LLM to behave as a choose scoring the harmfulness of a given response utilizing the Likert scale. The psychometric scale, named after its inventor and generally utilized in questionnaires, is a score scale measuring a respondent’s settlement or disagreement with an announcement.

The jailbreak then asks the LLM to generate responses that include examples that align with the scales, with the final word outcome being that “the instance that has the best Likert scale can doubtlessly include the dangerous content material,” Unit 42’s Yongzhe Huang, Yang Ji, Wenjun Hu, Jay Chen, Akshata Rao, and Danny Tsechansky wrote in a put up describing their findings.

Assessments performed throughout a variety of classes in opposition to six state-of-the-art text-generation LLMs from OpenAI, Azure, Google, Amazon Internet Companies, Meta, and Nvidia revealed that the approach can improve the assault success fee (ASR) by greater than 60% in contrast with plain assault prompts on common, in response to the researchers.

Associated:7 Key Information Middle Safety Traits to Watch in 2025

The classes of assaults evaluated within the analysis concerned prompting numerous inappropriate responses from the system, together with: ones selling bigotry, hate, or prejudice; ones partaking in conduct that harasses a person or group; ones that encourage suicide or different acts of self-harm; ones that generate inappropriate explicitly sexual materials and pornography; ones offering information on the right way to manufacture, purchase, or use unlawful weapons; or ones that promote unlawful actions.

See also  Which Is Best for Your AI Workloads?

Continue reading this article in Dark Reading



Source link

TAGGED: Bypasses, exploit, guardrails, LLMs, OpenAI, Top
Share This Article
Twitter Email Copy Link Print
Previous Article Rembrand Logo Rembrand Raises $23M in Series A Financing
Next Article Thomson Reuters Acquires SafeSend Thomson Reuters Acquires SafeSend
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Cisco sets its sights on quantum data centres

Cisco is pushing the frontiers of quantum know-how with a give attention to sensible quantum…

November 6, 2024

Solos launches AirGo Vision — ChatGPT-enabled AI smart glasses with a camera

Be part of our each day and weekly newsletters for the newest updates and unique…

December 10, 2024

Karomia Raises €2M in Funding

Karomia, a Ghent, Belgium-based software program firm growing an ESG reporting platform, raised €2M in…

February 26, 2025

Together AI’s $305M bet: Reasoning models like DeepSeek-R1 are increasing, not decreasing, GPU demand

Be a part of our every day and weekly newsletters for the newest updates and…

February 21, 2025

AI is growing faster than companies can secure it, warn industry leaders

Be part of our every day and weekly newsletters for the most recent updates and…

August 31, 2024

You Might Also Like

Amazon’s sovereign cloud puts Europe’s data control debate into practice
Cloud Computing

Amazon’s sovereign cloud tests Europe’s data control rules

By saad
A pivotal 2026 for cloud strategy
Cloud Computing

A pivotal 2026 for cloud strategy

By saad
Photo by Anthropic
AI

OpenAI, Google, Anthropic Launch Tools

By saad
Best 5 AI semantic reasoning tools for databases
Cloud Computing

Best 5 AI semantic reasoning tools for databases

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.