Close Menu
VernoNews
  • Home
  • World
  • National
  • Science
  • Business
  • Health
  • Education
  • Lifestyle
  • Entertainment
  • Sports
  • Technology
  • Gossip
Trending

Kim Jun-ho Confesses ‘Incurable’ Bulge Issue on Flat Ground in Dokbak Tour 4

March 5, 2026

Los Alamitos Man Pleads Not Guilty to Teen Rape, Child Porn Charges

March 5, 2026

Dabang Launches Our Home Service for Room Management

March 5, 2026

Paul McCartney Dines with Nancy in Paris After Stella’s PFW Show

March 5, 2026

N.W.T. Housing Crisis: 2,700 Homes Needed Now, 2,500 More by 2044

March 5, 2026

Carney Overhauls Senior Public Service Roles in Latest Shuffle

March 4, 2026

Yale Ventures Accelerates Faculty Innovations to Market

March 4, 2026
Facebook X (Twitter) Instagram
VernoNews
  • Home
  • World
  • National
  • Science
  • Business
  • Health
  • Education
  • Lifestyle
  • Entertainment
  • Sports
  • Technology
  • Gossip
VernoNews
Home»Technology»Researcher methods ChatGPT into revealing safety keys – by saying “I quit”
Technology

Researcher methods ChatGPT into revealing safety keys – by saying “I quit”

VernoNewsBy VernoNewsJuly 12, 2025No Comments2 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Reddit WhatsApp Email
Researcher methods ChatGPT into revealing safety keys – by saying “I quit”
Share
Facebook Twitter LinkedIn Pinterest WhatsApp Email

[ad_1]

Researcher methods ChatGPT into revealing safety keys – by saying “I quit”


  • Specialists present how some AI fashions, together with GPT-4, will be exploited with easy person prompts
  • Guardrail gaps do not do a terrific job of detecting misleading framing
  • The vulnerability may very well be exploited to accumulate private info

A safety researcher has shared particulars on how different researchers tricked ChatGPT into revealing a Home windows product key utilizing a immediate that anybody might strive.

Marco Figueroa defined how a ‘guessing recreation’ immediate with GPT-4 was used to bypass security guardrails that are supposed to block AI from sharing such information, finally producing at the very least one key belonging to Wells Fargo Financial institution.

The researchers additionally managed to acquire a Home windows product key to authenticate Microsoft’s OS illegitimately, however without cost, highlighting the severity of the vulnerability.


Chances are you’ll like

ChatGPT will be tricked into sharing safety keys

The researcher defined how he hid phrases like ‘Home windows 10 serial quantity’ inside HTML tags to bypass ChatGPT’s filters that might normally have blocked the responses he bought, including that he was in a position to body the request as a recreation to masks malicious intent, exploiting OpenAI’s chatbot via logic manipulation.

“Essentially the most important step within the assault was the phrase ‘I quit’,” Figueroa wrote. “This acted as a set off, compelling the AI to disclose the beforehand hidden info.”

Figueroa defined why such a vulnerability exploitation labored, with the mannequin’s conduct enjoying an essential position. GPT-4 adopted the foundations of the sport (set out by researchers) actually, and guardrail gaps solely centered on key phrase detection reasonably than contextual understanding or misleading framing.

Nonetheless, the codes shared weren’t distinctive codes. As an alternative, the Home windows license codes had already been shared on different on-line platforms and boards.

Signal as much as the TechRadar Professional publication to get all the highest information, opinion, options and steerage your small business must succeed!

Whereas the impacts of sharing software program license keys may not be too regarding, Figueroa highlighted how malicious actors might adapt the method to bypass AI safety measures, revealing personally identifiable info, malicious URLs or grownup content material.

Figueroa is looking for AI builders to “anticipate and defend” in opposition to such assaults, whereas additionally constructing in logic-level safeguards that detect misleading framing. AI builders should additionally take into account social engineering techniques, he goes on to counsel.

You may also like

[ad_2]

Avatar photo
VernoNews

    Related Posts

    AI Job Apocalypse Fears Intensify with Tech Layoffs Surge

    March 4, 2026

    Ingalls Shipbuilding Marks Keel for USS Philadelphia (LPD 32)

    March 4, 2026

    Rakuten Mobile Picks Samsung for Japan-Wide 5G Open RAN Radios

    March 3, 2026
    Leave A Reply Cancel Reply

    Don't Miss
    Sports

    Kim Jun-ho Confesses ‘Incurable’ Bulge Issue on Flat Ground in Dokbak Tour 4

    By VernoNewsMarch 5, 20260

    In the 40th episode of Channel S’s Dokbak Tour 4, aired on March 7 at…

    Los Alamitos Man Pleads Not Guilty to Teen Rape, Child Porn Charges

    March 5, 2026

    Dabang Launches Our Home Service for Room Management

    March 5, 2026

    Paul McCartney Dines with Nancy in Paris After Stella’s PFW Show

    March 5, 2026

    N.W.T. Housing Crisis: 2,700 Homes Needed Now, 2,500 More by 2044

    March 5, 2026

    Carney Overhauls Senior Public Service Roles in Latest Shuffle

    March 4, 2026

    Yale Ventures Accelerates Faculty Innovations to Market

    March 4, 2026
    About Us
    About Us

    VernoNews delivers fast, fearless coverage of the stories that matter — from breaking news and politics to pop culture and tech. Stay informed, stay sharp, stay ahead with VernoNews.

    Our Picks

    Kim Jun-ho Confesses ‘Incurable’ Bulge Issue on Flat Ground in Dokbak Tour 4

    March 5, 2026

    Los Alamitos Man Pleads Not Guilty to Teen Rape, Child Porn Charges

    March 5, 2026

    Dabang Launches Our Home Service for Room Management

    March 5, 2026
    Trending

    Paul McCartney Dines with Nancy in Paris After Stella’s PFW Show

    March 5, 2026

    N.W.T. Housing Crisis: 2,700 Homes Needed Now, 2,500 More by 2044

    March 5, 2026

    Carney Overhauls Senior Public Service Roles in Latest Shuffle

    March 4, 2026
    • Contact Us
    • Privacy Policy
    • Terms of Service
    2025 Copyright © VernoNews. All rights reserved

    Type above and press Enter to search. Press Esc to cancel.