Close Menu
    What's Hot

    Judge Demands Answers About Plans for Trump’s East Potomac Golf Course

    Rare Copy of the Declaration of Independence Is Discovered in London

    Trump administration indicts Olympic athlete for Reflecting Pool vandalism | Donald Trump News

    Facebook X (Twitter) Instagram
    Trending
    • Judge Demands Answers About Plans for Trump’s East Potomac Golf Course
    • Rare Copy of the Declaration of Independence Is Discovered in London
    • Trump administration indicts Olympic athlete for Reflecting Pool vandalism | Donald Trump News
    • As Ukraine War Escalates, Witkoff and Kushner Are Focused on Iran
    • Mark Zuckerberg tells staff that AI agents haven’t progressed as quickly as he’d hoped
    • Man City set to beat Arsenal to Leicester winger Jeremy Monga as move closes in – Paper Talk | Football News
    • Ransomware Groups Turn to Citrix Bleed 2, BYOVD, and Supply Chain Credentials
    • National Parks Can Continue to Remove Signs That Trump Calls ‘Negative’
    interluknewsinterluknews
    • Home
    • Business
      • Corporate News
      • Industry Insights
      • Startups & Entrepreneurship
      • Technology & Innovation
    • Economy
      • Economic Policy
      • Financial Analysis
      • Inflation & Interest Rates
      • Trade & Markets
    • Global
      • Conflicts & Security
      • Diplomacy
      • Global Trends
      • International Affairs
    • Lifestyle
      • Fashion
      • Food & Dining
      • Personal Development
      • Travel
    • Opinion
      • Columns
      • Editorials
      • Expert Opinions
      • Reader Voices
    • More
      • Politics
        • Elections
        • Government & Policy
        • International Relations
        • Political Analysis
      • Sports
        • Cricket
        • Football / Soccer
        • International Sports
        • Local Sports
      • Technology
        • Artificial Intelligence
        • Cybersecurity
        • Gadgets & Reviews
        • Tech News
      • South Africa News
    Facebook X (Twitter) Instagram
    interluknewsinterluknews
    Technology & Innovation

    You Can Now Sound the Alarm on AI Behaving Badly

    adminBy adminJuly 1, 2026No Comments3 Mins Read
    Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
    You Can Now Sound the Alarm on AI Behaving Badly
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Writing AI Lab each week means I occasionally encounter AI models that behave badly and bizarrely. Usually, there’s nothing to be done about it, save for sharing those tales with you. But that could soon change.

    A group of AI researchers has set up a crowdsourced website, Flaw Reporting for AI (FLARE-AI), for reporting and tracking AI harms. If, for example, a chatbot generates malware or a bomb-making recipe, leaks personal information, or triggers delusional thinking in users, FLARE-AI could be used to sound the alarm. The open source code behind the system allows others to verify an issue and route reports to model makers, as well as organizations like MITRE, a nonprofit that tracks problems with technical systems. It’s a bit like Downdetector, which compiles real-time user reports for global service outages affecting things like apps and websites.

    The website is another step in the group’s ongoing work with AI reporting, which I first wrote about last year. Members of the group also consulted on a congressional bill announced in June, which would see the US government take a central role in tracking this kind of AI misbehavior.

    “Right now, there is no centralized, accountable way to report flaws in AI systems,” says Avijit Ghosh, an artificial intelligence policy researcher at HuggingFace who co-led development of FLARE-AI with computer scientists Elaine Zhu and Shayne Longpre.

    The alarm system was developed in collaboration with 49 AI experts from 32 different organizations. In a paper outlining the work, the researchers argue that their initiative could prove crucial as AI is adopted more widely and as agentic systems gain greater power. The lack of a consistent way to report AI flaws is a significant problem, they believe.

    “I think it’s a really good initiative,” says Jessica Ji, a researcher at the think tank Center for Security and Emerging Technology. Ji says the researchers are right to note that existing reporting mechanisms are fragmented and that AI models are black boxes. “I’m in support of anything that makes AI more transparent,” she says.

    Though bugs and cybersecurity problems get a lot of attention—especially of late—Ghosh tells me that problems with AI systems span topics like psychological harm, discrimination or bias, and misinformation. He adds that different companies have different standards around such issues, which means some problems go unrecognized. “In the absence of a coordinated disclosure system, there are no external mechanisms to enforce transparency,” Ghosh says.

    A spate of recent incidents involving popular AI tools shows how easily the technology can go bad.

    This week, a company called LayerX disclosed a way to dupe AI-infused web browsers, including OpenAI’s Atlas and Perplexity’s Comet, into vaulting their guardrails. Convincing the AI model behind the browser that it was playing a game, for example, could lead to the browser going rogue and trying to hack a website. (The companies responsible for the affected browsers have fixed the issue, LayerX says.) And this April, Johann Rehberger, a security researcher, discovered a way to trick Claude into divulging personal data using images generated by ChatGTP.

    AI introduces bizarre new kinds of problems, too. Last year, OpenAI was forced to update its models after it discovered that they were overly sycophantic, which sometimes appeared to encourage delusional thinking.

    Rumman Chowdhury, the CEO and founder of Humane Intelligence PBC, says FLARE-AI could be a useful way for many AI developers to implement ways of reporting issues with their tools. But she adds that such initiatives often come with serious challenges.

    alarm badly Behaving Sound
    Follow on Google News Follow on Flipboard
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
    Previous ArticleCharles H. Townsend Dies at 82; Led Condé Nast During Digital Transition
    Next Article As Earthquake Death Toll Mounts, Venezuela Grapples With Recovering and Burying Bodies
    admin
    • Website

    Related Posts

    Mark Zuckerberg tells staff that AI agents haven’t progressed as quickly as he’d hoped

    July 2, 2026

    Tesla Driver Using Autopilot in Texas Crash Is Charged With Manslaughter

    July 2, 2026

    How Big Is ‘Love Island USA’? More Than 10 Million People Are Already on Its App

    July 2, 2026
    Leave A Reply Cancel Reply

    Demo
    Latest Posts

    Judge Demands Answers About Plans for Trump’s East Potomac Golf Course

    Rare Copy of the Declaration of Independence Is Discovered in London

    Trump administration indicts Olympic athlete for Reflecting Pool vandalism | Donald Trump News

    As Ukraine War Escalates, Witkoff and Kushner Are Focused on Iran

    Latest Posts

    Subscribe to News

    Get the latest sports news from NewsSite about world, sports and politics.

    Advertisement
    Demo

    We are a digital news platform delivering timely, accurate, and insightful coverage of politics, global affairs, business, economy, sports, and more. Our mission is to keep readers informed with reliable news, clear analysis, and stories that truly matter.
    We're social. Connect with us:

    Facebook X (Twitter) Instagram Pinterest YouTube

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Type above and press Enter to search. Press Esc to cancel.

    Powered by
    ...
    ►
    Necessary cookies enable essential site features like secure log-ins and consent preference adjustments. They do not store personal data.
    None
    ►
    Functional cookies support features like content sharing on social media, collecting feedback, and enabling third-party tools.
    None
    ►
    Analytical cookies track visitor interactions, providing insights on metrics like visitor count, bounce rate, and traffic sources.
    None
    ►
    Advertisement cookies deliver personalized ads based on your previous visits and analyze the effectiveness of ad campaigns.
    None
    ►
    Unclassified cookies are cookies that we are in the process of classifying, together with the providers of individual cookies.
    None
    Powered by