Close Menu
    What's Hot

    Buccaneers’ Godwin ‘didn’t believe’ news of Evans’ exit at first

    Squishmallows, dentures, and an ‘I Heart Hot Dads’ bag: Uber has found thousands of items left in robotaxis

    Did You Know Wearing Camouflage Is Against the Law in…

    Facebook X (Twitter) Instagram
    Trending
    • Buccaneers’ Godwin ‘didn’t believe’ news of Evans’ exit at first
    • Squishmallows, dentures, and an ‘I Heart Hot Dads’ bag: Uber has found thousands of items left in robotaxis
    • Did You Know Wearing Camouflage Is Against the Law in…
    • Opinion | China Is ‘a Loss’
    • Tom Steyer: 5 Facts About the Candidate for California Governor
    • Deadly Mining Accidents in Shanxi, Yunnan Highlight Dangers
    • Work-Life Balance Is the Wrong Goal
    • Cyera eyes $12B valuation at 80x ARR multiple despite operating losses
    interluknewsinterluknews
    • Home
    • Business
      • Corporate News
      • Industry Insights
      • Startups & Entrepreneurship
      • Technology & Innovation
    • Economy
      • Economic Policy
      • Financial Analysis
      • Inflation & Interest Rates
      • Trade & Markets
    • Global
      • Conflicts & Security
      • Diplomacy
      • Global Trends
      • International Affairs
    • Lifestyle
      • Fashion
      • Food & Dining
      • Personal Development
      • Travel
    • Opinion
      • Columns
      • Editorials
      • Expert Opinions
      • Reader Voices
    • More
      • Politics
        • Elections
        • Government & Policy
        • International Relations
        • Political Analysis
      • Sports
        • Cricket
        • Football / Soccer
        • International Sports
        • Local Sports
      • Technology
        • Artificial Intelligence
        • Cybersecurity
        • Gadgets & Reviews
        • Tech News
      • South Africa News
    Facebook X (Twitter) Instagram
    interluknewsinterluknews
    Startups & Entrepreneurship

    New Microsoft tool lets devs spin up AI behavior tests using text descriptions

    adminBy adminJune 2, 2026No Comments3 Mins Read
    Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
    New Microsoft tool lets devs spin up AI behavior tests using text descriptions
    Share
    Facebook Twitter LinkedIn Pinterest Email

    AI researchers and labs have advanced by leaps and bounds in evaluating AI models for everything from safety and compliance to sycophancy and alignment. But it appears companies and developers are faced with a new, specific need: making sure their AI system behaves as intended for their specific product or service.

    In a bid to make that testing process simpler, Microsoft on Tuesday took the wraps off ASSERT, short for Adaptive Spec-driven Scoring for Evaluation and Regression Testing.

    The open source framework, Microsoft says, makes evaluating application-specific AI behavior easy by using AI to turn high-level, natural-language descriptions of goals, policies, or intended behaviors into thorough, scored tests that can be investigated.

    ASSERT takes plain-language descriptions of an AI model’s expected behavior and policies, turns them into a structured set of acceptable and unacceptable behaviors, generates problem scenarios and test cases, runs them against the target system, and scores the results. It can also record the paths the AI system takes, including intermediate actions and tool calls, so developers can inspect where failures happen.

    Devs can provide system context, tools, and constraints, too, if they want to further customize what the evaluations cover.

    For example, a developer could specify that a document research AI agent shouldn’t send emails to people outside the company, and it should limit confidential information to C-level executives and provide concise summaries with prior context in mind. ASSERT will use those rules to generate test cases that check whether the system follows those rules on an ongoing basis.

    Image Credits:Microsoft

    The framework, according to Microsoft, fills a gap that broader, more general evaluations cannot when AI models are intended to behave in a manner that is shaped by an application or product’s context, policies, and tools.

    “One of the things we’ve learned is that evaluations are absolutely critical to making good decisions,” said Sarah Bird, chief product officer of Responsible AI at Microsoft. “Because if you don’t understand the behavior of the AI system, it’s really hard to know if it’s meeting your organization’s bar … What we found is that if you really want to have a trustworthy system, you should evaluate many more dimensions that are application-specific.”

    Bird said ASSERT can be used to evaluate systems when they’re being built, after deployment, and even for continuous monitoring.

    The release comes amidst a gradual but broader shift in the AI industry. As models grow more capable, researchers are focusing on repeatable testing and regression checks, with Stanford’s HELM, MLCommons’ AILuminate, and evaluation groups like METR rolling out benchmarks to measure how models behave under different conditions.

    When you purchase through links in our articles, we may earn a small commission. This doesn’t affect our editorial independence.

    behavior descriptions devs Lets Microsoft Spin Tests text tool
    Follow on Google News Follow on Flipboard
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
    Previous ArticleNIQ Global Intelligence plc (NIQ) Presents at 2026 Baird Global Consumer, Technology & Services Conference Transcript
    Next Article U.S. Treasury Imposes Sanctions on Iran’s Biggest Crypto Exchange
    admin
    • Website

    Related Posts

    Work-Life Balance Is the Wrong Goal

    June 2, 2026

    Perplexity AI unveils hybrid local-cloud inference system at Computex 2026

    June 2, 2026

    WTIA selects 21 startups for 14th Founder Cohort Accelerator Program – GeekWire

    June 2, 2026
    Leave A Reply Cancel Reply

    Demo
    Latest Posts

    Buccaneers’ Godwin ‘didn’t believe’ news of Evans’ exit at first

    Squishmallows, dentures, and an ‘I Heart Hot Dads’ bag: Uber has found thousands of items left in robotaxis

    Did You Know Wearing Camouflage Is Against the Law in…

    Opinion | China Is ‘a Loss’

    Latest Posts

    Subscribe to News

    Get the latest sports news from NewsSite about world, sports and politics.

    Advertisement
    Demo

    We are a digital news platform delivering timely, accurate, and insightful coverage of politics, global affairs, business, economy, sports, and more. Our mission is to keep readers informed with reliable news, clear analysis, and stories that truly matter.
    We're social. Connect with us:

    Facebook X (Twitter) Instagram Pinterest YouTube

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Type above and press Enter to search. Press Esc to cancel.

    Powered by
    ...
    ►
    Necessary cookies enable essential site features like secure log-ins and consent preference adjustments. They do not store personal data.
    None
    ►
    Functional cookies support features like content sharing on social media, collecting feedback, and enabling third-party tools.
    None
    ►
    Analytical cookies track visitor interactions, providing insights on metrics like visitor count, bounce rate, and traffic sources.
    None
    ►
    Advertisement cookies deliver personalized ads based on your previous visits and analyze the effectiveness of ad campaigns.
    None
    ►
    Unclassified cookies are cookies that we are in the process of classifying, together with the providers of individual cookies.
    None
    Powered by