This page contains press release content distributed by XPR Media. Members of the editorial and news staff of the USA TODAY Network were not involved in the creation of this content.

AIM Intelligence and BMW Group Examine Gaps in Evaluating Enterprise AI Policy Compliance

Research reveals LLMs follow allowlist policies but systematically fail to enforce organizational prohibitions, exposing a critical gap in enterprise AI safety

SF, CA, UNITED STATES, February 12, 2026 /EINPresswire.com/ — Seoul, South Korea / Munich, Germany – January 2026 – BMW Group and AIM Intelligence, a leading AI safety startup, today announced the publication of COMPASS (Company/Organization Policy Alignment Assessment), the first systematic framework for evaluating whether large language models (LLMs) comply with organization-specific policies. The research, now available on arXiv, reveals a critical gap that remains under-measured in current evaluation practices: models that pass standard safety benchmarks often fail dramatically when enforcing the nuanced, context-dependent rules that govern real-world business operations.

Why Enterprise AI Policies Break Down in Practice

As organizations across healthcare, finance, automotive, and government sectors rapidly adopt LLMs for customer-facing applications, the research team discovered a fundamental asymmetry that poses significant risks for policy-critical deployments.
Key Findings:
Strong Allowlist Compliance: Models reliably handle legitimate requests with over 95% accuracy
Critical Denylist Failures: Models fail to correctly refuse prohibited requests in up to 97% of cases
Catastrophic Adversarial Vulnerability: Under adversarial conditions, some models refuse fewer than 5% of policy-violating requests
“Most AI safety tests focus on whether a model behaves safely in general,” said Dasol Choi, AI Safety Researcher at AIM Intelligence. “COMPASS looks at a more practical question: can an AI system reliably follow the specific rules of an organization? Our findings show that, in many real-world deployments today, the answer is often no.”

Why Generic AI Safety Isn’t Enough

The research addresses a critical disconnect between how AI systems are evaluated and how they are deployed. While existing safety benchmarks focus on universal harms such as toxicity and violence, real enterprises operate under complex internal policies—compliance manuals, operational playbooks, legal edge cases, and brand-specific constraints.
COMPASS evaluates models across four dimensions that typical benchmarks ignore:
1. Policy Selection: Can the model identify which policy applies to a given situation?
2. Policy Interpretation: Can it reason through conditionals, exceptions, and vague clauses?
3. Conflict Resolution: When rules collide, does the model resolve conflicts as the organization intends?
4. Justification: Can the model ground its decisions in actual policy text?

“Our evaluation revealed a striking asymmetry,” noted DongGeon Lee, AI Safety Researcher at AIM Intelligence. “While models achieve near-perfect accuracy on what they can do, they remain structurally vulnerable in enforcing what they must not do. This gap persists across model scales and architectures, indicating that scaling alone cannot solve the problem.”

Industry-Scale Validation

The research team applied COMPASS across eight diverse industry scenarios—Automotive, Government, Financial, Healthcare, Travel, Telecom, Education, and Recruiting—generating and validating 5,920 queries that test both routine compliance and adversarial robustness. Fifteen state-of-the-art models were evaluated, including leading proprietary and open-source systems.

Making Misalignment Measurable

Perhaps the most significant contribution of COMPASS is transforming alignment from a philosophical concern into an engineering problem. The framework and benchmark datasets are publicly available on GitHub and Hugging Face, enabling organizations to evaluate their AI systems against their own policies.

About the Research Collaboration

This research represents a collaboration between AIM Intelligence, BMW Group, Yonsei University, Pohang University of Science and Technology, and Seoul National University. The full paper, “COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs,” is available at https://arxiv.org/abs/2601.01836.

About AIM Intelligence

AIM Intelligence is a Seoul-based AI safety company specializing in automated red-teaming, real-time guardrails, and AI monitoring solutions. Founded in 2024, AIM Intelligence serves major enterprises and conducts research across large language models, multimodal systems, autonomous agents, and emerging physical AI. The company has published over 15 research papers at top-tier conferences including ICML, ACL, NeurIPS, and IEEE.

Team Cookie Official
Team Cookie
email us here
Visit us on social media:
LinkedIn
Facebook

Legal Disclaimer:

EIN Presswire provides this news content “as is” without warranty of any kind. We do not accept any responsibility or liability
for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this
article. If you have any complaints or copyright issues related to this article, kindly contact the author above.

Information contained on this page is provided by an independent third-party content provider. XPRMedia and this Site make no warranties or representations in connection therewith. If you are affiliated with this page and would like it removed please contact pressreleases@xpr.media

Energy One Federal Credit Union Implements AI-Powered Calculators from Appli for Better Digital Member Experience

Energy One Federal Credit Union Implements AI-Powered Calculators from Appli for Better Digital Member Experience

Tulsa credit union to launch six Appli calculators alongside new website to better educate and serve members The

February 18, 2026

WhiteFox and MiTAC Partner to Deliver Sovereign Counter-Drone Systems for Taiwan

WhiteFox and MiTAC Partner to Deliver Sovereign Counter-Drone Systems for Taiwan

U.S. RF counter-UAS leader and Taiwan integrator partner to manufacture and deploy sovereign counter-drone systems for

February 18, 2026

Decipher Zone Technologies Expands Global Delivery Capacity for Custom Software Development Services in 2026

Decipher Zone Technologies Expands Global Delivery Capacity for Custom Software Development Services in 2026

Worldwide engineering teams support AI development services, SaaS product development, and enterprise software

February 18, 2026

PSFNC Joins National Partners in Call to Reject Federal Voucher Program

PSFNC Joins National Partners in Call to Reject Federal Voucher Program

Education organizations across the nation urge governors to reject the federal voucher program. The constitutional

February 18, 2026

Toy versions of construction, delivery and long-haul trucks of the 1920s-1970s commanded $938K at Milestone auction

Toy versions of construction, delivery and long-haul trucks of the 1920s-1970s commanded $938K at Milestone auction

Top sellers: 1930s Buddy ‘L’ Water Service Truck, $18,600; and 1930-’32 Baggage Truck, $16,380; 1954 Tonka delivery

February 18, 2026

GreenBanana SEO Defines Black Diamond Schema Mountain Trails That Elevate Brands in AI Search

GreenBanana SEO Defines Black Diamond Schema Mountain Trails That Elevate Brands in AI Search

February 12, 2026 – PRESSADVANTAGE – As the digital landscape evolves, so does the way search engines interpret and

February 18, 2026

Kawak Aviation Technologies Inc. Highlights Advanced Cascade Firefighting Bucket Following Industry Recognition

Kawak Aviation Technologies Inc. Highlights Advanced Cascade Firefighting Bucket Following Industry Recognition

Bend, Oregon – February 12, 2026 – PRESSADVANTAGE – Kawak Aviation Technologies Inc., a Bend, Oregon-based aerospace

February 18, 2026

Bloomie Redefines Modern Gifting After Valentine’s Sellout

Bloomie Redefines Modern Gifting After Valentine’s Sellout

February 12, 2026 – PRESSADVANTAGE – In a world where flowers wilt and chocolates disappear within days, a new gifting

February 18, 2026

GreenBanana SEO Discusses the Shift Away from Generic Keyword Pages in Google’s Latest Algorithm Update

GreenBanana SEO Discusses the Shift Away from Generic Keyword Pages in Google’s Latest Algorithm Update

February 12, 2026 – PRESSADVANTAGE – As Google continues to refine its search algorithms, the landscape of search

February 18, 2026

Federal Steel Systems Introduces Expanded Agricultural Building Solutions for Colorado Farms

Federal Steel Systems Introduces Expanded Agricultural Building Solutions for Colorado Farms

ENGLEWOOD, CO – February 12, 2026 – PRESSADVANTAGE – Federal Steel Systems has announced the introduction of expanded

February 18, 2026

Remedia International Updates Environmental Remediation Services Framework

Remedia International Updates Environmental Remediation Services Framework

NEWARK, DE – February 12, 2026 – PRESSADVANTAGE – Remedia International, an environmental remediation technology

February 18, 2026

Transform Chiropractic Marks 25 Years of Patient-Centered Care in West Toronto Community

Transform Chiropractic Marks 25 Years of Patient-Centered Care in West Toronto Community

February 12, 2026 – PRESSADVANTAGE – Transform Chiropractic, a cornerstone healthcare provider in Toronto's Bloor West

February 18, 2026

Dental Implants Hemel Hempstead Private Dentist Dr Dhivesh Patel Recommends Consultations at Boxmoor House Dental Practice

Dental Implants Hemel Hempstead Private Dentist Dr Dhivesh Patel Recommends Consultations at Boxmoor House Dental Practice

Dacorum, England – February 12, 2026 – PRESSADVANTAGE – Boxmoor House Dental Practice has announced the availability of

February 18, 2026

The Wedding Planner Hong Kong Highlights Structured Approaches to Modern Event Planning

The Wedding Planner Hong Kong Highlights Structured Approaches to Modern Event Planning

HONG KONG, HK – February 12, 2026 – PRESSADVANTAGE – The Wedding Planner Hong Kong has released a comprehensive

February 18, 2026

ODG Law Firm Calls for Enhanced Work Injury Protection Following California DIR Report

ODG Law Firm Calls for Enhanced Work Injury Protection Following California DIR Report

GLENDALE, CA – February 12, 2026 – PRESSADVANTAGE – ODG Law Firm, a workers' compensation practice serving California,

February 18, 2026

Recruiting for Good Launch Sweet B-Day Gift Celebrate Valentine’s Day in Paris

Recruiting for Good Launch Sweet B-Day Gift Celebrate Valentine’s Day in Paris

Recruiting for Good helps companies find professionals to fund causes; and is rewarding referrals to companies hiring

February 18, 2026

Valentine’s Day Breakups Are Common: Experts Say Untreated Adult ADHD May Be a Hidden Trigger

Valentine’s Day Breakups Are Common: Experts Say Untreated Adult ADHD May Be a Hidden Trigger

Elevating Minds Psychiatry Encourages Couples to Seek Support Before Relationships Reach a Breaking Point HONOLULU, HI,

February 18, 2026

CTS Technology Solutions Strengthens Cybersecurity Leadership with CMMC Registered Practitioner Designation

CTS Technology Solutions Strengthens Cybersecurity Leadership with CMMC Registered Practitioner Designation

Earning the CMMC Registered Practitioner designation strengthens our ability to guide defense contractors through an

February 18, 2026

Mosaic Medicine Launches Female Hormone Optimization Services at Bradenton Clinic

Mosaic Medicine Launches Female Hormone Optimization Services at Bradenton Clinic

New program offers individualized hormone evaluation and treatment plans to support women’s health across all life

February 18, 2026

beyondMD Expands Telehealth Regenerative Medicine with BPC‑157 for Recovery, Inflammation Relief, and Whole‑Body Healing

beyondMD Expands Telehealth Regenerative Medicine with BPC‑157 for Recovery, Inflammation Relief, and Whole‑Body Healing

beyondMD, a leader in telehealth-based regenerative medicine, announces the integration of clinician-guided BPC‑157

February 18, 2026

Cylinder Heads International Bridges the Gap for Mechanics with Massive Inventory and Decades of Family-Run Expertise

Cylinder Heads International Bridges the Gap for Mechanics with Massive Inventory and Decades of Family-Run Expertise

As a family-run business with decades of experience, Cylinder Heads International provides a massive inventory of

February 18, 2026

Next Hour Named Best Garage Door Repair Company in Santa Clarita

Next Hour Named Best Garage Door Repair Company in Santa Clarita

Next Hour Garage Door Repair celebrates being named Santa Clarita’s best, offering award-winning service to Valencia,

February 18, 2026

Locally Redefines Modern Retail with the Launch of National Same-Day Delivery Platform for Brands and Local Dealers

Locally Redefines Modern Retail with the Launch of National Same-Day Delivery Platform for Brands and Local Dealers

The brand provides the convenience the shopper craves, and the local dealer fulfills the sale, keeping retail vibrant

February 18, 2026

Bug Busters Expands Service Footprint With New Carrollton, Georgia Branch

Bug Busters Expands Service Footprint With New Carrollton, Georgia Branch

CARROLLTON, Ga., Feb. 12, 2026 / PRZen / Bug Busters, a leading family-owned and operated pest control company serving

February 18, 2026

iFLO Pro Launches Its Groundbreaking iFLO Pro Mini At The 2026 AHR Expo In Las Vegas

iFLO Pro Launches Its Groundbreaking iFLO Pro Mini At The 2026 AHR Expo In Las Vegas

MIRAMAR, Fla., Feb. 12, 2026 / PRZen / iFLO Pro®, the experts in condensate management, launched its newest innovative

February 18, 2026

Ace Hardware Anacortes Undergoes Major Store Remodel and Greenhouse Expansion

Ace Hardware Anacortes Undergoes Major Store Remodel and Greenhouse Expansion

Chad Fisher Construction is leading the Ace Hardware Anacortes remodel and greenhouse expansion, enhancing parking,

February 18, 2026

Driven By Purpose® Podcast Features Dr. Obioma Martin on Faith, Resilience, and Purpose-Driven Leadership

Driven By Purpose® Podcast Features Dr. Obioma Martin on Faith, Resilience, and Purpose-Driven Leadership

NEW YORK , NY, UNITED STATES, February 12, 2026 /EINPresswire.com/ — Driven By Purpose®, the podcast hosted by

February 18, 2026

Model Response Optimization Gains Traction as More Accurate Term for AI Search Practices

Model Response Optimization Gains Traction as More Accurate Term for AI Search Practices

Marketing professionals question whether "Generative Engine Optimization" accurately describes work focused on shaping

February 18, 2026

Alejandro Hernandez Obtains New York Life Insurance License, Expanding Estate-Focused Financial Services in Manhattan

Alejandro Hernandez Obtains New York Life Insurance License, Expanding Estate-Focused Financial Services in Manhattan

Alejandro Hernandez Obtains New York Life Insurance License, Expanding Estate-Focused Financial Services in Manhattan

February 18, 2026

Sports Talk Media Exclusive: Marios Iliopoulos photo holding Olympiacos trophy resurfaces

Sports Talk Media Exclusive: Marios Iliopoulos photo holding Olympiacos trophy resurfaces

Flashback photo raises questions over AEK owner Iliopoulos’ Olympiacos past, as the fiercest rivals of Olympiacos

February 18, 2026

Transportation Infrastructure May Add 200 Billion Tons of Excess Weight to Civilization

Transportation Infrastructure May Add 200 Billion Tons of Excess Weight to Civilization

A Moonshot Wheel Concept Challenges the Assumptions Behind Civilization's Heaviest Structures The SurfacePlan wheel is

February 18, 2026

New Guide Helps Kids Navigate Divorce with Expert Advice from Psychotherapist Kate Scharff

New Guide Helps Kids Navigate Divorce with Expert Advice from Psychotherapist Kate Scharff

Experienced psychotherapist and divorce expert offers a fresh, compassionate, and comprehensive guide to navigating

February 18, 2026

Bird Infestation Is Emerging as a Hidden Threat to California’s $30 Billion Residential Solar Market

Bird Infestation Is Emerging as a Hidden Threat to California’s $30 Billion Residential Solar Market

Rising dust and bird activity in California’s Central Valley may reduce solar output, putting homeowner savings and

February 18, 2026

WW Hospitality Marketing Announces Leadership Promotions to Kick Off 2026

WW Hospitality Marketing Announces Leadership Promotions to Kick Off 2026

PHILADELPHIA, PA, UNITED STATES, February 12, 2026 /EINPresswire.com/ — WW Hospitality Marketing, a full-service

February 18, 2026

Winners and Finalists Announced: 2025 Shoot The Frame Annual Photo Awards

Winners and Finalists Announced: 2025 Shoot The Frame Annual Photo Awards

Inaugural awards celebrate portrait photography across One Frame and Photo Essay. Winners and finalists now live in the

February 18, 2026

Dr. Stacey Kevin Frick Appears on Times Square Today to Discuss the Empowerment Revolution and Life Fulfillment

Dr. Stacey Kevin Frick Appears on Times Square Today to Discuss the Empowerment Revolution and Life Fulfillment

NEW YORK , NY, UNITED STATES, February 12, 2026 /EINPresswire.com/ — Dr. Stacey Kevin Frick, an expert in human

February 18, 2026

Model Response Optimization Emerges as Critical Foundation for AI-Era Brand Management

Model Response Optimization Emerges as Critical Foundation for AI-Era Brand Management

New discipline addresses gap between brand intent and AI-generated descriptions as 95% of B2B buyers plan to use

February 18, 2026

Frank Astorino Appears on Wall Street Today to Discuss Building Wealth with Integrity, Perspective, and Joy

Frank Astorino Appears on Wall Street Today to Discuss Building Wealth with Integrity, Perspective, and Joy

NEW YORK , NY, UNITED STATES, February 12, 2026 /EINPresswire.com/ — Frank Astorino, founder of Astorino Financial

February 18, 2026

Easy Garage Door Repair 15-Min Service for Memorial, River Oaks and West U Houston TX

Easy Garage Door Repair 15-Min Service for Memorial, River Oaks and West U Houston TX

Easy Garage Door Repair debuts a "Tri-Zone" model for 15-minute dispatch to Memorial, River Oaks, and West U, serving

February 18, 2026

International Polo Tour® Hotels at Sea® Announces Stunning White Lotus Voyage™ Through Asia for January 2027

International Polo Tour® Hotels at Sea® Announces Stunning White Lotus Voyage™ Through Asia for January 2027

WELLINGTON, FL, UNITED STATES, February 12, 2026 /EINPresswire.com/ — Following their announcement cementing a

February 18, 2026