Business Wire

ALIGNED-AI

Share
AI Alignment Lab Achieves Major Milestone in Step Towards Agentic AI

Aligned AI, a leader in artificial intelligence (AI) research, has announced a groundbreaking AI advancement in misgeneralization, a critical challenge in the field of AI. It is the first to surpass a key benchmark called CoinRun by teaching an AI to “think” in human-like concepts. The technology underpinning the achievement opens the door to more precise, reliable, and controllable AI for a wide variety of real world applications.

By teaching AI models to generalize in a manner more akin to agentic human cognition, Aligned AI’s innovation enables AI to correctly identify concepts across new situations and environments, reducing the need for prolonged production, testing, and retraining.

Misgeneralization occurs when AI systems learn incorrect patterns and behaviors from their training data, and are not able to correctly adapt when presented with new information. This leads to unexpected, and often harmful, outcomes. Today’s foundation models suffer from varying degrees of misgeneralization, as evidenced by users’ ability to “jailbreak” them, or there is a trade off between functionality and undesired behavior. The challenge of misgeneralization also prevents the industry as a whole from moving forward. For instance, generalization is required for truly autonomous vehicles and applying AI to critical applications. Otherwise, AIs cannot operate well enough in unfamiliar environments or discern the correct goals without human intervention.

To achieve this milestone, Aligned AI used the 2021 CoinRun misgeneralization benchmark, an Atari-style game released by researchers at Google DeepMind, the University of Cambridge, the University of Tubingen, and the University of Edinburgh. The goal of the benchmark is to test whether an AI can deduce a complex goal when that goal is spuriously correlated with a simpler goal in its training environment. The AI is rewarded for getting a coin, which is always placed at the end of the level during the training period, but is placed in a random location during the testing period, without additional reward information being provided.

Prior to Aligned AI’s innovation, AIs trained on CoinRun believed the best way to play the game was to go to the right, while avoiding monsters and holes. Because the coin was always at the end of the level during training, this strategy seemed effective. When the AI encountered a new level where the coin was placed elsewhere in the level but without being given new information, it would ignore the coin and either miss it or get it only by accident. ACE (which stands for “Algorithm for Concept Extrapolation”), the new AI developed by Aligned AI, notices the changes in the test environment and figures out to go for the coin, even without new reward information - just as a human would.

The key benefits of this breakthrough include:

  • Enhanced Safety: By reducing misgeneralization, AI systems become more reliable, ensuring they operate safely in a wide range of scenarios, from autonomous vehicles to robotics.
  • Improved Capabilities: It enables AI to better understand human intentions and make decisions that align with those intentions, significantly boosting its capabilities.
  • Ethical AI: It enhances the ethical aspects of AI by promoting fairness, transparency, and non-discrimination. AI systems that are precise, reliable, and interpretable are more likely to make ethical decisions by avoiding bias and aligning with human values.
  • Industry Impact: It’s poised to transform industries such as robotics, autonomous vehicles, and foundation models, making them more practical and applicable in various real-world settings.

“This isn't just a game-changer for the world of AI, it's a seismic shift for countless industries,” said Rebecca Gorman, Co-Founder and CEO of Aligned AI. “By significantly reducing misgeneralization and enhancing AI's ability to understand and adapt to unforeseen scenarios, we're opening doors to unparalleled opportunities across the board. From autonomous vehicles that can navigate from San Francisco to Phoenix on streets it's never seen before, to robots that can operate effectively in a range of changing and unforeseen environments, this benchmark is the linchpin that will make these futuristic visions a reality. It's not just about improving AI; it's about revolutionizing how industries operate, innovate, and serve humanity.”

Aligned AI’s innovation addresses a critical problem facing all AI systems. When confronted with new environments, current AIs tend to incorrectly extend the training data. This is why 70% of models don’t make it into production or face prolonged production and testing time, hindering scalability and often requiring retraining within the first year of release.

“As AI increases in power and widespread use, generalization remains a challenge,” said John Sviokla, a pioneering researcher in AI and current co-founder of GAI Insights, an advisory firm that helps companies achieve ROI with generative AI. “Aligned AI’s research is a critical step forward in the safe, ethical, and effective use of AI across industries.”

Since it was founded, Aligned AI has been at the forefront of addressing the critical challenges facing AI development and deployment. In 2022, Aligned AI was the leader in ChatGPT-jailbreak prevention, releasing the first prompt-evaluator as an open-source project. In September 2023, Aligned AI was awarded the CogX prize for the “Best Innovation in Mitigating Algorithm Bias” for EquitAI, an algorithm that constrains LLMs to output gender unbiased text, and faAIr, its algorithm for measuring and ranking gender bias in foundation models. Aligned AI’s previous work on concept extrapolation improves the performance of AI on out-of-distribution datasets and helps models behave safely while waiting for human feedback.

To learn more about Aligned AI and its misgeneralization breakthrough, please visit buildaligned.ai.

About Aligned AI:

Founded in Oxford by Rebecca Gorman and Dr. Stuart Armstrong, Aligned AI is a deep-tech startup that is enabling the next step change in AI by teaching AIs to understand and hold human-like concepts. Its core technology of “concept extrapolation” enables AIs to extend its trainers’ intent beyond its training data, meaning it operates as it should even in new scenarios. Aligned AI believes that safety and capability are not trade-offs, but rather an AI that is more precise and controllable is also more powerful.

To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.

View source version on businesswire.com: https://www.businesswire.com/news/home/20230927032399/en/

About Business Wire

Business Wire
Business Wire
101 California Street, 20th Floor
CA 94111 San Francisco

http://businesswire.com
DK

Subscribe to releases from Business Wire

Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.

Latest releases from Business Wire

Geoswift Launches Visa Direct to Enhance Cross-Border Payment Offers in Over 30 Countries30.4.2025 03:00:00 CEST | Press release

Geoswift announced today the integration of Visa Direct with Geoswift's cross-border payments platform. Visa Direct facilitates payouts to more than 140 countries and territories. The integration will enable payouts in 32 countries and territories, across 13 currencies, covering major markets in Asia Pacific, North America, Europe and Middle East, with plans for more in the future. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250423288249/en/ Bryan Ma, SVP, Head of Geoswift Global Payments, and Swapnil Mhasde, Head of Visa Direct Commercialization and Solutions, Asia Pacific, celebrating the launch in Singapore. Geoswift is a leading provider of cross-border payment services and solutions globally. With over two decades of innovation, it has become a trusted name in B2B, education, eCommerce, remittance, and travel payment use cases. Raymond Qu, Group CEO of Geoswift, stated, "At Geoswift, our vision has always been to del

Logitech Announces Q4 and Full Fiscal Year 2025 Results29.4.2025 22:03:00 CEST | Press release

A Year of Broad-Based Sales Growth, Expanded Market Share and Increased Profitability, Driven by Strategic Priorities SIX Swiss Exchange Ad hoc announcement pursuant to Art. 53 LR — Logitech International (SIX: LOGN) (Nasdaq: LOGI) today announced financial results for the fourth quarter and full Fiscal Year 2025 ended March 31, 2025. For Fiscal Year 2025: Sales were $4.55 billion, up 6 percent in US dollars and 7 percent in constant currency, compared to the prior year. GAAP operating income was $655 million, up 11 percent compared to the prior year. Non-GAAP operating income was $775 million, up 11 percent compared to the prior year. GAAP earnings per share (EPS) was $4.13, up 7 percent compared to the prior year. Non-GAAP EPS was $4.84, up 14 percent compared to the prior year. Cash flow from operations was $843 million. The year-ending cash balance was $1.5 billion. The Company returned $797 million of cash to shareholders through its annual dividend payment and share repurchases.

U.S. Patent and Trademark Office Invalidates Pharmacyclics Patent Asserted Against BeiGene29.4.2025 21:32:00 CEST | Press release

BeiGene, Ltd. (NASDAQ: ONC; HKEX: 06160; SSE: 688235), a global oncology company that intends to change its name to BeOne Medicines Ltd., today announced that the U.S. Patent and Trademark Office (USPTO) rendered a Final Written Decision invalidating all claims of Pharmacyclics LLC’s (Pharmacyclics) U.S. Patent No. 11,672,803 (the ‘803 patent) that were challenged by BeiGene in a post-grant review (PGR) proceeding. On November 1, 2023, BeiGene filed a PGR petition with the USPTO challenging the validity of certain claims of the ‘803 patent, in response to a patent infringement lawsuit Pharmacyclics brought against BeiGene concerning BRUKINSA® (zanubrutinib). On May 1, 2024, the USPTO granted BeiGene’s petition to institute the PGR. The USPTO’s Final Written Decision is appealable by Pharmacyclics. Commenting on the ruling, BeiGene General Counsel Chan Lee said: “We are pleased that the USPTO invalidated all challenged claims of the ‘803 patent. Today’s decision reinforces our belief th

One out of Three Secure Civil IDs Delivered Each Year Is Powered by Thales29.4.2025 16:50:00 CEST | Press release

In a world where identity fraud represents a critical vulnerability for citizens and societies, Thales is leading the transformation of civil identity into a secure and citizen-first service. Through its advanced Civil Identity Suite, Thales enables governments worldwide to protect their citizens, ensuring protection at every stage of the identity journey and for the entire identity chain. Supporting more than 300 national identity programmes and having enrolled over 500 million people, Thales is uniquely positioned to deliver secure and responsible identity solutions. Each year, Thales powers one in three smart civil IDs (official electronic documents) issued worldwide, highlighting the company’s key role in shaping the future of identities and helping governments and citizens transition smoothly to digital. With its Civil Identity Suite, Thalesenables the issuance and management of both physical and digital identities, as well as all means of enrolling citizens and enabling seamless

Solving Border Control Staffing Challenges: Regula Launches an Ecosystem for Remote Document Examination29.4.2025 15:00:00 CEST | Press release

Regula, a global developer of forensic devices and identity verification solutions, introduces an innovative solution designed to transform document examination processes, particularly in border control operations. Based on high-resolution photospectral scanners from the Regula 88XX product line, it enables highly accurate and reliable remote document authentication, effectively addressing the pressing issue of staffing shortages in border security. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250429852027/en/ High-resolution photospectral scanner Regula 8880 for remote document verification (Photo: Regula) Traditionally, document authenticity experts needed to be physically present at each checkpoint. However, this requirement has become increasingly challenging due to current staffing constraints. For example, a recent European Commission report highlights gaps in the availability of certain specialized experts, notably

In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.

Visit our pressroom
World GlobeA line styled icon from Orion Icon Library.HiddenA line styled icon from Orion Icon Library.Eye