ALIGNED-AI

28.9.2023 07:01:33 CEST | Business Wire | Press release

AI Alignment Lab Achieves Major Milestone in Step Towards Agentic AI

Aligned AI, a leader in artificial intelligence (AI) research, has announced a groundbreaking AI advancement in misgeneralization, a critical challenge in the field of AI. It is the first to surpass a key benchmark called CoinRun by teaching an AI to “think” in human-like concepts. The technology underpinning the achievement opens the door to more precise, reliable, and controllable AI for a wide variety of real world applications.

By teaching AI models to generalize in a manner more akin to agentic human cognition, Aligned AI’s innovation enables AI to correctly identify concepts across new situations and environments, reducing the need for prolonged production, testing, and retraining.

Misgeneralization occurs when AI systems learn incorrect patterns and behaviors from their training data, and are not able to correctly adapt when presented with new information. This leads to unexpected, and often harmful, outcomes. Today’s foundation models suffer from varying degrees of misgeneralization, as evidenced by users’ ability to “jailbreak” them, or there is a trade off between functionality and undesired behavior. The challenge of misgeneralization also prevents the industry as a whole from moving forward. For instance, generalization is required for truly autonomous vehicles and applying AI to critical applications. Otherwise, AIs cannot operate well enough in unfamiliar environments or discern the correct goals without human intervention.

To achieve this milestone, Aligned AI used the 2021 CoinRun misgeneralization benchmark, an Atari-style game released by researchers at Google DeepMind, the University of Cambridge, the University of Tubingen, and the University of Edinburgh. The goal of the benchmark is to test whether an AI can deduce a complex goal when that goal is spuriously correlated with a simpler goal in its training environment. The AI is rewarded for getting a coin, which is always placed at the end of the level during the training period, but is placed in a random location during the testing period, without additional reward information being provided.

Prior to Aligned AI’s innovation, AIs trained on CoinRun believed the best way to play the game was to go to the right, while avoiding monsters and holes. Because the coin was always at the end of the level during training, this strategy seemed effective. When the AI encountered a new level where the coin was placed elsewhere in the level but without being given new information, it would ignore the coin and either miss it or get it only by accident. ACE (which stands for “Algorithm for Concept Extrapolation”), the new AI developed by Aligned AI, notices the changes in the test environment and figures out to go for the coin, even without new reward information - just as a human would.

The key benefits of this breakthrough include:

Enhanced Safety: By reducing misgeneralization, AI systems become more reliable, ensuring they operate safely in a wide range of scenarios, from autonomous vehicles to robotics.
Improved Capabilities: It enables AI to better understand human intentions and make decisions that align with those intentions, significantly boosting its capabilities.
Ethical AI: It enhances the ethical aspects of AI by promoting fairness, transparency, and non-discrimination. AI systems that are precise, reliable, and interpretable are more likely to make ethical decisions by avoiding bias and aligning with human values.
Industry Impact: It’s poised to transform industries such as robotics, autonomous vehicles, and foundation models, making them more practical and applicable in various real-world settings.

“This isn't just a game-changer for the world of AI, it's a seismic shift for countless industries,” said Rebecca Gorman, Co-Founder and CEO of Aligned AI. “By significantly reducing misgeneralization and enhancing AI's ability to understand and adapt to unforeseen scenarios, we're opening doors to unparalleled opportunities across the board. From autonomous vehicles that can navigate from San Francisco to Phoenix on streets it's never seen before, to robots that can operate effectively in a range of changing and unforeseen environments, this benchmark is the linchpin that will make these futuristic visions a reality. It's not just about improving AI; it's about revolutionizing how industries operate, innovate, and serve humanity.”

Aligned AI’s innovation addresses a critical problem facing all AI systems. When confronted with new environments, current AIs tend to incorrectly extend the training data. This is why 70% of models don’t make it into production or face prolonged production and testing time, hindering scalability and often requiring retraining within the first year of release.

“As AI increases in power and widespread use, generalization remains a challenge,” said John Sviokla, a pioneering researcher in AI and current co-founder of GAI Insights, an advisory firm that helps companies achieve ROI with generative AI. “Aligned AI’s research is a critical step forward in the safe, ethical, and effective use of AI across industries.”

Since it was founded, Aligned AI has been at the forefront of addressing the critical challenges facing AI development and deployment. In 2022, Aligned AI was the leader in ChatGPT-jailbreak prevention, releasing the first prompt-evaluator as an open-source project. In September 2023, Aligned AI was awarded the CogX prize for the “Best Innovation in Mitigating Algorithm Bias” for EquitAI, an algorithm that constrains LLMs to output gender unbiased text, and faAIr, its algorithm for measuring and ranking gender bias in foundation models. Aligned AI’s previous work on concept extrapolation improves the performance of AI on out-of-distribution datasets and helps models behave safely while waiting for human feedback.

To learn more about Aligned AI and its misgeneralization breakthrough, please visit buildaligned.ai.

About Aligned AI:

Founded in Oxford by Rebecca Gorman and Dr. Stuart Armstrong, Aligned AI is a deep-tech startup that is enabling the next step change in AI by teaching AIs to understand and hold human-like concepts. Its core technology of “concept extrapolation” enables AIs to extend its trainers’ intent beyond its training data, meaning it operates as it should even in new scenarios. Aligned AI believes that safety and capability are not trade-offs, but rather an AI that is more precise and controllable is also more powerful.

To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.

View source version on businesswire.com: https://www.businesswire.com/news/home/20230927032399/en/

About Business Wire

Business Wire
101 California Street, 20th Floor
CA 94111 San Francisco

http://businesswire.com

Subscribe to releases from Business Wire

Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.

Latest releases from Business Wire

30 Peer-Reviewed Studies Highlight Statistically Significant Health Benefits of Almased16.7.2025 09:15:00 CEST | Press release

Two recent, peer-reviewed clinical studies have found that Almased, a high-protein, low-glycemic meal replacement, delivers significant health benefits ranging from weight loss and improved metabolic health to anti-aging effects and enhanced quality of life. Both reviews synthesize 30 peer-reviewed clinical studies across three decades of scientific research, confirming that Almased is effective and safe for weight reduction, preservation of lean muscle mass, and cardiovascular health. The 2025 review in the American Journal of Biomedical Science & Research highlighted Almased’s efficacy and safety for wide groups of people including individuals seeking weight loss, those with metabolic syndrome or fatty liver, older adults needing to preserve muscle mass, and athletes or healthy-weight individuals who require additional high-quality protein. This review also discusses how Almased’s patented fermentation process produces over 80 bioactive peptides, including 2 times the average daily i

The Future of Connectivity Starts Here: Network X Returns to Paris October 14 - 1616.7.2025 09:00:00 CEST | Press release

Show Reconvenes at Paris Expo Porte de Versailles with Global Representation of Industry Leaders and Telco Experts Network X 2025 - the only event that brings the fixed and mobile markets together - returns to Paris Expo Porte de Versailles October 14 - 16. Built for telecom's top players, this annual show drives business model innovation and monetisation of next-generation fixed, mobile, satellite and transport networks through AI and cloud. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250716595903/en/ Speaker on Headliners Stage at Network X 2024 New to Network X in 2025 are specialty events designed to deliver expert insights on trending topics including Data Center World and two Expo Stages for Fixed-Line and Mobile. More than 5,500 telco network infrastructure professionals will gather alongside 1,500 telcos to learn from six program tracks highlighting the latest advancements in Fibre, Wi-Fi Networks and Services, IP

Skechers AERO Series Opens New Chapter of Technical Running Innovation16.7.2025 09:00:00 CEST | Press release

New Collection Features an Evolution in Design that Cuts Through the Wind for That Aerodynamic Feel on Every Run Skechers Performance opens a new chapter of running innovation with the arrival of the Skechers AERO series. Named for the aerodynamic feel of the design, Skechers AERO represents the latest evolution of technical running shoes from the brand. The collection is engineered to deliver an exhilarating blend of speed, style and comfort to help runners cut through the wind and push beyond their personal bests while logging miles. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250716754749/en/ Introducing the Skechers AERO Series of technical running shoes: Skechers AERO Burst, Skechers AERO Spark, and Skechers AERO Tempo (L-R). “Recently launched in North America and Asia, the AERO Series leverages innovative technologies to elevate our signature comfort that’s now available to runners in Europe,” said Ben Stewart, Vic

4Moving Biotech Enrolls First Patient in Phase 2a Trial of 4P004, a Potential First-in-Class GLP-1 Therapy for Knee Osteoarthritis16.7.2025 07:00:00 CEST | Press release

- First patient enrolled in INFLAM MOTION, a global randomized Phase 2a trial including 129 knee osteoarthritis patients - 4P004 to be evaluated over 3 months for dual efficacy: symptom relief and synovial health improvement via contrast-enhanced MRI - Topline results expected in the second half of 2026 4Moving Biotech (4MB), a spin-off of 4P-Pharma dedicated to developing first-in-class treatments that modify the natural course of knee osteoarthritis (OA), today announced that the first patient has been enrolled in Phase 2a clinical trial, INFLAM MOTION. The study will evaluate 4P004, an intra-articular GLP-1 analog, as a potential first-in-class therapeutic candidate for knee osteoarthritis. INFLAM MOTION is a multicenter, randomized, double-blind, placebo-controlled Phase 2a trial planned to be conducted across Europe, the United States, and Canada. A total of 129 patients worldwide diagnosed with knee OA will be enrolled to evaluate, for the first time in humans, the efficacy of 4P

Belkin Achieves Qi2.2 Certification for Its Upcoming Products, Unlocking the Future of 25W Wireless Charging15.7.2025 19:06:00 CEST | Press release

With Qi2.2 certification, Belkin reinforces its commitment to quality, safety, and performance for the next generation of wireless charging Belkin, a leading consumer electronics brand for over 40 years, today announced it has received official Qi2.2 certification from the Wireless Power Consortium (WPC) for its upcoming products. As one of the first accessory brands to deliver Qi2.2-certified devices, Belkin is helping bring the next generation of wireless charging to market – enabling faster wireless charging speeds, broader compatibility, and improved performance for consumers. Belkin’s close partnership with the WPC since 2015 has been instrumental in bringing these advancements to consumers. As an early adopter and long-time contributor to WPC standards, Belkin was selected as one of a small group of trusted manufacturers to test and certify Qi2.2 products ahead of the broader industry rollout. All Belkin products undergo rigorous safety, quality, and performance testing. The comp

In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.

Visit our pressroom