Business Wire

TII

Share
Technology Innovation Institute Announces Launch of NOOR, the World’s Largest Arabic NLP Model

Technology Innovation Institute (TII), a global research center and applied research pillar of Abu Dhabi’s Advanced Technology Research Council, today announced the launch of NOOR, the world’s largest Arabic natural language processing (NLP) model to date.

This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220411005085/en/

TII’s team of advanced researchers and Artificial Intelligence (AI) specialists, has joined forces with LightOn, a technology company that unlocks extreme-scale machine intelligence for businesses, to transform the Arabic NLP model. The NOOR model has the capability to carry out tasks beyond the domain of language - offering end-to-end pipeline high quality data, including crawling, filtering, and curation at scale. The model facilitates extreme-scale distributed training and serving – to deliver applications with efficient inference and model specialization.

Dr. Ray O. Johnson, CEO, TII and ASPIRE, said: “With this development, we are well on track to enhance our research capabilities and credentials as well as elevate the status of Abu Dhabi and the UAE as a serious research ecosystem. Our expert teams have demonstrated yet again that this region can achieve breakthrough R&D outcomes to impact the world.”

Dr. Ebtesam Almazrouei, Director, AI Cross-Center Unit, TII, said: “Large language models have taken the world of natural language processing by storm, and we are proud to introduce this cutting-edge model with 10 billion parameters - the world’s largest Arabic NLP model. The uniquely large Arabic dataset collected to train the model is the result of months of work that included curating, scrapping, and filtering of varied sources. A special thank you to the entire team that worked on this project to make NOOR the go-to exploration model in Arabic for academicians and businesses everywhere.”

Speaking on the launch, Prof. Mérouane Debbah, Chief Researcher, Digital Science Research Center and AI Cross-Center Unit, TII, said: “With NOOR, TII has expanded the scope of the modern standard Arabic model by leveraging know-how in large language models to build cross-disciplinary, cutting-edge expertise in this new generation of AI research.”

To curate the world’s largest high-quality cross-domain Arabic datasets, NOOR’s unique dataset of more than 30 billion words combines web data with books, poetry, news articles, and technical information to significantly widen the applicability of the model.

Dr. Ebtesam Almazrouei said the NOOR model is based on the popular Transformer architecture. As a decoder-only model, similar in structure to GPT-3, it is programmed to tackle generative tasks with architecture upgraded to reflect the latest developments in the world of machine learning, including improvements such as better positional embeddings. To help ensure quality at scale in the NOOR dataset, the TII team designed an automated filtering pipeline based on machine learning techniques. These tools identify text like quality references and safeguard the model from exposure to spam content.

Leveraging state-of-the-art 3D parallelism, NOOR was trained on a High-Performance Computing resource with 128 A100 GPUs, allowing for the distribution of computations and ensuring efficient use of the available hardware resources.

The Director of the AI Cross-Center Unit noted that this was only the first step in the Unit’s efforts to contribute to the wider UAE Strategy for Artificial Intelligence.

Named for the Arabic word "light", the model has been so called to establish the correlation of the Arabic language model to enlightening the mind.

About Technology Innovation Institute (TII)

For more information, visit www.tii.ae

*Source: AETOSWire

Link:

ClickThru

About Business Wire

Business Wire
Business Wire
101 California Street, 20th Floor
CA 94111 San Francisco

http://businesswire.com

Subscribe to releases from Business Wire

Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.

Latest releases from Business Wire

EIG Acquires a 49.87% Stake in Transportadora de Gas del Perú (TgP)19.12.2025 18:42:00 CET | Press release

EIG, through its managed investment vehicles, acquired a 49.87% equity stake in Transportadora de Gas del Perú S.A. (“TgP”) from Canada Pension Plan Investment Board today. TgP operates Peru’s principal natural gas and natural gas liquids pipelines under a long-term concession, supplying approximately 40% of the country’s power generation. “We are delighted to complete this transaction and embark on the next chapter of our partnership with TgP,” said Matt Hartman, EIG’s Global Head of Infrastructure. “Our priority is to support TgP’s operational excellence and long-term stability, delivering value for customers and stakeholders throughout Peru.” About EIG EIG is a leading institutional investor in the global energy and infrastructure sectors with $24.3 billion assets under management as of September 30, 2025. EIG specializes in private investments in energy and energy-related infrastructure on a global basis. During its 43-year history, EIG has committed over $51.7 billion to the energ

CyberArk Named a Leader in IDC MarketScape: Worldwide Integrated Solutions for Identity Security 202519.12.2025 17:00:00 CET | Press release

Unified platform uses AI and automation to accelerate time-intensive workflows, streamline operations and improve threat detectionEnables CISOs to consolidate cybersecurity stack, optimizing total cost of ownership CyberArk (NASDAQ: CYBR), the global leader in identity security, today announced that it has been recognized as a Leader in the IDC MarketScape: Worldwide Integrated Solutions for Identity Security 2025 Vendor Assessment. CyberArk extends dynamic privilege controls across all identity types with its unified platform, enabling organizations to improve efficiencies and streamline security operations. This IDC MarketScape report notes, “More change has occurred in the identity security marketplace in the past two years than in almost a decade. Vendors are entering a new phase defined by the emergence of intelligence technologies, none of which are specifically defined by any industry standards. Though different by design, the new adjacent IAM offerings are largely focused on im

New York Liberty and Ant International’s Alipay+ Announce Multiyear Partnership Focused on Empowerment, Sustainability and Youth Development19.12.2025 14:30:00 CET | Press release

Ant International’s Alipay+ Named an Official Sponsor and Innovation Partner for Sustainability of the Team The New York Liberty and Ant International’s Alipay+, a leading cross-border fintech services platform based in Singapore, today announced a multiyear partnership, making Alipay+ an Official Sponsor and Innovation Partner for Sustainability of the New York Liberty. Through this partnership, Alipay+ and the Liberty will jointly support community programs designed to advance community empowerment, environmental sustainability and youth development across New York City. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20251219678825/en/ Peng Yang, CEO, Ant International and Clara Wu Tsai, Vice Chair, Brooklyn Sports and Entertainment; Governor, New York Liberty “Our partnership with Alipay+ goes beyond the game,” said Keia Clarke, Chief Executive Officer, New York Liberty. “Together, we are investing in the future of New York

Parse Biosciences and Codebreaker Labs Partner to Apply Whole Transcriptome Single Cell Profiling and Causal Genomics at Scale19.12.2025 14:00:00 CET | Press release

Collaboration pairs robust synthetic biology platform with massive scale single cell sequencing to overcome long-standing challenges in variant mapping Parse Biosciences, the leading provider of scalable and accessible single cell sequencing solutions, today announced a collaboration with Codebreaker Labs to develop and validate a breakthrough platform capable of testing thousands of genetic variants in parallel and measuring their effects at single cell resolution. By combining Codebreaker’s synthetic biology platform and variant engineering capabilities with the scale and accessibility of Parse’s Evercode™ technology, the collaboration aims to generate the causal data increasingly sought by AI developers, drug discovery teams, and clinical researchers. Today’s genomic studies rely heavily on observational data, or variants that appear in large populations. But rare and private variants, often only seen in one individual or family, are nearly impossible to study this way because too f

Cinemo Launches Cinemo ICO™, Accelerating the AI-Driven Intelligent Cockpit19.12.2025 11:00:00 CET | Press release

The future of in-car intelligence, delivered today for hyper-personalized, safer, smarter, and more exceptional journeys Cinemo, a global leader and highly innovative one-stop-shop provider for fully integrated digital media products announces today the launch of its next-generation, AI-powered cockpit solutions - Cinemo ICO™. By bringing agentic AI, Cinemo unlocks a truly intelligent cockpit - connecting vehicles, drivers, passengers, and their digital ecosystems into one seamless, personal and context-aware flow. The first product launched within the Cinemo ICO™ portfolio is Cinemo ICO™ MediaMind, enabling advanced intelligent media discovery. It combines the latest agentic AI technology with Cinemo’s core expertise of providing automotive-grade media management, helping users effortlessly discover the right content for every ride - perfectly matched to their taste, context, and environment. With Cinemo ICO™ MediaMind, the digital media experience evolves: using cutting-edge large la

In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.

Visit our pressroom
World GlobeA line styled icon from Orion Icon Library.HiddenA line styled icon from Orion Icon Library.Eye