Business Wire

TII

Share
Technology Innovation Institute Announces Launch of NOOR, the World’s Largest Arabic NLP Model

Technology Innovation Institute (TII), a global research center and applied research pillar of Abu Dhabi’s Advanced Technology Research Council, today announced the launch of NOOR, the world’s largest Arabic natural language processing (NLP) model to date.

This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220411005085/en/

TII’s team of advanced researchers and Artificial Intelligence (AI) specialists, has joined forces with LightOn, a technology company that unlocks extreme-scale machine intelligence for businesses, to transform the Arabic NLP model. The NOOR model has the capability to carry out tasks beyond the domain of language - offering end-to-end pipeline high quality data, including crawling, filtering, and curation at scale. The model facilitates extreme-scale distributed training and serving – to deliver applications with efficient inference and model specialization.

Dr. Ray O. Johnson, CEO, TII and ASPIRE, said: “With this development, we are well on track to enhance our research capabilities and credentials as well as elevate the status of Abu Dhabi and the UAE as a serious research ecosystem. Our expert teams have demonstrated yet again that this region can achieve breakthrough R&D outcomes to impact the world.”

Dr. Ebtesam Almazrouei, Director, AI Cross-Center Unit, TII, said: “Large language models have taken the world of natural language processing by storm, and we are proud to introduce this cutting-edge model with 10 billion parameters - the world’s largest Arabic NLP model. The uniquely large Arabic dataset collected to train the model is the result of months of work that included curating, scrapping, and filtering of varied sources. A special thank you to the entire team that worked on this project to make NOOR the go-to exploration model in Arabic for academicians and businesses everywhere.”

Speaking on the launch, Prof. Mérouane Debbah, Chief Researcher, Digital Science Research Center and AI Cross-Center Unit, TII, said: “With NOOR, TII has expanded the scope of the modern standard Arabic model by leveraging know-how in large language models to build cross-disciplinary, cutting-edge expertise in this new generation of AI research.”

To curate the world’s largest high-quality cross-domain Arabic datasets, NOOR’s unique dataset of more than 30 billion words combines web data with books, poetry, news articles, and technical information to significantly widen the applicability of the model.

Dr. Ebtesam Almazrouei said the NOOR model is based on the popular Transformer architecture. As a decoder-only model, similar in structure to GPT-3, it is programmed to tackle generative tasks with architecture upgraded to reflect the latest developments in the world of machine learning, including improvements such as better positional embeddings. To help ensure quality at scale in the NOOR dataset, the TII team designed an automated filtering pipeline based on machine learning techniques. These tools identify text like quality references and safeguard the model from exposure to spam content.

Leveraging state-of-the-art 3D parallelism, NOOR was trained on a High-Performance Computing resource with 128 A100 GPUs, allowing for the distribution of computations and ensuring efficient use of the available hardware resources.

The Director of the AI Cross-Center Unit noted that this was only the first step in the Unit’s efforts to contribute to the wider UAE Strategy for Artificial Intelligence.

Named for the Arabic word "light", the model has been so called to establish the correlation of the Arabic language model to enlightening the mind.

About Technology Innovation Institute (TII)

For more information, visit www.tii.ae

*Source: AETOSWire

Link:

ClickThru

About Business Wire

Business Wire
Business Wire
101 California Street, 20th Floor
CA 94111 San Francisco

http://businesswire.com

Subscribe to releases from Business Wire

Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.

Latest releases from Business Wire

MultiBank Group Confirms $MBG Token TGE Set for July 22, 202512.7.2025 11:14:00 CEST | Press release

MultiBank Group, the world’s largest financial derivatives institution has officially announced that the Token Generation Event (TGE) for its highly anticipated $MBG Token will take place on July 22, 2025. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250712404220/en/ MultiBank Group has officially announced that the Token Generation Event (TGE) for its highly anticipated $MBG Token will take place on July 22, 2025. This milestone will mark the full activation of the $MBG Token on the blockchain, enabling holders to view and manage their balances across supported platforms. Following the token minting, users will be able to trade $MBG via MultiBank.io, the Group’s regulated crypto exchange and Uniswap, the world’s leading decentralized platform. The $MBG Token has garnered global attention for its rare combination of real-world utility, institutional backing, and strong deflationary mechanics. It is underpinned by $29 billi

Elegen and Nutcracker Therapeutics to Pilot First Fully Cell-Free Manufacturing Process for RNA-based Personalized Cancer Therapeutics11.7.2025 14:00:00 CEST | Press release

Fully cell-free process aims to further democratize personalized cancer therapeutic manufacturing with shorter turnaround times and negligible bioburden and endotoxin risks. Elegen, a global leader in next-generation DNA manufacturing, and Nutcracker Therapeutics, a global leader in next-generation RNA design and manufacturing, today announced the launch of a pilot program to demonstrate the industry’s first fully synthetic, cell-free manufacturing platform for RNA-based personalized cancer therapeutics (PCTs). The pilot marks another step toward making PCTs more accessible, timely, and scalable. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250711152688/en/ As late-stage PCT clinical trials progress and therapy developers work to create the next generation of PCTs, the speed, reliability, scaling and cost of traditional production methods pose a major challenge. Specifically, the first step of DNA template production is hi

$MBG Token Pre-Sale Set for July 15 — Only 7 Million Tokens Available at $0.3511.7.2025 10:17:00 CEST | Press release

MultiBank Group, the world’s largest financial derivatives institution headquartered in Dubai, has confirmed that its highly anticipated $MBG Token pre-sale will go live on July 15, with demand expected to be intense. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250711737311/en/ With only 7 million $MBG tokens up for grabs at an exclusive entry price of $0.35, this is a rare opportunity to secure early access to what many are calling the year’s most powerful utility asset. With only 7 million tokens up for grabs at an exclusive entry price of $0.35, this is a rare opportunity to secure early access to what many are calling the year’s most powerful utility asset. Early participants can join simultaneously on MultiBank.io, the Group’s regulated crypto exchange, and Uniswap, the world’s leading decentralized platform. Supported by $29 billion in real assets and powered by over $35 billion in daily turnover, $MBG is engineered

Live Story Raises €2.7 Million to Revolutionize the Digital Experience11.7.2025 10:05:00 CEST | Press release

With a round led by Vertis, the next-generation CMS platform accelerates its focus onAI, performance, and European expansion. Target: surpass €10M in recurring revenueby 2027. Live Story, the tech company founded by Stefano Mocellini, has closed a €2.7 million seed round led by Vertis, one of Italy’s leading early-growth venture capital firms. The funding will support the company’s international expansion and technological development, with a clear goal: to exceed €10 million in annual recurringrevenue by 2027. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250711335560/en/ “We invested in Live Story because it addresses one of the major inefficiencies in digital commerce: the slow and rigid management of visual and narrative content,” says Alessandro Pontari, Partner at Vertis SGR. “The platform helps brands drastically reduce their time-to-market through a visual CMS that integrates seamlessly with any tech stack. In a wor

With a Score of 84 out of 100, Sagemcom Is Awarded the EcoVadis Platinum Medal: a Prestigious Recognition of its CSR Commitment11.7.2025 09:00:00 CEST | Press release

Sagemcom Group is proud to announce that it has been awarded, for the third time, the Platinum Medal by EcoVadis, the highest distinction granted by the leading global platform for assessing Corporate Social Responsibility (CSR) performance. This medal places Sagemcom in the top 1% of companies evaluated worldwide, across all industries. With a score of 84 out of 100, Sagemcom reaffirms its position as a committed leader in ecological transition, business ethics, sustainable supply chain management, and social responsibility. “The EcoVadis Platinum Medal is more than just an award — it is the recognition of our collective efforts to embed sustainable development principles at the heart of our corporate strategy and culture,” says Sylvaine Couleur, Executive Vice President, CSR & Communication. “Achieving this level demonstrates that our commitments are tangible, impactful, and internationally recognized. This distinction strengthens our determination to further advance and expand our C

In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.

Visit our pressroom
World GlobeA line styled icon from Orion Icon Library.HiddenA line styled icon from Orion Icon Library.Eye