TII
11.4.2022 13:18:04 CEST | Business Wire | Press release
Technology Innovation Institute (TII), a global research center and applied research pillar of Abu Dhabi’s Advanced Technology Research Council, today announced the launch of NOOR, the world’s largest Arabic natural language processing (NLP) model to date.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220411005085/en/
TII’s team of advanced researchers and Artificial Intelligence (AI) specialists, has joined forces with LightOn, a technology company that unlocks extreme-scale machine intelligence for businesses, to transform the Arabic NLP model. The NOOR model has the capability to carry out tasks beyond the domain of language - offering end-to-end pipeline high quality data, including crawling, filtering, and curation at scale. The model facilitates extreme-scale distributed training and serving – to deliver applications with efficient inference and model specialization.
Dr. Ray O. Johnson, CEO, TII and ASPIRE, said: “With this development, we are well on track to enhance our research capabilities and credentials as well as elevate the status of Abu Dhabi and the UAE as a serious research ecosystem. Our expert teams have demonstrated yet again that this region can achieve breakthrough R&D outcomes to impact the world.”
Dr. Ebtesam Almazrouei, Director, AI Cross-Center Unit, TII, said: “Large language models have taken the world of natural language processing by storm, and we are proud to introduce this cutting-edge model with 10 billion parameters - the world’s largest Arabic NLP model. The uniquely large Arabic dataset collected to train the model is the result of months of work that included curating, scrapping, and filtering of varied sources. A special thank you to the entire team that worked on this project to make NOOR the go-to exploration model in Arabic for academicians and businesses everywhere.”
Speaking on the launch, Prof. Mérouane Debbah, Chief Researcher, Digital Science Research Center and AI Cross-Center Unit, TII, said: “With NOOR, TII has expanded the scope of the modern standard Arabic model by leveraging know-how in large language models to build cross-disciplinary, cutting-edge expertise in this new generation of AI research.”
To curate the world’s largest high-quality cross-domain Arabic datasets, NOOR’s unique dataset of more than 30 billion words combines web data with books, poetry, news articles, and technical information to significantly widen the applicability of the model.
Dr. Ebtesam Almazrouei said the NOOR model is based on the popular Transformer architecture. As a decoder-only model, similar in structure to GPT-3, it is programmed to tackle generative tasks with architecture upgraded to reflect the latest developments in the world of machine learning, including improvements such as better positional embeddings. To help ensure quality at scale in the NOOR dataset, the TII team designed an automated filtering pipeline based on machine learning techniques. These tools identify text like quality references and safeguard the model from exposure to spam content.
Leveraging state-of-the-art 3D parallelism, NOOR was trained on a High-Performance Computing resource with 128 A100 GPUs, allowing for the distribution of computations and ensuring efficient use of the available hardware resources.
The Director of the AI Cross-Center Unit noted that this was only the first step in the Unit’s efforts to contribute to the wider UAE Strategy for Artificial Intelligence.
Named for the Arabic word "light", the model has been so called to establish the correlation of the Arabic language model to enlightening the mind.
About Technology Innovation Institute (TII)
For more information, visit www.tii.ae
*Source: AETOSWire
View source version on businesswire.com: https://www.businesswire.com/news/home/20220411005085/en/
Link:
About Business Wire
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
Imagine Dragons to Perform at Abu Dhabi Grand Prix21.5.2026 17:51:00 CEST | Press release
Ethara, organiser of the Formula 1 Etihad Airways Abu Dhabi Grand Prix, have announced that one of the world’s biggest bands, Imagine Dragons, will headline the Saturday After-Race Concerts at the F1 Season Finale in Abu Dhabi. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260521214839/en/ Imagine Dragons to perform at Formula 1 Etihad Airways Abu Dhabi Grand Prix (Photo: AETOSWire) The announcement is another landmark moment for the Abu Dhabi Grand Prix, whose thrilling Yasalam presented by e& fan entertainment offering has become synonymous with the F1 Championship finale in Abu Dhabi and is recognised as one of the most compelling sports and entertainment crossovers globally. The global chart-toppers join Lewis Capaldi and Zara Larsson, who are set to kick off a blockbuster line-up of performances on Yas Island on Thursday, 3 December, with more major international artists to be revealed. With their popular top hits, Ima
Carnegie Mellon University and Cleveland Clinic Develop AI System to Interpret Cardiac MRI Scans with Enhanced Accuracy21.5.2026 14:05:00 CEST | Press release
Trained on more than 13,000 patient studies, novel system significantly outperforms existing models by up to 35% A team of researchers from Carnegie Mellon University, in collaboration with Cleveland Clinic’s Cardiovascular Innovation Research Center, has developed an artificial intelligence (AI) system capable of interpreting some of the most complex heart scans in medicine, cardiac magnetic resonance imaging (MRI), without the need for manually labeled training data. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260521762286/en/ A team of researchers from Carnegie Mellon University, in collaboration with Cleveland Clinic’s Cardiovascular Innovation Research Center, has developed an artificial intelligence (AI) system capable of interpreting some of the most complex heart scans in medicine, cardiac magnetic resonance imaging (MRI), without the need for manually labeled training data. The novel system, called CMR-CLIP, is d
Otovo Hits 30,000 Customers in Under a Year, Tackling the Growing ‘Solar Service Crisis’21.5.2026 14:00:00 CEST | Press release
A growing wave of unsupported solar systems and rising electricity prices are creating strong demand for Otovo’s energy service platform Otovo ASA (“Otovo”), a leading global energy service provider for residential and commercial customers, today announced it has reached 30,000 customers across the U.S. and Europe. A total of 20,000 customers have enrolled in Otovo Care, the Company’s membership-based home and commercial energy service, which is powered by Otovo’s industry-leading AI platform, Endurance™. “Reaching 30,000 customers in less than year is proof positive that home and business owners value their power systems,” said William J. (John) Berger, CEO of Otovo. “The ‘solar service crisis’ that is leaving millions of orphaned energy systems without support is driving strong interest in our Otovo Care membership program. Every day your home or commercial power system is not working, you are throwing money away. Otovo’s rapid response service platform keeps you up and running, ensu
The Live Moment Effect: Genius Sports and MediaScience Study Finds Specific Moments in Live Sports Can Double Unaided Brand Recall21.5.2026 14:00:00 CEST | Press release
New research shows that brands aligned with emotionally heightened moments in live sports can improve ad effectiveness Genius Sports Limited (NYSE: GENI), a global leader in real-time sports data, today released new biometric research conducted with MediaScience showing that ads delivered immediately after emotionally heightened moments in live sports can double unaided brand recall. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260521475265/en/ The Live Moment Effect report from Genius Sports and MediaScience. The study, The Live Moment Effect, finds that advertising effectiveness is significantly influenced by a viewer’s emotional state immediately before an ad is shown. In controlled biometric testing, ads shown after high-intensity sporting moments, such as near-scoring plays or crucial momentum shifts, delivered approximately double the unaided brand recall of baseline conditions. The Moment Before the Ad Matters The r
Merck Announces First Patient Dosed in Phase 3 Study for Investigational Antibody-Drug Conjugate in Colorectal Cancer21.5.2026 14:00:00 CEST | Press release
Precemtabart tocentecan (Precem-TcT) is investigated as a potential first-in-class anti-CEACAM5 ADC, for the treatment of metastatic CRC (mCRC) CEACAM5 is overexpressed in the majority of colorectal tumors (~90%), and requires no patient selection Significant unmet need remains for clinically meaningful innovation in colorectal cancer (CRC), the second leading cause of cancer death worldwide Not intended for Canada-, UK- or US-based media Merck, a leading science and technology company, today announced that the first patient has been dosed in the Phase 3 PROCEADE®-CRC-03 trial (NCT07549412). The study is evaluating precemtabart tocentecan (Precem‑TcT), a potential first‑in‑class investigational anti‑CEACAM5 antibody‑drug conjugate (ADC), for the treatment of metastatic colorectal cancer (mCRC). “Leveraging our novel payload‑linker technology, Precem‑TcT is the first CEACAM5‑targeted ADC in clinical studies with an exatecan payload, rationally designed for stability and enhanced cancer
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom
