TII
11.4.2022 13:18:04 CEST | Business Wire | Press release
Technology Innovation Institute (TII), a global research center and applied research pillar of Abu Dhabi’s Advanced Technology Research Council, today announced the launch of NOOR, the world’s largest Arabic natural language processing (NLP) model to date.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220411005085/en/
TII’s team of advanced researchers and Artificial Intelligence (AI) specialists, has joined forces with LightOn, a technology company that unlocks extreme-scale machine intelligence for businesses, to transform the Arabic NLP model. The NOOR model has the capability to carry out tasks beyond the domain of language - offering end-to-end pipeline high quality data, including crawling, filtering, and curation at scale. The model facilitates extreme-scale distributed training and serving – to deliver applications with efficient inference and model specialization.
Dr. Ray O. Johnson, CEO, TII and ASPIRE, said: “With this development, we are well on track to enhance our research capabilities and credentials as well as elevate the status of Abu Dhabi and the UAE as a serious research ecosystem. Our expert teams have demonstrated yet again that this region can achieve breakthrough R&D outcomes to impact the world.”
Dr. Ebtesam Almazrouei, Director, AI Cross-Center Unit, TII, said: “Large language models have taken the world of natural language processing by storm, and we are proud to introduce this cutting-edge model with 10 billion parameters - the world’s largest Arabic NLP model. The uniquely large Arabic dataset collected to train the model is the result of months of work that included curating, scrapping, and filtering of varied sources. A special thank you to the entire team that worked on this project to make NOOR the go-to exploration model in Arabic for academicians and businesses everywhere.”
Speaking on the launch, Prof. Mérouane Debbah, Chief Researcher, Digital Science Research Center and AI Cross-Center Unit, TII, said: “With NOOR, TII has expanded the scope of the modern standard Arabic model by leveraging know-how in large language models to build cross-disciplinary, cutting-edge expertise in this new generation of AI research.”
To curate the world’s largest high-quality cross-domain Arabic datasets, NOOR’s unique dataset of more than 30 billion words combines web data with books, poetry, news articles, and technical information to significantly widen the applicability of the model.
Dr. Ebtesam Almazrouei said the NOOR model is based on the popular Transformer architecture. As a decoder-only model, similar in structure to GPT-3, it is programmed to tackle generative tasks with architecture upgraded to reflect the latest developments in the world of machine learning, including improvements such as better positional embeddings. To help ensure quality at scale in the NOOR dataset, the TII team designed an automated filtering pipeline based on machine learning techniques. These tools identify text like quality references and safeguard the model from exposure to spam content.
Leveraging state-of-the-art 3D parallelism, NOOR was trained on a High-Performance Computing resource with 128 A100 GPUs, allowing for the distribution of computations and ensuring efficient use of the available hardware resources.
The Director of the AI Cross-Center Unit noted that this was only the first step in the Unit’s efforts to contribute to the wider UAE Strategy for Artificial Intelligence.
Named for the Arabic word "light", the model has been so called to establish the correlation of the Arabic language model to enlightening the mind.
About Technology Innovation Institute (TII)
For more information, visit www.tii.ae
*Source: AETOSWire
View source version on businesswire.com: https://www.businesswire.com/news/home/20220411005085/en/
Link:
About Business Wire
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
Eurofins Biomnis Launches New Clinical LC‑MS/MS Method for the Detection of Cereulide Toxin in Stool Samples11.3.2026 09:00:00 CET | Press release
Eurofins Biomnis, a leading European provider of specialised clinical diagnostics services, and part of the global network of Eurofins laboratories, announces the successful development and validation of a new liquid chromatography tandem mass spectrometry (LC‑MS/MS) method for the detection and quantification of cereulide toxin in human stool samples. This innovation underlines Eurofins Biomnis’ diagnostic innovation, and its commitment to contributing its expertise to reducing diagnostic uncertainty with solutions that support clinicians and laboratories nationwide and internationally. Eurofins Biomnis has fully validated the method for human stool samples, taking into account matrix effects and meeting the requirements of ISO 15189 (with the exception of inter‑method comparison on pathological clinical samples). Cereulide, a toxin produced by specific strains of Bacillus cereus, has recently raised significant public attention following contamination events involving infant formula
1NCE and Netmore Combine Cellular and LoRaWAN Access to Deliver Global IoT Coverage11.3.2026 08:40:00 CET | Press release
The two largest global providers of massive IoT networks partner to provide a combined offering of LoRaWAN® and Cellular connectivity. 1NCE now integrates Netmore’s LoRaWAN into the 1NCE OS platform, allowing customers to use both services seamlessly through its software stack. The new network addresses 90% of the LPWAN market and offers an unparalleled ability to eliminate coverage blind spots around the globe. 1NCE, a company offering a plug-and-play platform for creating and managing the world’s best IoT products, today opened access for its customers to the LoRaWAN® services of Netmore, the world’s leading low power wide area network operator for massive IoT. With growing demand for low power long range connectivity, the Netmore LoRaWAN Network Server (LNS) Plugin provides 1NCE customers access to cellular and LoRaWAN IoT coverage options through one platform. The launch of the Netmore Plugin marks the beginning of strategic collaboration to expand the combined offering of the two
1NCE and Netmore Combine Cellular and LoRaWAN Access to Deliver Global IoT Coverage11.3.2026 08:40:00 CET | Press release
The two largest global providers of massive IoT networks partner to provide a combined offering of LoRaWAN® and Cellular connectivity. 1NCE now integrates Netmore’s LoRaWAN into the 1NCE OS platform, allowing customers to use both services seamlessly through its software stack. The new network addresses 90% of the LPWAN market and offers an unparalleled ability to eliminate coverage blind spots around the globe 1NCE, a company offering a plug-and-play platform for creating and managing the world’s best IoT products, today opened access for its customers to the LoRaWAN® services of Netmore, the world’s leading low power wide area network operator for massive IoT. With growing demand for low power long range connectivity, the Netmore LoRaWAN Network Server (LNS) Plugin provides 1NCE customers access to cellular and LoRaWAN IoT coverage options through one platform. The launch of the Netmore Plugin marks the beginning of strategic collaboration to expand the combined offering of the two b
Codethink Opens Early Access to IEC 61508 Mapping for the Eclipse Trustable Software Framework11.3.2026 08:07:00 CET | Press release
Preview release invites industry collaboration on open source approach to functional safety assessment EMBEDDED WORLD--Codethink today opened early access to its mapping between the Eclipse Trustable Software Framework (TSF) and IEC 61508, the international standard governing the functional safety of electrical and electronic systems. The mapping establishes a transparent relationship between the engineering principles of the Trustable Software Framework and the objectives defined in IEC 61508. By making this work available as an early preview, Codethink is inviting organisations interested in applying open source approaches to functional safety to review and begin working with the mapping while the work continues to mature. IEC 61508 forms the foundation of many domain-specific safety standards, including ISO 26262 for automotive systems. The early access reflects Codethink’s long-standing commitment to open development of software engineering methods. “This preview release reflects o
Galderma Buys Back Shares Worth CHF 232 Million in the Context of Accelerated Bookbuild Offering11.3.2026 07:00:00 CET | Press release
Ad hoc announcement pursuant to Art. 53 LR Galderma (SIX: GALD), the pure-play dermatology category leader, today announced that it has agreed to repurchase 1.6 million shares at a price of CHF 143.75 per share for a total consideration of CHF 232 million in the context of the accelerated bookbuild offering (“ABO”) of Galderma shares by Sunshine SwissCo GmbH (“EQT”), Abu Dhabi Investment Authority (Private Equities Department) and Auba Investment Pte. Ltd. (all together the “Selling Shareholders”) launched yesterday evening. The repurchase was made at the same price per share determined by the bookbuilding offering. As a result of yesterday evening’s ABO, the Selling Shareholders have fully divested their remaining stake in Galderma. The repurchase, which is expected to settle on March 13 is being financed by Galderma’s existing liquidity on hand and will not affect the company’s ability to deliver on its strategic and financing priorities. The shares will be held in treasury for futur
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom
