TII
11.4.2022 13:18:04 CEST | Business Wire | Press release
Technology Innovation Institute (TII), a global research center and applied research pillar of Abu Dhabi’s Advanced Technology Research Council, today announced the launch of NOOR, the world’s largest Arabic natural language processing (NLP) model to date.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220411005085/en/
TII’s team of advanced researchers and Artificial Intelligence (AI) specialists, has joined forces with LightOn, a technology company that unlocks extreme-scale machine intelligence for businesses, to transform the Arabic NLP model. The NOOR model has the capability to carry out tasks beyond the domain of language - offering end-to-end pipeline high quality data, including crawling, filtering, and curation at scale. The model facilitates extreme-scale distributed training and serving – to deliver applications with efficient inference and model specialization.
Dr. Ray O. Johnson, CEO, TII and ASPIRE, said: “With this development, we are well on track to enhance our research capabilities and credentials as well as elevate the status of Abu Dhabi and the UAE as a serious research ecosystem. Our expert teams have demonstrated yet again that this region can achieve breakthrough R&D outcomes to impact the world.”
Dr. Ebtesam Almazrouei, Director, AI Cross-Center Unit, TII, said: “Large language models have taken the world of natural language processing by storm, and we are proud to introduce this cutting-edge model with 10 billion parameters - the world’s largest Arabic NLP model. The uniquely large Arabic dataset collected to train the model is the result of months of work that included curating, scrapping, and filtering of varied sources. A special thank you to the entire team that worked on this project to make NOOR the go-to exploration model in Arabic for academicians and businesses everywhere.”
Speaking on the launch, Prof. Mérouane Debbah, Chief Researcher, Digital Science Research Center and AI Cross-Center Unit, TII, said: “With NOOR, TII has expanded the scope of the modern standard Arabic model by leveraging know-how in large language models to build cross-disciplinary, cutting-edge expertise in this new generation of AI research.”
To curate the world’s largest high-quality cross-domain Arabic datasets, NOOR’s unique dataset of more than 30 billion words combines web data with books, poetry, news articles, and technical information to significantly widen the applicability of the model.
Dr. Ebtesam Almazrouei said the NOOR model is based on the popular Transformer architecture. As a decoder-only model, similar in structure to GPT-3, it is programmed to tackle generative tasks with architecture upgraded to reflect the latest developments in the world of machine learning, including improvements such as better positional embeddings. To help ensure quality at scale in the NOOR dataset, the TII team designed an automated filtering pipeline based on machine learning techniques. These tools identify text like quality references and safeguard the model from exposure to spam content.
Leveraging state-of-the-art 3D parallelism, NOOR was trained on a High-Performance Computing resource with 128 A100 GPUs, allowing for the distribution of computations and ensuring efficient use of the available hardware resources.
The Director of the AI Cross-Center Unit noted that this was only the first step in the Unit’s efforts to contribute to the wider UAE Strategy for Artificial Intelligence.
Named for the Arabic word "light", the model has been so called to establish the correlation of the Arabic language model to enlightening the mind.
About Technology Innovation Institute (TII)
For more information, visit www.tii.ae
*Source: AETOSWire
View source version on businesswire.com: https://www.businesswire.com/news/home/20220411005085/en/
Link:
About Business Wire
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
Rubedo’s RLS-1496 Reduces Actinic Keratosis Pre-Cancerous Skin Lesions by 46% at Four Weeks with Minimal Irritation in Preliminary Results of Phase 1b/2a Study28.5.2026 14:30:00 CEST | Press release
RLS-1496 is an investigational, first-in-class, disease-modifying, selective glutathione peroxidase 4 (GPX4) modulator that targets pathologic senescent and other stressed, aging cells that drive chronic, age-dependent diseases, such as AK, and represents a novel drug category — Adaptive SenoTherapeutics In recognition of May as Skin Cancer Awareness Month, Rubedo is calling attention to the myths and facts surrounding AKs — and to the urgent need for a new generation of treatments that are effective without the side-effect burden of today's options Rubedo Life Sciences, Inc. (Rubedo), an AI-driven, clinical-stage biotech focused on selective cellular rejuvenation medicines targeting aging cells, today announced preliminary results from a Phase 1b/2a study of RLS-1496 in patients with actinic keratosis (AK), a common age-related condition resulting in precancerous skin lesions, that is most commonly seen after age 65.1 The open-label multi-center trial, conducted in the United States,
ExaGrid Wins 5 Industry Awards at Network Computing Awards 202628.5.2026 14:00:00 CEST | Press release
ExaGrid named “Company of the Year” for seventh year in a row ExaGrid®, the world’s largest independent backup storage vendor providing Tiered Backup Storage with the most Comprehensive Security and AI-Powered Retention Time-Lock for Ransomware Recovery, today announced that company was honored with five industry awards, including Air-gapped Ransomware Recovery Product of the Year, Bench Tested Product of the Year, Company of the Year, Data Protection Product of the Year, and the Storage Product of the Year during the Network Computing Awards ceremony, held in London on May 21, 2026. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260528849813/en/ The ExaGrid team headed to the stage five times throughout the Network Computing Awards ceremony in London to accept awards for ExaGrid Tiered Backup Storage. Photo courtesy of Network Computing Awards. The Network Computing Awards are determined by public vote. The 2026 awards mark
Ardoq Launches AI-First Enterprise Architecture Platform28.5.2026 14:00:00 CEST | Press release
Custom Agents, Omnipresent AI Assistant, and AI Import Builder Automate an Estimated 40% of Routine EA Work; Tenneco Already Achieving 292% ROI on Ardoq AI Ardoq, named a 5x Leader in the Gartner® Magic Quadrant™ for Enterprise Architecture Tools, today launched its AI-first enterprise architecture (EA) platform. The release grounds every Ardoq AI output in customers' live architecture data and introduces a new generation of AI agents capable of automating an estimated 40% of routine EA work. Architects today are being asked to defend decisions that generic AI is generating in seconds. Application rationalization choices. ERP transformation roadmaps. AI governance reviews. The questions land on the architect's desk, but the analysis underneath increasingly comes from AI assistants that do not know the architecture. Generic agents reason on whatever document is in front of them, not on the live relationships between applications, dependencies, capabilities, and risks. Ask a generic LLM
European DataWarehouse Launches DealDox®, a Next-Generation Virtual Data Room Built specifically for the Securitisation Market28.5.2026 14:00:00 CEST | Press release
European DataWarehouse (EDW) announced today the launch of DealDox®, a secure virtual data room uniquely tailored to the needs of the securitisation and structured finance market. Developed in response to long‑standing challenges around transaction data and document management, DealDox provides a single, secure environment where all parties throughout the deal lifecycle can collaborate efficiently while maintaining high standards of security, governance, and regulatory alignment. DealDox enables the centralised management of transaction data and documentation, offering robust security, granular access controls, and clear audit trails. The platform integrates seamlessly with EDW’s existing regulatory reporting ecosystem, supporting smoother workflows from deal preparation through to disclosure and compliance. “As a market infrastructure, our role is to reduce complexity and make processes simpler and more transparent for all participants,” said Dr. Christian Thun, CEO of European DataWa
SLB and Vår Energi Expand Digital Collaboration to Scale Well and Integrated Field Development Planning28.5.2026 13:47:00 CEST | Press release
Agreement supports Vår Energi’s ambition to reduce time to first oil, building on multi-discipline, collaborative well planning workflows that reduce cycle times from months to days Global energy technology company SLB (NYSE: SLB) today announced an expanded collaboration with Vår Energi to scale well planning and integrated field development planning across its Norwegian Continental Shelf operations. With collaborative well planning already reducing cycle times from months to days and integrated field development planning expected to support similar benefits, the expanded deployment is designed to support faster, more consistent decision-making as operators work to sustain production from mature offshore assets while managing increasing development complexity. As part of the expanded collaboration, Vår Energi is deploying the Delfi™ digital platform to connect exploration, subsurface evaluation, well planning, subsea design, field development planning, and production in a cloud-native
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom
