TII
11.4.2022 13:18:04 CEST | Business Wire | Press release
Technology Innovation Institute (TII), a global research center and applied research pillar of Abu Dhabi’s Advanced Technology Research Council, today announced the launch of NOOR, the world’s largest Arabic natural language processing (NLP) model to date.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220411005085/en/
TII’s team of advanced researchers and Artificial Intelligence (AI) specialists, has joined forces with LightOn, a technology company that unlocks extreme-scale machine intelligence for businesses, to transform the Arabic NLP model. The NOOR model has the capability to carry out tasks beyond the domain of language - offering end-to-end pipeline high quality data, including crawling, filtering, and curation at scale. The model facilitates extreme-scale distributed training and serving – to deliver applications with efficient inference and model specialization.
Dr. Ray O. Johnson, CEO, TII and ASPIRE, said: “With this development, we are well on track to enhance our research capabilities and credentials as well as elevate the status of Abu Dhabi and the UAE as a serious research ecosystem. Our expert teams have demonstrated yet again that this region can achieve breakthrough R&D outcomes to impact the world.”
Dr. Ebtesam Almazrouei, Director, AI Cross-Center Unit, TII, said: “Large language models have taken the world of natural language processing by storm, and we are proud to introduce this cutting-edge model with 10 billion parameters - the world’s largest Arabic NLP model. The uniquely large Arabic dataset collected to train the model is the result of months of work that included curating, scrapping, and filtering of varied sources. A special thank you to the entire team that worked on this project to make NOOR the go-to exploration model in Arabic for academicians and businesses everywhere.”
Speaking on the launch, Prof. Mérouane Debbah, Chief Researcher, Digital Science Research Center and AI Cross-Center Unit, TII, said: “With NOOR, TII has expanded the scope of the modern standard Arabic model by leveraging know-how in large language models to build cross-disciplinary, cutting-edge expertise in this new generation of AI research.”
To curate the world’s largest high-quality cross-domain Arabic datasets, NOOR’s unique dataset of more than 30 billion words combines web data with books, poetry, news articles, and technical information to significantly widen the applicability of the model.
Dr. Ebtesam Almazrouei said the NOOR model is based on the popular Transformer architecture. As a decoder-only model, similar in structure to GPT-3, it is programmed to tackle generative tasks with architecture upgraded to reflect the latest developments in the world of machine learning, including improvements such as better positional embeddings. To help ensure quality at scale in the NOOR dataset, the TII team designed an automated filtering pipeline based on machine learning techniques. These tools identify text like quality references and safeguard the model from exposure to spam content.
Leveraging state-of-the-art 3D parallelism, NOOR was trained on a High-Performance Computing resource with 128 A100 GPUs, allowing for the distribution of computations and ensuring efficient use of the available hardware resources.
The Director of the AI Cross-Center Unit noted that this was only the first step in the Unit’s efforts to contribute to the wider UAE Strategy for Artificial Intelligence.
Named for the Arabic word "light", the model has been so called to establish the correlation of the Arabic language model to enlightening the mind.
About Technology Innovation Institute (TII)
For more information, visit www.tii.ae
*Source: AETOSWire
View source version on businesswire.com: https://www.businesswire.com/news/home/20220411005085/en/
Link:
About Business Wire
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
Cognite and ABB Collaborate to Integrate Agentic AI into Industrial Applications to Deliver Faster Workflows21.5.2026 12:00:00 CEST | Press release
Aker BP joins as the first customer to scale agent-to-agent operations through a new generation of industrial agentic workflow applications Cognite, the leader in Industrial AI, today announced a collaboration with ABB to assess how advanced industrial AI and data capabilities can be integrated to solve key use cases in the energy sector. By adding an agentic layer to established industrial applications, including ABB Ability™ SafetyInsight™ and ABB Ability™ AlarmInsight™, using the Cognite Industrial AI and Data platform, the collaboration aims to enable "agent-to-agent" orchestration. Leading energy producer Aker BP has signed on as the first customer to implement this new generation of intelligent offerings as part of its strategy to further increase its current production efficiency of 96% and achieve a production growth target of 525,000 barrels of oil equivalent per day by 2028. Transforming Data into Actionable Business Value By breaking down traditional data silos and shifting
BeOne Medicines Sets the Pace in Oncology at ASCO and EHA 2026 with 60+ Abstracts21.5.2026 12:00:00 CEST | Press release
Long-term and real-world evidence reinforce BRUKINSA as the foundation of CLL treatmentThree oral presentations at ASCO highlight rapid acceleration of BeOne’s solid tumor pipeline BeOne Medicines Ltd. (Nasdaq: ONC; HKEX: 06160; SSE: 688235), a global oncology company, today announced that more than 60 abstracts across hematologic malignancies and solid tumors have been accepted for presentation at the 2026 American Society of Clinical Oncology (ASCO) Annual Meeting (May 29–June 2, Chicago) and the 2026 European Hematology Association (EHA) Congress (June 11–14, Stockholm). Continuing to raise the bar in CLL At ASCO and EHA 2026, BeOne will showcase its hematology leadership with data spanning foundational therapies and next-generation innovation across CLL, mantle cell lymphoma and other B-cell malignancies. The data emphasize impressive long-term outcomes, durability across patient populations, and a disciplined approach to advancing future regimens. Collectively, these data undersco
Cranium AI and ISTARI Forge Global Alliance to Drive Enterprise AI Security and Governance21.5.2026 12:00:00 CEST | Press release
The collaboration integrates Cranium’s AI security and governance platform with ISTARI’s global cyber-resilience expertise to provide enterprises with a comprehensive framework for AI risk management and compliance. Cranium AI, the leading end-to-end AI Security and Governance platform, and ISTARI, a leading cyber resilience advisory firm, today announced a strategic partnership to provide global organizations with an end-to-end AI security & governance solution. As organizations accelerate AI adoption, they face a critical challenge: implementing actionable, operational AI governance while keeping pace with the speed of the AI landscape. This collaboration bridges that gap by merging Cranium’s cutting-edge AI security and governance platform with ISTARI’s deep advisory expertise in cyber risk and operating model design. Together, the firms provide a powerful, end-to-end solution for enterprises navigating the complexities of AI transformation. A Unified Vision for Sustainable AI Gover
From Broadcast to OTT: Norsk Rikstoto Unlocks the Full Potential of NEP Mediabank Across its Media Ecosystem21.5.2026 11:00:00 CEST | Press release
NEP Europe today announced that Norsk Rikstoto has expanded its long-standing partnership with NEP Mediabank to support the launch of its new direct-to-consumer OTT streaming service, Play. NEP Mediabank now serves as the central media asset management (MAM) platform across Rikstoto’s entire media ecosystem, supporting content management and distribution for broadcast, its branded TV channel, digital platforms and OTT services. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260521866777/en/ With the launch of NEP Mediabank's 'Play' direct-to-consumer OTT streaming service, Rikstoto now brings all distribution, broadcast, TV channel, digital, and OTT, into a single environment. Left to right are Geir Lilleberg, Lena Pettersen, Maria Otterlei, Sondre Skandsen, Espen Stensrud and Frode Martnes at the launch of Rikstoto. Photo: Mari Bull/NEP The extension marks an important step in a partnership that began in 2008, when Rikstoto
Audiencerate: Riccardo Fabbri Joins as Chief Technology Officer—The AI-Driven Phase of the Platforms for SMEs and Media Agencies Begins21.5.2026 09:00:00 CEST | Press release
The co-founder and former managing partner of Nohup (acquired by Havas Group in 2021) will lead the development of the artificial intelligence infrastructure that integrates first-party and third-party data, powering the platform delivered with Postel and Microsoft to Italian SMEs and the platform with the DV360 offering for global media agencies. Audiencerate Ltd, one of the few globally certified Google Customer Match Upload Partners and a Microsoft IP Co-sell certified partner with MACC eligibility, today announced the appointment of Riccardo Fabbri as Chief Technology Officer. The appointment marks a phase of dual expansion: the Audiencerate–Postel–Microsoft platform for Italian SMEs, and the Data platform integrated with Google DV360 for Agencies and Data Providers — both evolving toward a model that natively leverages first-party and third-party data through AI and machine learning. This press release features multimedia. View the full release here: https://www.businesswire.com/n
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom
