TII
Technology Innovation Institute (TII), a global research center and applied research pillar of Abu Dhabi’s Advanced Technology Research Council, today announced the launch of NOOR, the world’s largest Arabic natural language processing (NLP) model to date.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220411005085/en/
TII’s team of advanced researchers and Artificial Intelligence (AI) specialists, has joined forces with LightOn, a technology company that unlocks extreme-scale machine intelligence for businesses, to transform the Arabic NLP model. The NOOR model has the capability to carry out tasks beyond the domain of language - offering end-to-end pipeline high quality data, including crawling, filtering, and curation at scale. The model facilitates extreme-scale distributed training and serving – to deliver applications with efficient inference and model specialization.
Dr. Ray O. Johnson, CEO, TII and ASPIRE, said: “With this development, we are well on track to enhance our research capabilities and credentials as well as elevate the status of Abu Dhabi and the UAE as a serious research ecosystem. Our expert teams have demonstrated yet again that this region can achieve breakthrough R&D outcomes to impact the world.”
Dr. Ebtesam Almazrouei, Director, AI Cross-Center Unit, TII, said: “Large language models have taken the world of natural language processing by storm, and we are proud to introduce this cutting-edge model with 10 billion parameters - the world’s largest Arabic NLP model. The uniquely large Arabic dataset collected to train the model is the result of months of work that included curating, scrapping, and filtering of varied sources. A special thank you to the entire team that worked on this project to make NOOR the go-to exploration model in Arabic for academicians and businesses everywhere.”
Speaking on the launch, Prof. Mérouane Debbah, Chief Researcher, Digital Science Research Center and AI Cross-Center Unit, TII, said: “With NOOR, TII has expanded the scope of the modern standard Arabic model by leveraging know-how in large language models to build cross-disciplinary, cutting-edge expertise in this new generation of AI research.”
To curate the world’s largest high-quality cross-domain Arabic datasets, NOOR’s unique dataset of more than 30 billion words combines web data with books, poetry, news articles, and technical information to significantly widen the applicability of the model.
Dr. Ebtesam Almazrouei said the NOOR model is based on the popular Transformer architecture. As a decoder-only model, similar in structure to GPT-3, it is programmed to tackle generative tasks with architecture upgraded to reflect the latest developments in the world of machine learning, including improvements such as better positional embeddings. To help ensure quality at scale in the NOOR dataset, the TII team designed an automated filtering pipeline based on machine learning techniques. These tools identify text like quality references and safeguard the model from exposure to spam content.
Leveraging state-of-the-art 3D parallelism, NOOR was trained on a High-Performance Computing resource with 128 A100 GPUs, allowing for the distribution of computations and ensuring efficient use of the available hardware resources.
The Director of the AI Cross-Center Unit noted that this was only the first step in the Unit’s efforts to contribute to the wider UAE Strategy for Artificial Intelligence.
Named for the Arabic word "light", the model has been so called to establish the correlation of the Arabic language model to enlightening the mind.
About Technology Innovation Institute (TII)
For more information, visit www.tii.ae
*Source: AETOSWire
View source version on businesswire.com: https://www.businesswire.com/news/home/20220411005085/en/
Link:
About Business Wire
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
Binarly to Unveil “Broken Trust” Research: Firmware Bypass Chains, BMC Persistence, and EDR Evasion15.1.2026 23:04:00 CET | Press release
Binarly, the industry leader in software and firmware supply-chain security, today announced an upcoming DistrictCon presentation “Broken Trust: Firmware Bypass Chains, BMC Persistence, and EDR Evasion.” The session will detail how firmware-level attack chains observed in shipped enterprise devices can effectively undermine modern endpoint defenses, enabling stealthy compromise and long-lived persistence. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260115834965/en/ Binarly Unveils Broken Trust Research: Firmware Bypass, BMC Persistence In this presentation, the Binarly REsearch team will dismantle the assumption of hardware trust by presenting multiple real-world firmware bypass chains. Alex Matrosov and Fabio Pagani will provide a deep dive into the specific vulnerability classes and exploitation primitives that make these attacks reliable in practice. The team will also deliver a live demonstration compromising a fully
Coolbrook Named on the 2026 Global Cleantech 10015.1.2026 18:14:00 CET | Press release
Coolbrook, a transformational technology and engineering company on a mission to decarbonise major industrial sectors like petrochemicals and chemicals, iron and steel, aluminium, and cement, has been named on Cleantech Group’s 2026 Global Cleantech 100. This annual list recognizes companies poised to deliver market-ready solutions that advance a cleaner, more resilient global future. The report highlights innovators addressing some of the world’s most urgent environmental and infrastructure challenges. The complimentary report introduces you to innovators advancing groundbreaking technologies and business models to enable us to act on the ever-increasing climate and environmental crisis. Following a 2025 marked by geopolitical volatility and shifting economic signals, the global cleantech ecosystem enters 2026 with slightly greater certainty - yet heightened competitive pressure. Growth is concentrating around two dominant themes: AI infrastructure and critical minerals. “The 2026 Glo
World Economic Forum and Salesforce Empower Global Leaders With First-of-its-Kind Agentic Assistant for the 2026 Annual Meeting in Davos15.1.2026 18:01:00 CET | Press release
The Forum activates its vast data stores through Agentforce 360, enabling a level of preparation and decision-making for its over 3,000 attendees previously unachievable by human processing alone Salesforce (NYSE: CRM), the world’s #1 CRM, today announced the activation of the World Economic Forum’s institutional knowledge powered by Agentforce 360 to support over 3,000 of the world’s most influential leaders at the 2026 World Economic Forum Annual Meeting. The Forum has launched a new proactive, high-precision concierge app, “EVA,” built on the Agentforce 360 Platform, Salesforce’s agentic platform. EVA will empower attendees to move beyond traditional information access, with an AI agent that doesn’t just answer questions, but can reason, prioritize, and act on a leader’s behalf for the 2026 Annual Meeting. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260115571119/en/ Scheduled for January 19–23 in Davos, Switzerland, th
Frasca to Supply Four New Flight Training Devices to Global Medical Response15.1.2026 16:05:00 CET | Press release
New Level 7 FTDs will support pilot training for emergency medical operations Frasca International, Inc., a FlightSafety International company, today announced it has signed a contract with Global Medical Response (GMR) to supply four new Level 7 Flight Training Devices (FTDs). The new devices include an Airbus EC135, a Pilatus PC-12, a Beechcraft C90, and a Beechcraft B200. Each FTD will feature Frasca's unique motion system to provide enhanced realism in training. The devices will be installed at GMR’s new training facility currently under construction in Denton, Texas. Frasca has supported GMR’s pilot training efforts for nearly two decades, beginning with the delivery of their first device in 2005 for Air Evac Lifeteam, a GMR company. Since then, Frasca simulators have played a central role in preparing GMR’s flight crews for the complex and high-stakes environments they encounter in emergency medical operations. With the delivery of these new devices, GMR will operate a total of 1
illumynt Reports 60% Revenue Growth and Launches Global Innovation Center to Meet Rising Enterprise Security and Sustainability Demands15.1.2026 15:11:00 CET | Press release
illumynt an intelligent, security-first technology lifecycle partner, today announced significant growth and innovation milestones that position the company as a leader in the next evolution of the IT Asset Disposition (ITAD) industry—an industry increasingly shaped by artificial intelligence, accelerated hardware refresh cycles, and heightened regulatory scrutiny. Under the leadership of CEO Joerg Herbarth, illumynt continues to execute its mission to deliver intelligent, technology-driven lifecycle solutions that maximize sustainability, security, and recovery value for the world’s most compute-intensive organizations. In 2025, ITAD became a strategic imperative. AI-driven workloads have dramatically compressed infrastructure lifecycles, while updates to NIST SP 800-88 Rev. 2, adoption of R2v3, and the expansion of global privacy frameworks have raised expectations for auditability, transparency, and verified data security. As a result, ITAD has evolved from a back-end operational fu
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom
