Dataocean AI Launched High Quality Off-the-Shelf Datasets and Frontier Data Solutions at Interspeech 2024
In the rapidly growing AI market that especially focused on foundation models and Generative AI, the quality of datasets directly impacts the performance. In real-world applications, data is messy and improving models is not the only way to get better performance. As AI continues to transform industries, the need for high quality datasets has become critical for developing responsive, adaptable, and intelligent systems.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20240919575026/en/
Dataocean AI at Interspeech 2024 (Photo: Business Wire)
At the Interspeech 2024, Dataocean AI, a global leader in AI data solutions, officially launched its latest offerings: high-quality off-the-shelf datasets. This exciting announcement further illustrates the company's position as a pioneer in the AI technology domain.
Dataocean AI introduced its newest corpus designed to meet the demands of various application scenarios - “Massively Multilingual Speech Corpus”. This corpus was recording from 215,891 speakers with total of 259,672 hours, covering over 100 languages. Along with this corpus, Dataocean AI also showcased its datasets in European languages. These meticulously labeled high quality datasets, covering English, French, Spanish, Turkish and Swedish, known for their diversity and accuracy, promise to enhance the performance of AI models across industries, such as smart finance, AI assistant, in-cabin, smart home, and other trendy topics related to AI.
The key strength of Dataocean AI’s datasets lies in their ability to deliver high precision across different fields.
- For data collection process, Dataocean AI leverages its extensive global network, comprising native speakers who professionally record in over 200+ languages. The company owns a team of native and professional speakers for these recordings and employs high-fidelity equipment within professional recording studios including indoor, outdoor, and in-cabin environments.
- For data labeling process, the company offer datasets that are labeled with their advanced self-developed platform with human in the loop. The expert team consist of scholars and specialists that covering multiple scenarios, and they have successfully build over 1100 speech datasets that match top quality standards, fulfilling the evolving needs of the AI industry.
In addition to speech datasets, Dataocean AI also owns over 1600 high-quality training datasets with proprietary intellectual property rights, covering a wide range of fields including foundation models, autonomous driving, finance, healthcare, and law. At the same time, its self-developed data processing platform, DOTS, equipped with more than 200 algorithms and hundreds of data processing tools, can achieve powerful functions such as automated labeling and assisted labeling, better helping customers reduce costs and increase efficiency. Additionally, they have earned data security regulations such as the European GDPR, and obtained certifications for ISO 9001, ISO 27001, and ISO 27001, ensuring safety and compliance.
Along with the high-quality datasets, Dataocean AI also empower LLMs through world-class live data collection for pre-trained and SFT/RLHF/red teaming for fine-tuning, as well as model evaluation.
Dataocean AI’s goal is to deliver one-stop data solution that ensuring their partners and clients can build reliable, adaptable AI models. This commitment to excellence is central to the company's mission of driving innovation in AI.
For more information about Dataocean AI’s latest datasets and their innovative data solutions, visit their official website at www.dataoceanai.com.
About Dataocean AI
With nearly 20 years project experience, Dataocean AI empower more than 1000 internet companies, AI enterprises and academic institutes with data total solutions. We offer over 1600 high quality off-the-shelf datasets and frontier data services, including data collection and data labeling serving for deep learning technology and enable clients’ AI models leading in the market.
View source version on businesswire.com: https://www.businesswire.com/news/home/20240919575026/en/
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
Mastercard and OCTO Join Forces to Promote Responsible Driving Through Loyalty Programs21.5.2025 18:10:00 CEST | Press release
Thanks to Mastercard’s SessionM platform, OCTO will offer valuable incentives to more responsible drivers while increasing customer engagement and loyalty to insurance companies. Mastercard and OCTO, a global leader in advanced telematics solutions and data analytics services for the insurance and automotive sectors, today announce a collaboration aimed at redefining the interaction between insurance companies and customers by encouraging safer driving behaviors through an innovative loyalty program. The partnership involves the integration of Mastercard’s SessionM – a platform designed to support businesses in managing customer loyalty and engagement – with OCTO’s patented scoring models, which assess driving behavior using either physical devices (black boxes) or digital solutions (apps). This agreement has a dual objective: for drivers, it provides tangible benefits such as discounts on auto, home, and travel insurance, as well as other rewards, in exchange for safe and responsible
Boomi Recognized as a Leader for the 11th Time in the 2025 Gartner® Magic Quadrant™ for Integration Platform as a Service21.5.2025 17:30:00 CEST | Press release
Boomi Believes Its Leadership in AI, API Management, and Data Management Advancements Will Drive Strong Customer and Partner Momentum Boomi™, the leader in AI-driven automation, today announced it has been recognized as a Leader in the 2025 Gartner® Magic Quadrant™ for Integration Platform as a Service (iPaaS), for the 11th consecutive time – the longest in the report’s history. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250521105444/en/ In our opinion, Boomi’s continued industry recognition reflects its unwavering commitment to innovation, customer success, and ecosystem growth. Over the past year, Boomi has accelerated its investments in AI agent management, API management, and data management to help enterprises connect everything with one platform and drive intelligent automation at scale. Key advancements include: Launch of Boomi Agentstudio for AI Agent Management: Boomi recently introduced the only full agent life
Textron Aviation Announces Fleet Order for up to 12 Cessna Citation Business Jets From Aerolineas Ejecutivas21.5.2025 17:00:00 CEST | Press release
Textron Aviation today announced it has entered into a purchase agreement with Aerolíneas Ejecutivas (ALE), Mexico’s leading business aviation company, for up to 12 Cessna Citation business jets. ALE will operate the aircraft — a mix of Cessna Citation Latitude, Citation CJ3 Gen2 and Citation CJ3 Gen3 business jets — in its fractional ownership division, MexJet. ALE expects to take delivery of four aircraft, including two Citation Latitudes and two Citation CJ3 Gen2 aircraft, in 2026. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250521263551/en/ Textron Aviation announces fleet order for up to 12 Cessna Citation business jets from Aerolineas Ejecutivas. (Photo Credit: Textron Aviation) The Cessna Citation business jets are designed and manufactured by Textron Aviation Inc., a Textron Inc. (NYSE:TXT) company. “Cessna Citation business jets are ideal for fractional customers seeking class-leading comfort and performance,” sa
ElastiFlow and Rohde & Schwarz Collaborate To Deliver Unmatched Network Traffic Insights21.5.2025 16:00:00 CEST | Press release
ElastiFlow, a pioneer in the observability space, today announced a strategic partnership with Rohde & Schwarz, one of the world’s leading manufacturers of test and measurement, secure communications, monitoring and network testing, and broadcasting equipment. This collaboration aims to improve network visibility and data enrichment capabilities for enterprises worldwide. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250521910528/en/ Network flow data, in the form of IPFIX records, is essential for understanding network traffic, detecting anomalies, and ensuring optimized performance and robust security. The new alliance leverages ElastiFlow to enrich IPFIX records from Rohde & Schwarz solutions, transforming raw data into actionable insights. This enables rapid, real-time detection of network events, security threats, and application performance issues. The collaboration allows for deep packet inspection (DPI) technology t
Rauma Marine Constructions:The First Multi-Purpose Corvette Built at Rauma Shipyard Has Been Launched21.5.2025 15:51:00 CEST | Press release
The first multi-purpose corvette built for the Finnish Navy’s pivotal Squadron 2020 project was launched at Rauma shipyard on Wednesday 21 May 2025. This is a significant milestone and an indication of RMC’s ability to successfully realise demanding building projects. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250521270211/en/ The first multi-purpose corvette built for the Finnish Navy’s pivotal Squadron 2020 project was launched at Rauma shipyard in Finland on Wednesday 21 May 2025. Photo by Rauma Marine Constructions. The Squadron 2020 project is proceeding on schedule. The building pace will accelerate as work on the second and subsequent multi-purpose corvettes progresses. The direct employment impact of the Squadron project in Finland is equivalent to more than 3,600 person-years. “We have increased the capacity of Rauma shipyard purposefully while strategically implementing significant investments in the shipyard a
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom