Innodata Inc.
NEW YORK, NY / ACCESSWIRE / April 25, 2024 / Innodata Inc. (NASDAQ:INOD), a leading data engineering company, today announced that it has released an open-source LLM Evaluation Toolkit, together with a repository of 14 semi-synthetic and human-crafted evaluation datasets, that enterprises can utilize for evaluating the safety of their Large Language Models (LLMs) in the context of enterprise tasks.
Using the toolkit and the datasets, data scientists can automatically test the safety of underlying LLMs across multiple harm categories simultaneously. By identifying the precise input conditions that generate problematic outputs, developers can understand how their AI systems respond to a variety of prompts and can identify remedial fine-tuning required to align the systems to the desired outcomes. Innodata encourages enterprise LLM developers to begin utilizing the toolkit and the published data sets as-is. Innodata expects a commercial version of the toolkit and more extensive, continually-updated benchmarking datasets to become available later this year.
Together with the release of the toolkit and the datasets, Innodata published its underlying research around its methods for benchmarking LLM safety. In the paper, Innodata shares the reproduceable results it achieved using the toolkit to benchmark Llama2, Mistral, Gemma, and GPT for factuality, toxicity, bias, and hallucination propensity.
The toolkit, the datasets, and the research are available on GitHub at https://github.com/innodatalabs/innodata-llm-safety.
Innodata began working on trust and safety for one of its Big Tech customers in Q4-2023. In Q1-2024, Innodata won two additional engagements for LLM safety and evaluation - one for a hyperscaler's own foundation models and one for an enterprise customer of the hyperscaler through Innodata's white label program with the hyperscaler. In addition, in Q1-2024, Innodata started pilots for a new customer and an existing customer around LLM trust and safety.
For additional information about Evaluation and Red Teaming in LLMs, see: https://innodata.com/red-teaming-in-llms-unveiling-ai-vulnerabilities/.
About Innodata
Innodata (NASDAQ:INOD) is a global data engineering company delivering the promise of AI to many of the world's most prestigious companies. We provide AI-enabled software platforms and managed services for AI data collection/annotation, AI digital transformation, and industry-specific business processes. Our low-code Innodata AI technology platform is at the core of our offerings. In every relationship, we honor our 30+ year legacy delivering the highest quality data and outstanding service to our customers. Visit www.innodata.com to learn more.
Forward Looking Statements
This press release may contain certain forward-looking statements within the meaning of Section 21E of the Securities Exchange Act of 1934, as amended, and Section 27A of the Securities Act of 1933, as amended. These forward-looking statements include, without limitation, statements concerning our operations, economic performance, and financial condition. Words such as "project," "believe," "expect," "can," "continue," "could," "intend," "may," "should," "will," "anticipate," "indicate," "predict," "likely," "estimate," "plan," "potential," "possible," "promises," or the negatives thereof, and other similar expressions generally identify forward-looking statements.
These forward-looking statements are based on management's current expectations, assumptions and estimates and are subject to a number of risks and uncertainties, including, without limitation, impacts resulting from the continuing conflict between Russia and the Ukraine and Hamas' attack against Israel and the ensuing conflict; investments in large language models; that contracts may be terminated by customers; projected or committed volumes of work may not materialize; pipeline opportunities and customer discussions which may not materialize into work or expected volumes of work; the likelihood of continued development of the markets, particularly new and emerging markets, that our services support; the ability and willingness of our customers and prospective customers to execute business plans that give rise to requirements for our services; continuing reliance on project-based work in the Digital Data Solutions (DDS) segment and the primarily at-will nature of such contracts and the ability of these customers to reduce, delay or cancel projects; potential inability to replace projects that are completed, canceled or reduced; continuing DDS segment revenue concentration in a limited number of customers; our dependency on content providers in our Agility segment; the Company's ability to achieve revenue and growth targets; difficulty in integrating and deriving synergies from acquisitions, joint ventures and strategic investments; potential undiscovered liabilities of companies and businesses that we may acquire; potential impairment of the carrying value of goodwill and other acquired intangible assets of companies and businesses that we acquire; a continued downturn in or depressed market conditions; changes in external market factors; changes in our business or growth strategy; the emergence of new, or growth in existing competitors; various other competitive and technological factors; our use of and reliance on information technology systems, including potential security breaches, cyber-attacks, privacy breaches or data breaches that result in the unauthorized disclosure of consumer, customer, employee or Company information, or service interruptions; and other risks and uncertainties indicated from time to time in our filings with the Securities and Exchange Commission.
Our actual results could differ materially from the results referred to in forward-looking statements. Factors that could cause or contribute to such differences include, but are not limited to, the risks discussed in Part I, Item 1A. "Risk Factors," Part II, Item 7. "Management's Discussion and Analysis of Financial Condition and Results of Operations," and other parts of our Annual Report on Form 10-K, filed with the Securities and Exchange Commission on March 4, 2024, as updated or amended by our other filings that we may make with the Securities and Exchange Commission. In light of these risks and uncertainties, there can be no assurance that the results referred to in the forward-looking statements will occur, and you should not place undue reliance on these forward-looking statements. These forward-looking statements speak only as of the date hereof.
We undertake no obligation to update or review any guidance or other forward-looking statements, whether as a result of new information, future developments or otherwise, except as may be required by the U.S. federal securities laws.
Company Contact
Marcia Novero
Innodata Inc.
Mnovero@innodata.com
(201) 371-8015
SOURCE: Innodata Inc.
View the original press release on accesswire.com
To view this piece of content from www.accesswire.com, please give your consent at the top of this page.
About ACCESSWIRE
Subscribe to releases from ACCESSWIRE
Subscribe to all the latest releases from ACCESSWIRE by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from ACCESSWIRE
Beyond Work Unveils Next-Generation Memory-Augmented AI Agent (MATRIX) for Enterprise Document Intelligence23.12.2024 08:00:00 CET | Press release
Matrix streamlines document processing by cutting manual labor and operational costs, using AI agents in the enterprise. LONDON, GB / ACCESSWIRE / December 23, 2024 / Today, Beyond Work, an enterprise AI company, announced the record-setting results of Matrix, a novel memory-augmented AI framework for automating business document processing. Developed in collaboration with researchers from Penn State University, Oregon State University, and Kuehne+Nagel, one of the world's largest logistics providers, Matrix addresses the complex, time-intensive task of extracting transport references from Universal Business Language (UBL) invoices.MATRIX Results Comparing the success rates of four methods (CoT, Two-agent, Reflexion, Matrix) across GPT-4o-mini and GPT-4o, with Matrix achieving the highest performance. By harnessing an iterative, memory-centric learning strategy, Matrix achieves a 30.3% improvement over chain-of-thought prompting, outperforms a standard Large Language Model agent by 35.
Brightline Interactive Successfully Delivers A Scalable Immersive Simulation To A Global Government Service Integrator, Positioning Itself As A Leading Operating System For Processing And Visualizing Complex Information In 3D Space23.12.2024 07:00:00 CET | Press release
NEW YORK, NY / ACCESSWIRE / December 23, 2024 / The Glimpse Group, Inc. ("Glimpse") (NASDAQ:VRAR)(FSE:9DR), a diversified Immersive Technology platform company providing enterprise-focused Virtual Reality ("VR"), Augmented Reality ("AR") and Spatial Computing software and services, today announced that its subsidiary company Brightline Interactive, LLC ("BLI") successfully delivered a paid for advanced immersive simulation through its cutting-edge middleware platform - SpatialCore - to a large government services integrator ("GSI"). Leveraging the power of SpatialCore's spatial computing and AI platform, BLI was able to create a sophisticated spatial simulation in record time, setting what we believe has the potential to become a new industry standard. This initial simulation project was developed with the goal of allowing the GSI to gather simulation needs from others and to then add to this build, or for further deployment, in a cost effective and scalable manner. Tyler Gates, Genera
MicroVision Increases Production Capacity to Meet Anticipated Demand19.12.2024 09:20:00 CET | Press release
REDMOND, WA / ACCESSWIRE / December 19, 2024 / MicroVision, Inc. (NASDAQ:MVIS), a leader in MEMS-based solid-state automotive lidar and ADAS solutions, today announced that it has increased production capacity for its MOVIA L sensor to meet anticipated demand from the industrial sector. Building on the relationship with its existing automotive Tier 1 manufacturing partner, MicroVision expects output of MOVIA L sensors for 2025 to significantly increase compared to 2024. The continued acceleration of production capacity throughout 2025 will result in a reduced average cost per sensor, while maintaining a high-quality product suitable for industrial applications. "Securing this production capacity is critical to support high-volume orders from industrial customers, so we feel good closing out the year with this commitment in hand," said Sumit Sharma, Chief Executive Officer. "We are pleased with this scaling, particularly from a cost perspective, and our Tier 1 automotive supplier, ZF, i
Brightline Interactive Enters into an Agreement with the U.S. Navy for an Immersive, AI-Driven Simulator System19.12.2024 07:00:00 CET | Press release
NEW YORK, NY / ACCESSWIRE / December 19, 2024 / The Glimpse Group, Inc. ("Glimpse") (NASDAQ:VRAR)(FSE:9DR), a diversified Immersive Technology platform company providing enterprise-focused Virtual Reality ("VR"), Augmented Reality ("AR") and Spatial Computing software and services, today announced that its subsidiary company Brightline Interactive, LLC ("BLI") has entered into an initial six figure dollar contract with the U.S. Navy for an Immersive Simulator, to be delivered in the first half of 2025. Tyler Gates, General Manager of BLI and Chief Futurist of Glimpse, commented: "Powered by BLI's cutting-edge spatial computing platform ("SpatialCore"), we have created a game-changing technology that pushes the boundaries of what's possible via the integration of AI and Spatial Computing. The Immersive Simulator system seamlessly integrates AI into both the full motion simulation and the spatial computing environments in which they operate, offering unparalleled realism, responsiveness,
BioNxt Solutions Expands Patent Protection for Drug Delivery Innovations Backed by Positive IPRP19.12.2024 03:05:00 CET | Press release
VANCOUVER, BC / ACCESSWIRE / December 19, 2024 / BioNxt Solutions Inc. ("BioNxt" or the "Company") (CSE:BNXT)(OTC PINK:BNXTF)(FSE:BXT), a bioscience innovator specializing in advanced drug delivery systems, is pleased to announce the expansion of its intellectual property portfolio with the filing of new international patents for sublingual delivery technologies targeting autoimmune neurodegenerative diseases. Building upon the positive International Preliminary Report on Patentability (IPRP) issued by the European Patent Office (EPO) in September 2024, BioNxt has initiated national-level filings in key jurisdictions, including the United States, Canada, Europe, and Japan. These patents are designed to protect the Company's proprietary sublingual formulations of anticancer drugs repurposed for the treatment of conditions such as Multiple Sclerosis (MS). "Securing robust intellectual property rights across major markets is a critical component of our strategy to bring innovative, patien
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom