Start Free Pilot

fill up this form to send your pilot request

Email is not valid.

Email is not valid

Phone is not valid

Some error text

Referrer domain is wrong

Thank you for contacting us!

Thank you for contacting us!

We'll get back to you shortly

TU Dublin Quotes

Label Your Data were genuinely interested in the success of my project, asked good questions, and were flexible in working in my proprietary software environment.

Quotes
TU Dublin
Kyle Hamilton

Kyle Hamilton

PhD Researcher at TU Dublin

Trusted by ML Professionals

Trusted by ML Professionals
Back to blog Back to blog
Published June 27, 2024

iMerit vs. Appen: An In-Depth Comparison for 2024

iMerit vs. Appen: An In-Depth Comparison for 2024

Selecting the right data labeling service can make or break your AI/ML projects. If you're deciding between iMerit and Appen, this article will help you make an informed choice.

We’ll provide an in-depth comparison of these two data labeling companies, covering their strengths, weaknesses, pricing, service quality, and more. Read on to get the insights you need to select the best data labeling company for your ML needs.

iMerit vs. Appen: Company Profiles

Feature

iMerit

Appen

Founded

2012

1996

Headquarters

San Jose, California

Chatswood, New South Wales, Australia

Market Focus

  • Autonomous vehicles

  • Medical AI

  • Geospatial technology

  • Financial services

  • Commerce

  • Government

  • Agricultural AI

  • LLM Optimization

  • Automotive

  • Finance

  • Government

  • Retail

  • Healthcare

iMerit Company

iMerit in numbers

Founded by Dipak Basu in 2012, iMerit employs over 5,000 people and offers comprehensive data annotation services across multiple industries. With its headquarters in San Jose, California, and additional offices in New Orleans, Kolkata, and Bengaluru, iMerit delivers high-quality data for AI applications in agriculture, autonomous vehicles, healthcare, and more. Their end-to-end solutions support Fortune 500 companies in developing robust AI models.

Appen Company

Appen in numbers

Founded in 1996 by linguist Dr. Julie Vonwiller, Appen Ltd is a leading Australian company specializing in human-annotated data for AI and machine learning. Appen has a global network of over 1 million contributors, providing services in 130+ countries and 180 languages. With 25+ years of experience, Appen company offers tailored data solutions for various industries, attracting clients from tech giants to automotive manufacturers.

Services and Products

iMerit Services and Products

iMerit offers a wide range of data labeling services, including:

  • Reinforcement Learning from Human Feedback (RLHF): Domain expertise and expert feedback for LLMS and LVMS.

  • Image Annotation: Bounding boxes, keypoint annotation, polygon annotation, image classification, semantic segmentation, and LiDAR.

  • Video Annotation: Bounding-box, polygon, keypoint, and semantic segmentation annotation.

  • Text Annotation: Sentiment analysis, intent analysis, named entity recognition (NER), and entity classification.

  • Audio Transcription: Converts audio data into text and labels it for machine learning.

  • LiDAR Annotation: Semantic segmentation, landmark annotation, 3D cuboids/box annotation, polygon, and polyline annotation.

  • Content Moderation: Monitors, assesses, and filters user-generated content.

  • Product Categorization: Categorizes images, videos, and text for product suggestions and personalized recommendations.

  • Image Segmentation: Methods include bounding boxes, grayscale, segmentation masks, and Gaussian blur.

iMerit’s multilingual annotators are based in India, the US, Bhutan, Germany, and Latin America.

Appen Services and Products

Appen provides a crowded workforce and a specialized software

Appen provides a comprehensive suite of annotation services. The company also offers an end-to-end platform to support the entire AI development lifecycle.

SERVICES

  • Data Collection. Appen sources various data types, including text, audio, video, and geospatial data.

  • Data Annotation. Human-in-the-loop annotation for NLP, speech processing, and computer vision.

  • Search Relevance. Enhancing search engine algorithms through model evaluation, content moderation, and related search refinement.

  • Reinforcement Learning. RLHF services specifically for developing LLMs.

  • Document Intelligence. Improves document processing through data curation and annotation.

  • Location-Based Services. Geospatial data annotation for accurate mapping and location intelligence.

  • Pre-Labeled Datasets. A library of 270+ pre-labeled audio, image, video, and text datasets in over 80 languages.

  • Data for LLMs: Custom data solutions for LLM builders.

PLATFORM

A global crowdsourcing network with over 1 million contributors across 170+ countries. It offers features like project management, performance monitoring, data quality control, and security management. It also provides access to subject-matter experts (SMEs).

The platform handles various tasks:

  • Data Collection

  • Data Annotation

  • Transcription

  • Translation

  • Speech Modeling

  • Model Evaluation

Appen’s AI data platform (Source: Appen)

Pricing Models

Feature

iMerit

Appen

Pricing Structure

Subscription-based, per task, discounts for high volume

Flexible pricing: unit-based and hourly pricing interchangeable

Pricing Details

  • Monthly subscription for Ango Hub platform

  • Universal annotation task pricing varies by dataset volume

  • Discounts for more than 100k objects per month

  • One-time charge for custom output export

  • Project-by-project basis

  • Type of work, volume of data, and worker location affect the cost

  • Higher accuracy requirements may increase cost

  • No minimum budget or time requirements

  • Consistent data flow needed for annotator availability

Free pilot

Free Analysis

Label 1,000 objects for free

Additional Notes

  • Discounts based on annotator’s language and location

  • Pricing flexibility depending on the scope of annotation

  • Clients can estimate costs using a formula:

  • (Judgments per row * (Pages of work * Price per page)) + transaction fee + buffer = estimated job cost

Dataset Types

iMerit Dataset Types

iMerit handles a diverse range of dataset types, offering data curation, generation, annotation, and evaluation for various AI applications. They specialize in preparing datasets for computer vision, sentiment analysis, natural language processing, categorization, and LiDAR annotation. Their annotation capabilities include polygons, bounding boxes, keypoints, polylines, classification, semantic segmentation, and text extraction. Additionally, iMerit provides audits and quality assurance (QA) for generative AI systems.

Appen Dataset Types

Appen offers a variety of datasets to meet diverse ML needs. They process and annotate text data, including emails, documents, and chat conversations in over 80 languages. Audio data, such as phone calls and voice commands, are transcribed and labeled for features like speaker identification. Visual data expertise includes image annotation for objects and scenes, facial recognition, and video data analysis for object tracking and activity recognition.

Supported Data Formats:

  • CSV (Comma-separated values)

  • TSV (Tab-separated values)

  • XLSX (Microsoft Excel spreadsheets)

  • ODS (OpenDocument Spreadsheets)

Encoding:

  • All files must use UTF-8 encoding.

Formatting:

  • Each column in your data file must have a clear and descriptive header.

Data Annotation Tools

iMerit Annotation Tools

iMerit company uses its proprietary tool, Ango Hub, for most annotations, preferring it over client-specific tools. Available by subscription, Ango Hub handles image, video, and text annotation, facilitating tasks such as:

  • Image and Video Annotation: Features include autodetect, OCR, and magnetic lasso.

  • Radiology Annotation: iMerit Radiology Editor supports medical imaging, offering data compliance and partial automation.

  • In-Cabin Monitoring: Annotates driver behaviors for driver monitoring systems.

  • Defect Detection: Automated surface inspection for manufacturing defects.

  • Crop and Weed Detection: Pre-labeling and auto-labeling for agriculture using built-in ML models.

  • Ground Control: Provides analytics, metrics, and seamless annotated data transfer.

  • Edge Cases: Manages complex scenarios like reflections, hidden signs, and ambiguous objects.

Appen Annotation Tools

Appen's CVP for AI Adopters (Source: Mopoke Cloud Newsletter)

Appen doesn’t offer standalone data annotation tools for public use, but they provide a comprehensive data annotation platform as part of their services. This platform combines human annotators with machine learning to deliver high-quality training data for AI models.

The vendor can also integrate with client-provided tools, although these may not always be scalable. If project demands increase and client tools can’t keep up, Appen may switch to their platform to handle the workload. Additionally, Appen has demonstrated the ability to automate tasks using Large Language Models (LLM), though it’s unclear if this is a standard feature or a custom solution.

Integrations

iMerit Integrations

Seamless integration

iMerit provides seamless no-code integrations, primarily through APIs and plugins. They can integrate with a variety of applications and MLOps platforms, ensuring efficient data pipelines that can accommodate custom requests.

Appen Integrations

Within the Appen projects, you get two main integration options: API and Live Large Language Model (LLM) APIs.

API Integration:

  • Automate tasks related to Appen’s data annotation services.

  • Programmatically create, edit, and launch annotation jobs.

  • Download results from completed jobs.

  • Integrate seamlessly with your existing workflow.

Appen uses a RESTful API with JSON data format and key-based authentication. They recommend trying out the process manually before setting up the API.

Live LLM API Integration:

  • Connect your Large Language Models (LLMs) with Appen’s platform.

  • Test your LLMs against real-world data.

  • Gather insights to improve your models.

  • Create a feedback loop with human experts to fine-tune your LLM.

This integration ensures your LLMs are accurate, relevant, and aligned with your specific needs.

Annotation Process

iMerit Annotation Process

While iMerit offers machine-assisted labeling for specific cases, they believe human annotation is often faster and more efficient. Here are the main steps in their data annotation process:

  1. Consultation with an Expert: Customers register on iMerit’s platform and prepare their technical task with expert guidance.

  2. Trial and Annotator Training: Annotators undertake a pilot project or proof of concept, with specific training provided, especially for tasks requiring industry-specific knowledge.

  3. Workflow Customization: Annotation is carried out on a project segment defined during the pilot stage.

  4. Feedback Cycle: Clients provide feedback, which informs the final offer.

  5. Evaluation: Each project undergoes evaluation before final submission.

Clients can be involved in the annotation process at any stage they choose.

Appen Annotation Process

Appen’s data annotation platform offers customizable workflows for efficient management from start to finish:

  1. Onboarding and Training: A Customer Success Manager helps your team get acquainted with the platform, enabling them to quickly create and launch annotation jobs.

  2. Setting Up Annotation Jobs: Use customizable templates to define the type of data (text, image, audio, etc.) and specific annotations needed. Jobs can be set up via Appen’s user interface or API for programmatic control.

  3. Data Annotation by Global Contributors: Jobs are assigned to a global network of qualified contributors. The platform includes tools and guidelines to ensure consistency in annotations.

  4. Monitoring and Adjustment: Track job progress and review data to make necessary adjustments, such as refining instructions or tweaking annotation requirements.

  5. Data Download and Reporting: Once complete, download the labeled data for your AI or ML projects. Appen provides dashboards and reports to help optimize jobs for cost, quality, and efficiency.

Appen’s workflow aims to streamline data annotation with a user-friendly platform, a global workforce, and comprehensive management tools.

Appen's CVP for AI Builders (Source: Mopoke Cloud Newsletter)

Quality Assurance

iMerit QA

At every stage of data annotation, iMerit utilizes reports, dashboards, and tracking systems to ensure efficient project management, troubleshoot issues, and monitor KPI metrics. Their quality assurance process includes the following steps:

  • Setting a Gold Standard with Mini-Sets: Establishing benchmark datasets for quality control.

  • Using Annotator Consensus: Ensuring accuracy through multiple annotators agreeing on labels.

  • Applying Scientific Methods for Label Consistency: Utilizing proven methods to maintain uniformity in labeling.

  • Implementing Subsampling: Randomly selecting 5-10% of the labeled dataset to check for errors.

A solution architect reviews these subsamples to identify any errors. Additionally, iMerit uses AI-based frameworks to assist annotators, enabling multiple reviews of labeled datasets before project submission.

Appen QA

Appen guarantees high-quality training data for AI projects through a multi-layered QA process, customized to your needs with a global network of contributors.

  • Customizable Solutions: Tailored QA plans with human expertise and advanced options like real-time learning for continuous improvement.

  • Seamless Workflow: Integrates your team, labeling tools, and workers. Offers automated or managed solutions for performance tracking and quality analysis.

  • Rigorous Quality Control: Appen reviews your data through multiple checks from pre-production to post-production to ensure top-notch data quality.

  • AI-Powered Crowd Management: Utilizes a network of over 1 million contributors and AI to match tasks to worker skills and streamline labeling.

  • Detailed Quality Reports: Comprehensive dashboards track project progress, including labeled units and overall accuracy. Data is segmented by annotator to identify training needs.

Security and Data Compliance

Feature

iMerit

Appen

Access Controls

  • Security Manager sets and monitors security measures

  • Role-based access

  • Temporary link for view-only access (though this measure can’t fully prevent screenshots or photos)

  • Direct storage access is not granted to annotators

Worker Screening

Security Manager trains employees

Background checks and verification, ongoing training and monitoring

Compliance

  • SOC 2

  • ISO 27001

  • ISO 9001:2015

  • GDPR

  • TISAX

  • GDPR

  • HIPAA

  • SOC 2 Type II

  • ISO 27001

TL;DR

Aspect

iMerit Pros

iMerit Cons

Appen Pros

Appen Cons

Services

Comprehensive range of services

Higher cost for premium services

Additional charges for custom output exports

Wide range of services

End-to-end platform support for AI development

Extensive pre-labeled datasets library

Some services may be costly for smaller projects

Complex service offerings can be overwhelming to navigate

Tools

Advanced features with proprietary tool Ango Hub

Steeper learning curve for advanced tools

Integration with client-provided tools

AI-powered task management

No standalone data annotation tools

Potential issues with scalability of client-provided tools

Pricing

Monthly subscription with flexible pricing based on volume, language, and location

High-volume discounts

One-time charge for custom data export

Total cost depends on project scope and personnel involved

No minimum budget or time requirements

Free pilot and analysis available

Higher accuracy requirements may increase costs

Pricing varies significantly based on project specifics

Variable costs

QA

Rigorous QA processes with multiple layers of quality checks

Time-consuming QA processes

AI-powered crowd management for matching tasks to skills

Detailed quality reports with performance tracking

Specific QA methods vary by project

Dependence on a large, global workforce may impact consistency

Summarizing about the two data annotation companies, iMerit offers a wide range of services and uses its proprietary tool, Ango Hub. Appen, with over 25 years of experience, provides comprehensive data solutions through a global network and end-to-end platform. Both prioritize data security and compliance. iMerit excels in quality assurance, while Appen offers flexible pricing and a broader service range. Your choice will depend on your project's requirements, budget, and preferred features.

And if you’re looking for a vendor with these qualities:

  • No commitment

  • Flexible pricing

  • Tool-agnostic

  • Data-compliant

Run a free pilot to put our team’s labeling expertise to the test.

FAQ

How do iMerit and Appen ensure the security and confidentiality of sensitive data during the annotation process?

iMerit uses role-based access controls, strict security protocols, and compliance with SOC 2, ISO 27001, and GDPR. Data access is managed by a Security Manager, and workers undergo background checks and training. Appen uses temporary view-only access links, complies with GDPR, HIPAA, SOC 2 Type II, and holds ISO 27001 certification, ensuring annotators don't have direct storage access.

What types of projects or use cases are best suited for iMerit versus Appen's data annotation services?

iMerit excels in autonomous vehicles, medical AI, geospatial tech, financial services, and agriculture, offering high-quality data annotation. Appen is ideal for large-scale projects with a global reach, such as automotive, finance, government, retail, and healthcare, supported by extensive pre-labeled datasets and a robust platform.

What are the onboarding and training processes like for new clients working with iMerit and Appen?

iMerit provides expert consultation, trial and annotator training, workflow customization, and a feedback cycle for process refinement. Appen offers onboarding with a Customer Success Manager, customizable job templates, and management tools, enabling clients to set up, monitor, and adjust annotation jobs easily.

Written by

Yuliia Kniazieva
Yuliia Kniazieva Editor-at-Large

One of the technical writers at Label Your Data, Yuliia has been gradually delving into the intricate aspects of AI. With her strong passion for the written word and technical expertise, Yuliia has developed a keen interest in the evolving field of data annotation and the power of machine learning in today's tech-savvy world. Check out her articles to learn more about the complex world of technology and find the solutions that work best for your AI project!