Appendix
This section discusses our thinking on guiding values and considering business models types when developing the standards for data work. Please get in touch with us with any questions or ideas you have!
I. Glossary
Data workers
Working people who handle data materials as per the specifications of their companies and paying clients, for purposes like the training of artificial intelligence (AI) or the moderation of user-generated content in online spaces. Data workers fulfill numerous roles, ranging from labelling images to responding to tickets regarding disturbing content on social media to helping evaluate large language models (LLMs). Data work can be conducted through different business models, like business process outsourcing (BPO) and digital labour platforms (DLPs). It is important to note that data work is not a fixed category of roles. It refers to a wide variety of roles, with the possibility of new roles emerging that exhibit the characteristics of data work.
Workflow
The processes, policies, infrastructure, and people that manage the working environment in which data workers operate.
Disturbing Materials and Content/ Riskier Work
Tasks and data that feature content that can cause physical and mental health harms, or put people at risk of adverse effects, either through short-term or sustained exposure. Some examples of disturbing materials include hate speech, footage and imagery of graphic violence, content featuring sexual abuse, and online posts displaying the abuse of children.
At-Risk Workers
The workers who routinely handle disturbing materials, be it for content moderation roles or otherwise, are classified as at-risk workers. This term is not being used with intent to reduce the experiences of other categories of data workers, but to highlight that some people have to face not only precarious work engagements but also handle large volumes of content that people should not have to endure under any circumstances.
Care Mechanism
The various services and measures companies design, adopt, and manage to provide aid and support to data workers. Care mechanisms can range from providing insurance to reducing or covering operating costs involved in data work.
II. Core Values
Based on our research and understanding of the precarities faced by data workers in the ecosystem, we developed a set of values that act as the fundamental criteria for creating a safe working environment and promoting workers’ well-being. These values form the core basis on which the standards are formulated. Standards are different pathways to make these values actionable. Hence, every standard is associated with one or more of these core values.
Freedom
Workers should be able to exercise their rights and seek external recourse, remedy, and counsel without interference, coercion, or intimidation from workplaces. Restrictions on workers’ discussing their work and experiences with external parties should be applied only when it is absolutely necessary for the sake of the client or sensitivities surrounding the data.
Recognition
The companies running workplaces should have a full grasp of the realities of data work and the production processes. Businesses must be cognizant of the time, effort, skill, risks, and even danger that data work can require, and provide their workforces with adequate levels of compensation, support, and aid.
Transparency
Data work relationships need to be transparent in ways that empower and equip workers with key information about their roles, their engagement with companies, and the production processes they enable. Businesses need to work to provide workers with a variety of information, ranging from clear explanations of their relationship with the companies and how they are managed to how wages are calculated and how worker performance is measured, recognized, and penalized.
Stability
Data work roles should be designed and managed in ways that provide a meaningful level of stability to people fulfilling companies data requirements. Stability itself would arise from companies achieving adequacy in critical labour areas, such as compensation, managing the time workers spend “benched,” social security and hazard-related measures, and worker assistance.
Support
Workplaces need to offer data workers tools, processes, and assistance that help them fulfill their roles safely and with minimal barriers. Companies need to build and maintain measures that protect workers’ safety, address emerging safety issues and hazards, and ensure that workers have the means and information necessary to complete work and earn.
Collaboration
The process of arranging projects and delivering AI-related services should be governed such that there is plenty of space for discussions and co-designing the workflow. Companies should engage workers as more than a source of feedback, and work with labour stakeholders to shape production pipelines and workers’ arrangements. Furthermore, companies themselves should move to open themselves to external initiatives geared towards impact assessments and understanding working conditions.
III. Model cognizance in developing standards
The data enrichment industry consists of a range of business models. Data work’s challenges as well as the pathways for pursuing ethical treatment of labor, can differ based on companies' business models. There are a number of “model-specific standards” that are meant for following two broad type of business models:
Business Process Outsourcing/ BPO: Set-ups where workers produce data under company supervision, possibly in a designated location rather than an online environment. BPO models may run offices outfitted with various assets and infrastructures that allow workers to function in their roles. One prominent example of the BPO model is the French firm, Teleperformance.
Digital Labor Platforms/ DLP: Arrangements that utilize the internet and various software to link different players and processes of data production together, allowing people otherwise separated to collaborate and exchange money, goods, and services. DLP models treat workers as independent entities who may need to arrange things like computers, phones, and internet needed for work through their own resourcefulness and funds. Amazon Mechanical Turk and Remotasks are examples of DLPs offering data work solutions.
These model-specific standards are marked by a tag that reads as “BPO” for Business Process Outsourcing, and “DLP” for Digital Labor Platforms.
Aapti Institute
37, Aga Abbas Ali Rd, Halasuru
Yellappa Chetty Layout, Sivanchetti Gardens Bengaluru, Karnataka 560042
For Inquiries
contact@aapti.in
© 2025 Aapti Institute. All rights reserved.