Police Officers Faces

Police Officers Faces (POF)

  • A large-scale facial recognition dataset encompassing approximately 90,000 images of thousands of law enforcement officers. This dataset has been meticulously curated to serve as a tool, equipping investigative journalists, human rights researchers, and digital activists with the necessary means to advance justice, foster transparency, and strengthen advocacy efforts. By facilitating critical inquiry and enabling informed action, the dataset aims to contribute to the ongoing efforts to hold systems of power accountable and promote the protection of fundamental rights.
  • Posted: Jun 17, 2024. Modified: Jan 11, 2025
The GIF above displays 100 randomly selected sample images from the “POF” face recognition dataset, each annotated with the Big Lips facial attribute label. In compliance with privacy considerations, all facial features have been redacted.
The GIF above displays 100 randomly selected sample images from the “POF” face recognition dataset, each annotated with the Mouth Closed facial attribute label. In compliance with privacy considerations, all facial features have been redacted.
The GIF above displays 100 randomly selected sample images from the “POF” face recognition dataset, each annotated with the Square Face facial attribute label. In compliance with privacy considerations, all facial features have been redacted.
The GIF above displays 100 randomly selected sample images from the “POF” face recognition dataset, each annotated with the No Eyewear facial attribute label. In compliance with privacy considerations, all facial features have been redacted.
The GIF above displays 100 randomly selected sample images from the “POF” face recognition dataset, each annotated with the Arched Eyebrows facial attribute label. In compliance with privacy considerations, all facial features have been redacted.
The GIF above displays 100 randomly selected sample images from the “POF” face recognition dataset, each annotated with the Middle Aged facial attribute label. In compliance with privacy considerations, all facial features have been redacted.

Context #

“We live in the age of digital data, and in that age mathematics has become the parliament of politics. The social law has become interwoven with models, theorems and algorithms. With digital data, mathematics has become the dominant means in which human beings coordinate with technology … Mathematics is a human activity after all. Like any other human activity, it carries the possibilities of both emancipation and oppression.” — Politically Mathematics Manifesto, 20191

Across Europe and globally, artificial intelligence (AI) systems are increasingly developed and deployed in ways that facilitate harmful and discriminatory state surveillance practices. These systems often leverage biometric technologies for identification, recognition, and categorization, alongside predictive algorithms for decision-making and resource allocation. In law enforcement, these applications disproportionately target marginalized communities, erode legal and procedural protections, and facilitate mass surveillance. The deployment of AI in law enforcement, security, and migration control—including the policing of social security—exacerbates power asymmetries between authorities and those under surveillance. This heightened imbalance significantly increases the risk of harm, undermines fundamental rights, and poses serious threats to the rule of law.

Artificial intelligence (AI) not only undermines due process but also fosters a form of thoughtlessness, as political philosopher Hannah Arendt described when analyzing the actions of Nazi war criminal Adolf Eichmann. This thoughtlessness reflects an inability to critically evaluate instructions, a failure to reflect on the potential consequences, and an adherence to the belief that a correct and just process is being followed. In the context of AI’s real-world impact, the primary concerns are discrimination and segregation. AI can be seen as a technology that perpetuates racism, reinforcing systems of racial and social segregation. In this sense, racism itself may be conceptualized as a technology of segregation, with AI playing a central role in its contemporary manifestation.

This is particularly evident in the deployment of facial recognition technology, which stands as one of the most contentious and ethically problematic applications of artificial intelligence (AI) to date. The concern extends beyond the mere fact that facial recognition systems exhibit lower accuracy rates when applied to individuals of color; more fundamentally, it lies in the way these technologies facilitate what Simone Browne terms ‘digital epidermalization’—’the exercise of power cast by the disembodied gaze of certain surveillance technologies… that can be employed to do the work of alienating the subject by producing a truth about the racial body and one’s identity (or identities) despite the subject’s claims.’2 In this context, facial recognition technology is implicated not only in reinforcing racialized power structures but also in the construction of a racialized truth that undermines individual autonomy and agency.

Facial recognition technology is a profoundly invasive, discriminatory, and biased mechanism. It reinforces structural power imbalances, perpetuates racial discrimination, sustains systems of racism, and facilitates authoritarian control. Within the European Union, AI-driven facial recognition systems are deployed in public spaces to capture and analyze facial features, classifying individuals into racial and ethnic categories such as Roma and Sinti. This process implicitly associates non-white individuals with suspicion, reinforcing racialized surveillance practices that exacerbate social inequalities and deepen the marginalization of racial and ethnic minorities.

The Police Officers Faces (POF) dataset is an investigative counter-surveillance artistic project that critically interrogates the deployment of facial recognition technology, with particular focus on its use by law enforcement agencies. The dataset comprises 88,783 facial images of thousands of police officers, meticulously collected from publicly accessible online sources and repurposed for facial recognition. By repurposing this data; the project challenges the ethical ramifications of surveillance technologies and critiques their capacity to normalize state power. In a bold inversion of the traditional surveillance paradigm—where the powerful observe and monitor the powerless—the ‘POF’ dataset shifts the surveillance gaze, enabling citizens to directly scrutinize those in positions of authority. This inversion disrupts the inherently one-sided nature of surveillance, stimulating critical reflection on the power dynamics underpinning these technologies, and raising essential questions about accountability, transparency, and the ethical use of surveillance in democratic societies.

To counter the dehumanizing effects of automated discrimination, systemic segregation, and the amplification of harm inherent in mass surveillance, the project proposes a radical alternative technical framework—a collective ‘counter-intelligence.’ This approach challenges the centralized control of surveillance technologies and emphasizes the transformative potential of citizens and grassroots movements in resisting and dismantling unjust systems. By employing a multifaceted methodology that integrates targeted data collection, social engineering, and rigorous analytical techniques, the ‘POF’ dataset offers a comprehensive structural alternative to prevailing surveillance models that disproportionately target marginalized communities. It not only empowers individuals engaged in copwatch, counter-surveillance, and other forms of resistance, but also equips them with the tools necessary to challenge and resist authoritarian oversight mechanisms. Furthermore, the project provides a platform for collective action, enabling grassroots movements to advocate for transparency, accountability, and justice within law enforcement practices.

Dataset Description #

The Police Officers Faces (POF) dataset constitutes a meticulously curated collection of facial images of law enforcement officers, derived from publicly accessible online sources. The dataset comprises 88,783 aligned and cropped facial images, each formatted to a resolution of 200×280 pixels. Each image is centered on a single face, with the majority being labeled with the name of the individual depicted. On average, each individual is represented by 12.22 facial images. Unlike most existing facial recognition datasets, the images in this collection were obtained in entirely uncontrolled settings, with subjects who were uncooperative. Consequently, the dataset exhibits considerable variability across multiple dimensions, including pose, resolution, lighting conditions, facial expressions, scene composition, imaging conditions, and other technical parameters.

The dataset includes JPEG-formatted images supplemented by detailed metadata stored in JSON and CSV formats. Part of this metadata encompasses facial bounding box coordinates and biometric landmark data, providing a robust framework for advanced analysis. Furthermore, the dataset includes comprehensive personal identifiers, such as the full names of the depicted individuals, enhancing its utility for identification, verification, and recognition tasks. In addition, each facial image is annotated with seventy-three distinct attributes, organized into three primary recognition domains: age group, gender, and a diverse array of visual characteristics. These visual traits encompass attributes such as hair color, face shape, and the presence of makeup, thereby enabling multifaceted recognition and analytical capabilities.

* Please be advised that the information presented on this page serves as an initial overview. It is subject to updates and revisions as new data and insights become available, ensuring the accuracy and comprehensiveness of the content.

License Agreement #

Any entity utilizing the Police Officers Faces (POF) dataset acknowledges and agrees to adhere to the following terms and conditions:

  • Non-Commercial Use: The dataset, along with its subsets, is made available exclusively for non-commercial research purposes. The reproduction, duplication, copying, sale, trade, resale, or any form of exploitation—whether direct or indirect—of the dataset or any derived data for commercial purposes is strictly prohibited.

  • No Warranty: The dataset is made available without any express or implied warranties. The user assumes full responsibility for their use of the dataset and acknowledges the inherent risks associated with the exercise of the rights granted under this license. The administrator of the Police Officers Faces (POF) dataset explicitly disclaims any liability for damages—whether physical, financial, or otherwise—arising from the use of the dataset.

  • No Distribution: Redistribution or sublicensing of the dataset, in whole or in part, to any third party is prohibited. This prohibition extends to any form of publication, copying, dissemination, or distribution to any organization, entity, or individual.

  • No Modification: Users are expressly prohibited from modifying, altering, or adapting the dataset in any form or manner.

Supplemental Information #

Annotated Sample Images #

The following section presents a randomly selected subset of sample images, each annotated with the corresponding seventy-three attribute labels. The frequency distribution of these labels has been visualized using the Matplotlib library. To preserve the privacy of the individuals depicted, the faces in the images have been redacted.

 The plot above illustrates the frequency distribution of the attribute labels corresponding to the image identified by "image_id": "g8ltccwb". Download data as CSV
 
 The plot above illustrates the frequency distribution of the attribute labels corresponding to the image identified by "image_id": "8g5e6p7g". Download data as CSV
 
 The plot above illustrates the frequency distribution of the attribute labels corresponding to the image identified by "image_id": "lo7zii72". Download data as CSV
 
 The plot above illustrates the frequency distribution of the attribute labels corresponding to the image identified by "image_id": "vnqvu49b". Download data as CSV
 
 The plot above illustrates the frequency distribution of the attribute labels corresponding to the image identified by "image_id": "ek1qpg59". Download data as CSV

* It is important to note that this specific section of the dataset employs a model with inherent limitations, which categorizes faces in a strictly discriminative manner. Additionally, the ground truth for the face attribute annotations is derived through automatic estimation, and these annotations have not been independently verified by human annotators.

Access #

The Police Officers Faces (POF) dataset, version 0.1.0, is currently not accessible to the general public. It is specifically designed to equip investigative journalists, human rights researchers, and digital activists with the means to strengthen justice and advocacy efforts.

Citations #

  • ICEspy by Kyle McDonald, 2018: “Concerned you may know an ICE employee? Use this helpful app to check. Just point your camera and press the photo button. We’ll check against hundreds of ICE employee photos from LinkedIn and show you the closest match. A direct subversion of face recognition technology to watch those who wield power without responsibility.”

  • Capture by Paolo Cirio, 2020: “The series of photos Capture is composed of French police officers’ faces. Paolo Cirio collected 1000 public images of police in photos taken during protests in France and processed them with Facial Recognition software. Cirio then created an online platform with a database of the resulting 4000 faces of police officers to crowdsource their identification by name.”

  • NYPD COPPELGÄNGER by Sam Lavigne, 2024: “Facial recognition on an archive of ~10,000 photos of New York cops. Images are publicly available and sourced from 50-a.org and then processed using Deepface with the yunet model for facial detection and Facenet512 for recognition.”

Credits #

Software #

The majority of this project was developed utilizing open-source software tools. Core software application development and data analysis were conducted using Python, alongside key software libraries and tools such as Pandas, Markdown, and Matplotlib, as well as other widely recognized open-source packages. All of these resources were freely available to support the project’s objectives. Communication throughout the project was facilitated using Signal, an open-source service for encrypted instant messaging, voice, and video calls, while face redaction was performed using the open-source DFACE.app web application.

Image #

Front Page Image: A mosaic comprising approximately 63,000 facial images of law enforcement officers from the Police Officers Faces (POF) facial recognition dataset. The images have been strategically downsized to protect the privacy and confidentiality of the individuals depicted, ensuring compliance with ethical standards and privacy considerations. Mosaic: © Algorithmic Sabotage Research Group (ASRG).


  1. Politically Mathematics collective, “Politically Mathematics Manifesto,” 2019. https://politicallymath.in/manifesto/↩︎

  2. Browne, S. 2015. Dark Matters: On the Surveillance of Blackness. Durham, NC: Duke University Press Books. ↩︎