Case Studies / AI-Powered Face Search for Mass Event

AI-Powered Face Search for Mass Event

Client Industry Public Relations / Large Scale Events
Tech Stack Python, Dlib (C++ Library), OpenCV, Django, Apache

The Challenge

During a massive, nationwide public awareness march, hundreds of photos were uploaded daily. Participants struggled to find their specific pictures among thousands of unsorted uploads. Scrolling through endless cloud folders was inefficient and led to a poor user experience.

Key Impact

Sub-second Face Matching on 50k+ Photos

The Challenge: Finding a Needle in a Digital Haystack

Our client was organizing a massive, nationwide march spanning several months. Every day, photographers uploaded hundreds of high-resolution images of the leader interacting with citizens.

The archive grew to tens of thousands of photos. For a participant to find their specific moment with the leader, they had to manually scroll through endless Google Drive folders. The user experience was broken.

The Engineering: Custom Computer Vision Pipeline

Off-the-shelf solutions were too expensive or didn't offer the privacy controls we needed. We engineered a custom solution:

  • Face Encoding: We used the Dlib library to generate 128-dimensional vector embeddings for every face in the dataset.
  • Vector Search: Instead of pixel matching (which is slow), we performed Euclidean distance calculations on these vectors, allowing us to search thousands of photos in milliseconds.
  • Dynamic Thresholding: Facial recognition is rarely black and white. Lighting and angles affect accuracy. We built a "Strict vs. Liberal" slider that let users control the algorithm's sensitivity. If "Strict" returned no results, they could slide to "Liberal" to see partial matches.

The Outcome

The platform became the central hub for digital engagement during the event.

Thousands of participants successfully retrieved their photos, turning a chaotic dump of images into a personalized, searchable memory bank. The system handled the daily influx of new data without performance degradation.

Facing a similar challenge?

We can architect this solution for you.

Discuss Engineering