Batching on `.represent` to improve performance and utilize GPU in full #1433

galthran-wq · 2025-02-11T14:03:02Z

Tickets

What has been done

With this PR, .represent is able to accept a list of paths/numpy arrays and process them all in a batch.

How to test

make lint && make test

I've made a collab notebook
https://colab.research.google.com/drive/1bV0yyrdT1a0a4dyemoeL5xf28w3ql1fd#scrollTo=lVndmFF5Kls7
which shows the >10x performance improvement and also the fact, that with batch_size=1 the GPU is almost not utilized

serengil · 2025-02-11T14:22:18Z

Would you please write unit test to make it clearer while reviewing?

serengil · 2025-02-11T17:39:57Z

tests/test_represent.py

@@ -81,3 +83,49 @@ def test_max_faces():
    max_faces = 1
    results = DeepFace.represent(img_path="dataset/couple.jpg", max_faces=max_faces)
    assert len(results) == max_faces
+
+
+@pytest.mark.parametrize("model_name", [


please do the with only one model - e.g. Facenet

I excluded some of those models from test otherwise it will take too long tests to be performed in github

it's just that some models have custom forward logic, which I also had to tweak a little bit.

maybe still keep those models(Dlib, SFace, VGGFace), along with some keras one, like, Facenet?

okay but some are optional (e.g. dlib), these are not installed in github actions. so, your tests will be failed.

serengil · 2025-02-11T19:18:20Z

There are some linting issues broken the actions

serengil · 2025-02-16T19:58:07Z

LGTM

Thank you for your contribution

serengil · 2025-02-18T13:10:27Z

tests/test_represent.py

+        "dataset/img2.jpg",
+        "dataset/img3.jpg",
+        "dataset/img4.jpg",
+        "dataset/img5.jpg",


if you add couple.jpg here, there are 6 input images but in the response we will have 7 items.

the bad part, we cannot understand that which image has 2 faces. i am creating a PR to store input image's index in the response payload.

we may consider to have List of List of Dict response type for batch inputs in the future.

I think it is a good idea to have List of List of Dict. I could make a PR now, or perhaps later, when batched detection is merged, because I also had in mind to use batched detection for .represent (now it is done in the for loop)

I will do the initial changes. PRs are always welcome!

i merged a workaround PR for this

great work!

will optimize this because if batch size is long, this approach gives O(n^2) complexity.

I plan to do something with dict, which decrease the complexity O(n)

galthran-wq added 4 commits February 11, 2025 13:01

batched inputs in representation

0ef420b

typo

72919d9

update DeepFace represent method

d7a985b

compatibility

bb134b2

galthran-wq added 7 commits February 11, 2025 16:58

batched represent

c60152e

VGGFace batched inference

8fb70eb

SFace pseudo-batched inference

035d3c8

List->Sequence typing

3a9385f

dlib pseudo-batched forward

a4a579e

dlib true-batched forward

8becc97

refactor test

9e12c92

serengil reviewed Feb 11, 2025

View reviewed changes

remove unnecessary models from the test

da03b47

linting

f1734b2

This was referenced Feb 12, 2025

[FEATURE]: Batching .extract_faces #1434

Open

Batching on .extract_faces to improve performance and utilize GPU in full #1435

Open

serengil merged commit ca73032 into serengil:master Feb 16, 2025
2 checks passed

serengil reviewed Feb 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batching on `.represent` to improve performance and utilize GPU in full #1433

Batching on `.represent` to improve performance and utilize GPU in full #1433

galthran-wq commented Feb 11, 2025

serengil commented Feb 11, 2025

serengil Feb 11, 2025 •

edited

Loading

galthran-wq Feb 11, 2025 •

edited

Loading

serengil Feb 11, 2025

serengil commented Feb 11, 2025

serengil commented Feb 16, 2025

serengil Feb 18, 2025

galthran-wq Feb 18, 2025 •

edited

Loading

serengil Feb 18, 2025

serengil Feb 19, 2025

galthran-wq Feb 19, 2025

serengil Feb 19, 2025

Batching on .represent to improve performance and utilize GPU in full #1433

Batching on .represent to improve performance and utilize GPU in full #1433

Conversation

galthran-wq commented Feb 11, 2025

Tickets

What has been done

How to test

serengil commented Feb 11, 2025

serengil Feb 11, 2025 • edited Loading

Choose a reason for hiding this comment

galthran-wq Feb 11, 2025 • edited Loading

Choose a reason for hiding this comment

serengil Feb 11, 2025

Choose a reason for hiding this comment

serengil commented Feb 11, 2025

serengil commented Feb 16, 2025

serengil Feb 18, 2025

Choose a reason for hiding this comment

galthran-wq Feb 18, 2025 • edited Loading

Choose a reason for hiding this comment

serengil Feb 18, 2025

Choose a reason for hiding this comment

serengil Feb 19, 2025

Choose a reason for hiding this comment

galthran-wq Feb 19, 2025

Choose a reason for hiding this comment

serengil Feb 19, 2025

Choose a reason for hiding this comment

Batching on `.represent` to improve performance and utilize GPU in full #1433

Batching on `.represent` to improve performance and utilize GPU in full #1433

serengil Feb 11, 2025 •

edited

Loading

galthran-wq Feb 11, 2025 •

edited

Loading

galthran-wq Feb 18, 2025 •

edited

Loading