Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The problem of feature dimension #1

Open
LuMenCat opened this issue Jun 1, 2022 · 3 comments
Open

The problem of feature dimension #1

LuMenCat opened this issue Jun 1, 2022 · 3 comments

Comments

@LuMenCat
Copy link

LuMenCat commented Jun 1, 2022

In ./datasets/CASIA_CK+_data_gen.py file:
you set the dimension of sample_numpy is 4, then through some concat operations, I did not see where the dimension could be changed. But in data_normalization(), the dimension of Data is 5.

I feel a little confused here. Can you answer it?
Thanks.

@negarhdr
Copy link
Owner

negarhdr commented Jun 1, 2022

sample_numpy represents the landmarks of each video/image_sequence and its 4 dimensions represent the number of frames, number of faces in each frame, number of landmarks in each face, and the feature dimension (channels) for each landmark!
After the concatenation of several samples in a fold, we will have an extra dimension "N" which shows the number of concatenated samples! So the total number of dimensions will be 5.

@LuMenCat
Copy link
Author

LuMenCat commented Jun 2, 2022

image
This is the mistake when I run this code.
On line 38 of CASIA_CK+_data_gen.py, you create a 4-dimensional array, then through the following concat operations,such as line 49, 97. But np.concatenate() could't increased the dimension. And in data_normalization(), the dimension of Data is 5, can you introduce it in detail?And describe what N, V, C, T, M represent respectively.

Thanks.

@negarhdr
Copy link
Owner

negarhdr commented Jun 2, 2022

You're right. Thanks for raising the issue. It should be fixed now.

The dimensions are as follows:
N: number of samples,
C: number of channels (dimensions) of each facial landmark
T: number of frames in an image sequence
V: number of landmarks in each face
M: number of faces in each frame

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants