我正在为人脸识别系统构建一个原型,在编写算法时,我有几个问题。
算法:


这些学习参数将确保相同图像的平坦的n-昏暗编码之间的距离会很小,而不同的图像之间的距离会很大。
- You will pass both of the test images through the CNN and get the corresponding encodings!
- Now, **The question comes** that how would you find the similarity between the test-picture-encoding and all the training-picture-encoding in the database? - **First question**, Would you do cosine similarity or I need to do something else? Can you add more clarity on it?
- **Second question**, Also, in terms of efficiency, how would you handle a scenario wherein you have 100,000 employees training-picture-encoding in the database present and for every new person you need to look these 100,000 encodings and compute cosine similarity and give result in <2 secs? Any suggestion on this part?
- **Third question** usually for face recognition task if we use approach(Image-->CNN-->SoftMax--> output), Each time a new person joins your organization you need to retrain your network, that's why it's a bad approach! - This problem can be mitigated by using the 2nd approach wherein we are using a learned distance function "d(img1, img2)" over a pair of images of employees as stated above on in **point 1 to 3**.
-
- **My question** is in case of a new employee joining the organization, How this learned distance function would be able to generalize when it was not been used in the training set at all? Isn't a problem of changed data distribution of test and train set? Any suggestion in this regards
有人能帮助理解这些概念上的小故障吗?
发布于 2019-03-27 19:36:45
在对人脸验证和识别/检测的相关文献进行综述的基础上,提出了计算机视觉领域的研究论文。我想我所有的问题都有答案,所以我在这里试着回答。
第一个问题,你会做余弦相似吗?你能给它增加更多的清晰度吗?
第二个问题,在效率方面,您将如何处理这样一个场景:数据库中有100,000名员工进行培训-图像编码,对于每一个需要查看这100,000个编码和计算余弦相似性的新人员,并给出<2秒?的结果。
第三个问题:-首先,我们正在通过最小化三重态损失函数来学习深层CNN的网络参数!
https://stackoverflow.com/questions/55291009
复制相似问题