Recently, Tencent Youtu laboratory has made another breakthrough in pedestrian recognition (Reid) technology. Through the introduction of cross scene Reid, its Reid model performance has refreshed the records of three authoritative mainstream Reid public data sets cuhk03, duke-mtmc and market151, and its key indexes, rank 1 accuracy and mean average accuracy, have been achieved Precision) is the best in the industry.
Pedestrian re-identification (Person ReID) refers to the establishment of identity correspondence (i.e., associated pedestrian ID) to pedestrian images captured by different cameras, and a comprehensive depiction of pedestrian implementation of the course of action under the whole scene. In short, in the face can not see the complex multi-scene can also be through the posture of the recognition of people. Compared with face recognition technology, ReID has high robustness to the occlusion, orientation and clarity of human body images, and has no hard requirements for the clarity, erection position and angle of the camera. As a result ReID technology has become a hot topic in computer vision after face recognition.
In view of the technical advantages of Reid technology and its wide application prospects in various fields, in recent years, Tencent Youtu has made a large number of technical investment and comprehensive technical layout in this direction, and has published more than 15 academic papers in related fields in top international academic conferences and journals such as CVPR / tpami / AAAI / IJCAI.
Although Reid technology has evolved for many years, the complex and changeable scenes in reality also make cross domain person re identification a big problem of Reid technology. This time, the cross scene Reid introduced by the three datasets of Tencent's optimal graph refresh is a technical breakthrough in this difficulty.
The difficulty of cross-scene recognition is that different scenes will affect the visual features of human images due to environmental illumination, camera angle, background and other factors, such as indoor shopping malls, side and high angle cameras of small stores, outdoor roads, strong light in the community and night environment. How to adapt the ReID technology to the complex and changeable scene and realize the cross-scene pedestrian image retrieval is a major technical challenge, and also a key technology to realize the indoor and outdoor pedestrian linkage and the whole city linkage. Breaking through this technical difficulty plays a great role in expanding the ReID landing scene and format and realizing large-scale pedestrian identification.
Visual differences between indoor and outdoor pedestrian images in public dataset msmt17
To solve Reid The technical difficulty is that Tencent excellent graph proposes a cross scene pedestrian recognition technology framework through targeted optimization on business problems such as occlusion matching, full angle matching and cross domain retrieval, as well as a large amount of accumulation and innovation in model structure, loss function, training algorithm and other technologies. The model based on graph convolution and twin network is adopted to make the neural network multi-directional And multi pose human body has stronger recognition ability. This technology can learn the unified feature expression for different scenes, different shooting angles and lighting conditions of pedestrian visual features, and effectively improve the precision of Reid technology in pedestrian image retrieval indoor and outdoor, cross scene.
Through the introduction of cross-scene ReID, Tencent Youtu refreshes the best level in the industry in three data sets, in which the RANK1 of Market-1501 data sets reaches 98.99%. RANK1 and MAP, as the core index to measure the technical level of ReID, have a high first hit rate, which means that the algorithm can accurately find the one most easily identified or matched in many images.
On this basis, the Reid algorithm of Tencent optimal graph is also at the leading level in the field of multi scene pedestrian image mutual retrieval, surpassing the existing algorithm in the cross scene Reid dataset msmt-17 to reach the state-of-the-art level.
Tencent Youtu's Reid technology has not only achieved leading performance in relevant data sets, but also achieved commercial level and wide implementation in a variety of scenarios. In the future, with the gradual maturity of cross scene pedestrian recognition ability, Tencent's Reid technology will also realize value in more scenes and formats.
User comments