{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,21]],"date-time":"2026-02-21T20:29:24Z","timestamp":1771705764001,"version":"3.50.1"},"reference-count":44,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2023,12,19]],"date-time":"2023-12-19T00:00:00Z","timestamp":1702944000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"\u201cPioneer\u201d and \u201ceading Goose\u201d R&D Program of Zhejiang","award":["2023C01030"],"award-info":[{"award-number":["2023C01030"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["62122011, U21A20514"],"award-info":[{"award-number":["62122011, U21A20514"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100012226","name":"Fundamental Research Funds for the Central Universities","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100012226","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Intell. Syst. Technol."],"published-print":{"date-parts":[[2024,2,29]]},"abstract":"<jats:p>The pedestrian attribute recognition aims to generate a structured description of pedestrians, which serves an important role in surveillance. Current works usually assume that the images and the specific pedestrian states, including pedestrian occlusion and pedestrian orientation, are given. However, we argue that the current works ignore the guidance of the pedestrian state and cannot achieve the appropriate performance since the appearance feature will become unreliable due to the variance of the pedestrian state, which is common in practice. Therefore, this paper proposes the Explicit State Representation (ExSR) Guided Pedestrian Attribute Recognition to improve the accuracy through state learning and attribute fusion among frames. Firstly, the pedestrian state is explicitly represented by concatenating the pedestrian orientation and occlusion, which can be accurately determined via analyzing the pose. Secondly, the state-aware pedestrian attribute fusion method is proposed and divided into two cases, namely the inter-state case and the intra-state case. In the intra-state case, the appearance feature will remain stable and the attribute relations are propagated to refine. The method of exploiting attribute relations within a single frame is the Graph Neural Network. In the inter-state case, the state changes, the attribute relationship propagation is prevented, and the advantages of attribute recognition in each frame are complemented to make a reliable judgment on the invisible region. The experimental results demonstrate that the ExSR outperforms the state-of-the-art methods on two public databases, benefiting from the explicit introduction of the state into the attribute recognition.<\/jats:p>","DOI":"10.1145\/3626240","type":"journal-article","created":{"date-parts":[[2023,10,26]],"date-time":"2023-10-26T22:13:09Z","timestamp":1698358389000},"page":"1-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["Explicit State Representation Guided Video-based Pedestrian Attribute Recognition"],"prefix":"10.1145","volume":"15","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1143-6745","authenticated-orcid":false,"given":"Wei-Qing","family":"Lu","sequence":"first","affiliation":[{"name":"Hangzhou Innovation Institute of Beihang University, Hangzhou 310051, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6811-9209","authenticated-orcid":false,"given":"Hai-Miao","family":"Hu","sequence":"additional","affiliation":[{"name":"the State Key Laboratory of Virtual Reality Technology and Systems, Beihang University, Beijing 100191, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0836-6181","authenticated-orcid":false,"given":"Jinzuo","family":"Yu","sequence":"additional","affiliation":[{"name":"Hangzhou Innovation Institute of Beihang University, Hangzhou 310051, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8065-5901","authenticated-orcid":false,"given":"Shifeng","family":"Zhang","sequence":"additional","affiliation":[{"name":"HIKVISION, Hangzhou 310051, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6913-9786","authenticated-orcid":false,"given":"Hanzi","family":"Wang","sequence":"additional","affiliation":[{"name":"Xiamen University, Xiamen 361005, China"}]}],"member":"320","published-online":{"date-parts":[[2023,12,19]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2020.2975417"},{"key":"e_1_3_1_3_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-31723-2_18"},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00532"},{"key":"e_1_3_1_5_2","doi-asserted-by":"publisher","DOI":"10.1145\/2647868.2654966"},{"key":"e_1_3_1_6_2","article-title":"Correlation graph convolutional network for pedestrian attribute recognition","author":"Fan Haonan","year":"2020","unstructured":"Haonan Fan, Hai-Miao Hu, Shuailing Liu, Weiqing Lu, and Shiliang Pu. 2020. Correlation graph convolutional network for pedestrian attribute recognition. IEEE Transactions on Multimedia (2020).","journal-title":"IEEE Transactions on Multimedia"},{"key":"e_1_3_1_7_2","doi-asserted-by":"publisher","DOI":"10.1145\/3123266.3123424"},{"key":"e_1_3_1_8_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_1_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00356"},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2013.248"},{"key":"e_1_3_1_11_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2012.59"},{"key":"e_1_3_1_12_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP.2017.8296261"},{"key":"e_1_3_1_13_2","article-title":"Rethinking of pedestrian attribute recognition: Realistic datasets with efficient method","author":"Jia Jian","year":"2020","unstructured":"Jian Jia, Houjing Huang, Wenjie Yang, Xiaotang Chen, and Kaiqi Huang. 2020. Rethinking of pedestrian attribute recognition: Realistic datasets with efficient method. arXiv preprint arXiv:2005.11909 (2020).","journal-title":"arXiv preprint arXiv:2005.11909"},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2021.3061538"},{"key":"e_1_3_1_15_2","doi-asserted-by":"publisher","DOI":"10.1007\/s42979-021-00493-z"},{"key":"e_1_3_1_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2009.5459250"},{"key":"e_1_3_1_17_2","first-page":"111","volume-title":"ACPR","author":"Li Dangwei","year":"2015","unstructured":"Dangwei Li, Xiaotang Chen, and Kaiqi Huang. 2015. Multi-attribute learning for pedestrian attribute recognition in surveillance scenarios. In ACPR. 111\u2013115."},{"key":"e_1_3_1_18_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICME.2018.8486604"},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2018.2878349"},{"key":"e_1_3_1_20_2","doi-asserted-by":"publisher","DOI":"10.1117\/12.2588381"},{"key":"e_1_3_1_21_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46466-4_41"},{"key":"e_1_3_1_22_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2019.06.006"},{"key":"e_1_3_1_23_2","article-title":"Sequence-based person attribute recognition with joint CTC-Attention model","author":"Liu Hao","year":"2018","unstructured":"Hao Liu, Jingjing Wu, Jianguo Jiang, Meibin Qi, and Bo Ren. 2018. Sequence-based person attribute recognition with joint CTC-Attention model. arXiv preprint arXiv:1811.08115 (2018).","journal-title":"arXiv preprint arXiv:1811.08115"},{"key":"e_1_3_1_24_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.46"},{"key":"e_1_3_1_25_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.148"},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01390"},{"key":"e_1_3_1_27_2","doi-asserted-by":"publisher","DOI":"10.1049\/ipr2.12195"},{"key":"e_1_3_1_28_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01282"},{"key":"e_1_3_1_29_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-48881-3_2"},{"key":"e_1_3_1_30_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2018.02.028"},{"key":"e_1_3_1_31_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01252-6_42"},{"key":"e_1_3_1_32_2","doi-asserted-by":"publisher","DOI":"10.3390\/fi11110245"},{"key":"e_1_3_1_33_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP40778.2020.9191264"},{"key":"e_1_3_1_34_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2015.51"},{"key":"e_1_3_1_35_2","article-title":"Sequence to sequence learning with neural networks","volume":"27","author":"Sutskever Ilya","year":"2014","unstructured":"Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to sequence learning with neural networks. Advances in Neural Information Processing Systems 27 (2014).","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_36_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i07.6883"},{"key":"e_1_3_1_37_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00510"},{"key":"e_1_3_1_38_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2021.108220"},{"key":"e_1_3_1_39_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00543"},{"key":"e_1_3_1_40_2","doi-asserted-by":"publisher","DOI":"10.1109\/AMCON.2018.8614752"},{"key":"e_1_3_1_41_2","doi-asserted-by":"crossref","first-page":"471","DOI":"10.1109\/ICMEW.2017.8026292","volume-title":"2017 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)","author":"Yao Chunfeng","year":"2017","unstructured":"Chunfeng Yao, Bailan Feng, Defeng Li, and Jian Li. 2017. Hierarchical pedestrian attribute recognition based on adaptive region localization. In 2017 IEEE International Conference on Multimedia & Expo Workshops (ICMEW). IEEE, 471\u2013476."},{"key":"e_1_3_1_42_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICME46284.2020.9102757"},{"key":"e_1_3_1_43_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2018.2812605"},{"key":"e_1_3_1_44_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46466-4_52"},{"key":"e_1_3_1_45_2","doi-asserted-by":"publisher","DOI":"10.5555\/2919332.2919877"}],"container-title":["ACM Transactions on Intelligent Systems and Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3626240","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3626240","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T00:05:47Z","timestamp":1750291547000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3626240"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,12,19]]},"references-count":44,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2024,2,29]]}},"alternative-id":["10.1145\/3626240"],"URL":"https:\/\/doi.org\/10.1145\/3626240","relation":{},"ISSN":["2157-6904","2157-6912"],"issn-type":[{"value":"2157-6904","type":"print"},{"value":"2157-6912","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,12,19]]},"assertion":[{"value":"2022-08-08","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-08-24","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-12-19","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}