#computervision #artificialintelligence #innovation

Oracle•105K followers

𝗗𝗮𝘆-𝟯𝟲𝟲 𝗖𝗼𝗺𝗽𝘂𝘁𝗲𝗿 𝗩𝗶𝘀𝗶𝗼𝗻 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴 SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos SenseTime 商汤科技 Follow me for a similar post: 🇮🇳 Ashish Patel ------------------------------------------------------------------- 𝗜𝗻𝘁𝗲𝗿𝗲𝘀𝘁𝗶𝗻𝗴 𝗙𝗮𝗰𝘁𝘀 : 🔸 Paper: SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos 🔸 This paper is published arxiv2021. 🔸 SMOOTHNET, a simple yet ef- fective pose refinement network to improve the temporal smoothness and per-frame precision of existing pose/body estimators. Compared to existing solutions, SMOOTH- NET can deal with long-term significant jitters that occurred often with rare or occluded poses, as verified with compre- hensive experiments on a large number of backbone net- works and datasets. ------------------------------------------------------------------- 𝗜𝗠𝗣𝗢𝗥𝗧𝗔𝗡𝗖𝗘 🔸 When analyzing human motion videos, the output jitters from existing pose estimators are highly-unbalanced. 🔸Most frames only suffer from slight jitters, while significant jitters occur in those frames with occlusion or poor image quality. 🔸Such complex poses often persist in videos, leading to consecutive frames with poor estimation results and large jitters. 🔸Existing pose smoothing solutions based on temporal convolutional networks, recurrent neural networks, or low-pass filters cannot deal with such a long-term jitter problem without considering the significant and persistent errors within the jittering video segment. 🔸Motivated by the above observation, we propose a novel plug-and-play refinement network, namely SMOOTHNET, which can be attached to any existing pose estimators to improve its temporal smoothness and enhance its per-frame precision simultaneously. 🔸Especially, SMOOTHNET is a simple yet effective data-driven fully-connected network with large receptive fields, effectively mitigating the impact of long-term jitters with unreliable estimation results. 🔸We conduct extensive experiments on twelve backbone networks with seven datasets across 2D and 3D pose estimation, body recovery, and downstream tasks. Our results demonstrate that the proposed SMOOTHNET consistently outperforms existing solutions, especially on those clips with high errors and long-term jitters. #computervision #artificialintelligence #innovation