|
[1] A. Toshev, and C. Szegedy, Deeppose: Human Pose Estimation via Deep Neural Networks. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, pp. 1653–1660, 2014. [2] A. Krizhevsky, I. Sutskever, and G. Hinton, Imagenet Classification with Deep Convolutional Neural Networks. In Neural Information Processing Systems NIPS, 2012. [3] J. Carreira, P. Agrawal, K. Fragkiadaki, and J. Malik. Human Pose Estimation with Iterative Error Feedback. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR,2016 [4] C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, and A. Rabinovich, Going Deeper with Convolutions. In IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2015. [5] X. Sun, J. Shang, S. Liang, and Y. Wei. Compositional Human Pose Regression. In IEEE International Conference on Computer Vision ICCV, 2017. [6] K. He, X. Zhang, S. Ren, and J. Sun, Deep Residual Learning for Image Recognition. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2015. [7] D. C. Luvizon, H. Tabia, and D. Picard. Human Pose Regression by Combining Indirect Part Detection and Contextual Information. In Clinical Orthopaedics and Related Research CoRR, abs/1710.02322, 2017. [8]T. Pfister, K. Simonyan, J. Charles, and A. Zisserman, Deep Convolutional Neural Networks for Efficient Pose Estimation in Gesture Videos. In Asian Conference on Computer Vision ACCV, 2014. [9]A. Nibali, Z. He, S. Morgan, and L. Prendergast, Numerical Coordinate Regression with Convolutional Neural Networks. In arXiv:1801.07372, 2018. [10]S. Li, Z.-Q. Liu, and A. B. Chan, Heterogeneous Multi-Task Learning for Human Pose Estimation with Deep Convolutional Neural Network, In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2014. [11]X. Fan, K. Zheng, Y. Lin, and S. Wang, Combining Local Appearance and Holistic View: Dual-Source Deep Neural Networks for Human Pose Estimation. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2015. [12]D. C. Luvizon, D. Picard, and H. Tabia, 2d/3d Pose Estimation and Action Recognition Using Multitask Deep Learning. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2018. [13]F. Zhang, X. Zhu, and M. Ye, Fast Human Pose Estimation. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2019. [14] J. J. Tompson, A. Jain, Y. LeCun, and C. Bregler, Joint Training of a Convolutional Network and A Graphical Model for Human Pose Estimation. In Neural Information Processing Systems NIPS, 2014. [15]I. Lifshitz, E. Fetaya, and S. Ullman, Human Pose Estimationusing Deep Consensus Voting. In European Conference on Computer Vision ECCV, 2016. [16] S. E. Wei, V. Ramakrishna, T. Kanade, and Y. Sheikh, Convolutional Pose Machines. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, pp. 4724-4732, 2016. [17] V. Ramakrishna, D. Munoz, M. Hebert, J. A. Bagnell, and Y. Sheikh, Pose Machines: Articulated Pose Estimation via Inference Machines. In European Conference on Computer Vision ECCV, pp. 33-47, September 2014. [18] A. Newell, K. Yang, and J. Deng, Stacked Hourglass Networks for Human Pose Estimation. In European Conference on Computer Vision ECCV, pp. 483-499, October 2016. [19] X. Chu, W. Yang, W. Ouyang, C. Ma, A. L. Yuille, and X. Wang, Multi-Context Attention for Human Pose Estimation. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2017. [20]W. Yang, S. Li, W. Ouyang, H. Li, and X. Wang, Learning Feature Pyramids for Human Pose Estimation. In IEEE International Conference on Computer Vision ICCV, 2017. [21]J. Tompson, R. Goroshin, A. Jain, Y. LeCun, and C. Bregler, Efficient Object Localization Using Convolutional Networks. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2015. [22]A. Bulat and G. Tzimiropoulos, Human Pose Estimation Via Convolutional Part Heatmap Regression. In European Conference on Computer Vision ECCV, 2016. [23]G. Gkioxari, A. Toshev, and N. Jaitly, Chained Predictions Using Convolutional Neural Networks. In European Conference on Computer Vision ECCV, 2016. [24]V. Belagiannis and A. Zisserman, Recurrent Human Pose Estimation. In IEEE International Conference on Automatic Face and Gesture Recognition, 2017. [25]Y. Luo, J. Ren, Z. Wang, W. Sun, J. Pan, J. Liu, J. Pang, and L. Lin, LSTM Pose Machines. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2018. [26]B. Debnath, M. O'Brien, M. Yamaguchi, and A. Behera, Adapting Mobilenets for Mobile Based Upper Body Pose Estimation. In IEEE International Conference on Advanced Video and Signal-based Surveillance AVSS, 2018. [27]B. Xiao, H. Wu, and Y. Wei, Simple Baselines for Human Pose Estimation and Tracking. In European Conference on Computer Vision ECCV, 2018. [28]H. Zhang, H. Ouyang, S. Liu, X. Qi, X. Shen, R. Yang, and J. Jia, Human Pose Estimation with Spatial Contextual Information. In arXiv:1901.01760, 2019. [29]B. Artacho and A. Savakis, Unipose: Unified Human Pose Estimation in Single Images and Videos. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2020. [30]X. Chu, W. Yang, W. Ouyang, C. Ma, A. L. Yuille, and X.Wang, Multi-Context Attention For Human Pose Estimation. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2017. [31] G. Papandreou, T. Zhu, N. Kanazawa, A. Toshev, J. Tompson, C. Bregler, and K. Murphy, Towards Accurate Multiperson Pose Estimation in the Wild. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, pp. 4903-4911, 2017. [32] H. S. Fang, S. Xie, Y. W. Tai, and C. Lu, RMPE: Regional Multi-Person Pose Estimation. In Proceedings of IEEE International Conference on Computer Vision ICCV, pp. 2334-2343, 2017. [33] W. Liu, D. Anguelov, D. Erhan, C. Szegedy, and S. E. Reed, SSD: Single Shot Multibox Detector, In European Conference on Computer Vision, pp. 21-37, October 2016. [34] M. Jaderberg, K. Simonyan, A. Zisserman, and K. Kavukcuoglu, Spatial Transformer Networks, Advances in Neural Information Processing System, Vol. 28, pp. 2017-2015, 2015. [35] B. Xiao, H. Wu, and Y. Wei, Simple Baselines For Human Pose Estimation And Tracking. In European Conference on Computer Vision ECCV, p.472–487, 2018. [36] K. Sun, B. Xiao, D. Liu, and J. Wang. Deep High-Resolution Representation Learning for Human Pose Estimation. In IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2019. [37] Y. Chen, Z. Wang, Y. Peng, Z. Zhang, G. Yu, and J. Sun, Cascaded Pyramid Network for Multi-Person Pose Estimation. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, pp. 7103-7112, 2018. [38] S. Huang, M. Gong, and D. Tao, A Coarse-Fine Network for Keypoint Localization. In IEEE International Conference on Computer Vision ICCV, 2017. [39] K. Sun, B. Xiao, D. Liu, and J. Wang, Deep High-Resolution Representation Learning for Human Pose Estimation. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2019. [40] W. Li, Z.Wang, B. Yin, Q. Peng, Y. Du, T. Xiao, G. Yu, H. Lu, Y.Wei, and J. Sun, Rethinking On Multi-Stage Networks For Human Pose Estimation. In arXiv:1901.00148, 2019. [41] G. Moon, J. Y. Chang, and K. M. Lee, Posefix: Model-Agnostic General Human Pose Refinement Network. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2019. [42] J. Wang, X. Long, Y. Gao, E. Ding, and S. Wen, Graph-Pcnn: Two Stage Human Pose Estimation with Graph Pose Refinement. In arXiv:2007.10599, 2020. [43] J. Huang, Z. Zhu, F. Guo, and G. Huang, The Devil Is in The Details: Delving Into Unbiased Data Processing For Human Pose Estimation. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2020. [44] Y. Cai, Z. Wang, Z. Luo, B. Yin, A. Du, H. Wang, X. Zhou, E. Zhou, X. Zhang, and J. Sun, Learning Delicate Local Representations For Multi-Person Pose Estimation. In arXiv:2003.04030, 2020. [45] F. Zhang, X. Zhu, H. Dai, M. Ye, and C. Zhu, Distribution-Aware Coordinate Representation for Human Pose Estimation. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2020. [46] L. Pishchulin, E. Insafutdinov, S. Tang, B. Andres, M. Andriluka, P. Gehler, and B. Schiele, Deepcut: Joint Subset Partition and Labeling for Multi Person Pose Estimation. In IEEE Conference on Computer Vision and Pattern Recognition CVPR, pp. 4929-4937, 2016. [47] E. Insafutdinov, L. Pishchulin, B. Andres, M. Andriluka, and B. Schiele, Deepercut: A Deeper, Stronger, and Faster Multiperson Pose Estimation Model. In European Conference on Computer Vision, Springer, pp. 34-50, October 2016. [48] Z. Cao, T. Simon, S. E. Wei, and Y. Sheikh, Realtime Multiperson 2d Pose Estimation Using Part Affinity Fields. In Proceedings of the IEEE conference on Computer Vision and Pattern Recognition CVPR, pp. 7291-7299, 2017. [49] Z. Cao, G. Hidalgo, T. Simon, S. E. Wei, and Y. Sheikh, OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields. In IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 43, No. 1, pp. 172-186, July 2019. [50] X. Zhu, Y. Jiang, and Z. Luo, Multi-Person Pose Estimation for Posetrack with Enhanced Part Affinity Fields. In IEEE International Conference on Computer Vision ICCV, 2017. [51] S. Kreiss, L. Bertoni, and A. Alahi, Pifpaf: Composite Fields for Human Pose Estimation. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2019. [52] E. Insafutdinov, M. Andriluka, L. Pishchulin, S. Tang, E. Levinkov, B. Andres, and B. Schiele, Arttrack: Articulated Multi-Person Tracking in The Wild. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2017. [53] A. Newell, Z. Huang, and J. Deng, Associative Embedding: End-to-End Learning for Joint Detection and Grouping. In Neural Information Processing Systems NIPS, 2017. [54] M. Fieraru, A. Khoreva, L. Pishchulin, and B. Schiele, Learning to Refine Human Pose Estimation. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2018. [55] Z. Tian, H. Chen, and C. Shen, Directpose: Direct End-To-End Multi-Person Pose Estimation. In arXiv:1911.07451, 2019. [56] X. Nie, J. Feng, J. Zhang, and S. Yan, Single-Stage Multi-Person Pose Machines. In IEEE International Conference on Computer Vision ICCV, 2019. [57] S. Jin, W. Liu, E. Xie, W. Wang, C. Qian, W. Ouyang, and P. Luo, Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation. In arXiv:2007.11864, 2020. [58] B. Cheng, B. Xiao, J. Wang, H. Shi, T. S. Huang, and L. Zhang, Higherhrnet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation. In arXiv:1908.10357, 2019. [59] K. Simonyan and A. Zisserman, Two-Stream Convolutional Networks for Action Recognition in Videos, Advances in Neural Information Processing System, Vol. 27, pp. 568-576, 2014. [60] C. Feichtenhofer, A. Pinz, and A. Zisserman, Convolutional Two-Stream Network Fusion for Video Action Recognition. In IEEE International Conference on Computer Vision and Pattern Recognition CVPR, pp. 1933-1941, 2016. [61] L. Wang, Y. Xiong, Z. Wang, Y. Qiao, D. Lin, X. Tang, and L. V. Gool, Temporal Segment Networks: Towards Good Practices for Deep Action Recognition. In European Conference on Computer Vision ECCV, pp. 20-36, October 2016. [62] D. Tran, L. Bourdev, R. Fergus, L. Torresani, and M. Paluri, Learning Spatial-Temporal Features with 3D Convolutional Networks. In IEEE International Conference on Computer Vision ICCV, pp. 4489–4497, 2015. [63] C. Feichtenhofer, H. Fan, J. Malik, and K. He. Slowfast Networks for Video Recognition. In arXiv:1812.03982, 2018. [64] Y. Du, W. Wang, and L. Wang, Hierarchical Recurrent Neural Network for Skeleton Based Action Recognition. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, pp. 1110-1118, 2015. [65] M. Li, S. Chen, X. Chen, Y. Zhang, Y. Wang, and Q. Tian, Actional-Structural Graph Convolutional Networks for Skeleton-Based Action Recognition. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2019, pp. 3595–3603. [66] S. Yan, Y. Xiong, and D. Lin, Spatial-Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32, No. 1, April 2018. [67] Joseph Redmon and Ali Farhadi, Yolov3: An Incremental Improvement, In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, abs/1804.02767, April 2018. [68] T.-Y. Lin, P. Dollar, R. Girshick, K. He, B. Hariharan, and 'S. Belongie. Feature Pyramid Networks for Object Detection. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2017. [69] N. Wojke, A. Bewley, and D. Paulus, Simple Online and Realtime Tracking with a Deep Association Metric. In Proceedings of IEEE Conference on Image Processing ICIP, pp. 3645–3649, 2017. [70] S. Woo, J. Park, J.-Y. Lee, and I. S. Kweon. CBAM: Convolutional block attention module. In Proceedings of the European Conference on Computer Vision ECCV, pages 3–19, 2018. [71] Wang, P., Chen, P., Yuan, Y., Liu, D., Huang, Z., Hou, X., Cottrell, G.: Understanding convolution for semantic segmentation. In arXiv:1702.08502, 2017. [72] W. Shi, J. Caballero, F. Huszar, J. Totz, A. P. Aitken, R. Bishop,D. Rueckert, and Z. Wang. Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network. In IEEE Conference on Computer Vision and Pattern Recognition CVPR, pages 1874–1883, 2016. [73] T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollar, and C. L. Zitnick. Microsoft COCO: Common Objects in Context. In European Conference on Computer Vision ECCV, 2014. [74] M. Andriluka, L. Pishchulin, P. Gehler, and B. Schiele. 2D Human Pose Estimation: New Benchmark and State of The Art Analysis. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2014. [75] K. He, G. Gkioxari, P. Doll´ar, and R. Girshick, Mask R-CNN, In IEEE International Conference on Computer Vision ICCV, 2017. [76] U. Iqbal and J. Gall, “Multi-person pose estimation with local joint-to- person associations,” In European Conference on Computer Vision ECCV, 2016. [77] E. Levinkov, J. Uhrig, S. Tang, M. Omran, E. Insafutdinov, A. Kirillov, C. Rother, T. Brox, B. Schiele, and B. Andres, Joint graph decomposition & node labeling: Problem, algorithms, applications, In IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2017. [78] M. Fieraru, A. Khoreva, L. Pishchulin, and B. Schiele, Learning to refine human pose estimation, In IEEE Conference on Computer Vision and Pattern Recognition CVPR, 2018. [79] Kay, W. Carreira, J. Simonyan, K. Zhang, B. Hillier, C. Vijayanarasimhan, S. Viola, F. Green, T. Back, T. Natsev, P. The kinetics human action video dataset. In arXiv:1705.06950. [80] Fernando, B. Gavves, E. Oramas, J. M. Ghodrati, A. and Tuytelaars, T. Modeling video evolution for action recognition. In IEEE Conference on Computer Vision and Pattern Recognition CVPR, 5378–5387. [81] Shahroudy, A. Liu, J. Ng, T.-T. and Wang, G. NTU RGB+ D: A large scale dataset for 3D human activity analysis. In IEEE Conference on Computer Vision and Pattern Recognition CVPR, 1010–1019. [82] Kim, T. S., and Reiter, A. Interpretable 3D human action analysis with temporal convolutional networks. In arXiv: 1704.04516.
|