MultiPoseNet: Fast Multi-Person Pose Estimation Using Pose Residual Network

Download
2018-09-14
KOCABAŞ, Muhammed
KARAGÖZ, Salih
Akbaş, Emre
In this paper, we present MultiPoseNet, a novel bottom-up multi-person pose estimation architecture that combines a multi-task model with a novel assignment method. MultiPoseNet can jointly handle person detection, person segmentation and pose estimation problems. The novel assignment method is implemented by the Pose Residual Network (PRN) which receives keypoint and person detections, and produces accurate poses by assigning keypoints to person instances. On the COCO keypoints dataset, our pose estimation method outperforms all previous bottom-up methods both in accuracy (+4-point mAP over previous best result) and speed; it also performs on par with the best top-down methods while being at least 4x faster. Our method is the fastest real time system with ∼23 frames/sec.

Suggestions

HPRNet: Hierarchical point regression for whole-body human pose estimation
SAMET, NERMİN; Akbaş, Emre (2021-11-01)
In this paper, we present a new bottom-up one-stage method for whole-body pose estimation, which we call “hierarchical point regression,” or HPRNet for short. In standard body pose estimation, the locations of ~17 major joints on the human body are estimated. Differently, in whole-body pose estimation, the locations of fine-grained keypoints (68 on face, 21 on each hand and 3 on each foot) are estimated as well, which creates a scale variance problem that needs to be addressed. To handle the scale variance ...
Linear Planning Logic: An Efficient Language and Theorem Prover for Robotic Task Planning
Kortik, Sitar; Saranlı, Uluç (2014-06-07)
In this paper, we introduce a novel logic language and theorem prover for robotic task planning. Our language, which we call Linear Planning Logic (LPL), is a fragment of linear logic whose resource-conscious semantics are well suited for reasoning with dynamic state, while its structure admits efficient theorem provers for automatic plan construction. LPL can be considered as an extension of Linear Hereditary Harrop Formulas (LHHF), whose careful design allows the minimization of nondeterminism in proof se...
Multi-perspective analysis and systematic benchmarking for binary-classification performance evaluation instruments
Canbek, Gürol; Taşkaya Temizel, Tuğba; Department of Information Systems (2019)
This thesis proposes novel methods to analyze and benchmark binary-classification performance evaluation instruments. It addresses critical problems found in the literature, clarifies terminology and distinguishes instruments as measure, metric, and as a new category indicator for the first time. The multi-perspective analysis introduces novel concepts such as canonical form, geometry, duality, complementation, dependency, and leveling with formal definitions as well as two new basic instruments. An indicat...
Multipath Characteristics of Frequency Diverse Arrays Over a Ground Plane
Cetintepe, Cagri; Demir, Şimşek (Institute of Electrical and Electronics Engineers (IEEE), 2014-07-01)
This paper presents a theoretical framework for an analytical investigation of multipath characteristics of frequency diverse arrays (FDAs), a task which is attempted for the first time in the open literature. In particular, transmitted field expressions are formulated for an FDA over a perfectly conducting ground plane first in a general analytical form, and these expressions are later simplified under reasonable assumptions. Developed formulation is then applied to a uniform, linear, continuous-wave opera...
Exploitation of multi-camera configurations for visual surveillance
Akman, Oytun; Alatan, Abdullah Aydın; Çiloğlu, Tolga (2008-06-20)
In this paper, we propose novel methods for background modeling, occlusion. handling and event recognition by using multi-camera configurations. Homography-related positions are utilized to construct a mixture of multivariate Gaussians to generate a background model for each pixel of the reference camera. Occlusion handling is achieved by generation of the top-view via trifocal tensors, as a result of matching over-segmented regions instead of pixels. The resulting graph is segmented into objects after dete...
Citation Formats
M. KOCABAŞ, S. KARAGÖZ, and E. Akbaş, “MultiPoseNet: Fast Multi-Person Pose Estimation Using Pose Residual Network,” 2018, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/48698.