Recognizing Conversational Interaction Based on 3D Human Pose

Jingjing Deng, Xianghua Xie*, Ben Daubney, Hui Fang, Phil W. Grant

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, we take a bag of visual words approach to investigate whether it is possible to distinguish conversational scenarios from observing human motion alone, in particular gestures in 3D. The conversational interactions concerned in this work have rather subtle differences among them. Unlike typical action or event recognition, each interaction in our case contain many instances of primitive motions and actions, many of which are shared among different conversation scenarios. Hence, extracting and learning temporal dynamics are essential. We adopt Kinect sensors to extract low level temporal features. These features are then generalized to form a visual vocabulary that can be further generalized to a set of topics from temporal distributions of visual vocabulary. A subject-specific supervised learning approach based on both generative and discriminative classifiers is employed to classify the testing sequences to seven different conversational scenarios. We believe this is among one of the first work that is devoted to conversational interaction classification using 3D pose features and to show this task is indeed possible.

Original languageEnglish
Title of host publicationAdvanced Concepts for Intelligent Vision Systems
Subtitle of host publication15th International Conference, ACIVS 2013, Poznań, Poland, October 28-31, 2013. Proceedings
EditorsJ BlancTalon, A Kasinski, W Philips, D Popescu, P Scheunders
PublisherSpringer
Pages138-149
Number of pages12
ISBN (Print)978-3-319-02894-1
DOIs
Publication statusPublished - 2013
Externally publishedYes
Event15th International Conference on Advanced Concepts for Intelligent Vision Systems (ACIVS) - Poznan, Poland
Duration: 28 Oct 201331 Oct 2013

Publication series

NameLecture Notes in Computer Science
PublisherSPRINGER-VERLAG BERLIN
Volume8192
ISSN (Print)0302-9743

Conference

Conference15th International Conference on Advanced Concepts for Intelligent Vision Systems (ACIVS)
Country/TerritoryPoland
CityPoznan
Period28/10/1331/10/13

Keywords

  • 3D human pose
  • conversational interaction classification
  • interaction analysis
  • Kinect sensor
  • MOTION CAPTURE
  • RECOGNITION
  • REPRESENTATION

Fingerprint

Dive into the research topics of 'Recognizing Conversational Interaction Based on 3D Human Pose'. Together they form a unique fingerprint.

Cite this