TY - GEN
T1 - Finger Spelling Recognition from Depth Data Using Direction Cosines and Histogram of Cumulative Magnitudes
AU - Cardenas, Edwin Jonathan Escobedo
AU - Chavez, Guillermo Camara
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2015/10/30
Y1 - 2015/10/30
N2 - In this paper, we propose a new approach for finger spelling recognition using depth information captured by Kinect sensor. We only use depth information to characterize hand configurations corresponding to alphabet letters. First, we use depth data to generate a binary hand mask which is used to segment the hand area from background. Then, the major hand axis is determined and aligned with Y axis in order to achieve rotation invariance. Later, we convert the depth data in a 3D point cloud. The point cloud is divided into sub regions and in each one, using direction cosines, we calculated three histograms of cumulative magnitudes Hx, Hy and Hz corresponding to each axis. Finally, these histograms were concatenated and used as input to our Support Vector Machine (SVM) classifier. The performance of this approach is quantitatively and qualitatively evaluated on a dataset of real images of American Sign Language (ASL) hand shapes. The dataset used is composed of 60000 depth images. According to our experiments, our approach has an accuracy rate of 99.37%, outperforming other state-of-the-art methods.
AB - In this paper, we propose a new approach for finger spelling recognition using depth information captured by Kinect sensor. We only use depth information to characterize hand configurations corresponding to alphabet letters. First, we use depth data to generate a binary hand mask which is used to segment the hand area from background. Then, the major hand axis is determined and aligned with Y axis in order to achieve rotation invariance. Later, we convert the depth data in a 3D point cloud. The point cloud is divided into sub regions and in each one, using direction cosines, we calculated three histograms of cumulative magnitudes Hx, Hy and Hz corresponding to each axis. Finally, these histograms were concatenated and used as input to our Support Vector Machine (SVM) classifier. The performance of this approach is quantitatively and qualitatively evaluated on a dataset of real images of American Sign Language (ASL) hand shapes. The dataset used is composed of 60000 depth images. According to our experiments, our approach has an accuracy rate of 99.37%, outperforming other state-of-the-art methods.
KW - depth information
KW - directional cosines
KW - Finger spelling recognition
KW - points cloud
KW - support vector machine (SVM)
UR - http://www.scopus.com/inward/record.url?scp=84959348870&partnerID=8YFLogxK
U2 - 10.1109/SIBGRAPI.2015.49
DO - 10.1109/SIBGRAPI.2015.49
M3 - Articulo (Contribución a conferencia)
AN - SCOPUS:84959348870
T3 - Brazilian Symposium of Computer Graphic and Image Processing
SP - 173
EP - 179
BT - Proceedings - 2015 28th SIBGRAPI Conference on Graphics, Patterns and Images, SIBGRAPI 2015
PB - IEEE Computer Society
T2 - 28th SIBGRAPI Conference on Graphics, Patterns and Images, SIBGRAPI 2015
Y2 - 26 August 2015 through 29 August 2015
ER -