研究生(外文):YU, JIE-REN
論文名稱(外文):The Recognition of Moving Gesture in Three Dimensional Space Base on AI
指導教授(外文):CHEN, WEI-MING
中文關鍵詞:HCI3D手勢資料擴展Leap MotionCNN
外文關鍵詞:HCI3D-gesturedata augmentationLeap MotionCNN
  在智慧型手機盛行的時代,人機互動(Human Computer Interaction, HCI)之研究在這幾年間快速竄起,2D手勢辨識與解鎖技術愈發成熟,成為常見的操作方式及安全措施之一,然而3D電影、AR和VR的崛起,2D手勢的創造性與多樣性已無法滿足人們的需求,專家學者開始著力於3D手勢辨識之研究,在3D空間比起2D空間多了一個維度,其複雜度也大幅提升。本論文提出一套動態手勢鎖辨識方式,無需顧慮3D手勢起始點,只要手勢和原設定手勢密碼相近,即可順利辨識與解鎖。
  本論文選用具有深度資訊的攝影設備Leap Motion取得手部位置與深度資訊,將三維度資訊正規化,以此組手勢做為樣板,進行資料擴展短時間建置手勢訓練集,並將訓練集之每組3D資訊拆分,分別對應組合成一張圖檔長100像素、寬50像素,最後帶入卷積類神經網路(CNN)訓練,完成一個密碼設定。欲解鎖者需對著本論文之手勢偵測設備,完成任一手勢,若經CNN模型預測,與已設手勢鎖相似度高於70%即可解鎖。

關鍵字:HCI、3D手勢、資料擴展、Leap Motion、CNN

  These days, mobile device can be seen everywhere, especially the smart phone which is a part of everyone’s life. Therefore, Human Computer Interaction (HCI), an interaction for human to computer, has been developed and researched in several years. It is important that a lot of technologies were designed in HCI, the recognition of 2D-gesture and security lock are cared more than twenty years ago, and then they play an considerable role in operating method and security for mobile device.
  However, 3D-movies, augmented reality (AR), and virtual reality (VR) is rising rapidly, people are not satisfied with the creativity and diversity of 2D-technology, experts and scholars start to contributing to research on recognizing 3D-gesture, three dimensional space is direct information in real world, so it have really impact on people rather than the information of two dimensional space. Recognizing 3D-gesture is more complex than 2D’s it because adding a dimension lead to more combination and more possibility. The purpose of this study was to investigate a system of the recognition of moving gesture in three dimensional space for unlocking gesture; besides, the same of gesture with different origin was recognized the same pattern.
  When getting the three dimensional information, it often is used to input data in the machine learning model, and doing a deep learning without data preprocessing. But data training with unprocessed data bring about the prediction that a different from the same gesture with the other origin. The purpose method of this study was another data training, using xy-plane and zy-plane of the 3D-gesture to be a front view and a side view, and then both view merge with a wider image. To increase the accuracy of prediction of CNN, some image processing and data augmentation were added.
  Leap Motion is a great photo equipment, it not only have a good technology of hand tracking but also have high stability. Hence, it was selected in this paper. In gesture setting, getting information of user’s hand from Leap Motion, made normalizing, transforming it into two view image, data augmentation, and data training, each step is necessary. Finally, in unlocking mode, user waves his hand in front of the purposed device, if this gesture is similar to the anything of correct gesture, the lock will be unlocked.
Keywords:HCI、3D-gesture、data augmentation、Leap Motion、CNN
表目錄 ...........................VI
圖目錄 ...........................VII
1.1 研究動機.......................1
1.2 研究目的.......................2
1.3 論文大綱.......................2
2.1手部位置捕捉及追蹤 ...............3
2.1.1 實時影像捕捉 ............3
2.1.2 穿戴式感測器捕捉 ..........3
2.2 Leap Motion感測器 ..............4
2.3 道格拉斯-普克演算法(Douglas-Peucker Algorithm)........4
2.4 資料擴增 ....................6
3.1 整體架構 ....................7
3.2 實驗流程 ....................8
3.3掌心偵測與點座標蒐集 ..............11
3.4點座標正規化 ..................11
3.4.1邊界正規化 ......................13
3.5多餘點過濾與軌跡簡化 ..............14
3.5.1雜訊點過濾 ......................14
3.5.2 3D道格拉斯-普克演算法(3D Douglas–Peucker Algorithm).14
3.6資料擴增(Data Augmentation).................16
3.6.1 3D道格拉斯-普克演算法之交叉配對...........16
3.6.2差異對應變形法 ....................17
3.6.3 點座標三軸旋轉 ....................18
3.8 CNN模型建置、訓練及預測.................20
3.8.1 CNN預測結果穩定法..................20
