臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.216.103) 您好！臺灣時間：2025/11/22 04:52

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
目次
參考文獻
紙本論文
QR Code

本論文永久網址:

研究生:

林天翼

研究生(外文):

Tian-Yi Lin

論文名稱:

應用卷積類神經網路於漫畫角色分群

論文名稱(外文):

Manga Character Clustering using Convolutional Neural Network

指導教授:

莊永裕

指導教授(外文):

Yung-Yu Chuang

口試委員:

朱威達、朱宏國

口試委員(外文):

Wei-Ta Chu、Hung-Kuo Chu

口試日期:

2018-07-31

學位類別:

碩士

校院名稱:

國立臺灣大學

系所名稱:

資訊工程學研究所

學門:

工程學門

學類:

電資工程學類

論文種類:

學術論文

論文出版年:

2018

畢業學年度:

106

語文別:

英文

論文頁數:

中文關鍵詞:

漫畫、人臉辨識、卷積類神經網路、聚類分析

相關次數:

被引用:0
點閱:237
評分:
下載:0
書目收藏:0

近年來隨著閱讀習慣改變，越來越多的漫畫被電子化儲存，但針對漫畫內容進行分析與自動標記的研究則尚未發展成熟。我們決定針對角色身份的識別著手研究，試著將不同畫格中出現的相同角色進行關聯，將整本漫畫中所有出現的人臉根據他們所屬的角色進行分群。在本論文中我們提出了一種基於卷積類神經網路與聚類算法的漫畫角色分群方法。我們設計並訓練了一個卷積類神經網路對漫畫中的人臉周邊做特徵的抽取，並將抽取出來的特徵結合臉的位置資訊利用聚類演算法進行分群。我們的方法在訓練集中未出現過的漫畫與角色上獲得了 76.92% 的分群準確率。

As reading habits change, more and more manga books are digitized and can be read on tablet or smartphone. However, most of them are just scanned from the printed version and treated as normal image files. There are only few studies focus on extracting information from manga pages automatically. In this thesis, we try to cluster faces in manga books based on their identities. We proposed a method using convolutional neural network and clustering algorithm. We designed and trained a CNN network to extract features from manga character faces, and used the extracted features and spatial relations of faces to cluster them. Our method achieved 76.92% accuracy on unseen manga books and characters.

誌謝 i
摘要 ii
Abstract iii
1 Introduction 1
1.1 Background and Motivation ........................ 1
1.2 Research Purpose.............................. 2
1.3 Thesis Organization............................. 3
2 Related Work 4
2.1 Computer Vision for Manga ........................ 4
2.2 Convolutional Neural Network....................... 4
2.3 Clustering Algorithm ............................ 5
3 Proposed Method 7
3.1 Overview .................................. 7
3.2 Input Image about Character ........................ 9
3.3 Feature Extraction Network......................... 10
3.3.1 Network Architecture........................ 10
3.3.2 Data Selection and Training Detail................. 11
3.4 Clustering Algorithm ............................ 12
3.4.1 Agglomerative Clustering ..................... 12
3.4.2 Distance Measurement (Feature Vectors) . . . . . . . . . . . . . 12
3.4.3 Distance Measurement (Spatial Relations) . . . . . . . . . . . . . 13
4 Evaluation and Results 15
4.1 Evaluation Method ............................. 15
4.2 Dataset ................................... 16
4.3 Empirical Analysis ............................. 16
4.3.1 The size of input region ...................... 16
4.3.2 SpatialConnectivity ........................ 17
4.3.3 Comparison with other methods.................. 18
5 Conclusion 19
Bibliography 20

[1] W.-T. Chu and W.-W. Li. Manga facenet: Face detection in manga based on deep neural network. In Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, pages 412–415. ACM, 2017.
[2] S. C. Johnson. Hierarchical clustering schemes. Psychometrika, 32(3):241–254, 1967.
[3] J. MacQueen et al. Some methods for classification and analysis of multivariate ob- servations. In Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, volume 1, pages 281–297. Oakland, CA, USA, 1967.
[4] Y. Matsui, K. Ito, Y. Aramaki, A. Fujimoto, T. Ogawa, T. Yamasaki, and K. Aizawa. Sketch-based manga retrieval using manga109 dataset. Multimedia Tools and Appli- cations, 76(20):21811–21838, 2017.
[5] R. Narita, K. Tsubota, T. Yamasaki, and K. Aizawa. Sketch-based manga retrieval using deep features. In Document Analysis and Recognition (ICDAR), 2017 14th IAPR International Conference on, volume 3, pages 49–53. IEEE, 2017.
[6] N.-V. Nguyen, C. Rigaud, and J.-C. Burie. Comic characters detection using deep learning. In Document Analysis and Recognition (ICDAR), 2017 14th IAPR Inter- national Conference on, volume 3, pages 41–46. IEEE, 2017.
[7] T. Ogawa, A. Otsubo, R. Narita, Y. Matsui, T. Yamasaki, and A. Kiyoharu. Object detection for comics using manga109 annotations. arXiv preprint arXiv:1803.08670, 2018.
[8] F. Schroff, D. Kalenichenko, and J. Philbin. Facenet: A unified embedding for face recognition and clustering. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 815–823, 2015.
[9] H. Schütze, C. D. Manning, and P. Raghavan. Introduction to information retrieval, volume 39. Cambridge University Press, 2008.
[10] C. Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens, and Z. Wojna. Rethinking the incep- tion architecture for computer vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 2818–2826, 2016.
[11] J. H. Ward Jr. Hierarchical grouping to optimize an objective function. Journal of the American statistical association, 58(301):236–244, 1963.
[12] L.Zhao,X.Li,J.Wang,andY.Zhuang.Deeply-learnedpart-alignedrepresentations for person re-identification. arXiv preprint arXiv:1707.07256, 2017.

國圖紙本論文

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

無相關論文

無相關期刊

1.	使用影像分割與單應性矩陣對齊之影片籃球場定位
2.	基於衣服轉換之深度虛擬試衣技術
3.	視覺問題生成搭配文字輔助的方法與比較
4.	基於跨領域映射達成非監督式內容保留風格導向的圖片轉換
5.	基於對齊、降噪與混合的深度多張低亮度影像增強
6.	透過深度特徵對應實現空拍影像之顏色轉換
7.	利用卷積類神經網路以色彩資訊及光流進行影片物體分割
8.	以知識迭代為規範的深度模型訓練
9.	基於卷積神經網絡的髮型消除與轉換
10.	弱監督式卷積神經網路學習於視覺顯著物體發掘
11.	學習高速且高效率的深度學習的迴歸應用
12.	基於拉普拉斯金字塔的深度人臉影像光影重建
13.	從移動的單一視角360相機生成虛擬實境立體視角影像
14.	基於軟性排名損失函數之圖像相似度估測
15.	類別無關部分監督之深度物件偵測網絡

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室