臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.217.137) 您好！臺灣時間：2026/05/06 11:55

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
目次
參考文獻
電子全文
紙本論文
論文連結
QR Code

本論文永久網址:

研究生:

賈鎮東

研究生(外文):

Zhen-Dong Jia

論文名稱:

物件偵測在Android手機裝置上之應用

論文名稱(外文):

The Object Detection Application on Android Mobile Phone

指導教授:

洪西進

指導教授(外文):

Shi-Jinn Horng

口試委員:

洪西進、范欽雄、顏成安、吳金雄

口試委員(外文):

Shi-Jinn Horng、Chin-Shyurng Fahn、Thompson Yen、Chin-Hsiung Wu

口試日期:

2012-06-28

學位類別:

碩士

校院名稱:

國立臺灣科技大學

系所名稱:

資訊工程系

學門:

工程學門

學類:

電資工程學類

論文種類:

學術論文

論文出版年:

2019

畢業學年度:

107

語文別:

中文

論文頁數:

中文關鍵詞:

物件偵測、深度學習、卷積神經網路、行動裝置

外文關鍵詞:

Object Detection、Deep Learning、Convolutional Neural Network、Mobile Device

相關次數:

被引用:0
點閱:383
評分:
下載:26
書目收藏:0

Android系統是一個以Linux為基礎的半開放原始碼作業系統，主要用於行動端設備，屬於開發性系統，自由度高，機體相容性廣泛，提供完善的開發環境，支援各種先進的繪圖、網路、相機等處理能力，近年來市場佔有率逐漸提高，Android的應用需求也越來越大。而本論文將物件偵測應用於Android系統上，好處在於現今越來越多裝置採用此作業系統，因此可被使用的範圍隨之變廣，由於Android APP相容性高，Android系統的裝置皆可使用本論文所提出的系統。
本文設計實作了一套完整的行動端物件偵測系統，特別針對深度神經網路的小型化及快速計算。對一種目前主流的SSD物件偵測演算法進行改造，令它可以即時運行在手機端。對於模型大小，使用一種小型化網路MoblieNet替換原有的VGG16，將整個模型大大縮小。對於模型精度，我們使用帶孔卷積金字塔多尺度卷積特徵融合的方法進行優化。在Android上實現了一個Demo應用，從鏡頭獲取場景，對其中的物件進行即時偵測。

Android is a Linux-based, semi-open source operation system, mainly used for mobile devices. This is a development system with high degrees of freedom and wide compatibility. It provides a complete development environment and supports advanced drawing, processing power of the Internet, camera etc. The market share has gradually increased in recent years, and the application requirements of Android are also growing. In this paper, the object detection is applied to the Android system. The advantage of this development in Android is that more and more devices adopt this operating system, so the field that can be used in becomes wider, because of the high compatibility. The system proposed in this paper can be implemented on Android system devices.
This paper designs and implements a complete object detection system on mobile phone, especially for the miniaturization and fast calculation of neural networks. A current mainstream object detection algorithm SSD is modified so that it can run on the mobile terminal instantly. For the model size, the original VGG16 network was replaced with a small network named MoblieNet, which greatly reduced the overall model size. For better accuracy, we use the method of Atrous Spatial Pyramid Pooling to concatenate and optimize feature map. The demo on Android is able to capture scenes from the camera and instantly detect objects from them.

第一章緒論 1
1.1 研究動機與目的 1
1.2 論文章節安排 2
第二章相關研究 3
2.1 文獻探討 3
第三章物件偵測 9
3.1 SSD物件偵測演算法 9
3.1.1 Prior Box的設定 10
3.1.2 損失函數Loss Function 11
3.1.3 網路架構 12
3.2 網路模型小型化 13
3.2.1 特殊網路結構MobileNet 14
第四章行動端物件偵測系統設計與優化 16
4.1 小型網路MobileNet 16
4.1 SSD的不足 17
4.2 基於帶孔卷積的特徵融合 17
4.3 多尺度特徵金字塔 19
4.4 實驗說明 20
4.4.1 運行環境 20
4.4.2 COCO Dataset簡介 20
4.5 TensorFlow Lite(Android) 22
4.6 系統效能 24
4.6.1 實驗結果分析 24
第五章結論 30
5.1 研究成果 30
5.1 未來展望 30
參考文獻 32

[1] Krizhevsky, A., Sutskever, I., & Hinton, G. E., "Imagenet classification with deep convolutional neural networks", 2012 Advances in neural information processing systems, pp. 1097-1105, 2012.
[2] ImageNet Large Scale Visual Recognition Challenge http://www.image-net.org/challenges/LSVRC/
[3] The PASCAL Visual Object Classes http://host.robots.ox.ac.uk/pascal/VOC/
[4] ImageNet dataset http://image-net.org/
[5] COCO dataset http://cocodataset.org/#home
[6] Girshick, R., Donahue, J., Darrell, T. and Malik, J., "Rich feature hierarchies for accurate object detection and semantic segmentation." 2014 Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 580-587. 2014.
[7] He, K., Zhang, X., Ren, S. and Sun, J., "Spatial pyramid pooling in deep convolutional networks for visual recognition." IEEE transactions on pattern analysis and machine intelligence 37, no. 9 (2015): 1904-1916.
[8] Girshick, R., "Fast r-cnn." 2015 Proceedings of the IEEE international conference on computer vision, pp. 1440-1448. 2015.
[9] Ren, S., He, K., Girshick, R. and Sun, J., "Faster r-cnn: Towards real-time object detection with region proposal networks." 2015 Advances in neural information processing systems, pp. 91-99. 2015.
[10] Dai, J., Li, Y., He, K. and Sun, J., "R-fcn: Object detection via region-based fully convolutional networks." 2016 Advances in neural information processing systems, pp. 379-387. 2016.
[11] Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R. and LeCun, Y., "Overfeat: Integrated recognition, localization and detection using convolutional networks." arXiv preprint arXiv:1312.6229 (2013).
[12] Redmon, J., Divvala, S., Girshick, R. and Farhadi, A., "You only look once: Unified, real-time object detection." 2016 Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 779-788. 2016.
[13] Redmon, J. and Farhadi, A., "YOLO9000: better, faster, stronger." 2017 Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 7263-7271. 2017.
[14] Redmon, J. and Farhadi, A., "Yolov3: An incremental improvement." arXiv preprint arXiv:1804.02767 (2018).
[15] Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y. and Berg, A.C., "Ssd: Single shot multibox detector." 2016 European conference on computer vision, pp. 21-37. Springer, Cham, 2016.
[16] Lin, T.Y., Goyal, P., Girshick, R., He, K. and Dollár, P., "Focal loss for dense object detection." 2017 Proceedings of the IEEE international conference on computer vision, pp. 2980-2988. 2017.
[17] Simonyan, K. and Zisserman, A., "Very deep convolutional networks for large-scale image recognition." arXiv preprint arXiv:1409.1556 (2014).
[18] Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M. and Adam, H., "Mobilenets: Efficient convolutional neural networks for mobile vision applications." arXiv preprint arXiv:1704.04861 (2017).
[19] Zhao, H., Shi, J., Qi, X., Wang, X. and Jia, J., "Pyramid scene parsing network." 2017 Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2881-2890. 2017.
[20] Yu, F. and Koltun, V., "Multi-scale context aggregation by dilated convolutions." arXiv preprint arXiv:1511.07122 (2015).
[21] Chen, L.C., Papandreou, G., Schroff, F. and Adam, H., "Rethinking atrous convolution for semantic image segmentation." arXiv preprint arXiv:1706.05587 (2017).
[22] Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V. and Rabinovich, A., "Going deeper with convolutions." 2015 Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1-9. 2015.
[23] Ioffe, S. and Szegedy, C., "Batch normalization: Accelerating deep network training by reducing internal covariate shift." arXiv preprint arXiv:1502.03167 (2015).
[24] Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J. and Wojna, Z., "Rethinking the inception architecture for computer vision." 2016 Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2818-2826. 2016.
[25] Szegedy, C., Ioffe, S., Vanhoucke, V. and Alemi, A.A., "Inception-v4, inception-resnet and the impact of residual connections on learning." 2017 Thirty-First AAAI Conference on Artificial Intelligence. 2017.

電子全文

國圖紙本論文

連結至畢業學校之論文網頁點我開啟連結
註: 此連結為研究生畢業學校所提供，不一定有電子全文可供下載，若連結有誤，請點選上方之〝勘誤回報〞功能，我們會盡快修正，謝謝！

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

1.	卷積神經網路在金融技術指標之應用
2.	基於深度學習之音樂片段人聲辨識
3.	開放環境下之車牌偵測
4.	基於深度學習類神經網路磁振造影自動化識別-以腮腺腫瘤為例
5.	基於深度學習之天候影像分類
6.	基於卷積神經網路之非平衡式陶瓷基板瑕疵檢測模型
7.	結合語意關鍵詞與卷積神經網路之文本分類研究
8.	結合技術指標與卷積網路於股市交易之研究
9.	基於稀疏矩陣影像強化和深度學習之目標檢測技術
10.	卷積神經網路影像辨識系統架構設計
11.	卷積神經網路應用於中文字手寫風格辨識
12.	基於深度學習之靜態影像超解析度技術
13.	先進卷積式神經網路應用於深度學習及影像通用分類
14.	基於深度學習網路架構之物件偵測運算加速
15.	應用雙向長短期記憶神經網路於新聞分類

無相關期刊

1.	基於深度學習的Android行動裝置人臉偵測與辨識
2.	基於深度學習的「拿了就走」無人商店
3.	基於深度學習的人臉辨識系統
4.	基於深度學習方法的非接觸式掌靜脈辨識系統
5.	塗改最敏感的像數點使人臉辨識出錯
6.	結合卷積神經網路與K-近鄰演算法之行動裝置人臉辨識系統
7.	確保事件日誌前向安全性的物聯網裝置入侵偵測機制
8.	利用CCDT技術研究具3D手勢之ATM人機介面系統
9.	應用深度估計與語義分割進行行人偵測
10.	基於機器學習的新穎雷達定位技術
11.	基於資料壓縮與QR Code應用之雙重手指靜脈驗證系統
12.	以深度強化學習網路玩非對稱遊戲
13.	以流場可視化方法探討化學機械拋光製程中拋光液流動行為與濃度分佈
14.	週期性阻抗變化之槽線模態濾波器
15.	基於區塊鏈技術之數位版權管理框架

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室