臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.216.50) 您好！臺灣時間：2026/03/15 04:19

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
目次
參考文獻
紙本論文
論文連結
QR Code

本論文永久網址:

研究生:

鍾惠如

研究生(外文):

Hui-Ju Chung

論文名稱:

基於Cordic之Tiny Yolo V2物件區塊網路層的架構設計與實現

論文名稱(外文):

Cordic Based Hardware Implementation for Object Region Layer of Tiny Yolo V2

指導教授:

吳崇賓

指導教授(外文):

Chung-Bin Wu

口試委員:

陳春僥、湯雲欽

口試委員(外文):

Chun-Yao Chen、Yun-Chin Tang

口試日期:

2019-07-16

學位類別:

碩士

校院名稱:

國立中興大學

系所名稱:

電機工程學系所

學門:

工程學門

學類:

電資工程學類

論文種類:

學術論文

論文出版年:

2019

畢業學年度:

107

語文別:

中文

論文頁數:

中文關鍵詞:

Tiny Yolo V2、Region Layer、Softmax、Bounding Box、物件分類與位置、硬體架構設計、座標旋轉數字計算方法

外文關鍵詞:

Tiny Yolo V2、Region Layer、Softmax、Bounding Box、Object Classification and Location、Hardware Architecture Design、Cordic

相關次數:

被引用:0
點閱:268
評分:
下載:0
書目收藏:0

本論文提出Tiny Yolo V2最後一層Region Layer之硬體架構設計與實現，利用Softmax與Bounding Box運算求得物件分類與位置，包含兩個主要步驟: Sigmoid運算單元及Softmax與Bounding Box運算單元的計算。首先，本層的輸入是前一層Layer 14的Output Feature Map中得到的資訊，分為座標資訊(Coordinate)、信心指數(Confidence Score)及分類資訊(Class)，先利用信心指數經由Sigmoid運算單元的計算，設定閥值以過濾沒有包含物件的分類及座標資訊，減少計算量，之後分類資訊與座標資訊再經由Softmax及Bounding Box運算單元，再次設定閥值過濾，並求得準確的物件分類與位置。此外，本論文在運算過程中會遇到Exponential及Natural Log的計算，因此在硬體架構的設計中，為使計算方便，利用座標旋轉數字計算方法(Cordic)降低運算複雜度，加速了本層的硬體運算速度。

This thesis proposed the hardware architecture design and implementation for region layer of Tiny Yolo V2 by calculating Softmax and Bounding Box to get the classification and the location of object. The proposed is composed of two steps: calculation of Sigmoid unit and Softmax unit, Bounding Box unit. First, input of region layer is the data from the Output Feature Map of layer 14, including Coordinate, Confidence Score and Class data. We use Confidence Score as the input of Sigmoid unit and set the threshold to filter the data without the object in order to reduce the calculation. Then, using Class and Coordinate as the input of Softmax unit and Bounding Box unit calculates the accurate classification and location of object. Furthermore, the process needs Exponential and Natural Log when calculating so we use Coordinate Rotation Digital Computer (Cordic) to reduce the complexity of operation and speed up the operation time in our design.

摘要 i
Abstract ii
目錄 iii
圖目錄 v
表目錄 vii
第一章緒論 1
1.1研究動機 1
1.2研究目的 2
1.3論文架構 2
第二章文獻探討 3
2.1 Tiny-Yolo V2 4
2.1.1分層處理 5
2.1.2 Region Layer設計概念 6
2.2分類函數 8
2.2.1 Sigmoid函數[25] 8
2.2.2 Softmax函數[25]-[26] 9
2.3 Bounding Box Regression[27]-[28] 11
2.4座標旋轉數字計算方法(Coordinate Rotation Digital Computer , Cordic) 12
2.4.1 Cordic旋轉模式(Rotation Mode) 13
2.4.2 Cordic向量模式(Vectoring Mode) 13
2.5非極大值抑制(Non-Maximum Suppression , NMS) [34] 14
第三章研究方法 15
3.1演算法架構 15
3.2 Region Layer介紹 15
3.2.1 Sigmoid運算單元 16
3.2.2 Softmax運算單元 17
3.2.3 Bounding Box運算單元 18
3.3資料提取 21
3.4硬體架構 22
3.4.1 SRAM0-8的記憶體配置 23
3.4.2硬體單元設計 24
第四章實驗結果與討論 32
4.1軟體模擬 32
4.2環境與效能分析 32
第五章結論與未來工作 35
5.1結論 35
5.2未來工作 35
參考文獻 36

[1] Y. LeCun, P. Haffner, L. Bottou, Y. Bengio, "Object Recognition with Gradient-
Based Learning," Feature Grouping, 1999.
[2] Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, L. D. Jackel, “Backpropagation Applied to Handwritten Zip Code Recognition,” Neural Computation, Vol. 1, no. 4, pp. 541-551, 1989.
[3] A. Mohamed, G.E. Dahl, and G. Hinton, “Acoustic modeling using deep belief networks,” Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 1, pp. 14-22, Jan. 2012.
[4] Ossama Abdel-Hamid, Abdel-rahman Mohamed, H. Jiang, L. Deng, G. Penn, D. Yu, “Convolutional Neural Networks for Speech Recognition,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 22, no. 10, Oct. 2014.
[5] F. Schroff, D. Kalenichenko, and J. Philbin, ‘‘FaceNet: A unified embedding for face recognition and clustering,’’ in Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit, pp. 815-823, Jun. 2015.
[6] Jiang, H. and Learned-Miller, E., “Face Detection with the Faster R-Cnn,” 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017). June. 2017.
[7] P. Sermanet, K. Kavukcuoglu, S. Chintala, and Y. LeCun, “Pedestrian detection with unsupervised multi-stage feature learning,” in CVPR, 2013.
[8] W. Ouyang, X. Wang, “Joint Deep Learning for Pedestrian Detection,” The IEEE International Conference on Computer Vision (ICCV), pp. 2056-2063, 2013.
[9] J. Li, X. Liang, S. Shen, T. Xu, J. Feng, S. Yan, “Scale-Aware Fast R-CNN for Pedestrian Detection,” IEEE Transactions on Multimedia, Vol. 20, no. 4, Apr. 2018.
[10] Shin, H.C., Roth, H.R., Gao, M., et al., “Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning,” IEEE Trans. Med. Imaging, 35(5), pp. 1285-1298, 2016.
[11] H.C. Shin, H.R. Roth, M. Gao, L. Lu, Z. Xu, I. Nogues, J. Yao, D. Mollura, R.M. Summers, “Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning,” IEEE Transactions on Medical Imaging, Vol. 35, May. 2016.
[12] M. Anthimopoulos, S. Christodoulidis, L. Ebner, A. Christe, and S. Mougiakakou, ‘‘Lung pattern classification for interstitial lung diseases using a deep convolutional neural network,’’ IEEE Trans. Med. Imag., vol. 35, no. 5, pp. 1207-1216, May. 2016.
[13] K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition,” CoRR, vol. abs/1409.1556, 2014.
[14] C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. E. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich, “Going deeper with convolutions,” CoRR, vol. abs/1409.4842, 2014.
[15] J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, “You Only Look Once: Unified, Real-time Object Detection,” In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 779-788, 2016.
[16] J. Redmon, A. Farhadi, “YOLO9000: Better Faster Stronger,” Computer Vision and Pattern Recognition (CVPR), 2016.
[17] J. R. Uijlings, K. E. van de Sande, T. Gevers, and A. W. Smeulders, “Selective search for object recognition,” International Journal of Computer Vision, vol. 104, no. 2, pp. 154-171, 2013.
[18] P.F. Felzenszwalb, R.B. Girshick, D. McAllester, D. Ramanan, “Object Detection with Discriminatively Trained Part-Based Models,” IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 1627-1645, 2010.
[19] P. Dollár, R. Appel, S. Belongie, and P. Perona, “Fast Feature Pyramids for Object Detection,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 36, no. 8, pp. 1532-1545, July. 2014.
[20] R. Girshick, J. Donahue, T. Darrell, and J. Malik., “Region-Based Convolutional Networks for Accurate Object Detection and Segmentation.” IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 142-158, 2016.
[21] V. Nair and G. E. Hinton, “Rectified linear units improve restricted boltzmann machines”, in ICML, pp. 807-814, 2010.
[22] Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E. Hinton, “Imagenet
classification with deep convolutional neural networks,” Advances in neural
information processing systems, 2012.
[23] Iliev, Anton and Kyurkchiev, Nikolay and Markov, Svetoslav, “A Note on the New Activation Function of Gompertz Type,” vol.4, pp. 1-20, Oct. 2017.
[24] R. Girshick, “Fast r-cnn,” Proceedings of the IEEE International Conference on Computer Vision, pp. 1440-1448, 2015.
[25] I. Goodfellow, Y. Bengio, and A. Courville. “Deep learning,” Cambridge, Massachusetts: MIT Press, 2016.
[26] B. Yuan, “Efficient hardware architecture of softmax layer in deep neural network,” 29th IEEE International System-on-Chip Conference (SOCC), pp. 323-326, 2016.
[27] P. Felzenszwalb, R. Girshick, D. McAllester, and D. Ramanan, “Object detection with discriminatively trained part based models”, TPAMI, 2010.
[28] P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, R. Fergus, and Y. LeCun, “OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks,” in ICLR, 2014.
[29] S. Ren, K. He, R. Girshick, J. Sun, “Faster r-cnn: Towards realtime object detection with region proposal networks,” Advances in neural information processing systems, pp. 91-99, 2015.
[30] J. E. Volder, “The CORDIC trigonometric computing technique,” IRE Transactions on Electronic Computers, vol. EC-8, pp. 330-334, Sept. 1959.
[31] Xia, Dunzhu, yu, Cheng, Wang, Yuliang, “A Digitalized Silicon Microgyroscope Based on Embedded FPGA,” Sensors (Basel, Switzerland), vol. 12, pp. 13150-13166, Dec. 2012.
[32] Rajkumar Tomar, Praveen Kumar Singh, Krishna Raj, “A Review of CORDIC Algorithms and Architectures with Applications for Efficient Designing,” International Journal of Scientific and Engineering Research (IJSER), 2013.
[33] P.K. Meher, S.Y. Park, “CORDIC designs for fixed angle of rotation,” IEEE Transaction on Very Large Scale Integration (VLSI) Systems, vol. 21, no. 2, pp. 217-228, 2013.
[34] A. Neubeck and L. Van Gool, “Efficient non-maximum suppression,” in Pattern Recognition, ICPR 2006. 18th International Conference on IEEE, vol. 3, pp. 850-855, 2006.
[35] CSDN.(2016).NMS(non maximum suppression).
Retrieved from https://blog.csdn.net/shuzfan/article/details/52711706 (June 15, 2019)
[36] Borui Jiang, Ruixuan Luo, Jiayuan Mao, Tete Xiao, Yuning Jiang, “Acquisition of Localization Confidence for Accurate Object Detection,” European Conference on Computer Vision (ECCV), 2018.
[37] Poernomo, Alvin , Kang, Dae-Ki, “Content-Aware Convolutional Neural Network for Object Recognition Task,” International journal of advanced smart convergence, vol.5, pp. 1-7, Sept. 2016.
[38] J. Sujitha, V. Ramohan Reddy, “Implementation of Log and Exponential Function in FPGA,” International Journal of Engineering Research & Technology (IJERT), vol. 3, pp. 1404-1407, Nov. 2014.
[39] Wang, Meiqi and Lu, Siyuan and Zhu, Danyang and Lin, Jun and Wang, Zhongfeng, “A High-Speed and Low-Complexity Architecture for Softmax Function in Deep Learning,” 2018 IEEE Asia Pacific Conference on Circuits and Systems, pp. 223-226,Oct. 2018.
[40] Z. Li, H. Li, X. Jiang, B. Chen, Y. Zhang, G. Du, “Efficient FPGA Implementation of Softmax Function for DNN Applications,” 2018 12th IEEE International Conference on Anti-counterfeiting, Security, and Identification (ASID), Nov. 2018.
[41] Q. Sun, Z. Di, Z. Lv, F. Song, Q. Xiang, Q. Feng, Y. Fan, X. Yu, W. Wang, “A High Speed SoftMax VLSI Architecture Based on Basic-Split,” 2018 14th IEEE International Conference on Solid-State and Integrated Circuit Technology (ICSICT), Nov. 2018.

國圖紙本論文

連結至畢業學校之論文網頁點我開啟連結
註: 此連結為研究生畢業學校所提供，不一定有電子全文可供下載，若連結有誤，請點選上方之〝勘誤回報〞功能，我們會盡快修正，謝謝！

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

1.	使用CORDIC演算法實現於深度學習Softmax分類函數之研究

無相關期刊

1.	可重組之AI深度學習硬體加速架構設計與實現
2.	高效能頻寬使用之AI深度學習硬體加速架構設計與實現
3.	一個基於YOLO模型進行即時物件辨識之研究
4.	訓練資料集自我擴增問題之分析-以YOLO分類器為例
5.	基於YOLO物體偵測進行即時的實例分割
6.	基於YOLO的車輛實時製造與模型識別
7.	適應性廣角影像變形校正與環景接合系統
8.	深度學習加速器硬體架構整合與實現
9.	應用於智慧型自走載具之基於YOLO深度學習網路的人群人臉方向辨識技術設計與實現
10.	使用YOLO演算法之水果品質分類系統實作
11.	光學遙測影像資料壓縮系統之硬體設計最佳化與FPGA晶片實現
12.	高吞吐量深度學習AI加速器之硬體實現
13.	基於YOLO深度學習應用於車牌辨識系統
14.	車牌辨識系統之FPGA硬體實現
15.	基於異質層融合架構之深度學習加速器設計與實現

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室