臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.216.109) 您好！臺灣時間：2026/04/20 02:17

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
目次
參考文獻
紙本論文
QR Code

本論文永久網址:

研究生:

陳奕愷

研究生(外文):

Yi-Kai Chen

論文名稱:

節能與可重組化深度卷積神經網路架構設計

論文名稱(外文):

Architecture Design of Energy-Efficient Reconfigurable Deep Convolutional Neural Network Accelerator

指導教授:

陳良基

口試委員:

蔡宗漢、劉宗德、楊佳玲

口試日期:

2017-11-22

學位類別:

碩士

校院名稱:

國立臺灣大學

系所名稱:

電子工程學研究所

學門:

工程學門

學類:

電資工程學類

論文種類:

學術論文

論文出版年:

2018

畢業學年度:

106

語文別:

英文

論文頁數:

中文關鍵詞:

深度學習、深度卷積神經網路、資料重複使用設計、高幀率架構設計、晶片設計

相關次數:

被引用:0
點閱:230
評分:
下載:0
書目收藏:0

深度卷積神經網路的相關研究已經進行多年，並且在電腦視覺的領域得到驚人的成果，相關的應用也使我們的生活更加便利和直覺，然而先進的深度卷積神經網路通常包含百萬等級的參數和十億等級的算術運算，這使深度卷積神經網路無法有效率的在行動裝置和嵌入式系統中使用。

在這篇論文中，我們呈現一個深度卷積神經網路的特殊應用積體電路(ASIC)架構設計，目標是需要即時計算和低功率消耗的應用，在深度卷積神經網路的架構設計中有兩個主要的挑戰，第一個是在運算過程中需要大量的記憶體存取，第二個是由於資料分布和精準度不同的特性，會造成運算中產生不必要的功率消耗，因此降低記憶體的讀取和有效率的算術運算是架構設計的重點。

我們透過系統化的方法分析深度卷積神經網路中每一層所適合的資料重複使用型式，並且使用最佳的重複利用資料型式來降低記憶體存取的次數，我們也提出一個基於有符號數處理(sign and magnitude)的乘積累加運算(MAC)實作方法，並且驗證這種設計方法相比於傳統二補數的設計方法可以達到更低的功率消耗。

Abstract ix
1 Introduction 1
1.1 The Applications of Deep Convolutional Neural Networks . . 1
1.2 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.3 Thesis Organization . . . . . . . . . . . . . . . . . . . . . . . 4
2 Background 5
2.1 Machine Learning Overview . . . . . . . . . . . . . . . . . . 5
2.2 Neural Networks . . . . . . . . . . . . . . . . . . . . . . . . 6
2.2.1 Modeling a neural . . . . . . . . . . . . . . . . . . . . 7
2.2.2 Backpropagation Algorithm . . . . . . . . . . . . . . 9
2.2.3 Deep Learning . . . . . . . . . . . . . . . . . . . . . . 9
3 Convolutional Neural Network 13
3.1 Overview and Core Concepts . . . . . . . . . . . . . . . . . . 13
3.2 Important Features . . . . . . . . . . . . . . . . . . . . . . . 13
3.3 Building Blocks . . . . . . . . . . . . . . . . . . . . . . . . . 15
3.4 Popular Convolutional Neural Network Model . . . . . . . . 19
4 Architecture Design and Implementation of CNN Acceler-
ator 23
4.1 Design Challenges and Considerations . . . . . . . . . . . . . 23
4.2 Proposed Hardware Architecture . . . . . . . . . . . . . . . 24
4.2.1 Reduce DRAM Access . . . . . . . . . . . . . . . . . 27
4.2.2 Filter Adapted Data
ow . . . . . . . . . . . . . . . . 35
4.2.3 Energy Efficient Multiplier-Accumulator . . . . . . . 42
4.2.4 Architecture and Design Features . . . . . . . . . . . 43
4.2.5 Synthesis Results and Comparosion . . . . . . . . . . 46
5 Conclusion 49
Bibliography 50

[1] Russakovsky, Olga, et al. "Imagenet large scale visual recognition challenge." International Journal of Computer Vision 115.3 (2015): 211-252.
[2] Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E. Hinton. "Imagenet classification with deep convolutional neural networks." Advances in neural information processing systems. 2012.
[3] Simonyan, Karen, and Andrew Zisserman. "Very deep convolutional networks for large-scale image recognition." arXiv preprint arXiv:1409.1556 (2014).
[4] Lai, Liangzhen, Naveen Suda, and Vikas Chandra. "Deep Convolutional Neural Network Inference with Floating-point Weights and Fixed-point Activations." arXiv preprint arXiv:1703.03073 (2017).
[5] C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, A. Rabinovich, C. Hill, and A. Arbor, Going Deeper with Convolutions," pp. 1-9, 2014.
[6] He, Kaiming, et al. "Deep residual learning for image recognition." Proceedings of the IEEE conference on computer vision and pattern recognition. 2016.
[7] Lawrence, Steve, et al. "Face recognition: A convolutional neural-network approach." IEEE transactions on neural networks 8.1 (1997): 98-113.
[8] Schroff, Florian, Dmitry Kalenichenko, and James Philbin. "Facenet: A unified embedding for face recognition and clustering." Proceedings of the IEEE conference on computer vision and pattern recognition. 2015.
[9] Abdel-Hamid, Ossama, et al. "Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition." Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on. IEEE, 2012.
[10] Kim, Yoon. "Convolutional neural networks for sentence classification." arXiv preprint arXiv:1408.5882 (2014).
[11] Kalchbrenner, Nal, Edward Grefenstette, and Phil Blunsom. "A convolutional neural network for modelling sentences." arXiv preprint arXiv:1404.2188 (2014).
[12] Hu, Baotian, et al. "Convolutional neural network architectures for matching natural language sentences." Advances in neural information processing systems. 2014.
[13] Dong, Chao, et al. "Learning a deep convolutional network for image super-resolution." European Conference on Computer Vision. Springer, Cham, 2014.
[14] Dong, Chao, Chen Change Loy, and Xiaoou Tang. "Accelerating the super-resolution convolutional neural network." European Conference on Computer Vision. Springer, Cham, 2016.
[15] Shi, Wenzhe, et al. "Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016.
[16] Le, Quoc V. "Building high-level features using large scale unsupervised learning." Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 2013.
[17] Coates, Adam, et al. "Deep learning with COTS HPC systems." International Conference on Machine Learning. 2013.
[18] Jia, Yangqing, et al. "Caffe: Convolutional architecture for fast feature embedding." Proceedings of the 22nd ACM international conference on Multimedia. ACM, 2014.
[19] Qiu, Jiantao, et al. "Going deeper with embedded fpga platform for convolutional neural network." Proceedings of the 2016 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays. ACM, 2016.
[20] Cavigelli, Lukas, and Luca Benini. "Origami: A 803-gop/s/w convolutional network accelerator." IEEE Transactions on Circuits and Systems for Video Technology 27.11 (2017): 2461-2475.
[21] Gysel, Philipp, Mohammad Motamedi, and Soheil Ghiasi. "Hardware-oriented approximation of convolutional neural networks." arXiv preprint arXiv:1604.03168 (2016).
[22] Samuel, Arthur L. "Some studies in machine learning using the game of checkers." IBM Journal of research and development 3.3 (1959): 210-229.
[23] Lowe, David G. "Distinctive image features from scale-invariant keypoints." International journal of computer vision 60.2 (2004): 91-110.
[24] Dalal, Navneet, and Bill Triggs. "Histograms of oriented gradients for human detection." Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on. Vol. 1. IEEE, 2005.
[25] Chen, Yu-Hsin, et al. "Eyeriss: An energy-efficient reconfigurable accelerator for deep convolutional neural networks." IEEE Journal of Solid-State Circuits 52.1 (2017): 127-138.
[26] Judd, Patrick, et al. "Reduced-precision strategies for bounded memory in deep neural nets." arXiv preprint arXiv:1511.05236 (2015).
[27] Chen, Yunji, et al. "Dadiannao: A machine-learning supercomputer." Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture. IEEE Computer Society, 2014.
[28] Chen, Tianshi, et al. "Diannao: A small-footprint high-throughput accelerator for ubiquitous machine-learning." ACM Sigplan Notices 49.4 (2014): 269-284.
[29] Jouppi, Norman P., et al. "In-datacenter performance analysis of a tensor processing unit." Proceedings of the 44th Annual International Symposium on Computer Architecture. ACM, 2017.
[30] Du, Zidong, et al. "ShiDianNao: Shifting vision processing closer to the sensor." ACM SIGARCH Computer Architecture News. Vol. 43. No. 3. ACM, 2015.
[31] Zhang, Chen, et al. "Optimizing fpga-based accelerator design for deep convolutional neural networks." Proceedings of the 2015 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays. ACM, 2015.
[32] Li, Huimin, et al. "A high performance FPGA-based accelerator for large-scale convolutional neural networks." Field Programmable Logic and Applications (FPL), 2016 26th International Conference on. IEEE, 2016.
[33] Sim, Jaehyeong, et al. "14.6 a 1.42 tops/w deep convolutional neural network recognition processor for intelligent ioe systems." Solid-State Circuits Conference (ISSCC), 2016 IEEE International. IEEE, 2016.
[34] Moons, Bert, and Marian Verhelst. "A 0.3–2.6 TOPS/W precision-scalable processor for real-time large-scale ConvNets." VLSI Circuits (VLSI-Circuits), 2016 IEEE Symposium on. IEEE, 2016.
[35] Moons, Bert, et al. "14.5 envision: A 0.26-to-10tops/w subword-parallel dynamic-voltage-accuracy-frequency-scalable convolutional neural network processor in 28nm fdsoi." Solid-State Circuits Conference (ISSCC), 2017 IEEE International. IEEE, 2017.
[36] Anwar, Sajid, Kyuyeon Hwang, and Wonyong Sung. "Fixed point optimization of deep convolutional neural networks for object recognition." Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on. IEEE, 2015.
[37] Gupta, Suyog, et al. "Deep learning with limited numerical precision." International Conference on Machine Learning. 2015.
[38] Li, Fengfu, Bo Zhang, and Bin Liu. "Ternary weight networks." arXiv preprint arXiv:1605.04711 (2016).
[39] Rastegari, Mohammad, et al. "Xnor-net: Imagenet classification using binary convolutional neural networks." European Conference on Computer Vision. Springer, Cham, 2016.
[40] Courbariaux, Matthieu, et al. "Binarized neural networks: Training deep neural networks with weights and activations constrained to+ 1 or-1." arXiv preprint arXiv:1602.02830 (2016).

國圖紙本論文

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

1.	不同深度卷積神經網路應用於不良圖像分類之研究
2.	使用合成情緒影像在深度卷積網路中進行人臉情緒分類之研究
3.	應用遷移學習在室內空間圖片分類
4.	深度卷積神經網路中池化層之分析及比較
5.	動作辨識之三維梯度方向直方圖架構設計
6.	基於深度卷積神經網路之靜態人臉影像偵測
7.	結合深度卷積神經網路分類在不良圖片上之研究

無相關期刊

1.	基於CNN方法與多層計算之即時超級解析度架構設計
2.	動作辨識之基於隨機抽樣一致性算法相機運動估計演算法開發與架構設計
3.	卷積神經網路影像辨識系統架構設計
4.	即時人體動作辨識系統之特徵點萃取架構設計
5.	以低差異領域自適應及特定領域弱注釋實現高效能室內場景解析
6.	使用單一魚眼相機與自我監督學習特定領域深度於可遷移之可通行性估計
7.	人物互動之區域檢測網路硬體導向演算法
8.	低複雜度卷積神經網路訓練與其低功耗運算單元電路設計
9.	應用於立體匹配之線上訓練優化網路及架構設計
10.	基於位移導向之立體影像視角合成應用於單一或多視角彩色深度相機
11.	動作辨識之三維梯度方向直方圖架構設計
12.	應用於立體匹配演算法之稀疏感知卷積神經網路加速器
13.	幾何感知表示學習用於非監督式單眼深度估計
14.	視覺辨識應用之階層時間記憶演算法及架構設計
15.	用於影像辨識之擴增學習系統演算法與架構設計

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室