臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.216.119) 您好！臺灣時間：2025/11/25 04:55

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
目次
參考文獻
紙本論文
QR Code

本論文永久網址:

研究生:

游輝亮

研究生(外文):

Hui-LiangYu

論文名稱:

具自適應輸出範圍之雙通道二值化網路

論文名稱(外文):

Dual Path Binary Neural Network with Adaptive Output Range

指導教授:

陳培殷

指導教授(外文):

Pei-Yin Chen

學位類別:

碩士

校院名稱:

國立成功大學

系所名稱:

資訊工程學系

學門:

工程學門

學類:

電資工程學類

論文種類:

學術論文

論文出版年:

2018

畢業學年度:

106

語文別:

英文

論文頁數:

中文關鍵詞:

圖片分類、類神經網路、模型壓縮、二值化網路

外文關鍵詞:

image classification、neural network、model compression、binary neural network

相關次數:

被引用:0
點閱:163
評分:
下載:0
書目收藏:0

近年來Deep Neural Network (DNN)不斷在圖片辨識、語意分割、語音辨識與自動翻譯等等領域，皆有重大的突破。然而較新的DNN通常有大量的參數和複雜的計算，例如AlexNet擁有6千多萬個參數，總記憶體使用量為249MB，辨識一張圖片需要15億次浮點數運算，所以需要Graphics Processing Units (GPUs)幫忙加速訓練過程與縮短推論時間，但是對於嵌入式裝置，像手機或是物聯網 (Internet of things) 這些裝置，只有少量的記憶體空間、電池電力與計算資源，因此把DNN佈署到這些裝置上是有難度的，如何把模型有效率的應用到嵌入式裝置已經成為熱門的研究議題。在模型縮減的領域中，二值化網路是一個非常有希望的技術，兼具低功耗與低儲存空間使用量，但是跟全精度網路相比，預測正確率有不小的差距。本論文改進此缺點，提出一個儲存空間使用量大致相同但預測正確率接近全精度網路的二值化網路。
本論文所提出的二值化網路，有三大特點：1)把批次正規化層的輸出連接到下一個卷積層，讓下一個卷積層的有兩個輸入來源。2)對批次正規化層的輸出做四捨五入運算。3)對每一層再加入一個可訓練的參數，讓每一層的輸出範圍可以自由調整。
實驗結果顯示本論文的二值化網路，模型所需容量與其他二值化網路差不多，但預測正確率大幅超過其他二值化網路。在CIFAR-10資料集上，預測正確率至少超出其他二值化網路2.85%，甚至也比三值化網路好，與全精度網路相比，正確率只差0.69%。在SVHN資料集上，正確率至少贏過其他二值化網路0.21%，甚至超過全精度網路0.58%。

In recent years, deep neural networks (DNNs) have achieved state-of-the-art results in the fields of image recognition, semantic segmentation and machine translation. However, powerful DNNs usually have a large number of parameters and complex calculations. For instance, ImageNet classification challenge winner in 2012, Alex Net, has a model size of about 249MB and 60 million parameters, which needs to perform about 1.5 billion FLOPs to classify a 224 x 224 image. While perform such complex computations, GPUs based machines usually used to speed up training process and inference time. However, for embedded devices, such as smart phones or Internet of Things, there is only a small amount of memory, battery power and computing resources, so it is difficult to deploy DNN to these devices. In the field of model compression, the binary neural network (BNN) is a very promising method, which features are low power consumption and low storage usage, but there is a large gap in prediction accuracy compared with full-precision networks. This thesis proposed a BNN that about the same storage usage as other BNNs and prediction accuracy is close to full-precision network.
The method proposed in this thesis has three characteristics: First, the convolution layers have two input sources by dual path method. Second, round the batch normalization output. Third, adjust each layer output by a trainable parameter.
The experiments show, our model size is about equal to other BNNs, but the prediction accuracy is much higher. In CIFAR-10 dataset, the prediction accuracy is at least 2.85% higher than other BNNs, even better than ternary network, only 0.69% loss compared to full-precision network. In SVHN dataset, the prediction accuracy is at least 0.21% higher than other BNNs, and even more than 0.58% compared to full-precision network.

摘要 I
Abstract II
誌謝 III
Contents IV
Table Captions VI
Figure Captions VII
Chapter 1. Introduction 1
Chapter 2. Background 5
2.1 Neural network 5
2.2 Convolution layer 6
2.3 Fully Connected layer 7
2.4 Batch Normalization layer 8
2.5 Binary Connect 10
2.6 Binary Neural Network (BNN) 13
2.7 XNOR net 14
Chapter 3. Proposed Method 17
3.1 Dual path 17
3.2 The round function 19
3.3 1.5-bit method 21
Chapter 4. Experiments and Comparisons 24
4.1 Experiments configuration 24
4.2 Accuracy 26
4.3 Storage usage 28
4.4 Execution time 29
Chapter 5. Conclusion and Future Work 31
References 32

[1]Alex Krizhevsky, Sutskever Ilya, and E. Hinton Geoffrey. Imagenet classification with deep convolutional neural networks. In NIPS, 2012.
[2]Karen Simonyan and Andrew Zisserman. Very deep convolutional networks for large-scale image recognition. In ICLR, 2015.
[3]Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. Going deeper with convolutions. Technical report, arXiv:1409.4842, 2014
[4]Jonathan Long, Evan Shelhamer, and Trevor Darrell. Fully convolutional networks for semantic segmentation. In CVPR, 2015.
[5]Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and L. Yuille Alan. Semantic image segmentation with deep convolutional nets and fully connected crfs. In ICLR, 2015.
[6]Geoffrey Hinton, Li Deng, George E. Dahl, Abdel-rahman Mohamed, Navdeep Jaitly, Andrew Senior, Vincent Vanhoucke, Patrick Nguyen, Tara Sainath, and Brian Kingsbury. Deep neural networks for acoustic modeling in speech recognition. IEEE Signal Processing Magazine, 29(6):82–97, Nov. 2012.
[7]Tara Sainath, Abdel rahman Mohamed, Brian Kingsbury, and Bhuvana Ramabhadran. Deep convolutional neural networks for LVCSR. In ICASSP 2013, 2013.
[8]Jacob Devlin, Rabih Zbib, Zhongqiang Huang, Thomas Lamar, Richard Schwartz, and John Makhoul. Fast and robust neural network joint models for statistical machine translation. In Proc. ACL’2014, 2014.
[9]Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. Sequence to sequence learning with neural networks. In NIPS’2014, 2014.
[10]Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. Neural machine translation by jointly learning to align and translate. In ICLR’2015, arXiv:1409.0473, 2015.
[11]Vanhoucke Vincent, Senior Andrew, and Mao Mark Z. Improving the speed of neural networks on cpus. In Proc. Deep Learning and Unsupervised Feature Learning NIPS Workshop, volume 1, 2011.
[12]Farabet Clement, LeCun Yann, Kavukcuoglu Koray, Culurciello Eugenio, Martini Berin, Ak-selrod Polina, and Talay Selcuk. Large-scale fpga-based convolutional networks. Scaling up Machine Learning: Parallel and Distributed Approaches, pp. 399–419, 2011.
[13]Pham Phi-Hung, Jelaca Darko, Farabet Clement, Martini Berin, LeCun Yann, and Culurciello Eugenio. Neuflow: Dataflow vision processing system-on-a-chip. In Circuits and Systems (MWSCAS), 2012 IEEE 55th International Midwest Symposium on, pp. 1044–1047. IEEE, 2012.
[14]Chen Yunji, Luo Tao, Liu Shaoli, Zhang Shijin, He Liqiang, Wang Jia, Li Ling, Chen Tianshi, Xu Zhiwei, Sun Ninghui, et al. Dadiannao: A machine-learning supercomputer. In Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture, pp. 609–622. IEEE Computer Society, 2014
[15]Hanson S.J., Pratt L.Y.: Comparing biases for minimal network construction with backpropagation. In: Advances in neural information processing systems. (1989) 177–185
[16]Han Song, Pool, Jeff Tran, John and Dally William. Learning both weights and connections for efficient neural network. In Advances in Neural Information Processing Systems, pp. 1135–1143, 2015.
[17]Yiwen Guo, Anbang Yao, and Yurong Chen. Dynamic network surgery for efficient dnns. In NIPS, 2016.
[18]Lin M., Chen Q., Yan S.: Network in network. In ICLR, 2014.
[19]He K., Zhang, X., Ren S., Sun J.: Identity Mappings in Deep Residual Networks. In European Conference on Computer Vision, 2016.
[20]Iandola F.N., Moskewicz M.W., Ashraf K., Han S., Dally W.J., Keutzer K.: Squeezenet: Alexnet-level accuracy with 50x fewer parameters and¡ 1mb model size. arXiv preprint arXiv:1602.07360 (2016)
[21]Gong Yunchao, Liu Liu, Yang Ming and Bourdev Lubomir. Compressing deep convolutional networks using vector quantization. arXiv preprint arXiv:1412.6115, 2014.
[22]Han Song, Mao Huizi and Dally William J. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv preprint arXiv:1510.00149, 2015.
[23]Wenlin Chen, James T. Wilson, Stephen Tyree, Kilian Q. Weinberger, and Yixin Chen. Compressing neural networks with the hashing trick. In ICML, 2015.
[24]Suyog Gupta, Ankur Agrawal, Kailash Gopalakrishnan, and Pritish Narayanan. Deep learning with limited numerical precision. In ICML, 2015.
[25]I. Hubara, M. Courbariaux, D. Soudry, R. El-Yaniv and Y. Bengio. Quantized neural networks: Training neural networks with low precision weights and activations. arXiv preprint arXiv:1609.07061, 2016.
[26]Courbariaux M., Bengio Y., David J.P.: Binaryconnect: Training deep neural networks with binary weights during propagations. In: Advances in Neural Information Processing Systems. 2015.
[27]Courbariaux M., Bengio Y.: Binarynet: Training deep neural networks with weights and activations constrained to +1 or -1. CoRR, 2016.
[28]Bengio Yoshua. Estimating or propagating gradients through stochastic neurons. Technical Report arXiv:1305.2982, Universite de Montreal, 2013.
[29]M. Rastegari, V. Ordonez, J. Redmon, and A. Farhadi. Xnor-net: Imagenet classification using binary convolutional neural networks. In European Conference on Computer Vision, pages 525–542. Springer, 2016.
[30]Alex Krizhevsky, Vinod Nair, and Geoffrey Hinton. Cifar-10 (canadian institute for advanced research). 2012. URL http://www.cs.toronto.edu/~kriz/cifar.html.
[31]Yuval Netzer, Tao Wang, Adam Coates, Alessandro Bissacco, Bo Wu, Andrew Y. Ng Reading Digits in Natural Images with Unsupervised Feature Learning NIPS Workshop on Deep Learning and Unsupervised Feature Learning 2011.
[32]Fengfu Li and Bin Liu. Ternary weight networks. arXiv preprint arXiv:1605.04711v1, 2016
[33]K. Chellapilla, S. Puri, P. Simard, et al. High performance convolutional neural networks for document processing. In Tenth International Workshop on Frontiers in Handwriting Recognition, 2006.

國圖紙本論文

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

1.	類神經網路於多光譜影像分類之應用
2.	模組化半徑基底函數類神經網路應用於影像分類問題
3.	藉由孿生網路進行不受濾鏡影響之社群網路圖片分類
4.	應用深度學習演算法於魚苗種類及數量辨識之研究
5.	案例式推理與類神經網路於影像辨識之應用研究
6.	空載全偏極POLSAR目標分類之DLBP演算法
7.	模糊動態類神經網路之研發與應用
8.	基於注意力機制實現知識蒸餾之特徵萃取於圖像分類
9.	殘差全連接層之神經網路系統
10.	人工智能居家安全警示系統之研究
11.	相干光學神經網路設計與演算法之研究
12.	基於領域適應的人類皮膚光譜資料分類方法
13.	以現場可程式化邏輯閘陣列加速二值化神經網路及其應用
14.	CNC綜合加工機刀具磨耗分析
15.	應用於自動駕駛安全之複雜度刪減深層卷積神經網路

無相關期刊

1.	在串流及巨量資料下之機率型資料結構：Bloom Filter和LogLog Counter個案比較與探討
2.	匯率與主權債信用違約交換價格的關係—以拉丁美洲為例
3.	以慣性感測元件應用於相機雲台改善航拍影像之精度研究
4.	基於ORB-SLAM並融合雙目與慣性測量單元之視覺慣導同時定位與建圖系統
5.	高效除霧演算法的設計與實現
6.	低延遲雙通道排序電路設計與實現
7.	植基於多尺度二值化殘差網路之低複雜度超高解析度影像設計
8.	探討乳癌病人接受輔助性化學治療期間的認知功能
9.	探討IGF2BP3在肝細胞癌中的臨床重要性及調控機制
10.	SPARC與PD-L1的交互作用可活化WNK1激酶並促進肺腺癌細胞的上皮間質變換
11.	使用離群偵測與實體辨識改進群眾生醫標注系統
12.	精實地面定位系統設計與驗證
13.	廢鋰電池中有價金屬資源化研究
14.	蛋白質結合親合度與癌症病人臨床資料之關係
15.	應用影像分割法於風機葉片之表面剝落破損研究

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室