(3.236.231.61) 您好!臺灣時間:2021/05/11 16:48
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果

詳目顯示:::

我願授權國圖
: 
twitterline
研究生:王永宗
研究生(外文):Yung-Tsung Wang
論文名稱:壓縮域中利用以歌找歌方式之內涵式搜尋應用於嵌入式系統設計
論文名稱(外文):Compression Domain Content-Based Retrieval of Audio Example for Embedded System Design
指導教授:蔡宗漢蔡宗漢引用關係
指導教授(外文):Tsung-Han Tsai
學位類別:碩士
校院名稱:國立中央大學
系所名稱:電機工程研究所
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2005
畢業學年度:93
語文別:英文
論文頁數:73
中文關鍵詞:內涵式搜尋嵌入式系統壓縮域音樂索引法相似度比對
外文關鍵詞:embedded systemMP3compression domainaudio retrievalcontent-basedindexing
相關次數:
  • 被引用被引用:0
  • 點閱點閱:152
  • 評分評分:系統版面圖檔系統版面圖檔系統版面圖檔系統版面圖檔系統版面圖檔
  • 下載下載:0
  • 收藏至我的研究室書目清單書目收藏:0
近幾年來,多媒體資料的搜尋與索引的技術受到愈來愈多的注意,尤其是在內涵式搜尋這塊領域中。在音訊這個範圍裡,許多的研究皆專注在MIDI或WAV這些未壓縮的音樂格式上,很少會針對壓縮過的音樂格式這個領域來探討。然而,MP3在目前已是一個重要且受歡迎的音樂格式,許多網路或是個人電腦中所流通的都是以MP3為主。在本篇論文中,我們提出了一個利用以歌找歌方式並且以MP3為主的內涵式音樂搜尋系統。我們將MP3中的次頻帶值取出來當成特徵值,並且利用量化樹索引方法及旋律線擷取方法將所取出的特徵值建成我們要的索引資料,最後再利用這些索引資料作為輸入歌曲片段與資料庫歌曲之相似度比對的依據。
此外,我們將整個內涵式音樂搜尋系統移植到以ARM為主的嵌入式系統平台上來做實現,整個系統的評估實驗分別進行在PC及嵌入式平台上,實驗結果顯示,在整個搜尋的準確性方面,在第一首找到的機率大約在70%左右,大約有90%的機率可以在前三首歌中找到。
In recent years, the searching and indexing techniques for multimedia data are getting more attention in the area of multimedia databases. As many research works were done on the content-based retrieval of MIDI or waveform format music, less attention was received on the compressed domain audio data. As the explosive growth of internet, MP3 audio has become one of the most important and popular media. Most of the audio accessed from database of Internet are all compressed on MP3 domain, not raw data or even low quality MIDI audio. In this thesis, the content-based retrieval of audio example on MP3-based (MPEG 1 layer III) digital music archive is considered. In the proposed approach, the sub-band coefficients (SBC) in a MP3 frame are used. These values are extracted from the MP3 decoder to compute the MP3 features for indexing the MP3 objects. A quantization-tree (QT) and the melody-line tracking (MLT) method are also proposed for indexing MP3 objects. These indexes are used to measure the similarity between MP3 objects. Evaluations on a content-based MP3 retrieval system are performed on the pc version and platform-based embedded system version which indicate the proposed approach can achieve a good performance.
ABSTRACT
LIST OF FIGURES
LIST OF TABLES
CHAPTER 1 INTRODUCTION………………………………………………….1
1.1 Background………………………………………………………………...1
1.2 Motivation…………………………………………………………………4
1.3 Thesis Organization………………………………………………………..5
CHAPTER 2 RELATED WORKS……………………………………………….7
2.1 Audio Feature Analysis……………………………………………………7
2.1.1 Physical Features………………………………………………………...8
2.1.2 Perceptual Features……………………………………………………..10
2.2 Audio Representations……………………………………………………10
2.2.1 Digital Audio……………………………………………………………11
2.2.2 Meta-Data……………………………………………………………….12
2.2.3 Melodic Contours……………………………………………………….13
2.2.4 The Fourier Transform………………………………………………….14
2.3 Overview of MPEG-7 Audio……………………………………………..14
2.4 A General Content-Based Retrieval System……………………………...18
CHAPTER 3 PROPOSED APPROACH FOR CONTENT-BASED RETRIEVAL OF QUERY-BY-EXAMPLE..........……………....22
3.1 Algorithm Development………………………………………………….22
3.1.1 Feature Extraction………………………………………………………24
3.1.2 Tone Classification……………………………………………………...26
3.1.3 Slot Processing………………………………………………………….27
3.1.4 Index Construction……………………………………………………...29
3.1.5 Similarity Measurement………………………………………………...33
3.2 Platform-Based Design…………………………………………………...37
3.2.1 Introduction of ARM Integrator Family………………………………...38
3.2.2 Implementation with ARM Embedded System…………………………41
3.2.3 System Integration………………………………………………………44
3.3 Results of Implementation………………………………………………..45
3.3.1 Presentation of Practical System………………………………………..45
3.3.2 Experiment Results……………………………………………………..50
CHAPTER 4 CONCLUSIONS…………………………………………………..54
REFERENCES………………………………………………………………………56
[1]Foote, Jonathan, “Content-Based Retrieval of Music and Audio” in Proc. SPIE, Multimedia Storage and Archiving systems II, Vol.3229, Pages: 138-147.
[2]Wold, E., T. Blum, D. Keislar, and J. Wheaton, “Content-based Classification, Search, and Retrieval of audio”, IEEE Multimedia, Vol. 3, No. 3, Fall 1996, Pages: 27-36.
[3]Zhu Liu; Qian Huang, ”Content-based indexing and retrieval-by-example in audio”, IEEE International Conference on Multimedia and Expo, 2000. ICME 2000. Volume: 2,July-2 Aug. 2000, Pages: 877 -880.
[4]N. Kosugi, Y. Nishihara, S. Kon'ya, M. Yamamuro, and K. Kushima, “Music Retrieval by Humming”, In Proceedings of PACRIM'99,. IEEE, August 1999, Pages: 404-407.
[5]Tomonari Sonoda, Masataka Goto, Yoichi Muraoka,”A WWWbased Melody Retrieval System”, ICME’98 Proc.98, Pages: 349- 352.
[6]Cheng Yang, “Efficient acoustic index for music retrieval with various degrees of similarity”, Proceedings of the tenth ACM international conference on Multimedia, December 2002, Pages: 584 – 591.
[7]Chih-Chin Liu , Po-Jun Tsai, “Content-based retrieval of MP3 music objects”, Proceedings of the tenth international conference on Information and knowledge management, October 05-10, 2001, Atlanta, Georgia, USA, Pages: 506-511.
[8]Wen-Nung Lie, Chen-Kang Su, “Content-based of MP3 songs based on query by singing”, IEEE International Converence on Acoustics Speech and Signal Processing, 2004, vol 5, Pages: 929 – 932.
[9]Tong Zhang and C.-C. Jay Kuo, “Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing”.
[10]Faisal Bashir, Shashank Khanvilkar, Ashfaq Khokhar, and Dan Schonfeld, “Content Based Indexing and Retrieval”.
[11]Karydis, I.; Nanopoulos, A.; Papadopoulos, A.N.; Manolopoulos, Y.; “Audio Indexing for Efficient Music Information Retrieval”, Multimedia Modelling Conference, 2005. MMM 2005. Proceedings of the 11th International, 12-14 Jan. 2005, Pages: 22 – 29.
[12]Steven George Blackburn, “Content Based Retrieval and Navigation of Music Using Melodic Pitch Contours”.
[13]Rui Cai, Lie Lu, Hong-Jiang Zhang, “Using structure patterns of temporal and spectral feature in audio similarity measure”, Proceedings of the eleventh ACM international conference on Multimedia, Berkeley, CA, USA, Pages: 219 – 222.
[14]Quackenbush, S.; Lindsay, A.; “Overview of MPEG-7 audio”, IEEE Transactions on Circuits and Systems for Video Technology, Volume 11, Issue 6, June 2001, Pages: 725 – 729.
[15]Cheng Yang, “Peer-to-peer architecture for content-based music retrieval on acoustic data”, Proceedings of the 12th international conference on World Wide Web, Budapest, Hungary, Pages: 376 – 383.
[16]Piamsa-Nga, P.; Alexandridis, N.A.; Srakaew, S.; Blankenship, G.C., Jr.; Subramanya, S.R.; “In-clip search algorithm for content-based audio retrieval”, Third International Conference on Computational Intelligence and Multimedia Applications, 1999. ICCIMA '99. 23-26 Sept. 1999, Pages: 263 – 267.
[17]Dongmoon Park; Eenjun Hwang; “Popularity-adaptive index scheme for fast music retrieval”, IEEE International Conference on Multimedia and Expo, 2002. ICME '02. Proceedings. 2002, Volume 1, 26-29 Aug. 2002, Pages: 121 – 124.
[18]Ning Hu, Roger B. Dannenberg, “A comparison of melodic database retrieval techniques using sung queries”, Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries, Portland, Oregon, USA, Pages: 301 – 307.
[19]Shih-Fu Chang; Sikora, T.; Purl, A.; “Overview of the MPEG-7 standard”, IEEE Transactions on Circuits and Systems for Video Technology, Volume 11, Issue 6, June 2001, Pages: 688 – 695.
[20]Jungmin Song; So-Young Bae; Kyoungro Yoon; “Query by humming: matching humming query to polyphonic audio”, IEEE International Conference on Multimedia and Expo, 2002. ICME '02. Proceedings. 2002 Volume 1, 26-29 Aug. 2002, Pages: 329 – 332.
[21]Asif Ghias, Jonathan Logan, David Chamberlin, Brian C. Smith, “Query by humming: musical information retrieval in an audio database”, Proceedings of the third ACM international conference on Multimedia, San Francisco, California, United States, Pages: 231 – 236.
[22]Yongwei Zhu; Kankanhalli, M.; Qi Tian; “Similarity matching of continuous melody contours for humming querying of melody databases”, IEEE Workshop on Multimedia Signal Processing, 2002, 9-11 Dec. 2002, Pages: 249 – 252.
[23] Zhu, Y.; Kankanhalli, M.S.; “A robust music retrieval method for query-by-humming”, Proceedings. ITRE2003. International Conference on Information Technology: Research and Education, 2003, 11-13 Aug. 2003, Pages: 89 – 93.
[24]Ya-Dong Wu; Yang Li; Bao-Long Liu; “A new method for approximate melody matching”, International Conference on Machine Learning and Cybernetics, 2003, Volume 5, 2-5 Nov. 2003, Pages: 2687 – 2691.
[25]Hsuan-Huei Shih; Narayanan, S.S.; Kuo, C.-C.Jay.; “Multidimensional humming transcription using a statistical approach for query by humming systems”, International Conference on Multimedia and Expo, 2003. ICME '03. Proceedings. 2003, Volume 3, 6-9 July 2003, Pages: III - 385-8.
[26]Parker, C.; “A tree-based method for fast melodic retrieval”, Conference on Digital Libraries, 2004. Proceedings of the 2004 Joint ACM/IEEE, 7-11 June 2004, Pages: 254 – 255.
[27] Yoshiharu Suga, Naoko Kosugi, Masashi Morimoto, “Real-time background music monitoring based on content-based retrieval”, Proceedings of the 12th annual ACM international conference on Multimedia, New York, NY, USA, Pages: 120 – 127.
[28]Chih-Chin Liu, Chuan-Sung Huang, “A singer identification technique for content-based classification of MP3 music objects”, Proceedings of the eleventh international conference on Information and knowledge management, McLean, Virginia, USA, Pages: 438 – 445.
[29]Jun Gao; Tzanetakis, G.; Steenkiste, P.; “Content-based retrieval of music in scalable peer-to-peer networks”, International Conference on Multimedia and Expo, 2003. ICME '03. Proceedings. 2003, Volume 1, 6-9 July 2003, Pages: I - 309-12.
[30]Keiichiro Hoashi, Kazunori Matsumoto, Naomi Inoue, “Personalization of user profiles for content-based music retrieval based on relevance feedback”, Proceedings of the eleventh ACM international conference on Multimedia, Berkeley, CA, USA, Pages: 110 – 119.
[31]Jyh-Shing Roger Jang, Hong-Ru Lee, “Hierarchical filtering method for content-based music retrieval via acoustic input”, Proceedings of the ninth ACM international conference on Multimedia, Ottawa, Canada, Pages: 401 – 410.
[32]Yuen-Hsien Tseng, “Content-based retrieval for music collections”, Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, Berkeley, California, United States, Pages: 176 – 182.
[33]Tao Li, Mitsunori Ogihara, Qi Li, “A comparative study on content-based music genre classification”, Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, Toronto, Canada, Pages: 282 – 289.
[34]Pierre-Yves Rolland, Gailius Raškinis, Jean-Gabriel Ganascia, “Musical content-based retrieval: an overview of the Melodiscov approach and system”, Proceedings of the seventh ACM international conference on Multimedia (Part 1), Orlando, Florida, United States, Pages: 81 – 84.
[35]Hsuan-Huei Shih; Narayanan, S.S.; Kuo, C.-C.Jay.; “A statistical multidimensional humming transcription using phone level hidden Markov models for query by humming systems”, International Conference on Multimedia and Expo, 2003. ICME '03. Proceedings. 2003, Volume 1, 6-9 July 2003, Pages: I - 61-4.
[36]ARM Integrator/LM-XCV600E+ Logic Module User Guide.
[37]ARM Integrator/CM946E-S Core Module User Guide.
[38]ARM Integrator/AP User Guide.
[39]Johnson, S.E.; Woodland, P.C.; “A method for direct audio search with applications to indexing and retrieval”, IEEE International Conference on Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000, Volume 3, 5-9 June 2000, Pages: 1427 – 1430.
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top
系統版面圖檔 系統版面圖檔