臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.216.20) 您好！臺灣時間：2026/07/15 16:22

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
目次
參考文獻
紙本論文
QR Code

本論文永久網址:

研究生:

鄭煒平

研究生(外文):

WeiPing Cheng

論文名稱:

MPEG電影音訊自動分段方法

論文名稱(外文):

A Method for Automatic Audio Segmentation in MPEG Movies

指導教授:

劉志俊

指導教授(外文):

C.C. Liu

學位類別:

碩士

校院名稱:

中華大學

系所名稱:

資訊工程學系碩士班

學門:

工程學門

學類:

電資工程學類

論文種類:

學術論文

論文出版年:

2005

畢業學年度:

語文別:

中文

論文頁數:

中文關鍵詞:

MPEG-7、MPEG音訊、音效分段、音效資料庫、內涵式查詢

外文關鍵詞:

MPEG-7、MPEG audio、audio segmentation、sound effect database、content-based retrieval

相關次數:

被引用:1
點閱:210
評分:
下載:0
書目收藏:4

隨著資訊軟硬體與網際網路技術快速發展，今日網路上的數位多媒體類型相當多元化，數位多媒體資料的取得也相當容易，要如何將如此豐富的數位資料保存整理，已經成為相當重要的課題。其中電影是一種最有趣但也是最複雜的多媒體資料，要對網際網路電影資料庫，描述其中電影資料的內涵特徵值，是所有電影內涵分析研究的首要條件。
在先前的研究我們提出一種有效且創新的方法，可以對電影劇情類型進行識別。在該論文中利用MPEG-7音效特徵值，對電影中所有音效進行分類，建立出網際網路電影資料庫，並建構出電影音效輪廓。但是美中不足的是電影中所有的音效必須以人工方式分段。因此本篇論文中將探討解決此一問題，我們提出一種利用MPEG-7音效特徵值組對電影音訊進行自動化分段，以協助先前的電影輪廓分析系統完成電影音訊自動分段與檢索，便於對大量的數位電影進行自動化分析。

1. 序論 5
2. 相關研究 7
3. MPEG電影音訊介紹 10
3.1 MPEG音訊介紹 10
3.2 MPEG音訊解碼係數特性 11
3.3 MP3特徵值向量 11
4. 電影音效分段之系統架構 13
5. 音效特徵值 15
5.1 MPEG-7音效特徵值組 15
5.2 非MPEG-7音效特徵值組 15
6. 音效斷點方法 19
6.1 經驗法則斷點偵測法 19
6.2 分類法則斷點偵測法 20
6.2.1 kNN分類器 21
6.2.2 RCE類神經網路分類器 23
6.2.3 電影音效特徵值資料庫 26
7. 實驗 28
7.1 實驗環境說明 28
7.1.1 實驗軟硬體 28
7.1.2 實驗樣本 28
7.1.3 實驗效能評估計算方式 29
7.1.4 影響音效分段效能之因素 30
7.2 經驗法則分段實驗結果 30
7.2.1音效特徵值邊界差異值的臨界值常數選定 30
7.2.2音效特徵值的權重設定 35
7.2.3 音訊資料框架小節大小 37
7.3 分類法則分段實驗結果 37
7.4 兩種分類法之比較 38
9. 參考文獻 41
附錄A MPEG7特徵値公式 45

[1]F. Dick Bernard, “Anatomy of Film 4th Edition,” Palgrave Macmillan, Jan 2002.
[2]S. M. Bhandarkar and A.A. Khombhadia, “Motion-based parsing of compressed video,” in Proc. of IEEE Intl. Workshop on Multi-Media Database Management Systems, pp. 80 –87, 1998.
[3]J. S. Boreczky and L. D.Wilcox, “A hidden Markov model framework for video segmentation using audio and image features,” in Proceedings of the 1998 IEEE Internation Conference on Acoustics, Speech, and Signal Processing, vol. 6, pp. 3741-3744, May 1998.
[4]N. Brady, “MPEG-4 standardized methods for the compression of arbitrarily shaped video objects,” IEEE Trans. on Circuits and Systems for Video Technology, Vol. 9, No. 8, pp. 1170 –1189, Dec. 1999.
[5]N. Brady, F. Bossen and N. Murphy, “Context-based arithmetic encoding of 2D shape sequences,” in Proc. IEEE Intl. Conf. on Image Processing, Vol. 1, pp. 29–32, 1997.
[6]Albert S. Bregman, “Auditory Scene Analysis: The Perceptual Organization of Sound,” MIT press, 1994.
[7]Guy J.Brown and Martin Cooke, "Computational auditory scene analysis," Computer Speech and Language,vol. 8, pp.297-336, Oct. 1994.
[8]M. J. Carey, E. S. Parris and H. Lloyd-Thomas, “A Comparison of Features for Speech Music Discrimination,” in Proc. of 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 149-152, March 1999.
[9]Wu Chou and Liang Gu, “Robust Singing Detection in Speech/Music Discriminator Design,” in Proc. of 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 2, pp. 865-868, May 2001.
[10]W.A.C. Fernando, C.N. Canagarajah, and D.R. Bull, “A unified approach to scene change detection in uncompressed and compressed video,” IEEE Transactions on Consumer Electronics, Vol. 46, No. 3, pp. 769 –779, Aug. 2000.
[11]Louis Giannetti, “Understanding Movies,” Prentice Hall, 1990.
[12]T. Hain and et al., “Segment Generation And Clustering In The Htk Broadcast News Transcription System,” in Proc. of 1998 Broadcast News Transcription and Understanding Workshop, pp. 133-137, 1998.
[13]ISO/IEC 11172-3:1993, “Information Technology — Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to about 1.5 Mbit/s — Part 3: Audio.”
[14]Zhu Liu and Qian Huang, “Classification of Audio Events in Broadcast News,” in Proc. of IEEE Second Workshop on Multimedia Signal Processing, pp. 364-369, Dec. 1998.
[15]Zhu Liu, Jincheng Huang and Yao Wang, ” Classification TV Programs Based on Audio Information Using Hidden Markov Model,” in Proc. of IEEE Second Workshop on Multimedia Signal Processing, vol. , pp. 27-32, Dec. 1998.
[16]Zhu Liu and et al., “Audio Feature Extraction and Analysis for Scene Classification,” in Proc. of IEEE First Workshop on Multimedia Signal Processing, pp. 343-348, June 1997.
[17]Beth Logan, “Mel Frequency Cepstral Coefficients for Music Modeling,” in Proc. of International Symposium on Music Information Retrieval, 2000.
[18]Lie Lu and et al., “Content Analysis for Audio Classification and Segmentation,” IEEE Transactions on Audio Classification and Segmentation, vol. 10, pp. 504-516, October 2002.
[19]J.P. Marques de Sá, ” Pattern Recognition Concepts, Methods and Applications,” Springer, 2001.
[20]MPEG Requirements Group, “Information technology - Multimedia Content Description Interface - Part 2：Description Definition Language,” ISO/IEC JTC1/SC29/WG11 N4002, Singapore, Mar. 2001.
[21]MPEG Requirements Group, “Information technology - Multimedia Content Description Interface - Part 4：Audio,” ISO/IEC CD 15938-4, Oct. 2000.
[22]MPEG Requirements Group, “Information technology - Multimedia Content Description Interface - Part 5：Multimedia Description Schemes,” ISO/IEC JTC1/SC29/WG11 N3966, Singapore, Mar. 2001.
[23]MPEG Requirements Group, “Overview of MPEG-7 Standard(version 8.0),” ISO/IEC JTC1/SC29/WG11 N4980, Singapore, July. 2002.
[24]A. Nagasaka and Y. Tanaka, “Automatic video indexing and fullvideo search for objects appearances,” Visual Database Systems II, E. Knuth and L. M. Wegner, Eds. New York: Elsevier Science, pp.113–127, 1992.
[25]Y. Nakajima and et al., “A fast audio classification from MPEG coded data,” in Proc. of 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 6, pp. 3005-3008, March 1999.
[26]M.R. Naphade and et al., “Probabilistic multimedia objects (multijects): a novel approach to video indexing and retrieval in multimedia systems,” in Proc. of 1998 International Conference on Image Processing, vol. 3, pp. 536-540, Oct. 1998.
[27]N. Patel and I. Sethi, “Audio Characterization for Video Indexing,” in Proc. of SPIE Conf. Storage Retrieval Still Image Video Databases, vol. 2670, pp. 373-384, 1996.
[28]Silvia Pfeiffer, Stephan Fischer and Wolfgang Effelsberg, “Automatic audio content analysis,” in Proc. of the fourth ACM international conference on Multimedia, pp. 21-30, 1997.
[29]V. I. Pudovkin, “Film Technique, and Film Acting,” Grove Press, Jun 1970.
[30]J. Saunders, “Read-Time Discrimination of Broadcast Speech/Music,”in Proc. of IEEE ICASSP, pp. 993-996, 1996.
[31]E. Scheirer and M.Slaney, “Construction and Evaluation of a Robust Multifeature Speech/Music Discrimination,” in Proc. IEEE ICASSP, vol. 2, pp. 1331-1334, 1997.
[32]G.. Tzanetakis and P. Cook, “Multifeature Audio Segmentation for Browsing and Annotation,” IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 17-20, Oct. 1999.
[33]E. Wold and et al., “Content-based Classification, Search, and Retrieval of Audio,” IEEE Multimedia, vol. 3, pp. 27-36, Fall 1996.
[34]B. L. Yeo and B. Liu, “Rapid scene analysis on compressed videos,” IEEE Trans. Circuits Syst. Video Technol., Vol. 5, No. 6, pp. 533-544, Dec. 1995.
[35]H. J. Zhang , A. Kankanhalli, and S. W. Smoliar, “Automatic partitioning of full-motion video,” Multimedia Systems, Vol.1, No.1, pp.10-28, June 1993.
[36]Tong Zhang and C. C. Jay Kuo, “Audio Content Analysis for Online Audiovisual Data Segmentation and Classification,” IEEE Transactions on Speech and Audio Processing, vol. 9, pp. 441 - 457, MAY 2001.
[37]范世鎮、劉志俊, “利用特寫鏡頭偵測與主角辨識技術來自動建立電影摘要,” 第二屆數位典藏技術研討會, 2003.
[38]陳信修、劉志俊, “一種利用特寫鏡頭對數位電影資料進行自動化摘要合成之技術,” 第一屆數位典藏技術研討會, 2002.
[39]黃群菘、劉志俊, “MP3數位音樂資料的自動化分類,” 第一屆數位典藏技術研討會, 2002
[40]葉億真、劉志俊, “音效資料的內涵式分類及其在電影資料庫的應用,” 第二屆數位典藏技術研討會, 2003.
[41]劉志俊、傅佳源、王志浩、喻仲平, “一種利用物件形狀來進行MPEG-4鏡頭變化偵測之技術,” 第一屆數位典藏技術研討會, 2002.
[42]梅長齡, ”電影原理與製作,” 三民書局股份有限公司, 1978.
[43]鄭煒平、劉志俊, “MPEG電影音效自動分段系統,” 2004數位生活與網際網路科技研討會, 2004.
[44]鄭煒平、劉志俊, “網際網路電影資料庫之音效自動分段索引系統,” 網際網路應用與發展學術研討會,Vol. 6, 2005.

國圖紙本論文

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

1.	運動電影之影像分類學：《衝浪季節》之影像分析
2.	以運動軌跡為主之棒球戰術資料庫
3.	MP3音樂物件之自動特徵值的擷取與時序上的分段
4.	音效資料的內涵式分類及其在電影資料庫的應用

1.	50.鄧盛東, “水下偵測專輯-淺談水下量測電聲換能器互易校正法,” 海下技術季刊, 第八卷, 第二期, pp.20-22, 六月, 1998.

1.	網路行銷工具之投資報酬率分析-以超級電池公司為例
2.	蛋白質表面結構模型及其搜尋演算法
3.	利用基因演算法建構演化樹之分析
4.	雪霸國家公園雪見地區景觀道路遊客美質偏好與生態工法應用之研究
5.	建築模板作業工率值預估模式之研究
6.	照相感應模組產業之良率研究
7.	從消費者行為探討高職學生升學選校之考量因素
8.	以新聞報導角度觀察政府危機處理能力-以SARS事件為例
9.	隨意無線電網路性質之量化分析與實驗
10.	利用資料分割來縮短不規則資料重新分配的傳輸時間
11.	開發SCP轉換DICOM12導程心電圖閘道伺服器
12.	微陣列玻片資料庫之建置:整合基因表現與生理調控路徑
13.	利用整合工具建立土雞濾泡發育的生理路徑模式
14.	銅活字中文辨識之序列比對演算法研究
15.	利用RF距離評估演化樹及合併演化樹演算法

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室