跳到主要內容

臺灣博碩士論文加值系統

(34.204.181.91) 您好!臺灣時間:2023/09/29 13:55
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

: 
twitterline
研究生:陳正偉
研究生(外文):Jeng-Wei Chen
論文名稱:感興趣區域視訊編碼之研究
論文名稱(外文):The Research of Region-of-Interest Video Coding
指導教授:陳美娟陳美娟引用關係
指導教授(外文):Mei-Juan Chen
學位類別:碩士
校院名稱:國立東華大學
系所名稱:電機工程學系
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2002
畢業學年度:90
語文別:英文
論文頁數:96
中文關鍵詞:膚色偵測感興趣區域視訊編碼
外文關鍵詞:H.263+face detectROI
相關次數:
  • 被引用被引用:1
  • 點閱點閱:216
  • 評分評分:
  • 下載下載:32
  • 收藏至我的研究室書目清單書目收藏:0
給予感興趣區域較高的權重,是近來影像編碼常用到的技術。我們提出一個快速、簡單的方法來動態地偵測人臉,並將這區域視為感興趣區域。我們採用R, G, B 及 Cr的資訊來定義出屬於膚色的像素,因為RGB及YCbCr這兩個色彩系統,廣泛的被硬體或影像編碼標準所採用,這樣的好處是不需要其他的前處理且偵測效果更精確。在定義出感興趣區域後,我們使用低通濾波器來減少非感興趣區域所需使用的位元。
我們的系統架構在H.263+上,並配合其中的一個附加選項:Modified Quantiaztion,我們調整各區域巨集區塊(macroblock)的失真權重及變異數,並藉此來控制其所產生出的影像品質。在我們的實驗結果顯示,我們的方法有顯著的提升感興趣區域的品質。我們的方法非常適用於即時系統的應用上。
The ability to give higher priority to Region-of-Interest(ROI) is the emerging functionality for nowadays video coding. A simple and fast method of face detection is proposed to dynamically define ROI in real time application. We use the color information R,G,B and Cr to determine the skin-color pixels. We don’t need any preprocessing, because these two color spaces are used in most hardware and video codec standards. Then, we use low-pass filters for background to reduce used bits.
For video coding system, a region-based video codec based on the H.263+ with the option mode of modified quantization is set up. We adjust the distortion weight parameter and variance at macroblock layer to control the qualities at different regions. From experimental results, the proposed method can significantly improve quality at ROI. Our method is suitable for real time videoconferencing.
ABSTRACT
CHAPTER 1 INTRODUCTION
1.1 BACKGROUND
1.2 REGION OF INTEREST VIDEO CODING
1.3 ORGANIZATION OF THE THESIS
CHAPTER 2 OVERVIEW OF EXISTING METHODS
2.1 OVERVIEW OF ROI
2.2 OVERVIEW OF FACE DETECTION
2.2.1 Face Detection using Skin-color
CHAPTER 3 PROPOSED METHOD
3.1 FACE DETECTION
3.2 BLURRING
3.3 VIDEO CODING
CHAPTER 4 EXPERIMENTAL RESULTS
CHAPTER 5 CONCLUSION
BIBLIOGRAPHY
International Telecommunication Union, “Narrow-band visual telephone systems and terminal equipment”, ITU-T Recommendation H.320, May 1999.
International Telecommunication Union, “Terminal for low bit-rate multimedia communication”, ITU-T Recommendation H.324, February 1998.
International Telecommunication Union, “Packet-based multimedia communications systems”, ITU-T Recommendation H.323, September 1999.
International Telecommunication Union, “Video coding for low bit rate communication”, ITU-T Recommendation H.263, March 1996.
International Telecommunication Union, “Video codec for audiovisual services at p*64 Kbps”, ITU-T Recommendation H.261, March 1993.
International Telecommunication Union, “Video coding for low bit rate communication”, ITU-T Recommendation H.263 version 2, January 1998.
Ming-Ting Sun and Amy R. Reibman, “Compressed video over network”, P.4-5, Marcel Dekker, Inc., 2001.
ISO/IEC JTC1/SC29/WG11, “ISO/IEC CD 11172: Information Technology”, MPEG-1 Committee Draft, December 1991.
ISO/IEC JTC1/SC29/WG11, “ISO/IEC CD 11172: Information Technology”, MPEG-2 Committee Draft, December 1993.
D. Chai and K. N. Ngan, “Face segmentation using skin-color map in videophone applications”, IEEE Trans. On Circuits and Systems for Video Technology, vol. 9, pp.551-564, June 1999.
A. Eleftheriadis and A. Jacqin, “Automatic face location detection for model-assisted rate control in H.261-compatible coding of video”, Signal Processing: Image Communication, vol. 7, no. 4-6, pp. 435-455, November 1995.
J. Hartung, A. Jacquin, J. Paqlyk, J. Rosenberg, H. Okada and P. E. Crouch, “Object-oriented H.263 compatible video coding platform for conferencing applications”, IEEE Journal Selected Areas in Communications, vol. 16, no. 1, pp. 42-55, January 1998.
B. Moghaddam and A. Pentland, “Probabilistic visual learning for object recognition”, IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 7, pp. 696-710, July 1997.
H.P. Fraf, T. Chen, E. Petajan and E. Cosatto, “Locating faces and facial parts”, Proc. 1st Int. Workshop Automatic Face and Gesture Recognition, pp. 41-46, 1995.
K. Sobottka and I. Pitas, “Face localization and facial feature extraction based on shape and color information”, IEEE ICIP’96, vol. 3, pp. 483-486, September 1996.
R. Kjeldsen and J. Kender, “Find skin in color images”, Proc. Of 2nd Int. Conf. On Automatic Face and Gesture Recognition, pp. 379-384, October 1996.
Richard P. Schumeyer, Edwin A. Heredia, and Kenneth E. Barner, “Region of Interest Priority Coding for Sign Language Videoconferencing,” IEEE on Multimedia Signal Processing, pp. 531-536, 1997.
G.Bedini, L.Favalli, A.Marazzi, A.Mecocci, and C.Zanardi, “An integrated approach for high-compression of videoconference sequences,” in Proc. IEEE Intl. Conf. Comm. 95, vol. 1, pp. 563-567, 1995.
A.Eleftheriadis and A.Jacquin, “Automatic face location detection and tracking for model-assisted coding of video teleconferencing sequences at low bit rates,” Signal Processing: Image Communication, vol. 7, no. 3, pp. 231-248, 1995.
J.Luo, C.W.Chen, and K.J.Parker, “Face location in wavelet-based video compression for high perceptual quality videoconferencing,” in Proc. ICIP 95, vol. 2, pp. 583-586, 1995.
A.Eleftheriadis and A.Jacquin, “Low bit rate model-assisted H.261-compatible coding of video,” in Proc. ICIP 95, vol. 2, pp. 418-421, 1995.
E.Badique, “Knowledge-based facial area recognition and improved coding in a CCITT-compatible low-bitrate video-codec,” in Picture Coding Symposium, 1990.
Hwangjun Song, C.-C. and Jay Kuo, ”Rate Control of a Region-Based H.263 Video Codec Under Time-Varying Channels,” IEEE Multimedia Signal Processing, pp. 339-344, 1999.
Li Ding,Kunnio Takaya, ”H.263 Based Facial Image Compression for Low Bitrate Communications,” IEEE Communications Power and Computing, pp.30-34, 1997.
D. chai, K. N. Ngan and A. Bouzerdoum, “Foreground/Background Bit Allocation for Region-Of-Interest Coding”, Proc. Int’l Conf. on Image Processing, vol.2, pp. 438 -441, 2000.
L. L. Yang and M. A. Robertson, “Multiple-face tracking system for general region-of-interest video coding”, Proc. Int’l Conf. on Image Processing, vol. 1, pp. 347-350, 2000.
C. H. Chen, L. G. Chen and H. C. Chang, “Using a Region-Based Blurring Method and Bits Reallocation to Enhance Quality on Face Region in Very Low Bitrate Video”, Proc. ISCAS 1998, vol. 4, pp. 134-137, 1998.
C. W. Lin, T. J. Liou and Y. C. Chen, “Dynamic rate control in multipoint video transcoding”, Proc. ISCAS 2000 Geneva, vol. 2, pp. 17-20, 2000.
M. T. Sun, T. D. Wu and J.N. Hwang, “Dynamic bit allocation in video combining for multipoint video conferencing”, IEEE Trans. On Circuit and Systems, vol. 45, No. 5, pp. 644-648, 1998.
J. R. Corbera and S. Lei, “Rate control in DCT video coding for low-delay communications”, IEEE Trans. On Circuit and Systems for Video Technology, vol. 9, no. 1, pp. 172-185, 1999.
G. Yang, and T. S. Huang, “Human Face Detection in Complex Background”, Pattern Recognition, vol. 27, no. 1, pp. 53-63, 1994.
C. Kotropouios and I. Pitas, “Rule-Based Face Detection in Frontal Views”, Proc. Int’l conf. Acoustics, Speech and Signal Processing, vol. 4, pp. 2537-2540, 1997.
I. Craw, H. Ellis, and J. Lishman, “Automatic Extraction of Face Features”, Pattern Recognition Latters, vol. 5, pp. 183-187, 1987.
I. Craw, D. Tock and Bennett, “Finding Face Features”, Proc. Second European Conf. Computer Vision, pp. 92-96, 1992.
A. Tsukamoto, C.-W. Lee, and S. Tsuji, “Detection and Tracking of Human Face with Synthesized Templates”, Proc. First Asian Conf. Computer Vision, pp. 183-186, 1993.
A. Samal and P.A. Iyengar, “Human Face Detection Using Silhouettes”, Int’l J. Pattern Recognition and Artificial Intelligence, vol. 9, no. 6, pp. 845-867, 1995.
Y. Sumi and Y. Ohta, “Detection of Face Orientation and Facial Components Using Distributed Appearance Modeling”, Proc. First Int’l Workshop Automatic Face and Gesture Recognition, pp. 254-259, 1995.
A. Dempster, “A Generalization of Bayesian Theory”, J. Royal Statistical Soc., vol. 30, pp. 205-247, 1978.
P. Sinha, “Object Recognition via Image Invariants: A case Study”, Investigative Ophthalmology and Visual Science, vol. 35, no. 5, pp. 1735-1740, 1994.
C. Papageorgiou and T. Poggio, “A Trainable System for Object Recognition”, Int’l J. Computer Vision, vol. 38, no. 1, pp. 15-33, 2000.
C. Breazeal and B. Scassellati, “A Context-Dependent Attention System for a Social Robot”, 16th Int’l Joint Conf. Artifical Intelligence, vol. 2, pp. 1146-1151, 1999.
M. Turk and A. Pentland, “Eigenfaces for Recognition”, J. Cognitive Neuroscience, vol. 3, no. 1, pp. 71-86, 1991.
H. Rowley, S. Baluja, and T. Kanade, “Neural Network-Based Face Detection”, IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20, no. 1, pp. 23-38, January 1998
E. Osuna, R. Freund and F. Girosi, “Training Support Vector Machines: An Application to Face Detection”, Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 130-136, 1997.
S.A. Sirohey, “Human Face Segmentation and Identification”, Technical Report CS-TR-3176, Univ. of Maryland, 1993.
D. Chetverikov and A. Lerch, “Multiresolution Face Detection“, Theoretical Foundations of Computer Vision, vol. 69, pp. 131-140, 1993.
T.K. Leung, M.C. Burl and P. Perona, “Finding Faces in Cluttered Scenes Using Random Labeled Graph Matching”, Proc. 5th IEEE Int’l Conf. Computer Vision, pp. 637-644, 1995.
M.C. Burl, T.K. Leung and P. Perona, “Face Localization via Shape Statistics”, Proc. 1st Workshop Automatic Face and Gesture Recognition, pp. 154-159, 1995.
T.K. Leung, M.C. Burl and P. Perona, “Probabilistic Affine Invariants for Recognition”, Proc. IEEE Conf. Compute Vision and Pattern Recognition, pp. 678-684, 1998.
D.G. Kendall, “Shape Manifolds, Procrustean Metrics, an Complex Projective Shapes”, Bull. London Math. Soc., vol. 16, pp. 81-121, 1984.
K.V. Mardia and I.L. Dryden, “Shape Distributions for Landmark Data”, Advanced Applied Probability, vol. 21, pp. 742-755, 1989.
M.F. Augusteijn and T.L. Skujca, “Identification of Human Faces through Texture-Based Feature Recognition and Neural Network Technology”, Proc. IEEE Conf. Neural Networks, pp. 392-398, 1993.
S. Fahlman and C. Lebiere, “The Cascade-Correlation Learning Architecture”, Advances in Neural Information Processing System 2, D.S. Touretsky, ed., pp. 524-532, 1990.
J. Yang and A. Waibel, “A Real-Time Face Tracker”, Proc. 3rd Workshop Applications of Computer Vision, pp. 142-147, 1996.
T.S. Jebara, K. Russell, and A. Pentland, “Mixtures of Eigenfeatures for Real-Time Structure from Texture”, Proc. 6th IEEE Int’l Conf. Computer Vision, pp. 128-135, 1998.
S. Satoh, Y. Nakamura and T. Kanade, “Name-It: Naming and Detecting Faces in News Video”, IEEE Nultimedia, vol. 6, no. 1, pp. 22-35, 1999.
J.L. Crowley and F. Berard, “Multi-Model Tracking of Faces for Video Communications”, Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 640-645, 1997.
N. Oliver, A. Pentland and F.Berard, “LAFER: Lips and Face Real Time Tracker”, Proc. IEEE Conf. Cmputer Vision and Pattern Recognition, pp. 123-129, 1997.
Q.B. Sun, W.M. Huang and J.K. Wu, “Face Detection Based on Color and Local Symmetry Information”, Proc. 3rd Int’l Conf. Automatic Face and Gesture Recognition, pp. 130-135, 1998.
R.J. Qian, M.I. Sezan and K.E. Matthews, “A Robust Real-Time Face Tracking Algorithm”, Proc. IEEE Int’l Conf. Image Processing, pp. 131-135, 1998.
D. Saxe and R. Foulds, “Toward Robust Skin Identification in Video Images”, Proc. 2nd Int’l Conf. Automatic Face and Gesture Recognition, pp. 379-384, 1996.
K. Sobottka and I. Pitas, “Face Localization and Feature Extraction Based on Shape and Color Information”, Proc. IEEE Int’l Conf. Image Processing, pp. 483-486, 1996.
H. Wang and S.-F. Chang, “A Highly Efficient System for Automatic Face Region Detection in MPEG video”, IEEE Trans. Circuits and Systems for Video Technology, vol. 7, no. 4, pp. 615-628,1997.
D. Chai and K.N. Ngan, “Locating Facial Region of a Head-and-Shoulders Color Image”, Proc. 3rd Int’l Conf. Automatic Face and Gesture Recognition, pp. 124-129, 1998.
Y. Dai and Y. Nakano, “Face-Texture Model Based on SGLD and Its Application in Face Detection in a Color Scene”, Pattern Recognition, vol. 29, no. 6, pp. 1007-1017,1996.
M. J. T. Reinders, P. J. L. van Beek, B. Sankur and J. C. A. van der Lubbe, “Facial Feature Localization and Adaptation of a Generic Face Model for Model-Based Coding”, Signal Process. Image Commun., vol. 7, no. 1, pp. 57-74, March 1995.
Y.J. Zhang, Y.R. Yao and Y. He, “Automatic Face Segmentation Using Color Cues for Coding Typical Videophone scenes”, Proc. SPIE Visual Commun. And Image Processing, San Jose, CA, vol. 3024, pp. 468-479, February 1997.
L. Li and R. Forchheimer, “Location Of Face Using Color Cues”, Proc. Picture Coding Symp., Lausanne, Switzerland, paper 2.4, March 1993.
D. Chai and K.N. Ngan, “Automatic Face Location For Videophone Iimages”, Proc. IEEE TENCON’96, Perth, Australia, vol. 1, pp. 137-140, November 1996.
H. P. Graf, E. Cosatoo, D. Gibbon, M. Kocheisen and E. Petajan, “Multi-modal system for locating heads and faces”, Proc. Int. Conf. Automatic Face and Gesture Recognition, Killington, VT, pp. 88-93, October 1996.
C. E. Priebe, “Adaptive mixtures”, Journal of the American Statistical Association, vol. 89,no. 427, pp. 796-806, 1994.
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top
1. 嚴曼麗,〈從神秘到愛──與英國倫敦大學宗教哲學教授冉天恩談神秘主義〉,《當代》第三十六期,頁六十八至七十三,[台北:當代雜誌社,中華民國七十八年四月一日出版]。
2. 劉秋固,〈超個人心理學與宗教心理學對靈性問題研究〉,《宗教哲學》季刊第四卷第三期,頁一七三至一八八,[台北:宗教哲學雜誌社,中華民國八十七年七月一日出版]。
3. 黃克鑣,〈早期希臘教父神秘思想〉,《輔仁大學神學論集》第一一六期,頁二七一至二八七,[台北:光啟出版社,一九九八年七月]。
4. 張春申,〈隱修性的神秘與使徒性的神秘〉,《輔仁大學神學論集》第八十九期,頁三四九至三五七,[台北:光啟出版社,中華民國八十年十月]。
5. 張奉箴,〈神秘經驗與天主教〉,《輔仁大學神學論集》第九十三期,頁四二九至四五六,[台北:光啟出版社,中華民國八十三年十月]。
6. 高天恩,〈追索西洋文明裡的神秘主義〉,《當代》第三十六期,頁十八至三十八,[台北:當代雜誌社,中華民國七十八年四月一日出版]。
7. 沈清松,〈表象、交談與身體──論密契經驗的幾個哲學問題〉,《哲學與文化》第二七四期,[台北:輔仁大學,中華民國八十六年三月],頁二六二至二七四。
8. 關永中,〈神秘主義及其四大型態〉,《當代》第三十六期,頁三十九至四十八,[台北:當代雜誌社,中華民國七十八年四月一日出版]。
9. 談德義著,歐馨雲譯,〈神秘性‧神秘主義‧神秘化〉,《當代》第四十一期,頁九○至九十八,[台北:當代雜誌社,中華民國七十八年九月一日出版]。
10. 石朝穎,〈現代心理學與古典宗教意識的會通〉,《宗教哲學》季刊第四卷第四期,台北:中華民國八十七年十月一日出版。
11. 鄔昆如,〈宗教靈修的時空基礎〉,《宗教哲學》季刊第四卷第二期,頁一至八,[台北:宗教哲學雜誌社,中華民國八十七年四月一日出版]。