臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.216.14) 您好！臺灣時間：2025/12/01 22:36

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
紙本論文
QR Code

本論文永久網址:

研究生:

林文杰

研究生(外文):

Lin, Wen-Chieh

論文名稱:

可辨識運動姿態的時空延遲網路及其在唇語辨識之應用

論文名稱(外文):

A Space-Time Delay Neural Network for Motion Recognition and Its Application to Lipreading in Bimodal Speech Recognition

指導教授:

林進燈

指導教授(外文):

Chin-Teng Lin

學位類別:

碩士

校院名稱:

國立交通大學

系所名稱:

控制工程系

學門:

工程學門

學類:

電資工程學類

論文種類:

學術論文

論文出版年:

1996

畢業學年度:

語文別:

中文

論文頁數:

中文關鍵詞:

時空延遲網路、唇語辨識、類神經網路、語音辨識、電腦視覺、運動辨識

外文關鍵詞:

STDNN、Lipreading、Neural Network、Speech Recognition、Computer Vision、Motion Recognition

相關次數:

被引用:0
點閱:276
評分:
下載:0
書目收藏:0

近年來，由於在無人監視系統，多重模型人機介面，及交通控制系統等不
同領域中對於電腦視覺的需求增加，物體運動姿態的辨識問題也逐漸受到
重視。現存的方法中，大多將待辨識的連續影像序列，經平面影像的特徵
抽取方法後，轉換成特徵向量序列，再送入辨識器辨識。此類方法的最大
缺點在於，辨識物體運動姿態的有效資訊被侷限在空間維度或時間維度。
然而，我們相信描述物體運動的資訊應存在於時空中，而非僅侷限於時間
維度或空間維度中。因此，我們提出一個時空延遲類神經網路來處理運動
姿態辨識的問題。這個新的類神經網路能處理關於三維動態資訊的問題，
得運動姿態的辨識能在時空維度進行，避免了前述的問題。此外，這個類
神經網路對於物體運動姿態在時間維度或空間維度產生輕微偏移失真時，
仍能有效辨識。這使得前級的影像追蹤系統的負擔減輕，因為物體的定位
在不是非常準確的情況下，這個類神經網路仍能有效處理。
我們將這個網路應用在唇語辨識上，實驗結果顯示這個網路比傳統的時間
延遲網路構成的辨識系統有較佳的學習能力與辨識能力。

The researches of the motion recognition has received more and
more attentions in recent years because the need for computer
vision is increasing in many domains, such as the surveillance
system, multimodal human computer interface, and traffic control
system. Most of the existing approaches separate the recognition
into the spatial feature extraction and time domai??cognition.
However, we believe that the information of motion resides in
the space-time domain, not restricted to the time domain or
space domain only. Consequently, it seems more reasonable to
integrate the feature extraction and classification in the space
and time domains altogether. We propose a Space-Time Delay
Neural Network (STDNN) that can deal with the 3-D dynamic
information, such as motion recognition. For the motion
recognition problem that we focus in this paper, the STDNN is an
unified structure, in which the low-level spatiotemporal feature
extraction and space-time recognition are embedded. It possesses
the spatiotemporal shift-invariant recognition abilities that
are inherited from the time delay neural network (TDNN) and
space displacement neural network (SDNN). Unlike the multilayer
perceptron (MLP), TDNN, and SDNN, the STDNN is constructed by
the vector-type nodes and matrix-type links such that the
spatiotemporal information can be gracefully represented in a
neural network. Some experiments are done to evaluate the
performance of the proposed STDNN. In the moving Arabic numerals
(MAN) experiments, which simulate the object'smoving in the
space-time domain by image sequences, the STDNN shows its
generalization ability on spatiotemporal shift-invariance
recognition. In the lipreading experiment, the STDNN recognizes
the lip motions by the inputs of real image sequences. It shows
that the STDNN has better performance than the existing TDNN-
based system, especially on the generalization ability. Although
the lipreading is a more specific application, the STDNN can be
applied to other applications since no domain-dependentknowledge
is used in the experiment.

國圖紙本論文

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

1.	以類神經網路為架構之語音辨識系統
2.	應用機械視覺於印刷電路板表面元件之檢測
3.	建構於碎形維之自動人眼虹膜辨識系統
4.	以PDA為平台之語音辨識應用系統開發
5.	以PDA為平台之語音辨識應用系統開發
6.	視覺伺服在運動控制系統的應用
7.	車輛牌照自動辨識系統－使用類神經網路
8.	以人類視覺為基礎的駕駛安全輔助系統
9.	特徵分析及分類器設計及其在電腦視覺及資料探勘之應用
10.	影像處理應用於細微表面微粒之自動評估系統
11.	鄉野道路之路邊界偵測及交通標誌辨識
12.	電腦影像處理在條碼判讀上的應用
13.	自主導航車沿路行進之電腦視覺系統設計
14.	人工智慧語音辨識之研究
15.	人工智慧語音辨識之研究

無相關期刊

1.	唇語辨識系統
2.	2.4Kbps位元率語音編碼技術
3.	以特徵參數抽取為基礎之類神經唇語辨識器
4.	類神經模糊推理網路用於溫度控制
5.	深度學習之唇語辨識系統
6.	基於隱藏式馬可夫模型之唇語辨識系統
7.	以類神經網路進行琴弦的動態模擬
8.	經由隱藏式馬可夫模型切割之語音辨識及其語者調適技術
9.	快速熱處理之模擬，參數判別和溫度控制
10.	類神經網路用於變形影像移動估計與補償
11.	注音符號唇語辨識系統之研製
12.	IDEA晶片設計
13.	一個漸進式資料庫知識擷取的方法
14.	未受限網路最佳化問題及分佈式狀態估計的解法
15.	應用於電解研磨之交換式直流功率擴大器設計與實作

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室