博碩士論文 88521044 詳細資訊




以作者查詢圖書館館藏 以作者查詢臺灣博碩士 以作者查詢全國書目 勘誤回報 、線上人數:24 、訪客IP:3.145.52.182
姓名 陳慶彰(Ching-Chang Chen )  查詢紙本館藏   畢業系所 電機工程研究所
論文名稱 運用G.729與G.723.1於多點會議系統之多聲道語音混合方法
(A G.729 and G.723.1 Based Multi-Channel Speech Mixing Method for Multi-Point Conferencing System)
相關論文
★ 寬頻網路之即時視訊品質控制系統★ 多層漸進式零樹小波分頻音訊壓縮技術
★ 資料隱藏技術應用於 H.263 視訊編碼之錯誤偵測★ 無線傳輸及網際網路環境下之G.729與G.723.1語音傳輸
★ 零樹小波視訊編碼之錯誤偵測與隱藏★ 雜訊通道中視訊編碼防錯技術之研究
★ 小波及離散餘弦域上使用子區塊分類之金匙數位影像浮水印系統★ 資料隱藏應用於零樹小波分頻壓縮系統之音訊封包遺失回復技術
★ IP網路之MPEG-4調適性視訊品質控制★ 區塊重排於小波封包之階層式集合分割影像壓縮技術
★ 動態調整緩衝區方法應用於網路延遲之VoIP語音品質改善★ 語音編碼G.729及MELP之資料隱藏方法
★ 金匙小波域音訊浮水印系統★ 快速多階連續消除移動預估演算法應用於H.26L視訊編碼標準
★ 視訊封包封裝與調適性自動重送於無線區域網路之研究★ 即時性視訊訊務預測與頻寬協商機制於具服務品質保證之網路
檔案 [Endnote RIS 格式]    [Bibtex 格式]    [相關文章]   [文章引用]   [完整記錄]   [館藏目錄]   至系統瀏覽論文 ( 永不開放)
摘要(中) 摘要
音訊混合(audio mixing)是網路音訊會議中不可缺少的機制,其重要性除了提供與真實會議現場一樣的發言環境,讓每位與會者在網路上以全雙工(full-duplex)的方式進行交談,尚可應用在網路連線遊戲及語音聊天室等娛樂用途。
傳統上最佳的語音混合方法是使用完全解碼(full decoding)的方式,在過程上必需進行語音的壓縮及解壓縮處理,造成運算複雜度過高與時間延遲長的缺點。為此,本論文提出一套部份解碼(partial decoding)方式的語音混合方法,利用語音訊號的特性,針對多個待混合的已壓縮音訊訊號,以碼框(frame)為單位,分析代表各音訊所需的音訊參數,選出一組目標音訊參數,作為混合後的音訊輸出。該組目標音訊參數亦符合原壓縮方法之壓縮格式,同時,可混合多個輸入的音訊。可運用在G.729與G.723.1語音壓縮標準上,並有效地降低運算複雜度為完全解碼法的5%至8%,且可得到與完全解碼法相同的混音品質。
摘要(英) In a multi-point conference, users are offered a substitute for a face-to-face meeting within the economic constraints of the technology available. In this situation, an audio mixing scheme is needed to make the meeting successful. Audio mixing can create a full-duplex conversation environment that users can speak at any moment. Furthermore, it can be used in entertainment applications, such as audio chat rooms and online games.
Full decoding method is an intuitive and traditional audio mixing method, but it requires high computational complexity and long processing time. In this work, we propose a partial decoding method based on CELP coding architecture. This method selects a target frame as the mixed output from all incoming frames. There is no need for any encoding and decoding processes. Partial decoding method can be directly applied to CELP based speech coding, such as G.729 and G.723.1 speech standards. It achieves excellent voice quality as the full decoding method does while it only requires 5% to 8% computation loading.
關鍵字(中) ★ 多點會議系統
★  完全解碼
★  部份解碼
★  音訊混合
關鍵字(英) ★ Audio Mixing
★  Full Decoding
★  Multipoint Conference
★  Partial Decoding
論文目次 內容 頁碼
中文摘要 .………...……..……………………………………………. I
Abstract …………………………………………………….………… II
目錄 ……………………………………………………….………… III
附圖索引 ………………………………………………….………. VI
附表索引 ……………………………………………….………. IX
第一章 緒論 ………………………..……………….………… 1
1.1多點會議系統簡介 ………………………………………… 1
1.2研究動機與目的 ………………..………………………….. 4
1.3論文架構 ……………….…………………..………………... 7
第二章 語音編碼技術…………..…………………………… 9
2.1語音的特性 ………….…….……………..…………… 9
2.2線性預測編碼(Linear Prediction Coding) ….……… 11
2.3基頻估測(Pitch Prediction) ……………………………. 16
2.4 CELP編碼理論 ……………………………………...….…. 19
2.5語音編碼系統的特性 …………………………………….. 24
2.5.1位元率 ……………………………...……………...….…. 24
2.5.2時間延遲 ……………………………………………...…. 26
2.5.3運算複雜度 ……………….…………………………..…. 28
2.5.4語音品質 ………………….…………………………..…. 28
第三章 G.729與G.723.1差異性研究…….……….…… 31
3.1線性預測分析及量化(Linear Prediction Analysis
and Quantization) ………………………..…………… 33
3.2感觀權重濾波器(Perceptual Weighting Filter) ..….…. 38
3.3開迴路基頻分析(Open Loop Pitch Analysis) ……….. 40
3.4適應性碼簿搜尋(Adaptive-codebook Search) ………. 42
3.5固定性碼簿之結構與搜尋 …………………………....…. 47
3.6增益量化 ……………………………………….………...…. 53
3.7附錄介紹 …………………………………………….. 55
第四章 多聲道語音混合方法之探討 …………….……59
4.1高位元率語音壓縮的混合方法…………..…………...…. 61
4.1.1線性疊加………………..……………..……………..…… 61
4.1.2時間分割多工…………..……………..……………..…… 64
4.2低位元率語音壓縮的混合方法…………………….……..65
4.2.1完全解碼法……………..……………..……………..…… 65
4.2.2部份解碼法……………..……………..……………..…… 66
4.2.2.1雙聲道語音混合……………….…………..…………68
4.2.2.2多聲道語音混合……………….…………..…………72
4.2.3樹狀結構部份解碼法………………....……………..…… 73
4.3語音混合之品質評量方法……………….…………... 74
4.3.1客觀品質評量………………………....……………..…… 74
4.3.2主觀品質評量………………………....……………..…… 76
第五章 模擬實驗與品質評估…………………………….. 77
5.1模擬環境及語音資料…..………..……………………...… 77
5.2低位元率語音壓縮的混合品質 …………………..…..… 79
5.3主觀聽覺測試 ……………………………...…..………...… 95
5.4運算複雜度分析 ………………………………………...… 96
第六章 結論 ……………………………………….….……..… 98
參考文獻 ……………………………………………………...… 99
參考文獻 [1]ITU-T, "Narrow-band visual telephone systems and terminal equipment," Summary of ITU-T Rec. H.320, June 1997.
[2]ITU-T, "Adaptation of H.320 visual telephone terminals to B-ISDN environments," Summary of ITU-T Rec. H.321, Feb 1998.
[3]ITU-T, "Visual telephone systems and terminal equipment for LAN which provide a guaranteed quality of service," Summary of ITU-T Rec. H.322, Mar 1996.
[4]ITU-T, "Visual telephone systems and terminal equipment for LAN which provide a non-guaranteed quality of service," Summary of ITU-T Rec. H.323, Nov 1996.
[5]ITU-T, "Terminal for low bit rate multimedia communication," Summary of ITU-T Rec. H.324, Mar 1996.
[6]吳佳任、林盈達;『徹底了解VOIP』;網路通訊;90期,1999年 1 月。
[7]J. C. William, "Multipoint Multimedia Conferencing," IEEE Communication, May 1992, pp. 44-50
[8]ITU-T Recommendation G.723.1, "Dual Rate Speech Coder for Multimedia Comunications Transmitting at 5.3 and 6.3 Kbits," March 1996.
[9]ITU-T Recommendation G.729, "Coding of speech at 8 kbit/s using Conjugate-Structure Algebraic-Code-Excited Linear-Prediction (CS-ACELP)," March 1996.
[10]L. R. Rabiner, R. W. Schafer, Digital Prediction of Speech Signals, Prentice Hall, 1978.
[11]A. S. Spanias, "Speech Coding: A Tutorial Review," Proceedings of the IEEE, vol. 82, no. 10, October 1994, pp. 1541-82.
[12]A. M. Kondoz, Digital Speech Coding for Low Bit Rate Communications Systems, Wiley, 1994
[13]D. G. Rowe, "Techniques for Harmonic Sinusoidal Coding," Thesis ,1997
[14]M. R. Scheroeder, B. S. Atal, "Code-excited linear prediction (CELP): high quality speech at very low bit rate," ICASSP, March 1985, pp. 937-940
[15]B. Gold and C. Rader, "The channel vocoder," IEEE Trans. Audio , vol. AU-15, Dec. 1967, pp. 148-161.
[16]I. Gibson, "Vector sum excited linear prediction (VSCELP) speech coding for Japan digital cellular," presented at the Meeting of IEICE, paper RCS90-26, Nov. 1990.
[17]J. P. Campbell, T. E. Tremain and V. C. Welch, "The DOD 4.8 kbps Standard (Proposed Federal Standard 1016)," Advances in Speech Coding, Kluwer Academic Publishers, 1991, pp. 121-133.
[18]R. V. Cox, "Three New Speech Codecs from the ITU Cover a Range of Application," IEEE Comm. Magazine, Sept. 1997.
[19]A. Papoulis, Probability, Random Variables, and Stochastic Processes, third edition, McGraw-Hill, 1991.
[20]A. Gersho, "Advances in Speech and Audio Compression," Proceedings of the IEEE, vol. 82, no. 6, June 1994, pp. 900-918.
[21]R. V. Cox and P. Kroon, "Low bit-rate speech coders for multimedia communication," IEEE Comm. Magazine, vol.34, no.12, Dec. 1996, pp.34-41.
[22]Shu Lin and D. J. Constello, Error Control Coding Fundamentals and Applications, Prentice-Hall, 1983.
[23]T. Fingscheidt, P. Vary and J. A. Andonegui, "Robust speech decoding: can error concealment be better than error correction?," Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference, vol. 1, May 1998, pp. 373-376.
[24]廖瑞祥, 無線傳輸環境下G.723.1語音編碼之位元保護與錯誤隱藏處理, 碩士論文, 中央大學, 1998.
[25]朱復興, 無線傳輸及網際網路環境下之G.729與G.723.1語音傳輸, 碩士論文, 中央大學, 2000.
[26]S. Atungsiri et. al , "Effective lost speech frame reconstruction for CELP coders," Proc. of EUROSPEECH Conf., volume 2, September 1991.
[27]R. Salami et. al. , "ITU-T G.729 Annex A : Reduced Complexity 8 kb/s CS-ACELP Codec for Digital Simultaneous Voice and Data," IEEE Comm. Magazine Sep. 1997
[28]A. Benyassine et. al. , "ITU-T G.729 Annex B : A Silence Compression Scheme for Use with G.729 Optimized for V.70 Digital Simultaneous Voice and Data Applications," IEEE Comm. Magazine Sep. 1997.
[29]Xu Wei, "A solution with fixed point DSP to digital audio mixing in voice conferencing," Texas Instruments (China) Co.,Ltd.
[30]S. Yeldener, J.C. de Martin, and V. Viswanathan, "A Mixed Sinusoidally Excited Linear Prediction Coder at 4 KB/S and Below," Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, vol. 2, 1998, pp. 589-592.
[31]S. R. Quackenbush, T. P. Barnwell and M. A. Clements, Objective Measures of Speech Quality, Prentice Hall, 1988.
[32]http://www.sipro.com/about/g723c54x.htm
[33]http://www.adt-inc.com/products/adtg729_1.htm
指導教授 張寶基(Pao-Chi Chang) 審核日期 2001-6-13
推文 facebook   plurk   twitter   funp   google   live   udn   HD   myshare   reddit   netvibes   friend   youpush   delicious   baidu   
網路書籤 Google bookmarks   del.icio.us   hemidemi   myshare   

若有論文相關問題,請聯絡國立中央大學圖書館推廣服務組 TEL:(03)422-7151轉57407,或E-mail聯絡  - 隱私權政策聲明