博碩士論文 91523046 詳細資訊




以作者查詢圖書館館藏 以作者查詢臺灣博碩士 以作者查詢全國書目 勘誤回報 、線上人數:24 、訪客IP:3.145.60.166
姓名 吳宗紘(Tsung-Hung Wu)  查詢紙本館藏   畢業系所 通訊工程學系
論文名稱 運用混合小波封包與離散餘弦轉換及 最佳位元配置之高音質音訊壓縮系統
(Hybrid Wavelet Packet and Discrete Cosine Transform with Optimum Bit Allocation Applied to High-Quality Audio Coding )
相關論文
★ 基於區域權重之衛星影像超解析技術★ 延伸曝光曲線線性特性之調適性高動態範圍影像融合演算法
★ 實現於RISC架構之H.264視訊編碼複雜度控制★ 基於卷積遞迴神經網路之構音異常評估技術
★ 具有元學習分類權重轉移網路生成遮罩於少樣本圖像分割技術★ 具有注意力機制之隱式表示於影像重建 三維人體模型
★ 使用對抗式圖形神經網路之物件偵測張榮★ 基於弱監督式學習可變形模型之三維人臉重建
★ 以非監督式表徵分離學習之邊緣運算裝置低延遲樂曲中人聲轉換架構★ 基於序列至序列模型之 FMCW雷達估計人體姿勢
★ 基於多層次注意力機制之單目相機語意場景補全技術★ 基於時序卷積網路之單FMCW雷達應用於非接觸式即時生命特徵監控
★ 視訊隨選網路上的視訊訊務描述與管理★ 基於線性預測編碼及音框基頻週期同步之高品質語音變換技術
★ 基於藉語音再取樣萃取共振峰變化之聲調調整技術★ 即時細緻可調性視訊在無線區域網路下之傳輸效率最佳化研究
檔案 [Endnote RIS 格式]    [Bibtex 格式]    [相關文章]   [文章引用]   [完整記錄]   [館藏目錄]   [檢視]  [下載]
  1. 本電子論文使用權限為同意立即開放。
  2. 已達開放權限電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。
  3. 請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。

摘要(中) 以小波分頻的訊號壓縮技術已被廣泛地應用在音視訊編碼系統中,其優越的性能充分顯現在靜態影像之壓縮技術上。本論文提出混合小波與離散餘弦轉換之音訊壓縮系統,以小波封包分頻方式,將樂音訊號經由濾波器群組分成26個次頻帶,再根據時域與頻域之平坦程度,決定是否要進一步執行離散餘弦轉換。本系統並採用非理想合成濾波器之最佳位元配置演算法,將人耳聲學模型所得出的頻域最小遮蔽臨界值,轉換成小波域上的遮蔽臨界值,以提供精良的量化準則。其後以均勻量化器配合小波域的遮蔽臨界值,大幅降低資料量並仍保有極高的音質,最後再以算術編碼將量化後的係數做進一步的熵編碼並封裝成位元流。實驗結果顯示,本系統僅需52 kbps即可達到MP3 64 kbps的音質;另外,在同樣64 kbps之位元率下,本系統所提供的音質不但優於MP3、AAC低複雜度規格,更可超越AAC高效率規格。
摘要(英) The wavelet filter bank analysis-synthesis technique has been widely applied to many areas of digital signal processing, especially in image and video coding. In this thesis, we propose a hybrid Wavelet Packet and DCT audio compression system, which divides the audio signal into 26 subbands via Wavelet Packet analysis and selectively performs DCT in each subband according to the flatness measure of time and frequency of this subband. The proposed coder adopts optimum bit allocation with nonideal reconstruction filters to transform the minimum masking threshold in frequency domain obtained from psychoacoustic model into the masking threshold in Wavelet domain. The WP or DCT coefficients are then quantized with uniform quantizers according to masking threshold, so that we can reduce the data rate but still have high quality. Finally, the quantized coefficients are encoded with arithmetic coding and encapsulated with other side information. The experiments show that, only 52 kbps is needed for proposed audio coder to achieve MP3 64-kbps quality. At the same bit rate of 64 kbps, the proposed audio coding system can provide not only better quality than MP3 and AAC LC profile but also superior to AAC HE profile!
關鍵字(中) ★ 壓縮
★ 小波
★ 離散餘弦轉換
★ 音訊
關鍵字(英) ★ wavelet
★ DCT
★ audio
★ compression
論文目次 目錄
目錄 III
圖目 VI
表目 VIII
第一章 緒論 1
1.1 音訊壓縮簡介 1
1.2 研究動機與目的 3
1.3 系統架構 4
1.4 論文架構 5
第二章 小波分析技術 7
2.1 小波轉換 ( Wavelet Transform ) 7
2.1.1小波分解與離散小波轉換 8
2.1.2多重解析度分析 9
2.2小波轉換與數位訊號處理 11
2.2.1 小波濾波器 11
2.2.2 Daubechies緊密時間涵蓋小波 17
2.2.3 GBCW雙正交小波 19
2.3小波封包 ( Wavelet Packet ) 20
第三章 人耳聲學模型及其應用實例 24
3.1 一般音訊壓縮編解碼器結構 24
3.2 人耳聲學模型 26
3.2.1 基本原理與其應用 26
3.2.2 雜訊對單頻音的遮蔽效應 28
3.2.3頻音對單頻音的遮蔽效應 32
3.2.4 時間軸上的遮蔽效應 33
3.2.5模型公式 34
3.3 MPEG音訊編碼器家族 38
3.3.1 MPEG-1第三層(MP3) 40
3.3.2 先進音訊編碼(AAC) 43
3.4 算術編碼 46
第四章 小波音訊壓縮系統 49
4.1 轉換 50
4.1.1 Daubechies緊密時間涵蓋小波 50
4.1.2 GBCW雙正交小波 54
4.1.3小波封包 55
4.1.4離散餘弦轉換 57
4.2 最佳位元配置 58
4.3熵編碼 61
第五章 實驗結果與討論 65
5.1 客觀評量工具 – EAQUAL 65
5.2 CDF與GBCW雙正交小波之比較 68
5.3 調適性小波與非調適性小波之比較 69
5.4 正交小波與雙正交小波之比較 .72
5.5 混合小波轉換與離散餘弦轉換 73
5.6 複雜度分析 77
第六章 結論與未來展望 81
6.1 結論 81
6.2 未來展望 81
參考文獻 82
參考文獻 [1] ISO/IEC 11172-3 : “Information technology - Coding of moving pictures and associatedaudio for digital storage media at up to about 1.5 Mbit/s - Part 3: Audio".1992 (“MPEG-1”).
[2] ISO/IEC 13818-3 : “Information technology – Generic Coding of Moving Pictures and Associated Audio , Part 3: Audio".1994 (“MPEG-2 BC”).
[3] ISO/IEC, Final Draft International Standard 14496-3: MPEG-4 Audio, ISO/IEC JTC1/SC29/WG11 N2503, Oct. 1998. (“MPEG-4”)
[4] T. Painter and A. Spanias, “A review of algorithms for perceptual coding of digital audio signals,” Digital Signal Processing Proceedings, 1997. DSP 97., 1997 13th International Conference, VOL 1 , 1997 , pp 179 –208.
[5] M. Sablatash and T. Cooklev, “Compression of High-Quality Audio Signals, Including Recent Methods Using Wavelet Packets,” Digital Signal Processing, vol. 6, no. 10, 1996, pp. 96-107.
[6] D. Sinha and A. H. Tewfik, “Low Bit Rate Transparent Compression using Adapted Wavelets,” IEEE Trans. on Signal Processing, vol. 41, no. 12, pp. 3463-3479, Dec. 1993.
[7] P. Srinivasan and L. H. Jamieson, “High-Quality Audio Compression Using an Adaptive Wavelet Packet Decomposition and Psychoacoustic Modeling,” IEEE Trans. on Signal Processing, vol. 46, no. 4, pp. 1085-1093, April 1998.
[8] S. Boland and M. Deriche, “Audio Coding Using The Wavelet Packet Transform and A combined Scalar-Vector Quantization,” in Proc. Int. Conf. Acoust., Speech, Signal Process. 1996, pp. 1041-1044.
[9] X. Xiong and Z. Eryuan, “Digital Audio Codec Based on the Improved Optimization Algorithm of Adaptive Wavelets and Dynamic Bit Allocation Scheme,” proceeding of ICSP’96, pp. 1523-1526.
[10] P. Philippe, F. Moreau de Saint-Martin, M. Lever, and J. Soumagne, “Optimal Wavelet Packets for Low-Delay Audio Coding,” in Proc. Int. Conf. Acoust., Speech, Signal Process. 1996, pp. 550-553.
[11] D. Y. Pan, “A Tutorial on MPEG/Audio Compression,” IEEE Multimedia pp. 60-74, 1995.
[12] C. S. Burrus, R. A. Gopinath, and H. Guo, “Introdution to Wavelets and Wavelet Transforms,” 1998.
[13] P. E. Kudumakis and M. B. Sandler, “Wavelet Packet Based Scalable Audio Coding,” in Proc. Int. Conf. Acoust., Speech, Signal Process. 1996, pp. 41-44.
[14] W. K. Dobson, J. J. Yang, K. J. Smart, and F. K. Guo, “High Quality Low Complexity Scalable Wavelet Audio Coding,” in Proc. Int. Conf. Acoust., Speech, Signal Process. 1997, pp. 327-330.
[15] P. Philippe, F. Moreau de Saint-Martin, and L. Mainard, “On The Choice of Wavelet Filters for Audio Compression,” in Proc. Int. Conf. Acoust., Speech, Signal Process. 1995, pp. 1045-1048.
[16] P. E. Kudumakis and M. B. Sandler, “On The Performance of Wavelets for Low Bit Rate Coding of Audio Signals,” in Proc. Int. Conf. Acoust., Speech, Signal Process. 1995, pp. 3087-3090.
[17] I. Daubechies, "Ten Lectures on Wavelets," no. 61 in CBMS-NSF Series in Applied Mathematics, SIAM, Philadelphia, 1992.
[18] C Todd, “A Digital Audio System for Broadcast and Prerecorded Media”, in Proc. 75th Conv. Aud. Eng. Soc., preprint #, Mar. 1984.
[19] E.F. Schroder and W. Voessing, “High Quality Digital Audio Encoding With 3.0 Bits/Sample Using Adaptive Transform Coding”, in Proc. 80th Conv. Aud. Eng. Soc., preprint #2321, Mar. 1986.
[20] G. Theile, et al., “Low-Bit Rate Coding of High Quality Audio Signals”, in Proc. 82nd Conv. Aud. Eng. Soc., preprint #2423, Mar. 1987.
[21] K. Brandenburg, “OCF – A New Coding Algorithm for High Quality Sound Signals”, in Proc. ICASSP-87, May 1987, pp. 5.1.1-5.1.4.
[22] J. Johnston, “Transform Coding of Audio Signals Using Perceptual Noise Criteria”, IEEE J. Sel. Areas in Comm., Feb. 1988, pp. 314-23.
[23] W-Y Chan and A. Gersho, “High Fidelity Audio Transform Coding With Vector Quantization”, in Proc. ICASSP-90, May 1990, pp. 1109-1112.
[24] K. Brandenburg and J.D. Johnston, “Second Generation Perceptual Audio Coding: The Hybrid Coder”, in Proc. 88th Conv. Aud. Eng. Soc., preprint #2937, Mar. 1990.
[25] K. Brandenburg, et al, “Adaptive Spectral Entropy Coding of High Quality Music Signals”, in Proc. 90th Conv. Aud. Eng. Soc., preprint #3011, Feb. 1991.
[26] Y.F. Dehery, et al, “A MUSICAM Source Codec for Digital Audio Broadcasting and Storage”, in proc. ICASSP-91, May 1991, pp. 3605-3608.
[27] M. Iwadare, et al., “A 128 kb/s Hi-Fi Audio CODEC Based on Adaptive Transform Coding With Adaptiv Block Size MDCT”, IEEE J. Sel. Areas in Comm., Jan. 1992, pp. 138-144.
[28] ISO/IEC 13818-7 : “MPEG-2 Advanced Audio Coding, AAC,” 1997.
[29] D. Wei, J. Tian, R. O. Wells, Jr., and C. S. Burrus, “A New Class of Biorthogonal Wavelet System for Image Transform Coding,” IEEE Trans. on Image Processing, vol. 7, no. 7, pp. 1000-1013, July. 1998.
[30] C. Caini and A. V. Coralli, “Optimum Bit Allocation in Subband Coding with Nonideal Reconstruction Filters,” IEEE Signal Processing Latters, vol. 8, no. 6, pp. 157-159, June. 2001.
[31] V. Britanak and K. R. Rao, “An efficient implementation of the forward and inverse MDCT in MPEG audio coding,” IEEE Signal Processing Letter, vol. 8, pp. 48-51, Feb. 2001.
[32] I. Daubechies and W. Sweldens, “Factoring wavelet transforms into lifting steps,” J. Fourier Anal. Appl., 4 (no. 3), pp. 247-269, 1998.
[33] S.K. Tang, S.C. Chan, K.L. Ho, and F.K. Lam, “Implementation of fast cosine transform on the Motorola DSP 96002 digital signal processor,” IEEE International Sympoisum on Circuits and Systems, Vol. 1, pp.73-76, June 1991.
[34] K. Konstantinides, “Fast Subband Filtering in MPEG Audio Coding,” IEEE Signal Processing Letter, vol. 1, pp. 26-28, Feb. 1994.
指導教授 張寶基(Pao-Chi Chang) 審核日期 2004-6-20
推文 facebook   plurk   twitter   funp   google   live   udn   HD   myshare   reddit   netvibes   friend   youpush   delicious   baidu   
網路書籤 Google bookmarks   del.icio.us   hemidemi   myshare   

若有論文相關問題,請聯絡國立中央大學圖書館推廣服務組 TEL:(03)422-7151轉57407,或E-mail聯絡  - 隱私權政策聲明