博碩士論文 92523026 詳細資訊




以作者查詢圖書館館藏 以作者查詢臺灣博碩士 以作者查詢全國書目 勘誤回報 、線上人數:18 、訪客IP:3.22.248.208
姓名 何恭岳(Kong-Yueh Ho)  查詢紙本館藏   畢業系所 通訊工程學系
論文名稱 高品質切換式離散餘弦與小波封包 轉換之音訊編碼技術
(High Quality Switched Discrete CosineTransform and Wavelet PacketAudio Coding Technique )
相關論文
★ 基於區域權重之衛星影像超解析技術★ 延伸曝光曲線線性特性之調適性高動態範圍影像融合演算法
★ 實現於RISC架構之H.264視訊編碼複雜度控制★ 基於卷積遞迴神經網路之構音異常評估技術
★ 具有元學習分類權重轉移網路生成遮罩於少樣本圖像分割技術★ 具有注意力機制之隱式表示於影像重建 三維人體模型
★ 使用對抗式圖形神經網路之物件偵測張榮★ 基於弱監督式學習可變形模型之三維人臉重建
★ 以非監督式表徵分離學習之邊緣運算裝置低延遲樂曲中人聲轉換架構★ 基於序列至序列模型之 FMCW雷達估計人體姿勢
★ 基於多層次注意力機制之單目相機語意場景補全技術★ 基於時序卷積網路之單FMCW雷達應用於非接觸式即時生命特徵監控
★ 視訊隨選網路上的視訊訊務描述與管理★ 基於線性預測編碼及音框基頻週期同步之高品質語音變換技術
★ 基於藉語音再取樣萃取共振峰變化之聲調調整技術★ 即時細緻可調性視訊在無線區域網路下之傳輸效率最佳化研究
檔案 [Endnote RIS 格式]    [Bibtex 格式]    [相關文章]   [文章引用]   [完整記錄]   [館藏目錄]   [檢視]  [下載]
  1. 本電子論文使用權限為同意立即開放。
  2. 已達開放權限電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。
  3. 請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。

摘要(中) 轉換編碼應用於音訊處理系統中已行之有年,而近年來最熱門的分頻編碼法則莫過於小波轉換。其多層解析之特性,使得分頻處理之選擇架構更為多元。本論文提出一套混合式高音質壓縮系統,在處理每個音框資料時,首先即依據頻率域之平坦度量測以決定使用頻率解析度較佳之離散餘弦轉換或具豐富時間資訊之小波轉換作為此音框的主要轉換方式。對於採用離散餘弦轉換之音框,將依循人耳聲學模型所算出之頻域遮罩配合非線性量化器作量化。若是選擇運用小波封包做分頻,則樂音訊號通過小波濾波器組後即分成26個固定子頻帶,並在各子頻帶之後再依據小波域與頻率域平坦度之量測選擇性地使用離散餘弦轉換以提升頻率解析度。配合針對非理想濾波器組之最佳化位元分配演算法,將頻域上之人耳聽覺遮蔽曲線轉換為小波域之遮蔽曲線,以提供精良之量化準則並保有極高之音質。最後再以熵編碼,將量化後係數封裝成位元流。實驗結果顯示,在64k位元率的情況下,本系統所提供之音質,不僅優於MP3,更能超越AAC低複雜度規格。
摘要(英) We propose a hybrid coding system that utilizes both Wavelet Packet (WP) and DCT techniques. To process each audio frame, the system selects either WP or DCT to process based on the frame flatness measures in wavelet domain and frequency domain. If DCT is chosen, all DCT coefficients are quantized by a non-uniform quantizer according to the frequency masking curve. On the other hand, frame data are segmented into 26 fixed subbands when WP is chosen. Then, the system selectively utilizes DCT to promote frequency resolution of each subband based on the subband flatness measure. By quoting optimal bit-allocation for non-ideal filter bank, the masking threshold from psychoacoustic model can be translated into specific criteria in the wavelet domain for quantization. Experiment results show that the proposed system is superior to MP3 and AAC LC profile at 64k bps
關鍵字(中) ★ 最佳化位元分配
★ 離散餘弦轉換
★ 小波濾波器
關鍵字(英) ★ optimal bit-allocation
★ DCT
★ Wavelet Packet
論文目次 目錄 IV
圖目 VII
表目 X
第一章 緒論 1
1.1音訊壓縮簡介 1
1.2研究動機與目的 3
1.3系統架構 5
1.4論文架構 7
第二章 小波分析技術 8
2.1小波轉換(Wavelet Transform) 8
2.1.1小波分解(Wavelet Expansion)與離散小波轉換(Discrete Wavelet Transform) 9
2.1.2多重解析度分析 10
2.2小波轉換與數位訊號處理 12
2.2.1 小波濾波器 12
2.2.2 小波轉換 18
2.2.3 小波封包(Wavelet Packet) 19
2.3 算數編碼(Arithmetic coding) 21
第三章 人類聽覺心理學模型及其應用實例 24
3.1現代音訊編碼技術 24
3.2人耳聲學模型基本原理與其應用 25
3.2.1人耳最小感知音壓與臨界頻帶(Critical Band) 26
3.2.2頻率軸上的遮蔽效應 29
3.2.3時間軸上的遮蔽效應 36
3.2.4模型公式 37
3.3 MPEG – 2 AAC之簡介 40
3.3.1濾波器組(Filter Bank) 41
3.3.2時域雜訊塑型(Temporal Noise Shaping; TNS) 42
3.3.3預測(Prediction) 43
3.3.4 M/S立體聲與強度編碼(M/S Stereo and Intensity Encoding) 43
3.3.5量尺因子頻帶Scale Factor Bands 44
3.3.6量化(Quantization) 44
3.3.7無失真編碼(Noiseless Coding) 45
3.3.8位元流的格式(Bit-stream Format) 45
第四章 混合式音訊壓縮系統 47
4.1 時頻轉換(Time/Freq. Mapping) 48
4.1.1 小波封包轉換 49
4.1.2 離散餘弦轉換 50
4.2 平坦度量測(Flatness Measure) 51
4.3 最佳化位元配置 54
4.4 量化 58
第五章 實驗結果與討論 59
5.1客觀評量工具-EAQUAL簡介 59
5.2測試歌曲說明 61
5.3小波封包與ISO – Standard MP3之比較 63
5.4混合小波封包轉換與離散餘弦轉換 64
5.5混合式轉換與現代音訊編碼器之比較 68
第六章 結論與未來展望 71
6.1結論 71
6.2未來展望 71
參考文獻 73
參考文獻 [1] ISO/IEC 11172-3 : “Information technology - Coding of moving pictures and associated audio for digital storage media at up to about 1.5 M bit/s - Part 3: Audio".1992 (“MPEG-1”).
[2] ISO/IEC 13818-3 : “Information technology – Generic Coding of Moving Pictures and Associated Audio , Part 3: Audio".1994 (“MPEG-2 BC”).
[3] ISO/IEC 13818-7, “Information Technology - Generic Coding of Moving Pictures and Associated Audio, Part 7: Advanced Audio Coding”, 1997.
[4] ISO/IEC, Final Draft International Standard 14496-3: MPEG-4 Audio, ISO/IEC JTC1/SC29/WG11 N2503, Oct. 1998. (“MPEG-4”)
[5] M. Sablatash and T. Cooklev, “Compression of High-Quality Audio Signals, Including Recent Methods Using Wavelet Packets,” Digital Signal Processing, vol. 6, no. 10, 1996, pp. 96-107.
[6] Y. Karelic and D. Malah, “Compression of High-Quality Audio Signals Using Adaptive Filterbanks and A Zero-Tree Coder,” Electrical and Electronics Engineers in Israel, 1995.
[7] P. Srinivasan and L. H. Jamieson, “High-Quality Audio Compression Using an Adaptive Wavelet Packet Decomposition and Psychoacoustic Modeling,” IEEE Trans. on Signal Processing, vol. 46, no. 4, pp. 1085-1093, April 1998.
[8] S. Boland and M. Deriche, “Audio Coding Using The Wavelet Packet Transform and A combined Scalar-Vector Quantization,” in Proc. Int. Conf. Acoust., Speech, Signal Process. 1996, pp. 1041-1044.
[9] X. Xiong and Z. Eryuan, “Digital Audio Codec Based on the Improved Optimization Algorithm of Adaptive Wavelets and Dynamic Bit Allocation Scheme,” proceeding of ICSP’96, pp. 1523-1526.
[10] P. Philippe, F. Moreau de Saint-Martin, M. Lever, and J. Soumagne, “Optimal Wavelet Packets for Low-Delay Audio Coding,” in Proc. Int. Conf. Acoust., Speech, Signal Process. 1996, pp. 550-553.
[11] D. Y. Pan, “A Tutorial on MPEG/Audio Compression,” IEEE Multimedia pp. 60-74, 1995.
[12] C. S. Burrus, R. A. Gopinath, and H. Guo, “Introdution to Wavelets and Wavelet Transforms,” 1998.
[13] P. E. Kudumakis and M. B. Sandler, “Wavelet Packet Based Scalable Audio Coding,” in Proc. Int. Conf. Acoust., Speech, Signal Process. 1996, pp. 41-44.
[14] W. K. Dobson, J. J. Yang, K. J. Smart, and F. K. Guo, “High Quality Low Complexity Scalable Wavelet Audio Coding,” in Proc. Int. Conf. Acoust., Speech, Signal Process. 1997, pp. 327-330.
[15] C Todd, “A Digital Audio System for Broadcast and Prerecorded Media”, in Proc. 75th Conv. Aud. Eng. Soc., preprint #, Mar. 1984.
[16] C Todd, “A Digital Audio System for Broadcast and Prerecorded Media”, in Proc. 75th Conv. Aud. Eng. Soc., preprint #, Mar. 1984.
[17] E. F. Schroder and W. Voessing, “High Quality Digital Audio Encoding With 3.0 Bits/Sample Using Adaptive Transform Coding”, in Proc. 80th Conv. Aud. Eng. Soc., preprint #2321, Mar. 1986.
[18] G. Theile, et al., “Low-Bit Rate Coding of High Quality Audio Signals”, in Proc. 82nd Conv. Aud. Eng. Soc., preprint #2423, Mar. 1987.
[19] K. Brandenburg, “OCF – A New Coding Algorithm for High Quality Sound Signals”, in Proc. ICASSP-87, May 1987, pp. 5.1.1-5.1.4.
[20] J. Johnston, “Transform Coding of Audio Signals Using Perceptual Noise Criteria”, IEEE J. Sel. Areas in Comm., Feb. 1988, pp. 314-23.
[21] W-Y Chan and A. Gersho, “High Fidelity Audio Transform Coding With Vector Quantization”, in Proc. ICASSP-90, May 1990, pp. 1109-1112.
[22] K. Brandenburg and J.D. Johnston, “Second Generation Perceptual Audio Coding: The Hybrid Coder”, in Proc. 88th Conv. Aud. Eng. Soc., preprint #2937, Mar. 1990.
[23] K. Brandenburg, et al, “Adaptive Spectral Entropy Coding of High Quality Music Signals”, in Proc. 90th Conv. Aud. Eng. Soc., preprint #3011, Feb. 1991.
[24] Y. F. Dehery, et al, “A MUSICAM Source Codec for Digital Audio Broadcasting and Storage”, in proc. ICASSP-91, May 1991, pp. 3605-3608.
[25] M. Iwadare, et al., “A 128 kb/s Hi-Fi Audio CODEC Based on Adaptive Transform Coding With Adaptiv Block Size MDCT”, IEEE J. Sel. Areas in Comm., Jan. 1992, pp. 138-144.
[26] Huber, D. M., Runstein, R. E., 2001. Modern Recording Techniques 5th Edition, Focal Press.
[27] ISO/IEC 13818-7 : “MPEG-2 Advanced Audio Coding, AAC,” 1997.
[28] E. Zwicker and H. Fastl, Psychoacoustics, Facts and Models (Springer, Berlin, Heidelberg, 1990).
[29] D. Sinha and A. H. Tewfik, “Low Bit Rate Transparent Compression using Adapted Wavelets,” IEEE Trans. on Signal Processing, vol. 41, no. 12, pp. 3463-3479, Dec. 1993.
[30] I. Daubechies, "Ten Lectures on Wavelets," no. 61 in CBMS-NSF Series in Applied Mathematics, SIAM, Philadelphia, 1992.
[31] C. Caini and A. V. Coralli, “Optimum Bit Allocation in Subband Coding with Nonideal Reconstruction Filters,” IEEE Signal Processing Latters, vol. 8, no. 6, pp. 157-159, June. 2001.
[32] ITU-R Recommendation BS.1387, Method for Objective Measurements of Perceived Audio Quality,Dec. 1998.
[33] SQAM - Sound Quality Assessment Material, website
http://www.tnt.uni-hannover.de/project/mpeg/audio/sqam/
[34] http://psplab.csie.nctu.edu.tw/modules.php?op=modload&name=Web_Links&file=index
[35] J. D. Johnston, “Transform coding of audio signals using perceptual noise criteria,” IEEE Trans. Select. Areas Commun., to be published.
[36] T. H. Wu and P. C. Chang, “Hybrid Wavelet Pack and Discrete Cosine Transform with Optimum Bit allocation Applied to High-Quality Audio Coding”, 碩士論文, 中央大學, 2004.
指導教授 張寶基(Pao-Chi Chang) 審核日期 2005-6-24
推文 facebook   plurk   twitter   funp   google   live   udn   HD   myshare   reddit   netvibes   friend   youpush   delicious   baidu   
網路書籤 Google bookmarks   del.icio.us   hemidemi   myshare   

若有論文相關問題,請聯絡國立中央大學圖書館推廣服務組 TEL:(03)422-7151轉57407,或E-mail聯絡  - 隱私權政策聲明