適合MPEG-2/4 AAC聲學模型之
設計與VLSI實現

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士

、以作者查詢全國書目

、勘誤回報

、線上人數：50

、訪客IP：18.226.88.141

姓名

王怡雯(Yi-Wen Wang) 查詢紙本館藏

畢業系所

電機工程學系

論文名稱

適合MPEG-2/4 AAC聲學模型之設計與VLSI實現
(Design and VLSI Implementation for Psychoacoustic Model in MPEG-2/4 Advanced Audio Coding )

相關論文

★ 即時的SIFT特徵點擷取之低記憶體硬體設計	★ 即時的人臉偵測與人臉辨識之門禁系統
★ 具即時自動跟隨功能之自走車	★ 應用於多導程心電訊號之無損壓縮演算法與實現
★ 離線自定義語音語者喚醒詞系統與嵌入式開發實現	★ 晶圓圖缺陷分類與嵌入式系統實現
★ 語音密集連接卷積網路應用於小尺寸關鍵詞偵測	★ G2LGAN: 對不平衡資料集進行資料擴增應用於晶圓圖缺陷分類
★ 補償無乘法數位濾波器有限精準度之演算法設計技巧	★ 可規劃式維特比解碼器之設計與實現
★ 以擴展基本角度CORDIC為基礎之低成本向量旋轉器矽智產設計	★ JPEG2000靜態影像編碼系統之分析與架構設計
★ 適用於通訊系統之低功率渦輪碼解碼器	★ 應用於多媒體通訊之平台式設計
★ 適用MPEG 編碼器之數位浮水印系統設計與實現	★ 適用於視訊錯誤隱藏之演算法開發及其資料重複使用考量

檔案

[Endnote RIS 格式]

[Bibtex 格式]

[相關文章]

[文章引用]

[完整記錄]

[館藏目錄]

[檢視]

[下載]

本電子論文使用權限為同意立即開放。
已達開放權限電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。
請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。

摘要(中)

資料壓縮技術對於聲音的系統是個必要的任務，它不只可以處理龐大的資料，但是也要求高品質的解析度。有一種聲音編碼的壓縮技術叫做MPEG，MPEG 是一個標準化對於聲音壓縮上是有效率的。它可以有意義的降低在頻寬傳送和資料儲存的需求上而且在失真率上也很低。
這篇論文所要呈現是一個新的低複雜度設計聲學模型，它的重要技術是應用於一個低功率的MPEG-2/4 AAC編碼器。在現今MPEG AAC的計算複雜很高，沒有辦法達到聲音即時播放，是攜帶式裝置的一個瓶頸。為了克服這個問題，必需對聲學模型做分析和最佳化的設計，所以，在演算法上，spreading function的計算方式是用查表方式來取代。除此之外，MDCT-Based 聲學模型也適合關於複雜度的降低和品質的保持，聲學模型的計算雜複度總共被降低到達到80%；在架構上，我們呈現一個專屬MDCT-Based聲學模型硬體設計，所以就可以實現一個即時播放在MPEG-2/4 AAC立體聲編碼器上，在位元速128kbit/sec下，頻率在20MHz都可以保持CD的品質。

摘要(英)

Data compression technique is an essential task for audio systems, which not only handles enormous amounts of data, but also requires the high quality resolution. One of theses audio coding techniques, Moving Pictures Experts Group (MPEG) is powerful audio compression standardization. It can significantly reduce the requirements of transmission bandwidth and data storage, but with low distortion.
The paper presents a new low complexity design of Psycho-Acoustic Model (PAM), which is the key technology for a low power MPEG-2/4 Advanced Audio Coding (AAC) encoding. The real-time constraint of MPEG AAC leads to a heavy computational bottleneck on today’s portable devices. To overcome this problem, design analysis and optimization of PAM are addressed. At algorithmic level, the calculation of spreading function was replaced with look-up tables. Besides, Modified-Discrete-Cosine-Transform-based (MDCT-based) PAM was referred to and adopted concerning reducing complexity and maintaining quality. The computational complexity of PAM could be reduced by more than 80% in total. At architectural level, we presented a dedicated hardware design of MDCT-based PAM. The proposed design could be implemented in a real-time MPEG-2/4 AAC stereo encoder at Low Complexity profile and at bitrate 128 kb/s below clock rate 20 MHz while maintaining CD quality.

關鍵字(中)

★ 聲學模型

關鍵字(英)

★ Psychoacoustic Model

論文目次

Chapter 1 Introduction 1
1.1 Motivation 1
1.2 Feature of Advanced Audio Coding Encoder 3
1.3 Audio Application 4
1.4 Thesis Organization 5
Chapter 2 Introduction to Psychoacoustic Model 6
2.1 Psychoacoustics 6
2.1.1 Human Auditory System 6
2.2 Psychoacoustic Principles 7
2.2.1 Absolute Threshold of Hearing 8
2.2.2 Frequency Masking 8
2.2.3 Temporal Masking 9
2.3 Critical Bands 10
2.4 The Psychoacoustic Model 11
Chapter 3 Algorithms of MDCT-Based Psychoacoustic Model 13
3.1 Overview of AAC Algorithms 13
3.2 Algorithms of Psychoacoustic Model 15
3.3 Optimization of Psychoacoustic Model 18
3.3.1 MDCT-Based Psychoacoustic Model 18
3.3.2 Look-up Table Method for Spreading Function 19
3.3.3 Optimization of Result 21
Chapter 4 Architecture Design of MDCT-Based Psychoacoustic Model 23
4.1 Overall architecture of MDCT-Based Psychoacoustic Model 23
4.2 MDCT buffer 26
4.3 MDCT 28
4.4 Spectral Flatness Measure (SFM) and tb 33
4.5 Spreading Function 38
4.6 Steps 7-13 40
4.6.1 Steps 7-9 40
4.6.2 Step 10 42
4.6.3 Steps 11-12 43
4.6.4 Step 13 45
Chapter 5 Implementation and Results 51
5.1 Design Flow 51
5.2 Implementation Result 52
Chapter 6 Conculsions 55
References 56

參考文獻

[1] MPEG. Coding of moving pictures and associated audio for digital storage media at up to 1.5 Mbit/s, part 3: Audio, International Standard IS 11172-3, ISO/IEC JTC1/SC29 WG11, 1992.
[2] MPEG. Information Technology – generic coding of moving pictures and associated audio, part 3: Audio, International Standard IS 13818-3, ISO/IEC JTC1/SC29 WG11, 1994.
[3] MPEG. MPEG-2 Advanced Audio Coding, AAC, International Standard IS 13818-7, ISO/IEC JTC1/SC29 WG11, 1997.
[4] Marina Bosi, Karlheinz Brandenburg, Schuyler Quackenbush, Louis Fielder, Kenzo Akagiri, Hendrik Fuchs, Martin Dietz, Jurgen Herre, Grant Davidson, Yoshiaki Oikawa, “ISO/IEC MPEG-2 Advanced Audio Coding,” J. Audio Eng. Soc., Vol. 45, No. 10, 1997 October.
[5] MPEG. Information technology – Coding of audio-visual objects – Part 3: Audio, International Standard IS 14496-3, ISO/IEC JTC1/SC29 WG11, 1999.
[6] Karlheinz Brandenburg, “MP3 and AAC explained,” AES 17th International Conference on High Quality Audio Coding, Italy, Sep. 2-5, 1999.
[7] ISO/IEC 14496-5 2001 Software Reference. Available: http://www.iso.ch/iso/en/ittf/PubliclyAvailableStandards/ISO_IEC_14496-5_2001_Software_Reference/
[8] Yuichiro Takamizawa, Toshiyuki Nomura, Masao Ikekawa, “High-quality and processor-efficient implementation of an MPEG-2 AAC encoder,” in Proceedings of the 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 2, Page(s): 985 –988.
[9] Do Hyoung Kim, Dong Hyun Kim, Jae Ho Chung, “Optimization of MPEG-4 GA AAC on general PC,” in Proceedings of the 44th IEEE 2001 Midwest Symposium on Circuits and Systems, Vol. 2, pp. 923-925.
[10] Ivan Dimkoviae, Dragorad Milovanoviae, Zoran Bojkoviae, “Fast software implementation of MPEG advanced audio encoder,” 2002 14th International Conference on Digital Signal Processing, Vol. 2, Page(s): 839 –843.
[11] Dong-Yan Huang, Xuesong Gong, Daqing Zhou, Toshio Miki, Sanae Hotani, “Implementation of the MPEG-4 Advanced Audio Coding encoder on ADSP-21060 SHARC,” in Proceedings of the 1999 IEEE International Symposium on Circuits and Systems, Vol. 3, page(s): 544 –547.
[12] Yuichiro Takamizawa, Tsuyoshi Okumura, Toshiyuki Nomura, Masao Ikekawa, and Ichiro Kuroda, “20mW MPEG-2/4 AAC LC stereo encoder on a 16-bit DSP,” Workshop and Exhibition on MPEG-4, San Jose, California, June 25-27 2002.
[13] Marc Gayer, Markus Lohwasser, Manfred Lutzky, “Implementing MPEG Advanced Audio Coding and Layer-3 encoders on 32-bit and 16-bit fixed-point processors,” presented at the AES 115th Convention, New York, Oct. 10-13, 2003.
[14] Tsung-Han Tsai, Shih-Way Huang, Liang-Gee Chen, “Design of a low power psychoacoustic model co-processor for MPEG-2/4 AAC LC stereo encoder,” in Proceedings of the 2003 IEEE International Symposium on Circuits and Systems, Vol. 2, page(s): 552 –555, May 25-28, 2003.
[15] Chi-Min Liu, Chin-Ching Chen, Wen-Chieh Lee, Szu-Wei Lee, “A fast bit allocation method for MPEG layer III,” IEEE International Conference on Consumer Electronics, 1999, pages 22-23.
[16] Chih-Kai Yang, Sau-Gee Chen, “New static and dynamic search algorithms for fast MP3 bit allocations,” in 2003 IEEE International Conference on Multimedia and Expo, Vol. 1, pages 77-80.
[17] Hyen-O Oh, Joon-Seok Kim, Chang-Jun Song, Young-Cheol Park, Dae-Hee Youn, “Low power MPEG/audio encoders using simplified psychoacoustic model and fast bit allocation,” IEEE Transaction of Consumer Electronics, Volume: 47 Issue: 3, Page(s): 613 –621, Aug. 2001.
[18] Vasudev Bhaskaran, Konstantions Konstantinides, “Image and video compresson standards algorithms and architectures,” Hewlett-Packard Laboratories, Second Edition, 2000.
[19] Ted Painter, Andreas Spanias, “Perceptual coding of digital audio,” Proceedings of the IEEE,Vol.88,P.451-515,2000.
[20] Marina Bosi, Richard E. Goldberg, “Introduction to digital audio coding and standards,” Kluwer Academic Publishers,2003.
[21] Miroslava Raspopovic, “Design of Perception Based Audio Codec,” University of Massachusetts Lowell,2001.
[22] H. Fletcher, “Auditory Patterns,” Rev. Mod. Phys., pp. 47-65, Jan. 1940.
[23] Terhardt, E., “Calculating Virtual Pitch,” Hearing Research, pp. 155-182, 1, 1979.
[24] Fengduo Hu, “ITE Technology Incorporated,”2003.
[25] Srinivasan P., Jamieson L.H., “High-Quality Audio Compression Using an Adaptive Wavelet Packet Decomposition and Psychoacoustic Modeling,” IEEE Transactions on,Vol.46,P.1085-1093,1998.
[26] P. Duhamel, Y. Mahieux, J.P. Petit, “A fast algorithm for the implementation of filter banks based on time domain aliasing cancellation,” in Proceedings of the 1991 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 2209-2212.
[27] James D. Johnston, “Transform coding of audio signals using perceptual noise criteria,” IEEE Journal on Selected Area on Communications, Vol. 6, No 2, Feb. 1988.
[28] Mark Kahrs, Karlheinz Brandenburg, Applications of digital signal processing to audio and acoustics. Kluwer Academic Publishers, 1998, p.59.
[29] Winnie Lau, Alex Chwu, “A common transform engine for MPEG & AC3 audio encoder,”
[30] Chichyang Chen, Rui-Lin Chen, Chih-Huan Yang, ”Pipelined Computation of Very Large Word-Length LNS Addition/Subtraction with Polynomial Hardware Cost”, IEEE Transactions on Computers, Vol. 49 Issue: 7, Page(s): 716 -726, July 2000.
[31] ISO/IEC JTC1/SC29 WG11 No.1650 “IS 13818-7 (MPEG-2 Advanced Audio Coding , AAC)”, April 1997.

指導教授

蔡宗漢(Tsung-Han Tsai)

審核日期

2004-7-13

推文