摘要(英) |
Nowadays, people search information from Internet by handheld device became a trend. Recently, browsing platform transform from desktop to handheld with their improved performance. The ubiquitous features of handheld device, “any-time” and “any-place”, make user can get information instantly. However, the screen size and the network speed of the device constrain the user browsing feeling. Thus, the browsing way for desktop device is unsuitable on handheld device. In the extant solutions, Web author takes a lot of efforts in preparing multiple versions of Web pages and resources for various platforms. Our solution provides an automatic parsing mechanism, we use a block-extraction approach to rendering news service by exploiting web content structures, to parse each news website to suitable handheld device.
|
參考文獻 |
[1] Kosala, R., Bruynooghe, M., Bussche, J. D., & Blockeel, H., ”Information extraction from web documents based on local unranked tree automaton inference,” In: Proceedings of eighteenth international joint conference on artificial intelligence, 2003
[2] Chao Wang, Jie Lu, Guangquan Zhang, “Mining key information of web pages: A method and its application,” Expert Systems with Applications, 2007
[3] Deng Cai, Shipeng Yu, Ji-Rong Wen, Wei-Ying Ma, “a Vision-based Page Segmentation Algorithm,” Microsoft Research Microsoft Corporation One Microsoft Way Redmond, 2003
[4] Zheng Yan, Cheng Xiao-chun, Chen Kai, “Filtering noise in Web pages based on parsing tree,” The Journal of China Universities of Posts and Telecommunications, 2008.
[5] Jing Wang, Zhijing Liu, “A Novel Method for the Web page Segmentation and Identification,” International Conference on Computer Engineering and Technology, 2009
[6] Stephen J.H. Yang, “An Automatic Segment Detection Service for HTML Documents,” Proceedings of the 2008 IEEE International Conference on Services Computing, 2008
[7] Shian-Hua Lin, Jan-Ming Ho, “Discovering Informative Content Blocks from Web Documents,” International Conference on Knowledge Discovery and Data Mining, 2002
[8] Hung-Yu Kao, Shian-Hua Lin, Jan-Ming Ho, Ming-Syan Chen, “Entropy-based link analysis for mining web informative structures,” In Proceedings of the 11th ACM international conference on Information and Knowledge Management, 2002
[9] Chih-Wei Hsu, Chih-Jen Lin, “A comparison of methods for multiclass support vector machines,” IEEE Transactions on Neural Networks, 2002
[10] Precision and Recall。2011年5月,取自
http://en.wikipedia.org/wiki/Precision_and_recall。
[11] HTML。2011年6月,取自
http://en.wikipedia.org/wiki/HTML
[12] Cascading Style Sheets home page。2011年6月,取自
http://www.w3.org/Style/CSS/。
[13] JavaScript。2011年6月,取自
http://en.wikipedia.org/wiki/JavaScript
[14] Dojo Toolkit。2011年6月,取自
http://dojotoolkit.org/
[15] HtmlUnit。2011年6月,取自
http://htmlunit.sourceforge.net/
[16] 張耕輔,「Design and Implementation of Web Content Clustering」,國 立中央大學,碩士論文,民國99年。
[17] 鄭致瑋,「Design and Implementation of XUL-Based Rendering for
Mobile Devices」,國立中央大學,碩士論文,民國99年。
[18] 林欣潔,「Design and Implementation of XUL-Based Rendering for Mobile Devices」,國立中央大學,碩士論文,民國100年。
|