生成式對抗網路架構搜尋

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士

、以作者查詢全國書目

、勘誤回報

、線上人數：106

、訪客IP：18.191.189.85

姓名

吳佳臻(WU, CHIA-CHEN) 查詢紙本館藏

畢業系所

資訊管理學系

論文名稱

生成式對抗網路架構搜尋
(GANAS: Generative Adversarial Network Architecture Search)

相關論文

★ 台灣50走勢分析：以多重長短期記憶模型架構為基礎之預測	★ 以多重遞迴歸神經網路模型為基礎之黃金價格預測分析
★ 增量學習用於工業4.0瑕疵檢測	★ 遞回歸神經網路於電腦零組件銷售價格預測之研究
★ 長短期記憶神經網路於釣魚網站預測之研究	★ 基於深度學習辨識跳頻信號之研究
★ Opinion Leader Discovery in Dynamic Social Networks	★ 深度學習模型於工業4.0之機台虛擬量測應用
★ A Novel NMF-Based Movie Recommendation with Time Decay	★ 以類別為基礎sequence-to-sequence模型之POI旅遊行程推薦
★ A DQN-Based Reinforcement Learning Model for Neural Network Architecture Search	★ Neural Network Architecture Optimization Based on Virtual Reward Reinforcement Learning
★ 以漸進式基因演算法實現神經網路架構搜尋最佳化	★ Enhanced Model Agnostic Meta Learning with Meta Gradient Memory
★ 遞迴類神經網路結合先期工業廢水指標之股價預測研究	★ A Novel Reinforcement Learning Model for Intelligent Investigation on Supply Chain Market

檔案

[Endnote RIS 格式]

[Bibtex 格式]

[相關文章]

[文章引用]

[完整記錄]

[館藏目錄]

[檢視]

[下載]

本電子論文使用權限為同意立即開放。
已達開放權限電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。
請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。

摘要(中)

近年來，機器學習在各個領域得到了廣泛的應用，並取得了突出的成績。在機器學習領域的各種方法中，深度學習是最受關注的一種，深度學習能夠快速處理海量資訊，滲透並改變著我們的日常生活。在深度學習中，設計優秀的神經網路架構是非常重要的事情，然而，設計一個優秀的架構不僅需要深度學習以及相關領域的專業知識，還需要在目標任務領域有足夠的經驗。因此，目前有很多關於自動產生神經網路架構的研究並實現神經網路的自動設計，然而大多數的研究非常消耗計算資源。因此在本文中，我們提出了一種新的方法，並將其稱為生成對抗式網路架構搜索（GANAS），該模型將conditional GANs擴展到NAS領域，最終目標是使用訓練有素的生成器生成神經網路架構，這種方法最大的特點是根據不同的資料會產生不同的神經網路架構，從而省去人工手動設計神經網路架構的時間，同時，只需要少量的運算資源便可達成我們的任務。

摘要(英)

In recent years, machine learning has gained a wide range of applications in various fields and has achieved outstanding results. Among the various approaches in the machine learning field, deep learning is the one that has received the most attention; deep learning can process vast amounts of information quickly, permeates, and changes our daily lives. In deep learning, designing excellent neural network architecture is very important, however, designing an excellent architecture requires not only deep learning and expertise in the relevant field but also sufficient experience in the target task area. Therefore, there is a lot of research on generating neural network architectures automatically, however, such search methods are very consuming computing resources. Therefore, in this paper, we propose a new approach and call GANAS, the model extends conditional GANs into the realm of NAS, with the ultimate goal of generating neural network architectures using trained well generators, the best feature of this method is that different neural network architectures are generated according to the data, thus saving the time of designing network architectures, and at the same time, only a small amount of computational resources is needed to achieve our task.

關鍵字(中)

★ 機器學習
★ 深度學習
★ 神經架構搜索
★ 生成式對抗網路

關鍵字(英)

★ Machine Learning
★ Deep Learning
★ Neural Architecture Search
★ Generative Adversarial Networks

論文目次

中文摘要 i
Abstract ii
誌謝 iii
Table of Contents iv
List of Figures v
List of Tables vi
1. Introduction 1
2. Related Work 8
2.1 Neural Architecture Search 8
2.2 Generative Adversarial Network 15
3. Proposed Method: GANAS 20
3.1 Data Sampling 20
3.2 Build Search Space 22
3.3 Architecture Search Process 24
3.4 Apply Phase 28
4. Experiments 30
4.1 Datasets 30
4.2 Benchmark Comparison 31
4.3 Discussion of Search Space Parameter Settings 36
4.4 Discussion of GANAS Parameter Settings 38
4.5 Discussion of Architecture Searched by GANAS Parameter Settings 41
4.6 Benchmark Model Architecture Comparison 43
4.7 Search Space & GANAS Model Architecture Comparison 48
5. Conclusion 52
References 53

參考文獻

[1] Ahmed, K., & Torresani, L. (2018). Maskconnect: Connectivity learning by gradient descent. In Proceedings of the European Conference on Computer Vision (ECCV), 349-365.
[2] Antipov, G., Baccouche, M., & Dugelay, J-L. (2017). Face aging with conditional generative adversarial networks. arXiv preprint arXiv: 1702.01983.
[3] Baker, B., Gupta, O., Naik, N., & Raskar, R. (2016). Designing neural network architectures using reinforcement learning. arXiv preprint arXiv:16112167.
[4] Baker, B., Gupta, O., Raskar, R., & Naik, N. (2017). Accelerating neural architecture search using performance prediction. arXiv preprint arXiv:170510823.
[5] Bello, I., Zoph, B., Vasudevan, V., & Le, Q. V. (2017). Neural optimizer search with reinforcement learning. arXiv preprint arXiv:17097417.
[6] Cai, H., Chen, T., Zhang, W., Yu, Y., & Wang, J. (2018, April). Efficient architecture search by network transformation. In Thirty-Second AAAI conference on artificial intelligence.
[7] Cai, H., Yang, J., Zhang, W., Han, S., & Yu, Y. (2018). Path-level network transformation for efficient architecture search. arXiv preprint arXiv:18062639.
[8] Cai, H., Zhu, L., & Han, S. (2018). Proxylessnas: Direct neural architecture search on target task and hardware. arXiv preprint arXiv:1812332.
[9] Chen, L. C., Collins, M., Zhu, Y., Papandreou, G., Zoph, B., Schroff, F., Adam, H., & Shlens, J. (2018). Searching for efficient multi-scale architectures for dense image prediction. In Advances in neural information processing systems, 8699-8710.
[10] Chen, L. C., Papandreou, G., Kokkinos, I., Murphy, K., & Yuille, A. L. (2017). Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE transactions on pattern analysis and machine intelligence, 40(4), 834-848.
[11] Chen, L. C., Papandreou, G., Schroff, F., & Adam, H. (2017). Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:17065587.
[12] Chen, X., Duan, Y., Houthooft, R., Schulman, J., Sutskever, I., & Abbeel, P. (2016). Infogan: Interpretable representation learning by information maximizing generative adversarial nets. In Advances in neural information processing systems, 2172-2180.
[13] Domhan, T., Springenberg, J. T., & Hutter, F. (2015, June). Speeding up automatic hyperparameter optimization of deep neural networks by extrapolation of learning curves. In Twenty-Fourth International Joint Conference on Artificial Intelligence.
[14] Elsken, T., Metzen, J. H., & Hutter, F. (2017). Simple and efficient architecture search for convolutional neural networks. arXiv preprint arXiv:17114528.
[15] Elsken, T., Metzen, J. H., & Hutter, F. (2018). Neural architecture search: A survey. arXiv preprint arXiv:18085377.
[16] Falkner, S., Klein, A., & Hutter, F. (2018). BOHB: Robust and efficient hyperparameter optimization at scale. arXiv preprint arXiv:18071774.
[17] Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., & Bengio, Y. (2014). Generative adversarial nets. In Advances in neural information processing systems, 2672-2680.
[18] He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, 770-778.
[19] He, Y., Lin, J., Liu, Z., Wang, H., Li, L. J., & Han, S. (2018). Amc: Automl for model compression and acceleration on mobile devices. In Proceedings of the European Conference on Computer Vision (ECCV), 784-800.
[20] Huang, S., Li, X., Cheng, Z. Q., Zhang, Z., & Hauptmann, A. (2018, October). Gnas: A greedy neural architecture search method for multi-attribute learning. In Proceedings of the 26th ACM international conference on Multimedia, 2049-2057.
[21] Isola, P., Zhu, J. Y., Zhou, T., & Efros, A. A. (2017). Image-to-image translation with conditional adversarial networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, 1125-1134.
[22] Jin, H., Song, Q., & Hu, X. (2018). Auto-keras: Efficient neural architecture search with network morphism. arXiv preprint arXiv:180610282, 5.
[23] Klein, A., Falkner, S., Springenberg, J. T., & Hutter, F. (2016). Learning curve prediction with Bayesian neural networks. In International Conference On Learning Representation (ICLR).
[24] Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems, 1097-1105.
[25] Li, L., Jamieson, K., DeSalvo, G., Rostamizadeh, A., & Talwalkar, A. (2017). Hyperband: A novel bandit-based approach to hyperparameter optimization. The Journal of Machine Learning Research, 18(1), 6765-6816.
[26] Liu, C., Chen, L. C., Schroff, F., Adam, H., Hua, W., Yuille, A. L., & Fei-Fei, L. (2019). Auto-deeplab: Hierarchical neural architecture search for semantic image segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, 82-92.
[27] Liu, C., Zoph, B., Neumann, M., Shlens, J., Hua, W., Li, L. J., Li, F. F., Yuille, A., Huang, J., & Murphy, K. (2018). Progressive neural architecture search. In Proceedings of the European Conference on Computer Vision (ECCV), 19-34.
[28] Liu, H., Simonyan, K., & Yang, Y. (2018). Darts: Differentiable architecture search. arXiv preprint arXiv:18069055.
[29] Liu, M. Y., & Tuzel, O. (2016). Coupled generative adversarial networks. In Advances in neural information processing systems, 469-477.
[30] Liu, X. Y., Wu, J., & Zhou, Z. H. (2008). Exploratory undersampling for class-imbalance learning. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 39(2), 539-550.
[31] Mirza, M., & Osindero, S. (2014). Conditional generative adversarial nets. arXiv preprint arXiv:14111784.
[32] Nekrasov, V., Chen, H., Shen, C., & Reid, I. (2019). Fast neural architecture search of compact semantic segmentation models via auxiliary cells. In Proceedings of the IEEE Conference on computer vision and pattern recognition, 9126-9135)
[33] Pham, H., Guan, M. Y., Zoph, B., Le, Q. V., & Dean, J. (2018). Efficient neural architecture search via parameter sharing. arXiv preprint arXiv:18023268.
[34] Real, E., Aggarwal, A., Huang, Y., & Le, Q. V. (2019, July). Regularized evolution for image classifier architecture search. In Proceedings of the aaai conference on artificial intelligence, 4780-4789.
[35] Real, E., Moore, S., Selle, A., Saxena, S., Suematsu, Y. L., Tan, J., Le, Q., & Kurakin, A. (2017). Large-scale evolution of image classifiers. arXiv preprint arXiv:17031041.
[36] Reed, S. E., Akata, Z., Mohan, S., Tenka, S., Schiele, B., & Lee, H. (2016). Learning what and where to draw. In Advances in neural information processing systems, 217-225.
[37] Ronneberger, O., Fischer, P., & Brox, T. (2015, October). U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical image computing and computer-assisted intervention, 234-241.
[38] Runge, F., Stoll, D., Falkner, S., & Hutter, F. (2018). Learning to design RNA. arXiv preprint arXiv:181211951.
[39] Silver, D., Huang, A., Maddison, C. J., Guez, A., Sifre, L., Van Den Driessche, G., Schrittwieser, J., Antonoglou, I., Panneershelvam, V., Lanctot, M., Dieleman, S., Grewe, D., Nham, J., Kalchbrenner, N., Sutskever, I., Lillicrap, T., Leach, M., Kavukcuoglu, K., Graepel, T., & Hassabis, D. (2016). Mastering the game of Go with deep neural networks and tree search. Nature, 529(7587), 484-489.
[40] Silver, D., Schrittwieser, J., Simonyan, K., Antonoglou, I., Huang, A., Guez, A., Hubert, T., Baker, L., Lai, M., Bolton, A., Chen, Y., Lillicrap, T., Hui, F., Sifre, L., Driessche, G. V. D., Graepel, T., & Hassabis, D. (2017). Mastering the game of go without human knowledge. Nature, 550(7676), 354-359.
[41] Simonyan, K., & Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:14091556.
[42] Suganuma, M., Ozay, M., & Okatani, T. (2018). Exploiting the potential of standard convolutional autoencoders for image restoration by evolutionary search. arXiv preprint arXiv:1803.00370.
[43] Swersky, K., Snoek, J., & Adams, R. P. (2014). Freeze-thaw Bayesian optimization. arXiv preprint arXiv:14063896.
[44] Tulyakov, S., Liu, M. Y., Yang, X., & Kautz, J. (2018). Mocogan: Decomposing motion and content for video generation. In Proceedings of the IEEE conference on computer vision and pattern recognition, 1526-1535.
[45] Vondrick, C., Pirsiavash, H., & Torralba, A. (2016). Generating videos with scene dynamics. In Neural Information Processing Systems, 613–621.
[46] Wong, C., Houlsby, N., Lu, Y., & Gesmundo, A. (2018). Transfer learning with neural automl. In Advances in Neural Information Processing Systems, 8356-8365.
[47] Wang, C., Xu, C., Wang, C., & Tao D. (2018). Perceptual adversarial networks for image-to-image transformation. In IEEE Transactions on Image Processing, vol. 27, no. 8, 4066-4079.
[48] Xie, L., & Yuille, A. (2017). Genetic cnn. In Proceedings of the IEEE international conference on computer vision, 1379-1388.
[49] Xie, S., Zheng, H., Liu, C., & Lin, L. (2018). SNAS: stochastic neural architecture search. arXiv preprint arXiv:18129926.
[50] Zhong, Z., Yan, J., Wu, W., Shao, J., & Liu, C. L. (2018). Practical block-wise neural network architecture generation. In Proceedings of the IEEE conference on computer vision and pattern recognition, 2423-2432.
[51] Zoph, B., & Le, Q. V. (2016). Neural architecture search with reinforcement learning. arXiv preprint arXiv:16111578.
[52] Zoph, B., Vasudevan, V., Shlens, J., & Le, Q. V. (2018). Learning transferable architectures for scalable image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, 8697-8710.

指導教授

陳以錚(Yi-Cheng Chen)

審核日期

2020-7-15

推文