摘要(英) |
Due to the anonymity and decentralization of Bitcoin, many governments and regulatory agencies have been cautious about it. In the future, regulation may become stricter. The Financial Supervisory Commission of Taiwan also plans to establish a virtual currency regulatory authority in Taiwan in May 2023.
Compared with traditional currencies, financial institutions must comply with international regulations to ensure that they do not provide services to criminals and terrorists. They also need to continuously monitor financial transactions to detect suspicious activities. These financial institutions have many operational procedures to monitor and verify customer information to confirm the real identity of customers. Failure to detect illegal transactions will lead to serious consequences for financial institutions, warnings or fines will be given to relevant institutions depending on the severity of the situation. Therefore, most financial institutions use Anti-Money Laundering(AML)solutions for sanctions and watchlist filtering and screening to monitor every transaction within the financial network to ensure that no transaction can be used to do business with prohibited persons. Recently, the financial community and academia unanimously believe that machine learning may have a significant impact on monitoring.
Therefore, this study uses the Bitcoin abnormal transaction dataset on Kaggle to further explore various machine learning algorithms under the characteristics of Bitcoin anonymous transactions, including Random Forest, Logistic Regression, XGBoost, Gradient Boosting, and Support Vector Machine, etc., for the efficiency of abnormal transaction monitoring. At the same time, because the features of this dataset have been pre-processed, all feature names are anonymous, so it is hoped to select a feature set that has a more significant impact on abnormal transaction detection through data-driven methods.
The experimental results of this research show that the efficiency of the model established by the XGBoost algorithm is the best, followed by the Random Forest algorithm. In the feature selection experiment, the transaction features and aggregation features have the most significant impact on the efficiency of the model. |
參考文獻 |
[1] 動區, "台灣金管會「管定加密貨幣了」!黃天牧:我將是主要監管角色," ed, 2023, pp. https://www.blocktempo.com/taiwan-financial-supervisory-commission-crypto-currencies-authority/.
[2] Elliptic, "Kaggle Elliptic Data Set - Bitcoin Transaction Graph," ed, 2019, pp. https://www.kaggle.com/datasets/ellipticco/elliptic-data-set.
[3] Elliptic, "Kaggle Elliptic-emb-50d," ed, 2019, p. https://www.kaggle.com/datasets/dhruvrnaik/ellipticemb50d.
[4] 中本聰, "比特幣:對等網路電子現金系統," ed, 2008, p. https://web.archive.org/web/20140320135003/https://bitcoin.org/bitcoin.pdf.
[5] 中央通訊社, "中非共和國通過 以比特幣為法定貨幣," ed, 2022, p. https://www.cna.com.tw/news/aopl/202204270406.aspx.
[6] 財經M平台, "加密貨幣市值," ed, 2023, pp. https://www.macromicro.me/charts/33112/cryptocurrency-total-market-cap.
[7] A. Ng and M. Jordan, "On discriminative vs. generative classifiers: A comparison of logistic regression and naive bayes," Advances in neural information processing systems, vol. 14, 2001.
[8] L. Breiman, "Random forests," Machine learning, vol. 45, pp. 5-32, 2001.
[9] B. E. Boser, I. M. Guyon, and V. N. Vapnik, "A training algorithm for optimal margin classifiers," in Proceedings of the fifth annual workshop on Computational learning theory, 1992, pp. 144-152.
[10] J. H. Friedman, "Greedy function approximation: a gradient boosting machine," Annals of statistics, pp. 1189-1232, 2001.
[11] T. Chen and C. Guestrin, "Xgboost: A scalable tree boosting system," in Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, 2016, pp. 785-794.
[12] M. Alkhalili, M. H. Qutqut, and F. Almasalha, "Investigation of applying machine learning for watch-list filtering in anti-money laundering," IEEE Access, vol. 9, pp. 18481-18496, 2021.
[13] M. Weber et al., "Anti-money laundering in bitcoin: Experimenting with graph convolutional networks for financial forensics," arXiv preprint arXiv:1908.02591, 2019.
[14] Y. Zeng, "Applications of Machine Learning in Bitcoin Anti-·Money Laundering," 2020.
[15] J. Lorenz, M. I. Silva, D. Aparício, J. T. Ascensão, and P. Bizarro, "Machine learning methods to detect money laundering in the bitcoin blockchain in the presence of label scarcity," in Proceedings of the First ACM International Conference on AI in Finance, 2020, pp. 1-8.
[16] Y. Boutellier, "Node embeddings for Beginners," ed, 2021, pp. https://towardsdatascience.com/node-embeddings-for-beginners-554ab1625d98. |