統計學習於克里金法的應用__國立東華大學博碩士論文全文影像系統

帳號：guest(18.226.104.27) 離開系統

字體大小：

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
論文目次
參考文獻
電子全文

作者:	林家弘
作者(英文):	Chia-Hung Lin
論文名稱:	統計學習於克里金法的應用
論文名稱(英文):	Statistical Learning in the Universal Kriging.
指導教授:	吳韋瑩
指導教授(英文):	Wei-Ying Wu
口試委員:	曹振海曾聖澧
口試委員(英文):	C. Andy Tsao Sheng-Li Tzeng
學位類別:	碩士
校院名稱:	國立東華大學
系所名稱:	應用數學系
學號:	610911106
出版年(民國):	112
畢業學年度:	111
語文別:	英文
論文頁數:	47
關鍵詞:	集成學習、地理數據、變異圖法、克里金法、PM 2.5
關鍵詞(英文):	Ensemble learning、geographical data、variogram approach、kriging、PM 2.5
相關次數:	推薦:0 點閱:31 評分: 下載:2 收藏:0

集成學習是由多個演算法組成，用於提升預測性能。然而，對於依賴性數據（例如地理數據），這種優越性可能會損失。結合集成學習和變異圖法，提出了一種學習程序來尋找適當的模型。此外，基於選定的模型，應用克立金法於預測問題中。模擬研究證明了所提出的程序在預測問題上的良好表現。通過與現有的PM 2.5 數據方法進行分析和比較，顯示了所提出方法的較好性能。

Ensemble learning consists of multiple algorithms to enhance predictive performance.
However, such superiority may lose for the dependent data such as the geographical data. Combining the ensemble learning with the variogram approach, a learning procedure is proposed to look for the adequate model. Further, kriging based on the selected model is applied in the forecast issue. The simulation study demonstrates that the proposed procedure works well for the forecast issue.
The analysis and comparison with an existing method for PM 2.5 data show better performance of the proposed method.

1 Introduction 1
2 Literature Reviews 3
2.1 Stationary random process 3
2.2 Metric Model 4
2.3 Model Assumptions 5
2.4 Spatio-Temporal Universal Kriging 6
2.4.1 Introduction to the Kriging method 6
2.4.2 Unbiasedness condition 8
2.4.3 The predict variance 8
2.4.4 Minimal Prediction Variance 9
2.5 Neural Network 10
2.5.1 What is neural network 10
2.5.2 How the Neural Network Works 10
2.6 Boosting 14
2.6.1 Introduction to boosting 14
2.6.2 Boosting algorithm 15
2.6.3 The first boosting method - Adaboost 16
2.6.4 XGBoost 17
2.6.5 Catboost 18
2.7 Ensemble Learning 19
3 Methodology 21
3.1 Ensemble method for estimation 21
3.2 Empirical spatio-temporal covariogram 22
3.3 Spatio-Temporal Universal Kriging 22
4 Simulation studies 25
4.1 Experimental assumption 25
5 Real data 31
5.1 Environmental Protection Data 31
5.1.1 Data introduction 31
5.2 Experimental Design 35
5.3 Evaluation Metrics 36
5.4 Experimental Results 36
5.4.1 Taipai 37
5.4.2 Taoyuan 39
5.4.3 Nantou 40
5.4.4 Kaohsiung 41
6 Conclusion 43
References 45

1. Anselin, L. (1988). Spatial econometrics: methods and models (Vol. 4). Springer Science & Business Media.
2. Agterberg, F. (2004). Georges Matheron: Founder of spatial statistics. Earth Sciences History, 23(2), 205-334.
3. Bohling, G. (2005). Introduction to geostatistics and variogram analysis. Kansas geological survey, 1(10), 1-20.
4. Biau, G., Devroye, L., & Lugosi, G. (2008). Consistency of random forests and other averaging classifiers. Journal of Machine Learning Research, 9(9).
5. Bowerman, B. L., & O’Connell, R. T. (1979). Time series and forecasting. North Scituate, MA: Duxbury Press.
6. Freund, Y., Schapire, R., & Abe, N. (1999). A short introduction to boosting. Journal-Japanese Society For Artificial Intelligence, 14(771-780), 1612.
7. Chen, T., & Guestrin, C. (2016, August). Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 785-794).
8. Cressie, N. (2015). Statistics for spatial data. John Wiley & Sons.
9. Calder, C. A. (2008). A dynamic process convolution approach to modeling ambient particulate matter concentrations. Environmetrics: The official journal of the International Environmetrics Society, 19(1), 39-48.
10. Chen, B. K., & Yang, C. Y. (2014). Differences in age-standardized mortality rates for avoidable deaths based on urbanization levels in Taiwan, 1971–2008. International journal of environmental research and public health, 11(2), 1776-1793.
11. Chen, W., Li, Y., Reich, B. J., & Sun, Y. (2020). Deepkriging: Spatially dependent deep neural networks for spatial prediction. arXiv preprint arXiv:2007.11972.
12. Dimitrakopoulos, R., & Luo, X. (1994). Spatiotemporal modelling: Covariances and ordinary kriging systems. Dordrecht: Springer Netherlands.
13. Datta, A., Banerjee, S., Finley, A. O., Hamm, N. A., & Schaap, M. (2016). Nonseparable dynamic nearest neighbor Gaussian process models for large spatiotemporal data with an application to particulate matter analysis. The annals of applied statistics, 10(3), 1286.
14. DeVore, R., Hanin, B., & Petrova, G. (2021). Neural network approximation. Acta Numerica, 30, 327-444.
15. Friedman, J. H. (2001). Greedy function approximation: a gradient boosting machine. Annals of statistics, 1189-1232.
16. Hong, B. Z., & Tsao, C. A. (2013). A comparison of random average regression methods.
17. Iranzad, R., Liu, X., Chaovalitwongse, W. A., Hippe, D., Wang, S., Han, J., ... & Bowen, S. (2022). Gradient boosted trees for spatial data and its application to medical imaging data. IISE transactions on healthcare systems engineering, 12(3), 165-179.
18. McCulloch, W. S., & Pitts, W. (1943). A logical calculus of the ideas immanent in nervous activity. The Bulletin of Mathematical Biophysics, 5, 115-133.
19. Montero, J. M., Fernández-Avilés, G., & Mateu, J. (2015). Spatial and spatiotemporal geostatistical modeling and kriging. John Wiley & Sons.
20. Nielsen, D. (2016). Tree boosting with xgboost-why does xgboost win” every” machine learning competition? (Master’s thesis, NTNU).
21. Gopal, S. (2016). Artificial neural networks in geospatial analysis. International Encyclopedia of Geography: People, the Earth, Environment and Technology, 1-7.
22. Grinsztajn, L., Oyallon, E., & Varoquaux, G. (2022). Why do tree-based models still outperform deep learning on typical tabular data?. Advances in Neural Information Processing Systems, 35, 507-520.
23. Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A. V., & Gulin, A. (2018). Catboost: Unbiased boosting with categorical features. Advances in Neural Information Processing Systems, 31.
24. Paciorek, C. J., Yanosky, J. D., Puett, R. C., Laden, F., & Suh, H. H. (2009). Practical large-scale spatio-temporal modeling of particulate matter concentrations. The Annals of Applied Statistics, 370-397.
25. Sigrist, F. (2022). Gaussian process boosting. The Journal of Machine Learning Research, 23(1), 10565-10610.
26. Shwartz-Ziv, R., & Armon, A. (2022). Tabular data: Deep learning is not all you need. Information Fusion, 81, 84-90.
27. Tsao, C. A. (2014). A statistical introduction to ensemble learning methods. 中國統計學報, 52(1), 115-132.

(此全文20280812後開放外部瀏覽)
01.pdf

推文
推薦
評分
引用網址
轉寄

top

詳目顯示

相關論文