基於不同語義模型的推薦系統__國立東華大學博碩士論文全文影像系統

帳號：guest(18.191.181.36) 離開系統

字體大小：

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
論文目次
參考文獻
電子全文

作者:	陳柏亨
作者(英文):	Bo-Heng Chen
論文名稱:	基於不同語義模型的推薦系統
論文名稱(英文):	Academic Articles Recommendation System Based on Different Semantic Models
指導教授:	陳林志
指導教授(英文):	Lin-Chih Chen
口試委員:	賴明豐葉國暉
口試委員(英文):	Ming-Feng Lai Kuo-Hui Yeh
學位類別:	碩士
校院名稱:	國立東華大學
系所名稱:	資訊管理碩士學位學程
學號:	610539005
出版年(民國):	107
畢業學年度:	106
語文別:	英文
論文頁數:	42
關鍵詞:	推薦系統、語意分析、分類演算法
關鍵詞(英文):	Recommendation system、Semantic analysis、Classification algorithms
相關次數:	推薦:0 點閱:18 評分: 下載:5 收藏:0

現代人隨著科學技術的發展改變了生活方式。大多數人依靠網際網路工作、學習知識和傳達信息，這使網際網路成為生活中不可或缺的一部分。通過搜索引擎查找資訊是一種現代規範。但是，在過去幾年中，網頁數量顯著增加。在短時間內找到用戶想要的資訊變得越來越困難。因此，出現了推薦系統的概念。推薦系統適用於許多層面，如新聞，電影，音樂，但沒有推薦系統適用於學術文章。在本文中，除了分析文章之間的相
似性之外，我們還使用不同的語義模型來分析字詞和文章之間的潛在關係。然後使用分類算法對同一類別中的類似文章進行分類，並將其推薦給用戶。

Modern people change the way of life with the development of science and technology. Most people rely on the Internet to work, learn knowledge and convey messages, which makes the Internet an integral part of life. Finding information through search engines is a modern norm. However, in the past few years, the number of web pages has increased significantly. It is becoming increasingly difficult to find the information that users want in a short period of time. Therefore, a recommendation system has emerged. Recommendation systems are applied at many levels such as news, movies, music, but there is no recommendation system that is suitable for academic articles. In this thesis, we use different semantic models to analyze the potential relationship between terms and articles besides analyzing the similarity between articles. Classification algorithms are then used to classify similar articles in the same category and recommend them to users.

Chapter I. Introduction.....1
Chapter II. Relate Works.....5
2.1 Different kind of semantic models.....5
2.2 Different kind of classification algorithm.....8
Chapter III. The proposed Methodology.....9
3.1 Natural Language Processing.....11
3.1.1 Stemming.....11
3.1.2 Stop-words.....12
3.1.3 Non-words token.....12
3.2 Matrix step.....12
3.3 Semantic models.....13
3.3.1 LSA.....13
3.3.2 PLSA.....14
3.3.3 LDA.....16
3.4 Classification algorithms.....18
3.4.1 K-nearest neighbors (K-NN).....19
3.4.2 Support vector machine.....20
Chapter IV. Experimental analysis and results.....23
4.1 The Dataset and Natural Language Processing .....23
4.2 Experiment results.....24
Chapter V. Conclusion.....37
References.....39

Achakulvisut, T., Acuna, D. E., Ruangrong, T., & Kording, K. (2016). Science Concierge: A fast content-based recommendation system for scientific publications. PLoS ONE, 11(7), e0158423.

Aggarwal, C. C. (2016). Content-based recommender systems Recommender Systems (pp. 139-166): Springer.

Altman, N. S. (1992). An introduction to kernel and nearest-neighbor nonparametric regression. The American Statistician, 46(3), 175-185.

Association for the Advancement of Artificial Intelligence. (2018). http://www.aaai.org/ .

Bird, S., & Loper, E. (2004). NLTK: the natural language toolkit. Paper presented at the Proceedings of the ACL 2004 on Interactive poster and demonstration sessions.

Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3(Jan), 993-1022.

Brandt, T., Bendler, J., & Neumann, D. (2017). Social media analytics and value creation in urban smart tourism ecosystems. Information & Management, 54(6), 703-713.

Breese, J. S., Heckerman, D., & Kadie, C. (1998). Empirical analysis of predictive algorithms for collaborative filtering. Paper presented at the Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence.

Chen, L.-C. (2017). An effective LDA-based time topic model to improve blog search performance. Information Processing & Management, 53(6), 1299-1319.

Chowdhury, G. G. (2003). Natural language processing. Annual review of information science and technology, 37(1), 51-89.

Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine Learning, 20(3), 273-297.

Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer, T. K., & Harshman, R. (1990). Indexing by latent semantic analysis. Journal of the American society for information science, 41(6), 391.

Ebrahimi, M., Khoshtaghaza, M., Minaei, S., & Jamshidi, B. (2017). Vision-based pest detection based on SVM classification method. Computers and Electronics in Agriculture, 137, 52-58.

Hofmann, T. (2017). Probabilistic latent semantic indexing. Paper presented at the ACM SIGIR Forum.

Jafarkarimi, H., Sim, A. T. H., & Saadatdoost, R. (2012). A naive recommendation model for large databases. International Journal of Information and Education Technology, 2(3), 216.

Jaskowiak, P. A., & Campello, R. (2011). Comparing correlation coefficients as dissimilarity measures for cancer classification in gene expression data. Paper presented at the Proceedings of the Brazilian Symposium on Bioinformatics.

Ji, Z., Jing, P., Wang, J., & Su, Y. (2012). Scene image classification with biased spatial block and pLSA. International Journal of Digital Content Technology and its Applications, 6(1), 398-404.

Kobayashi, M., Aono, M., Takeuchi, H., & Samukawa, H. (2002). Matrix computations for information retrieval and major and outlier cluster detection. Journal of Computational and Applied mathematics, 149(1), 119-129.

Landauer, T. K., Foltz, P. W., & Laham, D. (1998). An introduction to latent semantic analysis. Discourse Processes, 25(2-3), 259-284.

Linden, G., Smith, B., & York, J. (2003). Amazon. com recommendations: Item-to-item collaborative filtering. IEEE Internet computing, 7(1), 76-80.

McInerney, J., Rogers, A., & Jennings, N. R. (2012). Improving location prediction services for new users with probabilistic latent semantic analysis. Paper presented at the Proceedings of the 2012 ACM conference on ubiquitous computing.

Mehrotra, R., Sanner, S., Buntine, W., & Xie, L. (2013). Improving lda topic models for microblogs via tweet pooling and automatic labeling. Paper presented at the Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval.

Montaner, M., López, B., & de la Rosa, J. L. (2002). Opinion-based filtering through trust. Paper presented at the International Workshop on Cooperative Information Agents.

Olson, D. L., & Delen, D. (2008). Advanced data mining techniques: Springer Science & Business Media.

Panja, R., & Pal, N. R. (2018). MS-SVM: Minimally Spanned Support Vector Machine. Applied Soft Computing, 64, 356-365.

Petersen, A. M., Jung, W.-S., Yang, J.-S., & Stanley, H. E. (2011). Quantitative and empirical demonstration of the Matthew effect in a study of career longevity. Proceedings of the National Academy of Sciences, 108(1), 18-23.

Porter, M. F. (1980). An algorithm for suffix stripping. Program, 14(3), 130-137.

Schafer, J. B., Konstan, J., & Riedl, J. (1999). Recommender systems in e-commerce. Paper presented at the Proceedings of the 1st ACM conference on Electronic commerce.

ScienceDirect. (2018). https://www.sciencedirect.com/ .

Wang, C., & Blei, D. M. (2011). Collaborative topic modeling for recommending scientific articles. Paper presented at the Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining.

Wang, X., & McCallum, A. (2006). Topics over time: a non-Markov continuous-time model of topical trends. Paper presented at the Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining.

Wesley-Smith, I., & West, J. D. (2016). Babel: a platform for facilitating research in scholarly article discovery. Paper presented at the Proceedings of the 25th international conference companion on world wide web.

Yoneya, T., & Mamitsuka, H. (2007). PURE: a PubMed article recommendation system based on content-based filtering Genome Informatics 2007: Genome Informatics Series Vol. 18 (pp. 267-276): World Scientific.

Zhang, H., Berg, A. C., Maire, M., & Malik, J. (2006). SVM-KNN: Discriminative nearest neighbor classification for visual category recognition. Paper presented at the Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on.

Zhang, M.-L., & Zhou, Z.-H. (2007). ML-KNN: A lazy learning approach to multi-label learning. Pattern recognition, 40(7), 2038-2048.

Zhao, W. X., Jiang, J., Weng, J., He, J., Lim, E.-P., Yan, H., & Li, X. (2011). Comparing twitter and traditional media using topic models. Paper presented at the European Conference on Information Retrieval.

(此全文未開放授權)
01.pdf

推文
推薦
評分
引用網址
轉寄

top

詳目顯示

相關論文