作者(英文):Chun-Tung Li
論文名稱(英文):U-shaped Siamese Variational Autoencoder: A Network for Repairing Damaged Chinese Characters
指導教授(英文):Cheng-Chin Chiang
口試委員(英文):Der-Lor Way
Shin-Feng Lin
關鍵詞(英文):deep learningVariational Autoencodergenerative models
本研究運用深度學習(Deep Learning)技術提出一套修復破損文字的方法,能將破損文字缺漏的部份還原其原樣,並嚐試以此技術為基礎,希望透過學習的方式進行中文字新字體的自動生成試驗。近年來較有名之深度學習生成模型以對抗式生成網絡(Generative Adeversarial Network, GAN)及自動編碼器(Autoencoder, AE)為二大主流,所以本研究中以變分自動編碼器(Variational Autoencoder, VAE)為核心,嘗試將孿生神經網路(Siamese Neural Network)與U型網路(U-Net)結構整合入變分自動編碼器以開發出新型的神經網路,並運用跳躍連接技巧去改善重建原始圖像品質。本研究中進行了應用變分自動編碼器技術於中文字之完整與破損的中空字型(Outline Fonts)的填補,因為中空字型的輪廓一旦有所破損,對於文字修復的挑戰度將比實心字型的文字來得大,實驗中我們以部份中文字集的中空字型訓練神經網路進行填補,再使用其餘未經訓練的中文字集的中空字型測試填補效果,利用我們的網路架構進行實驗後發現網路學習時間可縮短且字體修復效果甚佳。
Before the computerization of early documents, people used to transcribe or record classical texts or documents on paper. Over time, the textual information on paper often suffered from improper preservation, leading to damage or missing parts. If the content lost or damaged was significant, restoration efforts were necessary. Manual restoration of all content was time-consuming and labor-intensive, with uncertain results. With advancements in image processing technology, particularly with the aid of artificial intelligence, restoration and processing effects have become comparable to manual methods. By employing computers as tools, restoration efficiency can be significantly improved, costs reduced, and the restored text can be subjected to text recognition technology for accuracy enhancement. This process greatly benefits the digital preservation of textual books.
Additionally, in modern computer typesetting, one can find various beautifully designed fonts. The design of these fonts requires considerable human and time resources. If text restoration techniques can be extended to automatically generate fonts of different styles, it might enable individuals to design their own unique fonts more swiftly.
This study utilizes deep learning techniques to propose a method for restoring damaged text. It aims to restore missing portions of damaged text and experimentally explores the generation of new Chinese fonts through learning. In recent years, two well-known deep learning generative models, Generative Adversarial Networks (GANs) and Autoencoders (AEs), have gained prominence. Therefore, this research focuses on Variational Autoencoders (VAEs), integrating Siamese Neural Networks and U-Net structures into VAEs to develop a novel neural network. Jump connections are employed to enhance the quality of reconstructed original images.
The study applies Variational Autoencoder technology to the filling of complete and damaged hollow fonts (Outline Fonts) in Chinese characters. The challenge of text restoration is higher for Outline Fonts due to the intricate outlines. In experiments, a portion of the Chinese character set's Outline Fonts were used for training the neural network for filling, while the rest of the untrained character set's Outline Fonts were used to test the filling effect. Experimental results showed that our network architecture reduced learning time and achieved excellent font restoration effects.
審定書 I
誌謝 II
摘要 III
Abstract V
目錄 VII
圖目錄 X
表目錄 XII
第一章 緒論 1
1.1 研究動機 1
1.2 研究目標 1
第二章 相關研究與文獻探討 3
2.1 相關研究 3
2.2 深度學習技術 8
2.2.1 卷積神經網路 8
2.2.2 變分自動編碼器(Variational AutoEncoder, VAE) 9
2.2.3 孿生神經網路(Siamese Neural Network) 12
2.2.4 Unet網路 14
2.3 小結 16
第三章 U形孿生變分自編碼器之破損字形修補 18
3.1 需求分析 18
3.2 訓練架構 19
3.3 U形孿生變分自編碼設計 20
3.3.1 孿生VAE字形修補網路 20
3.3.2 U形孿生VAE字形修補網路 22
3.3.3 訓練及測試資料集 23
第四章 實驗結果與討論 26
4.1 實驗流程 26
4.1.1 實驗方法 26
4.1.2 評量指標 27
4.2 實驗結果 28
4.2.1 破損字型測試 28
4.2.2 歪斜字型測試 37
4.2.3 破損之歪斜字型測試 39
4.2.4 圖片中文分割之破損字字型測試 41
4.2.5 損失函數 43
4.3 結果總結 48
第五章 結論與未來方向 49
5.1 結論 49
5.2 未來發展 49
參考文獻 50
