|
[1] 謝昀哲, “影像辨識技術運用於藥物檢索系統之研究,” 碩士論文, 義守大學電子工程學系, 2009. [2] 李哲宇, “應用於行動裝置之藥品外觀影像辨識演算法研究,” 碩士論文, 南臺科技大學資訊工程系, 2017. [3] Z. Huang and J. Leng, “Analysis of Hu’s moment invariants on image scaling and rotation,” in 2010 2nd International Conference on Computer Engineering and Technology, Apr. 2010, pp. V7-476-V7-480. doi: 10.1109/ICCET.2010.5485542. [4] K. O’Shea and R. Nash, “An Introduction to Convolutional Neural Networks.” arXiv, Dec. 02, 2015. doi: 10.48550/arXiv.1511.08458. [5] N. Usuyama, N. L. Delgado, A. K. Hall, and J. Lundin, “ePillID Dataset: A Low-Shot Fine-Grained Benchmark for Pill Identification,” in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, USA: IEEE, Jun. 2020, pp. 3971–3977. doi: 10.1109/CVPRW50498.2020.00463. [6] Lu Tan, Tianran Huangfu, Liyao Wu, and Wenying Chen, “Comparison of RetinaNet, SSD, and YOLO v3 for real-time pill identification.,” BMC Med. Inform. Decis. Mak., vol. 21, no. 1, p. 324, Nov. 2021, doi: 10.1186/s12911-021-01691-8. [7] T.-Y. Lin, P. Goyal, R. Girshick, K. He, and P. Dollár, “Focal Loss for Dense Object Detection.” arXiv, Feb. 07, 2018. Accessed: Jul. 15, 2023. [Online]. Available: http://arxiv.org/abs/1708.02002 [8] W. Liu et al., “SSD: Single Shot MultiBox Detector,” 2016, pp. 21–37. doi: 10.1007/978-3-319-46448-0_2. [9] J. Redmon and A. Farhadi, “YOLOv3: An Incremental Improvement.” arXiv, Apr. 08, 2018. doi: 10.48550/arXiv.1804.02767. [10] A. Bochkovskiy, C.-Y. Wang, and H.-Y. M. Liao, “YOLOv4: Optimal Speed and Accuracy of Object Detection,” ArXiv200410934 Cs Eess, Apr. 2020, Accessed: May 08, 2022. [Online]. Available: http://arxiv.org/abs/2004.10934 [11] H.-J. Kwon, H.-G. Kim, and S.-H. Lee, “Pill Detection Model for Medicine Inspection Based on Deep Learning,” Chemosensors, vol. 10, no. 1, Art. no. 1, Jan. 2022, doi: 10.3390/chemosensors10010004. [12] G. Huang, Z. Liu, L. Van Der Maaten, and K. Q. Weinberger, “Densely Connected Convolutional Networks,” in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI: IEEE, Jul. 2017, pp. 2261–2269. doi: 10.1109/CVPR.2017.243. [13] D. E. Rumelhart and J. L. McClelland, “Learning Internal Representations by Error Propagation,” in Parallel Distributed Processing: Explorations in the Microstructure of Cognition: Foundations, MIT Press, 1987, pp. 318–362. Accessed: Jul. 15, 2023. [Online]. Available: https://ieeexplore.ieee.org/document/6302929 [14] “[1409.1556] Very Deep Convolutional Networks for Large-Scale Image Recognition.” https://arxiv.org/abs/1409.1556 (accessed Jul. 15, 2023). [15] F. Rosenblatt, “The perceptron: A probabilistic model for information storage and organization in the brain,” Psychol. Rev., vol. 65, no. 6, pp. 386–408, 1958, doi: 10.1037/h0042519. [16] 山口達輝、松田洋之, 圖解AI:機器學習和深度學習的技術與原理. 碁峰, 2020. [17] R. Girshick, J. Donahue, T. Darrell, and J. Malik, “Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation”. [18] S. J. Pan and Q. Yang, “A Survey on Transfer Learning,” IEEE Trans. Knowl. Data Eng., vol. 22, no. 10, pp. 1345–1359, Oct. 2010, doi: 10.1109/TKDE.2009.191. [19] “[1504.08083] Fast R-CNN.” https://arxiv.org/abs/1504.08083 (accessed Jul. 15, 2023). [20] O. Corcoll, “Semantic Image Cropping,” arXiv.org, Jul. 15, 2021. https://arxiv.org/abs/2107.07153v1 (accessed Jul. 15, 2023). [21] “A Comprehensive Guide to Convolutional Neural Networks — the ELI5 way | Saturn Cloud Blog,” Dec. 15, 2018. https://saturncloud.io/blog/a-comprehensive-guide-to-convolutional-neural-networks-the-eli5-way/ (accessed Jul. 15, 2023). [22] “#013 CNN VGG 16 and VGG 19 - Master Data Science.” https://datahacker.rs/deep-learning-vgg-16-vs-vgg-19/ (accessed Jul. 15, 2023). [23] G. Koch, R. Zemel, and R. Salakhutdinov, “Siamese Neural Networks for One-shot Image Recognition”. [24] “Siamese networks with Keras, TensorFlow, and Deep Learning - PyImageSearch.” https://pyimagesearch.com/2020/11/30/siamese-networks-with-keras-tensorflow-and-deep-learning/ (accessed Jul. 15, 2023). [25] R. Hadsell, S. Chopra, and Y. LeCun, “Dimensionality Reduction by Learning an Invariant Mapping,” in 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06), Jun. 2006, pp. 1735–1742. doi: 10.1109/CVPR.2006.100. [26] P. F. Felzenszwalb, R. B. Girshick, D. McAllester, and D. Ramanan, “Object Detection with Discriminatively Trained Part-Based Models,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 32, no. 9, pp. 1627–1645, Sep. 2010, doi: 10.1109/TPAMI.2009.167. [27] P. Viola and M. Jones, “Rapid object detection using a boosted cascade of simple features,” in Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, Dec. 2001, p. I–I. doi: 10.1109/CVPR.2001.990517. [28] “機器/深度學習: 物件偵測 Non-Maximum Suppression (NMS) | by Tommy Huang | Medium.” https://chih-sheng-huang821.medium.com/機器-深度學習-物件偵測-non-maximum-suppression-nms-aa70c45adffa (accessed Jul. 15, 2023). [29] “Bilinear — PyTorch 2.0 documentation.” https://pytorch.org/docs/stable/generated/torch.nn.Bilinear.html (accessed Jul. 15, 2023). [30] T.-Y. Lin, A. RoyChowdhury, and S. Maji, “Bilinear CNN Models for Fine-Grained Visual Recognition,” in 2015 IEEE International Conference on Computer Vision (ICCV), Dec. 2015, pp. 1449–1457. doi: 10.1109/ICCV.2015.170. [31] “阿達瑪乘積 (矩陣),” 維基百科,自由的百科全書. Jul. 28, 2022. Accessed: Jul. 15, 2023. [Online]. Available: https://zh.wikipedia.org/w/index.php?title=阿達瑪乘積_(矩陣)&oldid=72935996 [32] Y. Bengio, R. Ducharme, P. Vincent, and C. Jauvin, “A Neural Probabilistic Language Model”. [33] “ImageNet Large Scale Visual Recognition Challenge | SpringerLink.” https://link.springer.com/article/10.1007/s11263-015-0816-y (accessed Jul. 15, 2023). |