FEGAN: A High-Performance Font Enhancement Network for Text CAPTCHA Preprocessing

This study aims to address performance deficiencies in CAPTCHA preprocessing methods that impede the accurate recognition of text CAPTCHAs, which are crucial for identifying security vulnerabilities. To improve CAPTCHA preprocessing methods, a similar font is initially searched and acquired by manua...

全面介紹

書目詳細資料
發表在:INTERNATIONAL JOURNAL OF ENGINEERING AND TECHNOLOGY INNOVATION
Main Authors: Wan, Xing; Ruslan, Fazlina Ahmat; Johari, Juliana
格式: Article; Early Access
語言:English
出版: TAIWAN ASSOC ENGINEERING & TECHNOLOGY INNOVATION 2025
主題:
在線閱讀:https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001447757700001
實物特徵
總結:This study aims to address performance deficiencies in CAPTCHA preprocessing methods that impede the accurate recognition of text CAPTCHAs, which are crucial for identifying security vulnerabilities. To improve CAPTCHA preprocessing methods, a similar font is initially searched and acquired by manually removing obstructing pixels from a target CAPTCHA and retaining the font part. Using the found font, a pseudo-dataset is generated containing a large number of clean and dirty pairs to train to the proposed supervised Font Enhancement Generative Adversarial Network (FEGAN), which is designed to effectively eliminate non-font-related interferences and preserve the font outlines. Test results show that FEGAN can improve the recognizer's accuracy by approximately 16% to 50% on the M-CAPTCHA dataset (a publicly available dataset on Kaggle) and 5% to 35% on the P-CAPTCHA dataset (generated using the Python ImageCaptcha package), substantially outperforming the Multiview-filtering-based preprocessing approach.
ISSN:2223-5329
2226-809X
DOI:10.46604/ijeti.2024.13977