FEGAN: A High-Performance Font Enhancement Network for Text CAPTCHA Preprocessing

This study aims to address performance deficiencies in CAPTCHA preprocessing methods that impede the accurate recognition of text CAPTCHAs, which are crucial for identifying security vulnerabilities. To improve CAPTCHA preprocessing methods, a similar font is initially searched and acquired by manua...

Full description

Bibliographic Details
Published in:INTERNATIONAL JOURNAL OF ENGINEERING AND TECHNOLOGY INNOVATION
Main Authors: Wan, Xing; Ruslan, Fazlina Ahmat; Johari, Juliana
Format: Article; Early Access
Language:English
Published: TAIWAN ASSOC ENGINEERING & TECHNOLOGY INNOVATION 2025
Subjects:
Online Access:https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001447757700001
Description
Summary:This study aims to address performance deficiencies in CAPTCHA preprocessing methods that impede the accurate recognition of text CAPTCHAs, which are crucial for identifying security vulnerabilities. To improve CAPTCHA preprocessing methods, a similar font is initially searched and acquired by manually removing obstructing pixels from a target CAPTCHA and retaining the font part. Using the found font, a pseudo-dataset is generated containing a large number of clean and dirty pairs to train to the proposed supervised Font Enhancement Generative Adversarial Network (FEGAN), which is designed to effectively eliminate non-font-related interferences and preserve the font outlines. Test results show that FEGAN can improve the recognizer's accuracy by approximately 16% to 50% on the M-CAPTCHA dataset (a publicly available dataset on Kaggle) and 5% to 35% on the P-CAPTCHA dataset (generated using the Python ImageCaptcha package), substantially outperforming the Multiview-filtering-based preprocessing approach.
ISSN:2223-5329
2226-809X
DOI:10.46604/ijeti.2024.13977