Skip to main navigation Skip to search Skip to main content

PhishFind: A Machine Learning-Based System for Real-Time Phishing Detection

Research output: Chapter in Book/Report/Conference proceedingPaper (Conference contribution)peer-review

2 Scopus citations

Abstract

Phishing attacks continue to exploit user trust, posing significant risks to individuals and organizations through increasingly sophisticated tactics. Existing detection tools often lack real-time analysis or transparent explanations, leaving a gap in effective browser-based protection. This work introduces PhishFind, a browser widget designed to address these limitations by integrating advanced machine learning and explainable AI. As part of this contribution, we developed PhishingLong, a continuously updated dataset of phishing and legitimate websites. Leveraging this dataset, the system applies a Gradient Boosting classifier to analyze URLs and web content, while a semantic module based on the ChatGPT API provides users with clear, human-readable explanations for suspicious sites. In evaluation, Gradient Boosting achieved an F1-score of 97.34%, and user testing demonstrated high acceptance, particularly for usability and alert clarity. Overall, PhishFind demonstrates the potential of combining robust detection with explainable feedback to enhance user protection against phishing in real time.

Original languageEnglish
Title of host publicationProceedings of 8th International Conference on Systems Engineering - Cybersecurity and AI
Subtitle of host publicationBuilding a reliable digital future, CIIS 2025
PublisherAssociation for Computing Machinery, Inc
Pages31-39
Number of pages9
ISBN (Electronic)9798400718809
DOIs
StatePublished - 22 Nov 2025
Event8th International Conference on Systems Engineering, CIIS 2025 - Hybrid, Lima, Peru
Duration: 1 Oct 20253 Oct 2025

Publication series

NameProceedings of 8th International Conference on Systems Engineering - Cybersecurity and AI: Building a reliable digital future, CIIS 2025

Conference

Conference8th International Conference on Systems Engineering, CIIS 2025
Country/TerritoryPeru
CityHybrid, Lima
Period1/10/253/10/25

Fingerprint

Dive into the research topics of 'PhishFind: A Machine Learning-Based System for Real-Time Phishing Detection'. Together they form a unique fingerprint.

Cite this