📞A.I Voice Phishing Detection Solution Utilizing NLP Algorithms📞

Final.Video.mp4

🐟 Project Organizations Link

🐟 Contents_1) Introduction to Project Background

🐟 Contents_2) Hardware design and Implementation

🐟 Contents_3) Service Architecture & User Scenarios

🐟 Contents_4) Used Algorithms and Models

--> Return 1 if phishing, and 0 if not phishing.

--> red LED(🔴) for phishing, green LED(🟢) for non-phishing

🐟 Contents_5) Collected Data & Methodology for Utilization.

➖ Loan fraud type: 185 instances
➖ Financial fraud type: 227 instances

➖ Financial/Insurance, Transfer, Withdrawal, Loan Service Type: 48,476 instances

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
README.md		README.md

alimhanhan/AI_Voice_Phishing_Detection_Solution_Utilizing_NLP_Algorithms

Folders and files

Latest commit

History

Repository files navigation

📞A.I Voice Phishing Detection Solution Utilizing NLP Algorithms📞

🐟 Project Organizations Link

🐟 Contents_1) Introduction to Project Background

📑Analysis of Market & Tech Trends

👁️‍🗨 The Evolving Techniques of Voice Phishing & Increasing Risks

📑Points of Differentiation from Existing Ideas & Products

👁️‍🗨️ Competitiveness of the Idea in terms of Functionality & Usability

🐟 Contents_2) Hardware design and Implementation

📑Sensors & Components Used

📑Detailed Hardware Design

🐟 Contents_3) Service Architecture & User Scenarios

📑Detailed Service Flow

👁️‍🗨️ Detailed UX

📑Explanation of the Service Flow

👁️‍🗨️ Additional Explanations for Each Step

🐟 Contents_4) Used Algorithms and Models

📑Algorithm Specifications

👁️‍🗨️ Voice Phishing Detection Algorithm Through Voice Data Processing

4️⃣ Based on the result, perform text notification and LED notification services.

5️⃣ Terminate all services upon completion.

📑Model Selection

👁️‍🗨️ Model Training Performance Evaluation

➡️ Consequently, the KoBIGBIRD model was selected for use in voice phishing detection, and the solution proceeded accordingly.

📑Model Concatenation

👁️‍🗨️ Considered & Utilized Models

⏩ In this project, we utilized the concatenation and customization of the KoBIGBIRD, R-BERT, and KR-BERT models.

#️⃣ KoBIGBIRD

KoBIGBIRD is a model developed for Korean natural language processing, based on Transformers. It can handle longer sequences than conventional BERT, dealing with a maximum of 4096 tokens, eight times more than BERT's 512 tokens.

#️⃣ KR-BERT

#️⃣ R-BERT

📑Model Customization

👁️‍🗨️ The Process of Model Customization

🐟 Contents_5) Collected Data & Methodology for Utilization.

📑 The Types of Collected Data

👉ㅤIn this project, the collected data is in unstructured form, categorized into phishing and non-phishing data. To address class imbalance, augmentation was conducted only for the phishing data.

Financial Supervisory Service Voice Phishing Voice Data:

👉 AI Hub Complaints Query-Response Data:

📑 Data Collection Methodology

1️⃣ Collecting phishing voice data through dynamic web crawling. 2️⃣ Downloading legitimate data. 3️⃣ Transforming the data using AWS Transcribe and uploading it as JSON files. 4️⃣ Creating TXT files through data parsing. 5️⃣ Converting the generated TXT files into CSV files.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages