As per industry report Fraud Detection & Prevention Market was approx. $20Billion in 2018 and will be $63.5Billion by 2023, at a Compound Annual Growth Rate (CAGR) of 26.7%
-
Imbalanced Class Data – On an average, only 1-2% of total transactions is identified as fraud which one of the biggest challenges for data scientist to build robust model to identify transactions.
-
Labelled Data – Non availability of labelled data as legitimate or fraud. Most of time data at organization level is not labelled or small amount of labelled data and labelling them is tedious and cost.
-
Sometimes the fraud actives are mixed up with the normal activities, hard to identify using general algorithms
-
Fraudlent activities (in case of few reported incidents) tend to change their mode of transaction and process
https://www.kaggle.com/mlg-ulb/creditcardfraud
https://towardsdatascience.com/anomaly-detection-with-isolation-forest-visualization-23cd75c281e2