1 min readfrom Machine Learning

[R]GNN Model For Fraud Detection Isn't Performing Well[R]

We're writing a research paper on explainable fraud detection GNN model and in the first step we're creating a basic Graph Neural Network for that. We're using the most famous dataset available on this topic i.e IEEE CIS Fraud Detection Dataset and implemented all necessary feature engineering on that data (although majority of feature engineering is already performed in the dataset). Then we constructed a heterogeneous graph on that dataset. Various transaction features like device, transaction id, amount are embedded as nodes and connected with transaction nodes. But the issue is after training the model isn't performing well. It is producing average AUC of 0.87, PR-AUC of 0.52, recall@5% around 0.57 and precision@5% around 0.37 (We tried GCN, GraphSAGE and GAT, all performs almost same for rest data)

Whereas the SOTA models in this topic produce much better metrics. Can anyone tell where potentially we're doing things wrong?

submitted by /u/LiveAccident5312
[link] [comments]

Want to read more?

Check out the full article on the original site

View original article

Tagged with

#generative AI for data analysis
#Excel alternatives for data analysis
#large dataset processing
#natural language processing for spreadsheets
#automated anomaly detection
#big data management in spreadsheets
#conversational data analysis
#real-time data collaboration
#intelligent data visualization
#data visualization tools
#enterprise data management
#big data performance
#data analysis tools
#data cleaning solutions
#rows.com
#financial modeling with spreadsheets
#GNN
#fraud detection
#Graph Neural Network
#IEEE CIS Fraud Detection Dataset