SlideShare a Scribd company logo
1 of 15
6.53
E-COMMERCE
FRAUD-MACHINE
LEARNING
MODELS
Ximena Bustamante
INTRODUCTION
According to Statista “e-commerce losses to online payment fraud were
estimated at 41 billion U.S. dollars globally in 2022, up from the previous
year. The figure is expected to grow further to 48 billion U.S. dollars by
2023” (Statista, “Value of e-commerce losses to online payment fraud
worldwide from 2020 to 2023”)
Machine learning algorithms are often used to identify potentially
fraudulent transactions
Come explore with me two models, logistic regression and decision trees,
that were used to identify variables significantly correlated with fraud
2023 E-Commerce Fraud Machine Learning Models-Ximena
Bustamante
2
DATASET
2023 E-Commerce Fraud Machine Learning Models-Ximena
Bustamante
3
Variables
customerEmail
Multiple
Duplicated
customerPhone
customerDevice
customerIPAddress
customerBillingAddress
No_Transactions
No_Orders
No_Payments
transactionId
orderId
paymentMethodId
paymentMethodRegistrationFailure
paymentMethodType
paymentMethodProvider
transactionAmount
transactionFailed
orderState
Fraud
KEY INSIGHTS
KEY INSIGHTS
SIGNIFICANT VARIABLES
2023 E-Commerce Fraud Machine Learning Models-Ximena
Bustamante
5
• The dataset consisted of 19 variable
• Out of the 18 independent variables—1 dependent variable—only 7 were found to be significant and the algorithms were run on
these.
KEY INSIGHTS
LOGISTIC REGRESSION
2023 E-Commerce Fraud Machine Learning Models-Ximena
Bustamante
6
• A logistic regression model was created with one dependent variable (fraud: Y/N) and 7 independent variables
• It resulted in a highly accurate model according to the confusion matrix used to measure its precision
• As see on the image on the right, it resulted in an 88% accuracy, 85% sensitivity
91% specificity, 90% precision and 87% negative predictive value
• Out of 65 non-fraud transactions in the test data, it correctly identified 59
• Out of 64 fraud transactions in the test data, it correctly identified 55
KEY INSIGHTS
DECISION TREES
2023 E-Commerce Fraud Machine Learning Models-Ximena
Bustamante
7
• A decision tree model was also created with the same dependent and independent
• It also resulted in a highly accurate algorithm according to the confusion matrix used to measure its precision
• This model resulted in a 96% sensitivity
83% specificity, 85% pos predictive and 95% negative predictive value
• Out of a total of 260 non-fraud transactions, it correctly identified 249
• Out of a total of 257 fraud transactions, it correctly identified 213
DATA PROCESS-ACQUISITION,
PREPARATION, ANALYSIS AND
VISUALIZATION
DATA ACQUISITION, PREPARATION AND
ANALYSIS
EXCEL & ACCES
2023 E-Commerce Fraud Machine Learning Models-Ximena
Bustamante
9
• Data was acquired from Kaggle and analysis was conducted with inspiration from University of Illinois –Urbana Champaign
Professor Hudson (Machine Learning Algorithms with R in Business Analytics)
• Tables with transaction data and customer data were initially joined in Acces and then explored in Excel
• Initial exploration of the data led to the identification of multiple customer e-mails associated to one customer
• This led to a new variable of binomial values being created to reflect transactions for customers with MULTIPLE emails
DATA ACQUISITION, PREPARATION AND
ANALYSIS
POWER BI
2023 E-Commerce Fraud Machine Learning Models-Ximena
Bustamante
10
• Power BI-Power Query was used to conduct more in-depth analysis of the variables
• Based on “Column Distribution”, it was evident that some IP addresses, devices and billing addresses were being used by multiple
customers (DUPLICATED)
• Thus, a new “Duplicated” column was created to reflect these transactions
DATA ACQUISITION, PREPARATION AND
ANALYSIS
R STUDIO
2023 E-Commerce Fraud Machine Learning Models-Ximena
Bustamante
11
• R Studio was used to create the 2 Machine Learning (ML) algorithms
• For the complete code, please visit my GitHub repository
• To create both ML models, I uploaded the necessary libraries, converted strings to factors, created confusion matrix, visualized the
balance of the dataset, split the data into training and testing sets, trained the models and the evaluated them on the test data,
made predictions, and finally used confusion matrix to measure accuracy
DATA VISUALIZATION
POWER BI
2023 E-Commerce Fraud Machine Learning Models-Ximena
Bustamante
12
• Power BI was used to create a map to show the geographical location of all transactions, color coded by fraud and non-fraud
CHALLENGES AND COOL TECHNIQUES
CHALLENGES AND COOL TECHNIQUES
2023 E-Commerce Fraud Machine Learning Models-Ximena
Bustamante
14
• Challenge: High number of correlated variables
• Cool Technique: Feature engineering--created two columns (with binomial values) to reflect transactions that had
duplicated/multiple addresses, phone numbers and Ip addresses, instead of creating one column for
• Challenge: Unbalanced dataset
• Cool Technique: Balanced it using RUS (random under sampling) to create a dataset with roughly the same amount of fraud/non-
fraud transactions
What If I had More Time?
• If I had more time, I would have done social networking to see how transactions may associate to one another
THANK YOU FOR
CHECKING OUT MY
PROJECT!
 Follow me for more project ideas
 If you have any questions, comments, feedback, JOB OFFERS , feel free to DM me
2023 E-Commerce Fraud Machine Learning
Models-Ximena Bustamante
15

More Related Content

Similar to E-Commerce Fraud Machine Learning Models.pptx

CREDIT CARD FRAUD DETECTION
CREDIT CARD FRAUD DETECTION CREDIT CARD FRAUD DETECTION
CREDIT CARD FRAUD DETECTION K Srinivas Rao
 
Share Credit_Card_Fraud_Detection_ML_MP (1).pptx
Share Credit_Card_Fraud_Detection_ML_MP (1).pptxShare Credit_Card_Fraud_Detection_ML_MP (1).pptx
Share Credit_Card_Fraud_Detection_ML_MP (1).pptxyatintaneja6
 
A Review of deep learning techniques in detection of anomaly incredit card tr...
A Review of deep learning techniques in detection of anomaly incredit card tr...A Review of deep learning techniques in detection of anomaly incredit card tr...
A Review of deep learning techniques in detection of anomaly incredit card tr...IRJET Journal
 
Sharing Microsoft RMS Data with QuickBooks
Sharing Microsoft RMS Data with QuickBooksSharing Microsoft RMS Data with QuickBooks
Sharing Microsoft RMS Data with QuickBooksDawn Scranton
 
Global Dynamics 365 Bootcamp London 2018
Global Dynamics 365 Bootcamp London 2018Global Dynamics 365 Bootcamp London 2018
Global Dynamics 365 Bootcamp London 2018Stefano Tempesta
 
A Novel Framework for Credit Card.
A Novel Framework for Credit Card.A Novel Framework for Credit Card.
A Novel Framework for Credit Card.Shakas Technologies
 
Certus Accelerate - Building the business case for why you need to invest in ...
Certus Accelerate - Building the business case for why you need to invest in ...Certus Accelerate - Building the business case for why you need to invest in ...
Certus Accelerate - Building the business case for why you need to invest in ...Certus Solutions
 
Online Transaction Fraud Detection System Based on Machine Learning
Online Transaction Fraud Detection System Based on Machine LearningOnline Transaction Fraud Detection System Based on Machine Learning
Online Transaction Fraud Detection System Based on Machine LearningIRJET Journal
 
TELECOM SERVICES: I.T. & ANALYTICS
TELECOM SERVICES: I.T. & ANALYTICSTELECOM SERVICES: I.T. & ANALYTICS
TELECOM SERVICES: I.T. & ANALYTICSGeorge Krasadakis
 
ATM fraud detection system using machine learning algorithms
ATM fraud detection system using machine learning algorithmsATM fraud detection system using machine learning algorithms
ATM fraud detection system using machine learning algorithmsIRJET Journal
 
Banking Circle: Money Laundering Beware: A Modern Approach to AML with Machin...
Banking Circle: Money Laundering Beware: A Modern Approach to AML with Machin...Banking Circle: Money Laundering Beware: A Modern Approach to AML with Machin...
Banking Circle: Money Laundering Beware: A Modern Approach to AML with Machin...Neo4j
 
FRAUD DETECTION IN CREDIT CARD TRANSACTIONS
FRAUD DETECTION IN CREDIT CARD TRANSACTIONSFRAUD DETECTION IN CREDIT CARD TRANSACTIONS
FRAUD DETECTION IN CREDIT CARD TRANSACTIONSIRJET Journal
 
IRJET- Survey on Credit Card Fraud Detection
IRJET- Survey on Credit Card Fraud DetectionIRJET- Survey on Credit Card Fraud Detection
IRJET- Survey on Credit Card Fraud DetectionIRJET Journal
 
Machine Learning in Cyber Security
Machine Learning in Cyber SecurityMachine Learning in Cyber Security
Machine Learning in Cyber SecurityRishi Kant
 
Are Merchants Losing The CNP Fraud Battle - A QPS Whitepaper
Are Merchants Losing The CNP Fraud Battle - A QPS WhitepaperAre Merchants Losing The CNP Fraud Battle - A QPS Whitepaper
Are Merchants Losing The CNP Fraud Battle - A QPS WhitepaperQuatrro Processing Services (QPS)
 
Automated cheque recognition
Automated cheque recognitionAutomated cheque recognition
Automated cheque recognitioninfo_jojo
 
IRJET - Fraud Detection in Credit Card using Machine Learning Techniques
IRJET -  	  Fraud Detection in Credit Card using Machine Learning TechniquesIRJET -  	  Fraud Detection in Credit Card using Machine Learning Techniques
IRJET - Fraud Detection in Credit Card using Machine Learning TechniquesIRJET Journal
 

Similar to E-Commerce Fraud Machine Learning Models.pptx (20)

CREDIT CARD FRAUD DETECTION
CREDIT CARD FRAUD DETECTION CREDIT CARD FRAUD DETECTION
CREDIT CARD FRAUD DETECTION
 
Share Credit_Card_Fraud_Detection_ML_MP (1).pptx
Share Credit_Card_Fraud_Detection_ML_MP (1).pptxShare Credit_Card_Fraud_Detection_ML_MP (1).pptx
Share Credit_Card_Fraud_Detection_ML_MP (1).pptx
 
A Review of deep learning techniques in detection of anomaly incredit card tr...
A Review of deep learning techniques in detection of anomaly incredit card tr...A Review of deep learning techniques in detection of anomaly incredit card tr...
A Review of deep learning techniques in detection of anomaly incredit card tr...
 
Sharing Microsoft RMS Data with QuickBooks
Sharing Microsoft RMS Data with QuickBooksSharing Microsoft RMS Data with QuickBooks
Sharing Microsoft RMS Data with QuickBooks
 
Global Dynamics 365 Bootcamp London 2018
Global Dynamics 365 Bootcamp London 2018Global Dynamics 365 Bootcamp London 2018
Global Dynamics 365 Bootcamp London 2018
 
A Novel Framework for Credit Card.
A Novel Framework for Credit Card.A Novel Framework for Credit Card.
A Novel Framework for Credit Card.
 
Certus Accelerate - Building the business case for why you need to invest in ...
Certus Accelerate - Building the business case for why you need to invest in ...Certus Accelerate - Building the business case for why you need to invest in ...
Certus Accelerate - Building the business case for why you need to invest in ...
 
Online Transaction Fraud Detection System Based on Machine Learning
Online Transaction Fraud Detection System Based on Machine LearningOnline Transaction Fraud Detection System Based on Machine Learning
Online Transaction Fraud Detection System Based on Machine Learning
 
TELECOM SERVICES: I.T. & ANALYTICS
TELECOM SERVICES: I.T. & ANALYTICSTELECOM SERVICES: I.T. & ANALYTICS
TELECOM SERVICES: I.T. & ANALYTICS
 
ATM fraud detection system using machine learning algorithms
ATM fraud detection system using machine learning algorithmsATM fraud detection system using machine learning algorithms
ATM fraud detection system using machine learning algorithms
 
Project PPT sem 2.pptx
Project PPT sem 2.pptxProject PPT sem 2.pptx
Project PPT sem 2.pptx
 
Banking Circle: Money Laundering Beware: A Modern Approach to AML with Machin...
Banking Circle: Money Laundering Beware: A Modern Approach to AML with Machin...Banking Circle: Money Laundering Beware: A Modern Approach to AML with Machin...
Banking Circle: Money Laundering Beware: A Modern Approach to AML with Machin...
 
CREDIT_CARD.ppt
CREDIT_CARD.pptCREDIT_CARD.ppt
CREDIT_CARD.ppt
 
FRAUD DETECTION IN CREDIT CARD TRANSACTIONS
FRAUD DETECTION IN CREDIT CARD TRANSACTIONSFRAUD DETECTION IN CREDIT CARD TRANSACTIONS
FRAUD DETECTION IN CREDIT CARD TRANSACTIONS
 
IRJET- Survey on Credit Card Fraud Detection
IRJET- Survey on Credit Card Fraud DetectionIRJET- Survey on Credit Card Fraud Detection
IRJET- Survey on Credit Card Fraud Detection
 
The Role of Generative AI and LLMs in Accounts Payable Automation1.pdf
The Role of Generative AI and LLMs in Accounts Payable Automation1.pdfThe Role of Generative AI and LLMs in Accounts Payable Automation1.pdf
The Role of Generative AI and LLMs in Accounts Payable Automation1.pdf
 
Machine Learning in Cyber Security
Machine Learning in Cyber SecurityMachine Learning in Cyber Security
Machine Learning in Cyber Security
 
Are Merchants Losing The CNP Fraud Battle - A QPS Whitepaper
Are Merchants Losing The CNP Fraud Battle - A QPS WhitepaperAre Merchants Losing The CNP Fraud Battle - A QPS Whitepaper
Are Merchants Losing The CNP Fraud Battle - A QPS Whitepaper
 
Automated cheque recognition
Automated cheque recognitionAutomated cheque recognition
Automated cheque recognition
 
IRJET - Fraud Detection in Credit Card using Machine Learning Techniques
IRJET -  	  Fraud Detection in Credit Card using Machine Learning TechniquesIRJET -  	  Fraud Detection in Credit Card using Machine Learning Techniques
IRJET - Fraud Detection in Credit Card using Machine Learning Techniques
 

Recently uploaded

Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlkumarajju5765
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 

Recently uploaded (20)

Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 

E-Commerce Fraud Machine Learning Models.pptx

  • 2. INTRODUCTION According to Statista “e-commerce losses to online payment fraud were estimated at 41 billion U.S. dollars globally in 2022, up from the previous year. The figure is expected to grow further to 48 billion U.S. dollars by 2023” (Statista, “Value of e-commerce losses to online payment fraud worldwide from 2020 to 2023”) Machine learning algorithms are often used to identify potentially fraudulent transactions Come explore with me two models, logistic regression and decision trees, that were used to identify variables significantly correlated with fraud 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 2
  • 3. DATASET 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 3 Variables customerEmail Multiple Duplicated customerPhone customerDevice customerIPAddress customerBillingAddress No_Transactions No_Orders No_Payments transactionId orderId paymentMethodId paymentMethodRegistrationFailure paymentMethodType paymentMethodProvider transactionAmount transactionFailed orderState Fraud
  • 5. KEY INSIGHTS SIGNIFICANT VARIABLES 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 5 • The dataset consisted of 19 variable • Out of the 18 independent variables—1 dependent variable—only 7 were found to be significant and the algorithms were run on these.
  • 6. KEY INSIGHTS LOGISTIC REGRESSION 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 6 • A logistic regression model was created with one dependent variable (fraud: Y/N) and 7 independent variables • It resulted in a highly accurate model according to the confusion matrix used to measure its precision • As see on the image on the right, it resulted in an 88% accuracy, 85% sensitivity 91% specificity, 90% precision and 87% negative predictive value • Out of 65 non-fraud transactions in the test data, it correctly identified 59 • Out of 64 fraud transactions in the test data, it correctly identified 55
  • 7. KEY INSIGHTS DECISION TREES 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 7 • A decision tree model was also created with the same dependent and independent • It also resulted in a highly accurate algorithm according to the confusion matrix used to measure its precision • This model resulted in a 96% sensitivity 83% specificity, 85% pos predictive and 95% negative predictive value • Out of a total of 260 non-fraud transactions, it correctly identified 249 • Out of a total of 257 fraud transactions, it correctly identified 213
  • 9. DATA ACQUISITION, PREPARATION AND ANALYSIS EXCEL & ACCES 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 9 • Data was acquired from Kaggle and analysis was conducted with inspiration from University of Illinois –Urbana Champaign Professor Hudson (Machine Learning Algorithms with R in Business Analytics) • Tables with transaction data and customer data were initially joined in Acces and then explored in Excel • Initial exploration of the data led to the identification of multiple customer e-mails associated to one customer • This led to a new variable of binomial values being created to reflect transactions for customers with MULTIPLE emails
  • 10. DATA ACQUISITION, PREPARATION AND ANALYSIS POWER BI 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 10 • Power BI-Power Query was used to conduct more in-depth analysis of the variables • Based on “Column Distribution”, it was evident that some IP addresses, devices and billing addresses were being used by multiple customers (DUPLICATED) • Thus, a new “Duplicated” column was created to reflect these transactions
  • 11. DATA ACQUISITION, PREPARATION AND ANALYSIS R STUDIO 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 11 • R Studio was used to create the 2 Machine Learning (ML) algorithms • For the complete code, please visit my GitHub repository • To create both ML models, I uploaded the necessary libraries, converted strings to factors, created confusion matrix, visualized the balance of the dataset, split the data into training and testing sets, trained the models and the evaluated them on the test data, made predictions, and finally used confusion matrix to measure accuracy
  • 12. DATA VISUALIZATION POWER BI 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 12 • Power BI was used to create a map to show the geographical location of all transactions, color coded by fraud and non-fraud
  • 13. CHALLENGES AND COOL TECHNIQUES
  • 14. CHALLENGES AND COOL TECHNIQUES 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 14 • Challenge: High number of correlated variables • Cool Technique: Feature engineering--created two columns (with binomial values) to reflect transactions that had duplicated/multiple addresses, phone numbers and Ip addresses, instead of creating one column for • Challenge: Unbalanced dataset • Cool Technique: Balanced it using RUS (random under sampling) to create a dataset with roughly the same amount of fraud/non- fraud transactions What If I had More Time? • If I had more time, I would have done social networking to see how transactions may associate to one another
  • 15. THANK YOU FOR CHECKING OUT MY PROJECT!  Follow me for more project ideas  If you have any questions, comments, feedback, JOB OFFERS , feel free to DM me 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 15