SlideShare une entreprise Scribd logo
1  sur  26
Claim Pattern Anomalies
Making a Mole Hill Out of a Mountain
Predictive Analytics World for Business
San Francisco
May 17, 2017
CAS Analytics & Data Provisioning Team v01
Darryl Humphrey, PhD, PMP
linkedin.com/in/dghumphrey1
Provider and member claiming behavior is
affected by many factors.
2CAS ADP Team
FraudAnalytics
Member and
Provider
Claiming
Patterns are
Dynamic
Economic Conditions
Plan Design
Policies and Processes
Compliance Verification
Industry Realities
Analyzing equivalent of 87,000,000 claim lines
monthly encompassing 17,000 providers and
1.6 million members.
–Nine (9) practice
areas across health,
dental, and pharmacy
benefits
–70 measures of
claiming behavior
–Six (6) algorithms
–Look for converging
results
3CAS ADP Team
Multi-variate distance measure identifies providers
whose claiming patterns differ from the population.
ProportionofTotal$
AssociatedwithRiskyClaims
0
.2.4.6.8
1
0 50 100 150
DrugRD
Non Outlier MCD Outlier
November Analytic Run
All providers reviewed
4.18
4CAS ADP Team
0
.2.4.6.8
1
0 50 100 150
DrugRD
Cluster 1 Cluster 2 Cluster 3 Cluster 4
Cluster 5 Cluster 6 Cluster 7 Cluster 8
November Analytic Run
All Providers reviewed kmeans results
Clustering algorithm sharpens the focus on the
riskiest providers.
4.18
Providers that cluster together have similar claiming patterns.
24
5
54
n=34
Small clusters
with high RD
scores are of most
interest.
ProportionofTotal$
thatareatRisk
5CAS ADP Team
CAS ADP Team 6
Reviewing the cluster characteristics gives insight into what
claiming patterns are driving the outlier scores.
Mean Z Scores
Cluster # Prvr
Avg
DrugRD
Var1 Var2 Var3 Var4 Var5 Var6 Var7 Var8 Var9
4 5 24 2.0 29.7 -0.5 -0.1 0.1 -0.4 -0.2 -0.2 -0.5
2 24 14 -0.1 0.8 10.3 1.7 1.7 -0.3 0.3 0.6 0.7
3 34 84 -1.3 0.8 -0.4 36.8 8.4 8.9 4.3 0.5 1.1
5 54 46 -0.4 0.9 0.9 17.1 3.4 1.9 2.5 0.9 0.5
…
1 682 2.15 0.2 0.0 0.2 -0.3 0.0 -0.2 -0.1 -0.2 0.2
CAS ADP Team 7
Claim-specific risk is estimated for the variables highlighted in
the K-means and MCD analyses.
RiskMA(i,j) = (e-(MA(i,j)/Max
MA
(i)) * (1-di(j)/r))-e-1)/(1-e-1)
Limited investigation resources are targeted on the specific claims
most likely to be an issue.
Network analysis can reveal relationships that warrant
further investigation.
Collusion between members
and suspect providers?
CAS ADP Team 8
Problematic providers tend to
have customers in common.
Claiming patterns for narcotics are of particular
interest.
CAS ADP Team 9
Highly concentrated business
relationships are flagged.
Are members seeking narcotics from
multiple doctors and pharmacies?
Machine learning (ML) = architectures for building
algorithms that learn.
CAS ADP Team 10
mA
SVM
Random
Forest
NN
Neural
Network
CNNDBN
Deep Learning
RBM
K-NN
RNN
Machine Learning
Random Forest algorithm classifies observations based
on the majority vote of many decision trees.
Risk classification
…
1200 obs
7 vars
Sample
with
replacement
Sample
with
replacement
Sample
with
replacement
11CAS ADP Team
Random Forest technique shows promise in predicting
which investigations will yield findings of note.
1 0
1 25 9
0 1 50
True Positive Rate: 74%
True Negative Rate: 98%
CAS ADP Team 12
RiskMA(i,j) = (e-(MA(i,j)/Max
MA
(i)) * (1-di(j)/r))-e-1)/(1-e-1)
Random Forest provides a measure of a variable’s
importance to classification success.
Var 6
Var 2
Var 3
Var 4
Var 1
Var 5
Var 7
CAS ADP Team 13
Automated review of receipts provides early detection
of potential issues.
Machine learning algorithm is being used to
determine if the document is a valid receipt.
Data lift technology extracts
the information.
Analytics is one input used to match cost-to-investigate
with the anticipated ROI.
15CAS ADP Team
There are many paths to generating ROI from
fraud detection analytics.
– Business knowledge and a
willingness to learn are more
important than the tool set
– Analytics are tools; keep them
sharp
– Verify that the analyses are:
– Relevant
– Reliable
– Responsible
– Tailor audit investigations to
the nature and magnitude of
the risk
16CAS ADP Team
Jil Tanguay, BSc (Spec), CFI, CRMA
Manager
Claims Assurance Services
Alberta Blue Cross
jtanguay@ab.bluecross.ca
Darryl Humphrey, PhD, PMP
Senior Data Scientist
Claims Assurance Services
Alberta Blue Cross
dhumphrey@ab.bluecross.ca
Yemi Dare-Ode, BSc
Nazanin Tahmasebi, PhD
Wesley Wood, Bsc
17CAS ADP Team
Random forest classification accuracy stabilizes at
approximately 220 trees.
18CAS ADP Team
Many data sets contain nonlinear relationships which can
reduce the effectiveness of some detection methods.
– Datasets that are linearly
separable with some noise work
out great
0 x
0 x
0
x2
x
– Some data sets aren’t linear in
their initial state
– The data can be mapped to a
higher-dimensional space
19CAS ADP Team
Map feature space to one of higher dimensionality
where the training set is linearly separable.
Φ: x → φ(x)
20CAS ADP Team
Support Vector Machines find the
optimal surface that separates the
groups.
– Maximizes the distance between the
hyperplane and the “difficult points”
close to decision boundary
– If there are no points near the decision
surface, then there will be fewer false
positives and false negatives
– Support vectors are the observations
near the decision boundary that
contribute to determining the boundary.
– Implies that only support vectors matter;
other training examples are ignorable
Ch. 15
21CAS ADP Team
RD QuintileRD Quintile
Random Forest Confusion Matrix
Quintile Accuracy
0-0.20 0.80
0.20-0.40 0.67
0.40-0.60 0.50
0.60-0.80 0.69
0.80 -1 0.91
Quintile Accuracy
0-0.20 0.85
0.20-0.40 0.54
0.40-0.60 0.54
0.60-0.80 0.67
0.80 -1 0.94
SVM Confusion Matrix
22CAS ADP Team
– Artificial neural networks are
composed of multiple nodes
which imitate neurons of the
human brain.
Neural networks are well-suited to detection tasks.
– Neurons are connected by links
and they interact with each
other. Each link is associated
with a weight
– Artificial neural networks learn
by modifying the weights in
response to feedback
– Deep learning = lots of hidden
layers
– Most often used for images
23CAS ADP Team
Eye movement research indicates that we recognize
objects by extracting features.
CAS ADP Team 24
The series of layers between input & output do
feature extraction and processing in stages, just as our
brains do.
CAS ADP Team 25
Learning
Variables
Network analysis is used to show the effect of
ownership on a pharmacy’s claiming behavior.
– Assertion is that company policy /
implicit guidelines can drive
claiming behavior across the
pharmacies owned by a single
corporate entity
– Network defined by pharmacies
registered with the same legal name
– Red = high total $ from risky claiming
relative to other pharmacies
– Large = high proportion of a
pharmacy’s $ from risky claiming
– Close together = similar high total $
at risk
26CAS ADP Team

Contenu connexe

Similaire à 1530 track2 humphrey

Customer Profiling using Data Mining
Customer Profiling using Data Mining Customer Profiling using Data Mining
Customer Profiling using Data Mining Suman Chatterjee
 
Early Identification of Diseases Based on Responsible Attribute using Data Mi...
Early Identification of Diseases Based on Responsible Attribute using Data Mi...Early Identification of Diseases Based on Responsible Attribute using Data Mi...
Early Identification of Diseases Based on Responsible Attribute using Data Mi...IRJET Journal
 
II-SDV 2017: The Next Era: Deep Learning for Biomedical Research
II-SDV 2017: The Next Era: Deep Learning for Biomedical ResearchII-SDV 2017: The Next Era: Deep Learning for Biomedical Research
II-SDV 2017: The Next Era: Deep Learning for Biomedical ResearchDr. Haxel Consult
 
Review of Algorithms for Crime Analysis & Prediction
Review of Algorithms for Crime Analysis & PredictionReview of Algorithms for Crime Analysis & Prediction
Review of Algorithms for Crime Analysis & PredictionIRJET Journal
 
Healthcare deserts: How accessible is US healthcare?
Healthcare deserts: How accessible is US healthcare?Healthcare deserts: How accessible is US healthcare?
Healthcare deserts: How accessible is US healthcare?Data Con LA
 
IRJET-A Hybrid Intrusion Detection Technique based on IRF & AODE for KDD-CUP ...
IRJET-A Hybrid Intrusion Detection Technique based on IRF & AODE for KDD-CUP ...IRJET-A Hybrid Intrusion Detection Technique based on IRF & AODE for KDD-CUP ...
IRJET-A Hybrid Intrusion Detection Technique based on IRF & AODE for KDD-CUP ...IRJET Journal
 
Introduction to the Open Source HPCC Systems Platform by Arjuna Chala
Introduction to the Open Source HPCC Systems Platform by Arjuna ChalaIntroduction to the Open Source HPCC Systems Platform by Arjuna Chala
Introduction to the Open Source HPCC Systems Platform by Arjuna ChalaHPCC Systems
 
Imtiaz khan data_science_analytics
Imtiaz khan data_science_analyticsImtiaz khan data_science_analytics
Imtiaz khan data_science_analyticsimtiaz khan
 
Deep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsDeep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsValery Tkachenko
 
Decoding the Acronyms in Clinical Data Standards
Decoding the Acronyms in Clinical Data StandardsDecoding the Acronyms in Clinical Data Standards
Decoding the Acronyms in Clinical Data Standardsd-Wise Technologies
 
Cyb 5675 class project final
Cyb 5675   class project finalCyb 5675   class project final
Cyb 5675 class project finalCraig Cannon
 
Neural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learningNeural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learningFrancisco E. Figueroa-Nigaglioni
 
Data Mining based on Hashing Technique
Data Mining based on Hashing TechniqueData Mining based on Hashing Technique
Data Mining based on Hashing Techniqueijtsrd
 
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...IRJET Journal
 
Feature Subset Selection for High Dimensional Data using Clustering Techniques
Feature Subset Selection for High Dimensional Data using Clustering TechniquesFeature Subset Selection for High Dimensional Data using Clustering Techniques
Feature Subset Selection for High Dimensional Data using Clustering TechniquesIRJET Journal
 
DA ST-1 SET-B-Solution.pdf we also provide the many type of solution
DA ST-1 SET-B-Solution.pdf we also provide the many type of solutionDA ST-1 SET-B-Solution.pdf we also provide the many type of solution
DA ST-1 SET-B-Solution.pdf we also provide the many type of solutiongitikasingh2004
 
Data analytics and visualization
Data analytics and visualizationData analytics and visualization
Data analytics and visualizationVini Vasundharan
 

Similaire à 1530 track2 humphrey (20)

V34132136
V34132136V34132136
V34132136
 
Customer Profiling using Data Mining
Customer Profiling using Data Mining Customer Profiling using Data Mining
Customer Profiling using Data Mining
 
Early Identification of Diseases Based on Responsible Attribute using Data Mi...
Early Identification of Diseases Based on Responsible Attribute using Data Mi...Early Identification of Diseases Based on Responsible Attribute using Data Mi...
Early Identification of Diseases Based on Responsible Attribute using Data Mi...
 
II-SDV 2017: The Next Era: Deep Learning for Biomedical Research
II-SDV 2017: The Next Era: Deep Learning for Biomedical ResearchII-SDV 2017: The Next Era: Deep Learning for Biomedical Research
II-SDV 2017: The Next Era: Deep Learning for Biomedical Research
 
Review of Algorithms for Crime Analysis & Prediction
Review of Algorithms for Crime Analysis & PredictionReview of Algorithms for Crime Analysis & Prediction
Review of Algorithms for Crime Analysis & Prediction
 
Healthcare deserts: How accessible is US healthcare?
Healthcare deserts: How accessible is US healthcare?Healthcare deserts: How accessible is US healthcare?
Healthcare deserts: How accessible is US healthcare?
 
IRJET-A Hybrid Intrusion Detection Technique based on IRF & AODE for KDD-CUP ...
IRJET-A Hybrid Intrusion Detection Technique based on IRF & AODE for KDD-CUP ...IRJET-A Hybrid Intrusion Detection Technique based on IRF & AODE for KDD-CUP ...
IRJET-A Hybrid Intrusion Detection Technique based on IRF & AODE for KDD-CUP ...
 
DataMining_CA2-4
DataMining_CA2-4DataMining_CA2-4
DataMining_CA2-4
 
Introduction to the Open Source HPCC Systems Platform by Arjuna Chala
Introduction to the Open Source HPCC Systems Platform by Arjuna ChalaIntroduction to the Open Source HPCC Systems Platform by Arjuna Chala
Introduction to the Open Source HPCC Systems Platform by Arjuna Chala
 
Imtiaz khan data_science_analytics
Imtiaz khan data_science_analyticsImtiaz khan data_science_analytics
Imtiaz khan data_science_analytics
 
Deep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsDeep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpoints
 
Decoding the Acronyms in Clinical Data Standards
Decoding the Acronyms in Clinical Data StandardsDecoding the Acronyms in Clinical Data Standards
Decoding the Acronyms in Clinical Data Standards
 
Cyb 5675 class project final
Cyb 5675   class project finalCyb 5675   class project final
Cyb 5675 class project final
 
Seminar Presentation
Seminar PresentationSeminar Presentation
Seminar Presentation
 
Neural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learningNeural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learning
 
Data Mining based on Hashing Technique
Data Mining based on Hashing TechniqueData Mining based on Hashing Technique
Data Mining based on Hashing Technique
 
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
 
Feature Subset Selection for High Dimensional Data using Clustering Techniques
Feature Subset Selection for High Dimensional Data using Clustering TechniquesFeature Subset Selection for High Dimensional Data using Clustering Techniques
Feature Subset Selection for High Dimensional Data using Clustering Techniques
 
DA ST-1 SET-B-Solution.pdf we also provide the many type of solution
DA ST-1 SET-B-Solution.pdf we also provide the many type of solutionDA ST-1 SET-B-Solution.pdf we also provide the many type of solution
DA ST-1 SET-B-Solution.pdf we also provide the many type of solution
 
Data analytics and visualization
Data analytics and visualizationData analytics and visualization
Data analytics and visualization
 

Plus de Rising Media, Inc.

1415 track 1 wu_using his laptop
1415 track 1 wu_using his laptop1415 track 1 wu_using his laptop
1415 track 1 wu_using his laptopRising Media, Inc.
 
1620 keynote olson_using our laptop
1620 keynote olson_using our laptop1620 keynote olson_using our laptop
1620 keynote olson_using our laptopRising Media, Inc.
 
1530 track 2 stuart_using our laptop
1530 track 2 stuart_using our laptop1530 track 2 stuart_using our laptop
1530 track 2 stuart_using our laptopRising Media, Inc.
 
1530 track 1 fader_using our laptop
1530 track 1 fader_using our laptop1530 track 1 fader_using our laptop
1530 track 1 fader_using our laptopRising Media, Inc.
 
1215 daa lunch owusu_using our laptop
1215 daa lunch owusu_using our laptop1215 daa lunch owusu_using our laptop
1215 daa lunch owusu_using our laptopRising Media, Inc.
 
1215 daa lunch a bos intro slides_using our laptop
1215 daa lunch a bos intro slides_using our laptop1215 daa lunch a bos intro slides_using our laptop
1215 daa lunch a bos intro slides_using our laptopRising Media, Inc.
 
855 sponsor movassate_using our laptop
855 sponsor movassate_using our laptop855 sponsor movassate_using our laptop
855 sponsor movassate_using our laptopRising Media, Inc.
 
1325 keynote yale_pdf shareable
1325 keynote yale_pdf shareable1325 keynote yale_pdf shareable
1325 keynote yale_pdf shareableRising Media, Inc.
 
905 keynote peele_using our laptop
905 keynote peele_using our laptop905 keynote peele_using our laptop
905 keynote peele_using our laptopRising Media, Inc.
 

Plus de Rising Media, Inc. (20)

1415 track 1 wu_using his laptop
1415 track 1 wu_using his laptop1415 track 1 wu_using his laptop
1415 track 1 wu_using his laptop
 
Matt gershoff
Matt gershoffMatt gershoff
Matt gershoff
 
Keynote adam greco
Keynote adam grecoKeynote adam greco
Keynote adam greco
 
1620 keynote olson_using our laptop
1620 keynote olson_using our laptop1620 keynote olson_using our laptop
1620 keynote olson_using our laptop
 
1530 track 2 stuart_using our laptop
1530 track 2 stuart_using our laptop1530 track 2 stuart_using our laptop
1530 track 2 stuart_using our laptop
 
1530 track 1 fader_using our laptop
1530 track 1 fader_using our laptop1530 track 1 fader_using our laptop
1530 track 1 fader_using our laptop
 
1415 track 2 richardson
1415 track 2 richardson1415 track 2 richardson
1415 track 2 richardson
 
1215 daa lunch owusu_using our laptop
1215 daa lunch owusu_using our laptop1215 daa lunch owusu_using our laptop
1215 daa lunch owusu_using our laptop
 
1215 daa lunch a bos intro slides_using our laptop
1215 daa lunch a bos intro slides_using our laptop1215 daa lunch a bos intro slides_using our laptop
1215 daa lunch a bos intro slides_using our laptop
 
915 e metrics_claudia perlich
915 e metrics_claudia perlich915 e metrics_claudia perlich
915 e metrics_claudia perlich
 
855 sponsor movassate_using our laptop
855 sponsor movassate_using our laptop855 sponsor movassate_using our laptop
855 sponsor movassate_using our laptop
 
1615 plack using our laptop
1615 plack using our laptop1615 plack using our laptop
1615 plack using our laptop
 
1530 rimmele do not share
1530 rimmele do not share1530 rimmele do not share
1530 rimmele do not share
 
1325 keynote yale_pdf shareable
1325 keynote yale_pdf shareable1325 keynote yale_pdf shareable
1325 keynote yale_pdf shareable
 
1115 fiztgerald schuchardt
1115 fiztgerald schuchardt1115 fiztgerald schuchardt
1115 fiztgerald schuchardt
 
1000 kondic do not share
1000 kondic do not share1000 kondic do not share
1000 kondic do not share
 
905 keynote peele_using our laptop
905 keynote peele_using our laptop905 keynote peele_using our laptop
905 keynote peele_using our laptop
 
Stephen morse sharable
Stephen morse sharableStephen morse sharable
Stephen morse sharable
 
Elder shareable
Elder shareableElder shareable
Elder shareable
 
1115 ramirez using our laptop
1115 ramirez using our laptop1115 ramirez using our laptop
1115 ramirez using our laptop
 

Dernier

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 

Dernier (20)

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 

1530 track2 humphrey

  • 1. Claim Pattern Anomalies Making a Mole Hill Out of a Mountain Predictive Analytics World for Business San Francisco May 17, 2017 CAS Analytics & Data Provisioning Team v01 Darryl Humphrey, PhD, PMP linkedin.com/in/dghumphrey1
  • 2. Provider and member claiming behavior is affected by many factors. 2CAS ADP Team FraudAnalytics Member and Provider Claiming Patterns are Dynamic Economic Conditions Plan Design Policies and Processes Compliance Verification Industry Realities
  • 3. Analyzing equivalent of 87,000,000 claim lines monthly encompassing 17,000 providers and 1.6 million members. –Nine (9) practice areas across health, dental, and pharmacy benefits –70 measures of claiming behavior –Six (6) algorithms –Look for converging results 3CAS ADP Team
  • 4. Multi-variate distance measure identifies providers whose claiming patterns differ from the population. ProportionofTotal$ AssociatedwithRiskyClaims 0 .2.4.6.8 1 0 50 100 150 DrugRD Non Outlier MCD Outlier November Analytic Run All providers reviewed 4.18 4CAS ADP Team
  • 5. 0 .2.4.6.8 1 0 50 100 150 DrugRD Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster 6 Cluster 7 Cluster 8 November Analytic Run All Providers reviewed kmeans results Clustering algorithm sharpens the focus on the riskiest providers. 4.18 Providers that cluster together have similar claiming patterns. 24 5 54 n=34 Small clusters with high RD scores are of most interest. ProportionofTotal$ thatareatRisk 5CAS ADP Team
  • 6. CAS ADP Team 6 Reviewing the cluster characteristics gives insight into what claiming patterns are driving the outlier scores. Mean Z Scores Cluster # Prvr Avg DrugRD Var1 Var2 Var3 Var4 Var5 Var6 Var7 Var8 Var9 4 5 24 2.0 29.7 -0.5 -0.1 0.1 -0.4 -0.2 -0.2 -0.5 2 24 14 -0.1 0.8 10.3 1.7 1.7 -0.3 0.3 0.6 0.7 3 34 84 -1.3 0.8 -0.4 36.8 8.4 8.9 4.3 0.5 1.1 5 54 46 -0.4 0.9 0.9 17.1 3.4 1.9 2.5 0.9 0.5 … 1 682 2.15 0.2 0.0 0.2 -0.3 0.0 -0.2 -0.1 -0.2 0.2
  • 7. CAS ADP Team 7 Claim-specific risk is estimated for the variables highlighted in the K-means and MCD analyses. RiskMA(i,j) = (e-(MA(i,j)/Max MA (i)) * (1-di(j)/r))-e-1)/(1-e-1) Limited investigation resources are targeted on the specific claims most likely to be an issue.
  • 8. Network analysis can reveal relationships that warrant further investigation. Collusion between members and suspect providers? CAS ADP Team 8 Problematic providers tend to have customers in common.
  • 9. Claiming patterns for narcotics are of particular interest. CAS ADP Team 9 Highly concentrated business relationships are flagged. Are members seeking narcotics from multiple doctors and pharmacies?
  • 10. Machine learning (ML) = architectures for building algorithms that learn. CAS ADP Team 10 mA SVM Random Forest NN Neural Network CNNDBN Deep Learning RBM K-NN RNN Machine Learning
  • 11. Random Forest algorithm classifies observations based on the majority vote of many decision trees. Risk classification … 1200 obs 7 vars Sample with replacement Sample with replacement Sample with replacement 11CAS ADP Team
  • 12. Random Forest technique shows promise in predicting which investigations will yield findings of note. 1 0 1 25 9 0 1 50 True Positive Rate: 74% True Negative Rate: 98% CAS ADP Team 12 RiskMA(i,j) = (e-(MA(i,j)/Max MA (i)) * (1-di(j)/r))-e-1)/(1-e-1)
  • 13. Random Forest provides a measure of a variable’s importance to classification success. Var 6 Var 2 Var 3 Var 4 Var 1 Var 5 Var 7 CAS ADP Team 13
  • 14. Automated review of receipts provides early detection of potential issues. Machine learning algorithm is being used to determine if the document is a valid receipt. Data lift technology extracts the information.
  • 15. Analytics is one input used to match cost-to-investigate with the anticipated ROI. 15CAS ADP Team
  • 16. There are many paths to generating ROI from fraud detection analytics. – Business knowledge and a willingness to learn are more important than the tool set – Analytics are tools; keep them sharp – Verify that the analyses are: – Relevant – Reliable – Responsible – Tailor audit investigations to the nature and magnitude of the risk 16CAS ADP Team
  • 17. Jil Tanguay, BSc (Spec), CFI, CRMA Manager Claims Assurance Services Alberta Blue Cross jtanguay@ab.bluecross.ca Darryl Humphrey, PhD, PMP Senior Data Scientist Claims Assurance Services Alberta Blue Cross dhumphrey@ab.bluecross.ca Yemi Dare-Ode, BSc Nazanin Tahmasebi, PhD Wesley Wood, Bsc 17CAS ADP Team
  • 18. Random forest classification accuracy stabilizes at approximately 220 trees. 18CAS ADP Team
  • 19. Many data sets contain nonlinear relationships which can reduce the effectiveness of some detection methods. – Datasets that are linearly separable with some noise work out great 0 x 0 x 0 x2 x – Some data sets aren’t linear in their initial state – The data can be mapped to a higher-dimensional space 19CAS ADP Team
  • 20. Map feature space to one of higher dimensionality where the training set is linearly separable. Φ: x → φ(x) 20CAS ADP Team
  • 21. Support Vector Machines find the optimal surface that separates the groups. – Maximizes the distance between the hyperplane and the “difficult points” close to decision boundary – If there are no points near the decision surface, then there will be fewer false positives and false negatives – Support vectors are the observations near the decision boundary that contribute to determining the boundary. – Implies that only support vectors matter; other training examples are ignorable Ch. 15 21CAS ADP Team
  • 22. RD QuintileRD Quintile Random Forest Confusion Matrix Quintile Accuracy 0-0.20 0.80 0.20-0.40 0.67 0.40-0.60 0.50 0.60-0.80 0.69 0.80 -1 0.91 Quintile Accuracy 0-0.20 0.85 0.20-0.40 0.54 0.40-0.60 0.54 0.60-0.80 0.67 0.80 -1 0.94 SVM Confusion Matrix 22CAS ADP Team
  • 23. – Artificial neural networks are composed of multiple nodes which imitate neurons of the human brain. Neural networks are well-suited to detection tasks. – Neurons are connected by links and they interact with each other. Each link is associated with a weight – Artificial neural networks learn by modifying the weights in response to feedback – Deep learning = lots of hidden layers – Most often used for images 23CAS ADP Team
  • 24. Eye movement research indicates that we recognize objects by extracting features. CAS ADP Team 24
  • 25. The series of layers between input & output do feature extraction and processing in stages, just as our brains do. CAS ADP Team 25 Learning Variables
  • 26. Network analysis is used to show the effect of ownership on a pharmacy’s claiming behavior. – Assertion is that company policy / implicit guidelines can drive claiming behavior across the pharmacies owned by a single corporate entity – Network defined by pharmacies registered with the same legal name – Red = high total $ from risky claiming relative to other pharmacies – Large = high proportion of a pharmacy’s $ from risky claiming – Close together = similar high total $ at risk 26CAS ADP Team