Using the power of OLTP and data transformation in SQL 2016 and advanced analytics in Microsoft R Server, various industries really push the boundary of processing higher number of transaction per second (tps) for different use cases. In this talk, we walked through the use case of predicting loan charge off (loan default) rate, architecture configuration that enable this use case, and rich visual dashboard that allow customer to do what-if analysis. Find out how SQL + R allows you to build an “intelligent datawarehouse”.
4. CIS Tiger
Jack Henry
A leading provider for banking solutions for credit unions across Americas
In-memory OLTPColumnStore
Age, Original Balance, Interest
Rate, Loan Remaining
Months, Credit Score
20M Vehicle Loans
PowerBI DashboardIn-Database
Analytics at Scale
R
Business User
Prepare for
analytics
Store
Predictions
Visualize
5. CIS Tiger
Using SQL Server R Services
Bringing Analytics to the Data
• Data already in SQL
• Use T-SQL know-hows to do ETL
• Use the power of in-memory OLTP and column store indexing to enhance speed of
ETL
• RevoScaleR package to provide parallelism and scale
Making the data travel
• Data sources not in SQL
• Data sinks not in SQL
• Complex ETL needed
• Long running R script
12. CIS Tiger
SQL Server as Scoring Engine
Deployment Using:
• Triggers
• Powershell scripts
• SQL agent jobs
13. CIS Tiger
DEMO
• Using public dataset of Lending Club
• Using G5 instance of Azure Data
Science VM (DSVM)
• Following Data Science Process using
SQL Server 2016 R Services
14. CIS Tiger
References
Loan Classification using SQL Server 2016 R Services
A walkthrough of Loan Classification using SQL Server 2016 R
Services
Using MicrosoftML in SQL-Server
GitHub SQL Server Samples
15. Microsoft Data Amp
WHERE DATA GETS TO WORK
Put data, analytics and artificial intelligence into
the heart of your solutions. Get the latest on big
data and machine learning innovations.
Join us online April 19, 2017 at 8AM PT
microsoft.com/data-amp
Editor's Notes
Use Case: Predict vehicle loan charge off (default) based on attributes like interest rate, credit scores etc
Input: A subset of 8 million row of vehicle loan data in SQL Server - columns including branch location, customer profiles, interest rate, loan age etc..
Expected Result: Probability score of loans get charged off (Higher the score, higher the probability of loan get charged off)
Build PowerBI report using probability score to show healthiness of vehicle loans across different branches
Build what if scenario in business application