SlideShare une entreprise Scribd logo
1  sur  30
Télécharger pour lire hors ligne
Apereo Webinar: Learning What Works When
Scaling Analytics Infrastructure
LOU HARRISON
DIRECTOR OF EDUCATIONAL TECHNOLOGY SERVICES
DELTA
NORTH CAROLINA STATE UNIVERSITY LOU@NCSU.EDU
GARY GILBERT
SOFTWARE ARCHITECT
UNICON GGILBERT@UNICON.NET
● Brief History: Open Academic Analytics Initiative (OAAI)
● The research
● Flashback to last year
● From Pilot to Enterprise efforts
● Slice and dice, including examples of ways to segment
the population
● Results
● Infrastructure overview
● Next steps / Q&A
If you’d like to follow along: https://goo.gl/g2MTCa
INTRODUCTION/OVERVIEW
● Open Academic Analytics Initiative (OAAI)
○ EDUCAUSE Next Gen Learning Challenge (NGLC)
○ Funded by Bill & Melinda Gates Foundation
● Leverage SIS and LMS data to create an open source
academic early alert system (and interventions)
● OAAI led to the Learning Analytics Processor (LAP)
project, which is part of the Apereo Learning Analytics
Initiative
● Exciting results; however, all LMS data was based on
Sakai Models
● NC State partnered with Unicon and
Marist College to bring LAP to NC State, applying it to
their Moodle LMS
BRIEF HISTORY
Predictive Model worked well and was quite portable to other schools (with
some tuning).
For more info, see JAYAPRAKASH, S. M., MOODY, E. W., LAURÍA, E. J.,
REGAN, J. R., & BARON, J. D. (2014). EARLY ALERT OF
ACADEMICALLY AT-RISK STUDENTS: AN OPEN SOURCE ANALYTICS
INITIATIVE.
JOURNAL OF LEARNING ANALYTICS, 1(1), 6-47.
THE RESEARCH
● Our Phase 1 Proof of Concept showed a 75% accuracy in
predicting at-risk students.* Recall rates were 88-90%, but
with high false positives (25%)
● Phase 2 (FY 15-16)
○ Make the LAP more automated, bigger, and badder
○ More Enterprise, more nimble
○ Similar results with much larger datasets
*in a small dataset, of incomplete historical data
FLASHBACK TO FY 16-17
Phase 3 work
● Cohorts (different models for different type classes)
○ Maybe, if incremental improvement outweighs cost
○ Tested ways to slice & dice into smaller cohorts to
improve accuracy
■ By LMS usage (no,light,med,heavy)
■ By Enrollment size (small,med,large)
■ By Student Level (FR, SO, JR, SR, GR)
● We learned splitting by courses is better than by people
● Splitting by LMS usage shows real promise
SLICE AND DICE, SEGMENT POPULATION
SOME PRELIMINARY RESULTS
Precision
Single Model 0.180994092 18.1%
Low LMS Usage 0.168674699 16.9%
Medium LMS Usage 0.184461986 18.4%
High LMS Usage 0.20375 20.4%
No LMS Usage 0.12540366 12.5%
Recall (“Accuracy for At-Risk Students”)
Single Model 0.639668826 64.0%
Low LMS Usage 0.612326044 61.2%
Medium LMS Usage 0.674772036 67.5%
High LMS Usage 0.75990676 76.0%
No LMS Usage 0.405217391 40.5%
Accuracy
Single Model 0.808493064 80.8%
Low LMS Usage 0.810299003 81.0%
Medium LMS Usage 0.758821249 75.9%
High LMS Usage 0.772434308 77.2%
No LMS Usage 0.863060429 86.3%
Testing Error
Single Model 0.191506936 19.2%
Low LMS Usage 0.189700997 19.0%
Medium LMS Usage 0.241178751 24.1%
High LMS Usage 0.227565692 22.8%
No LMS Usage 0.136939571 13.7%
● Numbers guy added to the team
● Learning how to set up cohorts and run the models
● There is a steep learning curve
● Numbers guy added to the team
● Learning how to set up cohorts and run the models
● There is a steep learning curve
SOME PRELIMINARY RESULTS
Precision
Single Model 0.180994092 18.1%
Low LMS Usage 0.168674699 16.9%
Medium LMS Usage 0.184461986 18.4%
High LMS Usage 0.20375 20.4%
No LMS Usage 0.12540366 12.5%
Recall (“Accuracy for At-Risk Students”)
Single Model 0.639668826 64.0%
Low LMS Usage 0.612326044 61.2%
Medium LMS Usage 0.674772036 67.5%
High LMS Usage 0.75990676 76.0%
No LMS Usage 0.405217391 40.5%
Accuracy
Single Model 0.808493064 80.8%
Low LMS Usage 0.810299003 81.0%
Medium LMS Usage 0.758821249 75.9%
High LMS Usage 0.772434308 77.2%
No LMS Usage 0.863060429 86.3%
Testing Error
Single Model 0.191506936 19.2%
Low LMS Usage 0.189700997 19.0%
Medium LMS Usage 0.241178751 24.1%
High LMS Usage 0.227565692 22.8%
No LMS Usage 0.136939571 13.7%
● Phase 3 - Learning Record Warehouse (LRW)
○ Currently only using Moodle logs (+ demo data)
○ Plans to incorporate data from other tools
■ BB Collaborate, Mediasite, etc.
○ All data input streams feed into LRW
○ Pull from LRW into predictive modeler
■ It's important to note that if we think we may have a need to
use certain data, it’s beneficial to have 3-5 years of historical
data to train from. So, if we think we might use it, we should
save it in the LRW.
● Implement OpenDashboard
○ To expose activity heatmap and possibly predictions
ENTERPRISE EFFORTS
Infrastructure Overview
Open Analytics Infrastructure
An Open Analytics Infrastructure
should support:
● Collection and Storage of a variety
of data
● Usage of data for analytics,
reporting and visualization
● Interoperability through Open
Standards
● Use of Open Software, Models
and Processes where appropriate
Open Analytics Infrastructure
NCSU Realization
OpenLRW
● Supports xAPI, IMS Caliper, and IMS OneRoster
● Java / Spring Boot
○ Heavy use of streams, MapReduce features of Java 8
○ Follows Spring-Boot conventions and best practices
○ LRW is packaged as an executable JAR file
■ Tomcat embedded
● MongoDB
OpenLRW: Performance & Scalability
● Stateless
● Horizontally Scalable
OpenLRW: Storage
● MongoDB
○ Sharding
○ Replica Sets
OpenLRW: Security
● API Security
○ JWT
● Authorization
○ Tenancy
○ Organization
● Data at Rest
○ Follow MongoDB best practices
IMS Caliper / xAPI in OpenLRW
● Caliper Messages are stored ~ as is
● xAPI Messages are converted to Caliper prior to storage
○ Current transformation is based on work done by the
Korean Ministry of Ed
○ More transformation options coming
■ IMS / ADL (this will be the default when available)
Other Entities in OpenLRW
● Tenants
● Organizations
● Events
○ xAPI & Caliper
● Supporting Data (OneRoster)
○ Users
○ Classes
○ Enrollments
○ Line Items
○ UserMapping & ClassMapping
OpenDashboard
● Originally developed to provide a widget-based framework for visualizations
● Evolved into a faculty / staff facing tool for monitoring student activity
● Java 8 / Spring-Boot
○ Heavy use of streams, MapReduce features of Java 8
○ Follows Spring-Boot conventions and best practices
○ Dashboard is packaged as an executable JAR file
■ Tomcat embedded
OpenDashboard
High Level View
● Ultimately the Dashboard may split
into two separate deployable
components: client and server
OpenDashboard: Session Storage
● Sessions stored in MongoDB
● Allows for horizontal scalability
● Essentially stateless client side
OpenDashboard For Students
● Dashboard is currently only intended for
faculty/staff
● To allow student access:
○ APIs would need to be apply finer grain authorization
controls
○ UI would need to be adapted for a single user view
Data Loader
● How do we get supporting (and maybe event) data into the LRW?
● Java application
● Run as cron job (or similar) daily or even more often
● Phase 4 Needs: FY17-18
● Plan for integrating dashboard
● Start incorporating data from other tools into
LRW
● Possibly add other tool data to predictions
● Start running the modeler regularly
(if we work out a way to share data)
WHAT’S HAPPENING NOW
● Disenfranchised by big, outrageously expensive, commercial black
box analytics systems?
● Can’t afford big, outrageously expensive, commercial black box
analytics systems?
● Overwhelmed by all this analytics talk and complicated math?
● Want to get your feet wet without betting the farm?
● Want to join a group of like-minded schools where every new
development benefits us all?
● This is not free, but your $$$ goes farther, and you benefit from
others’ work
● If you’re interested, contact us
Lou Harrison Gary Gilbert
lou@ncsu.edu ggilbert@unicon.net
WHERE DO YOU FIT IN?
About Unicon
TECHNOLOGY CONSULTING, SERVICES, & SUPPORT FOR THE EDUCATION
INDUSTRY
● Services, strategy, and support focused on the education industry
● Deep domain-specific expertise
● Open source software foundations
● Learn more at www.unicon.net
UNICON CONTRIBUTES TO THE APEREO LEARNING ANALYTICS INITIATIVE
● Unicon has been involved since 2015
● Developed standards-based integrations for open analytics technologies
● Provides services for open analytics technologies (OpenLRW,
OpenDashboard, SSP)
● Learn more at www.apereo.org/communities/learning-analytics-initiative

Contenu connexe

Similaire à Apereo Webinar: Learning What Works When Scaling Analytics Infrastructure (January 24, 2018)

Similaire à Apereo Webinar: Learning What Works When Scaling Analytics Infrastructure (January 24, 2018) (20)

Building Data Products with Python (Georgetown)
Building Data Products with Python (Georgetown)Building Data Products with Python (Georgetown)
Building Data Products with Python (Georgetown)
 
Scaling Recommendations at Quora (RecSys talk 9/16/2016)
Scaling Recommendations at Quora (RecSys talk 9/16/2016)Scaling Recommendations at Quora (RecSys talk 9/16/2016)
Scaling Recommendations at Quora (RecSys talk 9/16/2016)
 
C2_W1---.pdf
C2_W1---.pdfC2_W1---.pdf
C2_W1---.pdf
 
2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx
 
SWAD Timeline 4:3
SWAD Timeline 4:3SWAD Timeline 4:3
SWAD Timeline 4:3
 
SWAD timeline
SWAD timelineSWAD timeline
SWAD timeline
 
Lak2018: Scaling Nationally: Seven Lesson Learned
Lak2018:  Scaling Nationally: Seven Lesson LearnedLak2018:  Scaling Nationally: Seven Lesson Learned
Lak2018: Scaling Nationally: Seven Lesson Learned
 
Swad Timeline
Swad TimelineSwad Timeline
Swad Timeline
 
End to end MLworkflows
End to end MLworkflowsEnd to end MLworkflows
End to end MLworkflows
 
Grand Challenges Learning Analytics
Grand Challenges Learning AnalyticsGrand Challenges Learning Analytics
Grand Challenges Learning Analytics
 
Enterprise Frameworks: Java & .NET
Enterprise Frameworks: Java & .NETEnterprise Frameworks: Java & .NET
Enterprise Frameworks: Java & .NET
 
Adrian Colyer - Keynote: NoSQL matters - NoSQL matters Dublin 2015
Adrian Colyer - Keynote: NoSQL matters - NoSQL matters Dublin 2015Adrian Colyer - Keynote: NoSQL matters - NoSQL matters Dublin 2015
Adrian Colyer - Keynote: NoSQL matters - NoSQL matters Dublin 2015
 
Drinking the xAPI Kool Aid
Drinking the xAPI Kool AidDrinking the xAPI Kool Aid
Drinking the xAPI Kool Aid
 
(Greach 2015) Decathlon Sport Meeting
(Greach 2015) Decathlon Sport Meeting(Greach 2015) Decathlon Sport Meeting
(Greach 2015) Decathlon Sport Meeting
 
Open Learning Analytics Strategy for Student Success: The North Carolina Stat...
Open Learning Analytics Strategy for Student Success: The North Carolina Stat...Open Learning Analytics Strategy for Student Success: The North Carolina Stat...
Open Learning Analytics Strategy for Student Success: The North Carolina Stat...
 
Make Sense Out of Data with Feature Engineering
Make Sense Out of Data with Feature EngineeringMake Sense Out of Data with Feature Engineering
Make Sense Out of Data with Feature Engineering
 
10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning Systems10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning Systems
 
Limits of Machine Learning
Limits of Machine LearningLimits of Machine Learning
Limits of Machine Learning
 
Fms invited talk_2018 v5
Fms invited talk_2018 v5Fms invited talk_2018 v5
Fms invited talk_2018 v5
 
Choosing the Right Database - Facebook DevC Malang Hackdays 2017
Choosing the Right Database - Facebook DevC Malang Hackdays 2017Choosing the Right Database - Facebook DevC Malang Hackdays 2017
Choosing the Right Database - Facebook DevC Malang Hackdays 2017
 

Dernier

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Dernier (20)

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

Apereo Webinar: Learning What Works When Scaling Analytics Infrastructure (January 24, 2018)

  • 1. Apereo Webinar: Learning What Works When Scaling Analytics Infrastructure LOU HARRISON DIRECTOR OF EDUCATIONAL TECHNOLOGY SERVICES DELTA NORTH CAROLINA STATE UNIVERSITY LOU@NCSU.EDU GARY GILBERT SOFTWARE ARCHITECT UNICON GGILBERT@UNICON.NET
  • 2. ● Brief History: Open Academic Analytics Initiative (OAAI) ● The research ● Flashback to last year ● From Pilot to Enterprise efforts ● Slice and dice, including examples of ways to segment the population ● Results ● Infrastructure overview ● Next steps / Q&A If you’d like to follow along: https://goo.gl/g2MTCa INTRODUCTION/OVERVIEW
  • 3. ● Open Academic Analytics Initiative (OAAI) ○ EDUCAUSE Next Gen Learning Challenge (NGLC) ○ Funded by Bill & Melinda Gates Foundation ● Leverage SIS and LMS data to create an open source academic early alert system (and interventions) ● OAAI led to the Learning Analytics Processor (LAP) project, which is part of the Apereo Learning Analytics Initiative ● Exciting results; however, all LMS data was based on Sakai Models ● NC State partnered with Unicon and Marist College to bring LAP to NC State, applying it to their Moodle LMS BRIEF HISTORY
  • 4. Predictive Model worked well and was quite portable to other schools (with some tuning). For more info, see JAYAPRAKASH, S. M., MOODY, E. W., LAURÍA, E. J., REGAN, J. R., & BARON, J. D. (2014). EARLY ALERT OF ACADEMICALLY AT-RISK STUDENTS: AN OPEN SOURCE ANALYTICS INITIATIVE. JOURNAL OF LEARNING ANALYTICS, 1(1), 6-47. THE RESEARCH
  • 5. ● Our Phase 1 Proof of Concept showed a 75% accuracy in predicting at-risk students.* Recall rates were 88-90%, but with high false positives (25%) ● Phase 2 (FY 15-16) ○ Make the LAP more automated, bigger, and badder ○ More Enterprise, more nimble ○ Similar results with much larger datasets *in a small dataset, of incomplete historical data FLASHBACK TO FY 16-17
  • 6. Phase 3 work ● Cohorts (different models for different type classes) ○ Maybe, if incremental improvement outweighs cost ○ Tested ways to slice & dice into smaller cohorts to improve accuracy ■ By LMS usage (no,light,med,heavy) ■ By Enrollment size (small,med,large) ■ By Student Level (FR, SO, JR, SR, GR) ● We learned splitting by courses is better than by people ● Splitting by LMS usage shows real promise SLICE AND DICE, SEGMENT POPULATION
  • 7. SOME PRELIMINARY RESULTS Precision Single Model 0.180994092 18.1% Low LMS Usage 0.168674699 16.9% Medium LMS Usage 0.184461986 18.4% High LMS Usage 0.20375 20.4% No LMS Usage 0.12540366 12.5% Recall (“Accuracy for At-Risk Students”) Single Model 0.639668826 64.0% Low LMS Usage 0.612326044 61.2% Medium LMS Usage 0.674772036 67.5% High LMS Usage 0.75990676 76.0% No LMS Usage 0.405217391 40.5% Accuracy Single Model 0.808493064 80.8% Low LMS Usage 0.810299003 81.0% Medium LMS Usage 0.758821249 75.9% High LMS Usage 0.772434308 77.2% No LMS Usage 0.863060429 86.3% Testing Error Single Model 0.191506936 19.2% Low LMS Usage 0.189700997 19.0% Medium LMS Usage 0.241178751 24.1% High LMS Usage 0.227565692 22.8% No LMS Usage 0.136939571 13.7% ● Numbers guy added to the team ● Learning how to set up cohorts and run the models ● There is a steep learning curve
  • 8. ● Numbers guy added to the team ● Learning how to set up cohorts and run the models ● There is a steep learning curve SOME PRELIMINARY RESULTS Precision Single Model 0.180994092 18.1% Low LMS Usage 0.168674699 16.9% Medium LMS Usage 0.184461986 18.4% High LMS Usage 0.20375 20.4% No LMS Usage 0.12540366 12.5% Recall (“Accuracy for At-Risk Students”) Single Model 0.639668826 64.0% Low LMS Usage 0.612326044 61.2% Medium LMS Usage 0.674772036 67.5% High LMS Usage 0.75990676 76.0% No LMS Usage 0.405217391 40.5% Accuracy Single Model 0.808493064 80.8% Low LMS Usage 0.810299003 81.0% Medium LMS Usage 0.758821249 75.9% High LMS Usage 0.772434308 77.2% No LMS Usage 0.863060429 86.3% Testing Error Single Model 0.191506936 19.2% Low LMS Usage 0.189700997 19.0% Medium LMS Usage 0.241178751 24.1% High LMS Usage 0.227565692 22.8% No LMS Usage 0.136939571 13.7%
  • 9. ● Phase 3 - Learning Record Warehouse (LRW) ○ Currently only using Moodle logs (+ demo data) ○ Plans to incorporate data from other tools ■ BB Collaborate, Mediasite, etc. ○ All data input streams feed into LRW ○ Pull from LRW into predictive modeler ■ It's important to note that if we think we may have a need to use certain data, it’s beneficial to have 3-5 years of historical data to train from. So, if we think we might use it, we should save it in the LRW. ● Implement OpenDashboard ○ To expose activity heatmap and possibly predictions ENTERPRISE EFFORTS
  • 11. Open Analytics Infrastructure An Open Analytics Infrastructure should support: ● Collection and Storage of a variety of data ● Usage of data for analytics, reporting and visualization ● Interoperability through Open Standards ● Use of Open Software, Models and Processes where appropriate
  • 14.
  • 15. OpenLRW ● Supports xAPI, IMS Caliper, and IMS OneRoster ● Java / Spring Boot ○ Heavy use of streams, MapReduce features of Java 8 ○ Follows Spring-Boot conventions and best practices ○ LRW is packaged as an executable JAR file ■ Tomcat embedded ● MongoDB
  • 16. OpenLRW: Performance & Scalability ● Stateless ● Horizontally Scalable
  • 17. OpenLRW: Storage ● MongoDB ○ Sharding ○ Replica Sets
  • 18. OpenLRW: Security ● API Security ○ JWT ● Authorization ○ Tenancy ○ Organization ● Data at Rest ○ Follow MongoDB best practices
  • 19. IMS Caliper / xAPI in OpenLRW ● Caliper Messages are stored ~ as is ● xAPI Messages are converted to Caliper prior to storage ○ Current transformation is based on work done by the Korean Ministry of Ed ○ More transformation options coming ■ IMS / ADL (this will be the default when available)
  • 20. Other Entities in OpenLRW ● Tenants ● Organizations ● Events ○ xAPI & Caliper ● Supporting Data (OneRoster) ○ Users ○ Classes ○ Enrollments ○ Line Items ○ UserMapping & ClassMapping
  • 21. OpenDashboard ● Originally developed to provide a widget-based framework for visualizations ● Evolved into a faculty / staff facing tool for monitoring student activity ● Java 8 / Spring-Boot ○ Heavy use of streams, MapReduce features of Java 8 ○ Follows Spring-Boot conventions and best practices ○ Dashboard is packaged as an executable JAR file ■ Tomcat embedded
  • 23. High Level View ● Ultimately the Dashboard may split into two separate deployable components: client and server
  • 24. OpenDashboard: Session Storage ● Sessions stored in MongoDB ● Allows for horizontal scalability ● Essentially stateless client side
  • 25. OpenDashboard For Students ● Dashboard is currently only intended for faculty/staff ● To allow student access: ○ APIs would need to be apply finer grain authorization controls ○ UI would need to be adapted for a single user view
  • 26. Data Loader ● How do we get supporting (and maybe event) data into the LRW? ● Java application ● Run as cron job (or similar) daily or even more often
  • 27.
  • 28. ● Phase 4 Needs: FY17-18 ● Plan for integrating dashboard ● Start incorporating data from other tools into LRW ● Possibly add other tool data to predictions ● Start running the modeler regularly (if we work out a way to share data) WHAT’S HAPPENING NOW
  • 29. ● Disenfranchised by big, outrageously expensive, commercial black box analytics systems? ● Can’t afford big, outrageously expensive, commercial black box analytics systems? ● Overwhelmed by all this analytics talk and complicated math? ● Want to get your feet wet without betting the farm? ● Want to join a group of like-minded schools where every new development benefits us all? ● This is not free, but your $$$ goes farther, and you benefit from others’ work ● If you’re interested, contact us Lou Harrison Gary Gilbert lou@ncsu.edu ggilbert@unicon.net WHERE DO YOU FIT IN?
  • 30. About Unicon TECHNOLOGY CONSULTING, SERVICES, & SUPPORT FOR THE EDUCATION INDUSTRY ● Services, strategy, and support focused on the education industry ● Deep domain-specific expertise ● Open source software foundations ● Learn more at www.unicon.net UNICON CONTRIBUTES TO THE APEREO LEARNING ANALYTICS INITIATIVE ● Unicon has been involved since 2015 ● Developed standards-based integrations for open analytics technologies ● Provides services for open analytics technologies (OpenLRW, OpenDashboard, SSP) ● Learn more at www.apereo.org/communities/learning-analytics-initiative