SlideShare une entreprise Scribd logo
1  sur  27
Learning Analytics –
Opportunities for ISO/IEC JTC 1/SC36
standardisation
Tore Hoel
Oslo and Akershus University College of Applied Sciences
Norway
ISO/IEC JTC 1/SC36 WG8 meeting, 29 November 2015
Hangzhou, China
2
The Learning Analytics
Landscape
What
standards
are
needed?
Characteristics of Educational Big Data
• Grain size of recordable and analysable data has become smaller
– every pen stroke, every keystroke is recorded
• Sources of evidence are (more) varied
– tests, essay scoring, learning games, social interactions, affects,
body sensors, intelligent tutors, simulations, semantic mapping,
LMS data…
– Unstructured (e.g., . log files, clicks, timestamps)
– When structured different schemas are used
• How do we bring these data together to form a overall view of an
individual learner or a cohort of learners?
3
(Cope, B., & Kalantzis, M., 2015)
What data practices are emerging?
• Multi-scalar Data Collection
– Embedded, simultaneous collection of data that can be used
for different purposes at different scales
– Semantically legible datapoint (learner-actionable feedback):
«teachable moment»
• Self-describing, structured data ➔ meanings immediately evident
to learners, teachers, others
• Sample size n= all
• Data and interventions are not separate: Recursive micro
intervention ➔ result➔ redesign cycles
• More widely distributed data collection roles
4
(Cope, B., & Kalantzis, M., 2015)
Need for new Education Data Standards
supporting Learning Analytics
• Harmonization of Activity Stream Specifications (ADL xAPI, IMS
Caliper, W3C Activity Streams)
• Building Vocabularies – Profiles – Recipes – Communities of
Practice
• Storage designs – centralised data warehouses or distributed
Learning Record/Event Stores
• Extract, Transform and Load (ETL) tools for data storage
• Privacy and Data Protection – how to do Privacy-by-design in this
field?
• Sharing of Algorithms and Predictive Models
5
Harmonization of Activity Streams
6
Activity Streams
• Work started around 2009 by a group from IBM, Google,
Microsoft, MySpace, Facebook, VMware a.o.,
• First version published in 2011
• 2014 W3C Social Web Working Group took over the specification
• Working draft version 2.0 published October 2015
7
In its simplest form, an activity consists of an actor, a verb, an an
object, and a target. It tells the story of a person performing an action
on or with an object -- "Geraldine posted a photo to her album" or
"John shared a video". In most cases these components will be explicit,
but they may also be implied.
(Activity Streams Working Group, 2011)
Experience API (xAPI)
• 1st version 2013 (component of ADL Training and Learning
Architecture)
• A Statement consists of an <actor (learner)>, a <verb>, an
<object>, with a <result>, in a <context>. There is no constraint on
what these objects should be.
• Learning Record Store: a system that stores learning information
• xAPI is dependent on the presence of LRS to function
• Offered for standardisation in IEEE August 2014 – “it wasn’t the
slam dunk [they were] naively hoping it would be” (Silvers, 2014)
• End of 2015 a new Data Interoperability Standards Consortium
(not-for-profit organization in the State of Pennsylvania, USA) to
be the steward of Experience API
8
IMS Caliper Analytics
• White paper 2013
• Public release v 1.0 October 2015
• Information model buried in Sensor APIs
• Metric Profiles
• Base Metric Profile, Session, Annotation, Assignable,
Assessment, Outcome, Reading, Media
• IMS Learning Sensor API: defines basic learning events gathered
as learning metics across learning environments
• Leveraging of IMS LTI/LIS/QTI
9
IMS Caliper
10
11
Source: Yong-Sang Cho
Vocabularies
12
Talking about learning activities
• Looser coupled systems, diverse Communities of Practice lead to
more diverse schemas and data models
• Interoperability could be promoted by more efficient sharing of
vocabularies
• Encourage smaller vocabularies / ontologies
13
IMS Caliper
xAPI Communities
How to promote more interoperable
vocabularies for education?
• "Document standards" for vocabularies have severe limitations!
• Communities of Practice (ref xAPI) are part of the solution…
• … but serious stewardship issues
• What could ISO offer in terms of dynamic vocabulary
management?
14
Storage
15
Apereo Dimond model
16
Search architecture middle layer
17(Hoel & Chen, 2015)
MIT Open Personal Data Store / Safe
Answers
• openPDS allows users to
collect, store, and give fine-
grained access to their data in
the cloud.
• openPDS also protects users’
privacy by only sharing
anonymous answers, not raw
data.
• openPDS can also engage in
privacy-preserving group
computations to aggregate
data across users without the
need to share sensitive data
with an intermediate entity. 18
http://openpds.media.mit.edu/#architecture
Extract - Transform - Load tools
• When data are coming from different sources in different
structures, one need tools to extract, transform and load data
into data stores
• There are Open Source ( e.g., Pentaho Kettle and Talend), but
most are commercial
software
• Are ETL tools a possible
hot spot for standards
efforts?
19
SC36 20748-1 Data Storing & Processing
20
Challenges for standardisation
• Privacy and Data Ownership issues – how to turn these «soft»
requirements into «hard» ones?
• The role of Personal Data Stores in Learning Analytics
• Harmonization of data schemes prior to analysis
• Import / export facilities with ontology building (and automatic
reasoning technologies) as part of the storage solutions
• Publishing and Sharing of data for research and comparison and
testing of predictive models, student models, etc.
21
Privacy
22
Implications for designs when Surveillance
turns into Sousveillance?
23
Image credit: http://commons.wikimedia.org/wiki/File:SurSousVeillanceByStephanieMannAge6.png
When Privacy is affecting all LA processes
• Privacy-By-Design is the
overall design principle. What
does it mean for the LA
processes?
• Data Sharing
• Search
• Storing
• Analysing
• Visualising
24
Sharing Algorithms &
Predictive Models
25
How to support sharing?
• Exemplar predictive models are needed to advance learning
analytics
• Besides a Culture for sharing data, algorithms and predictive
models, what else is needed?
• Parallel data streams from production systems to support
development and research
• How to deal with anonymization?
• How to get data for R&D from cloud-based systems?
• How do we talk about these algorithms and models (create a
vocabulary for tagging)
• Where to host the resources (stewardship, openness policies,
open repositories)
26
References
• Cho, Yong-Sang (2015) Quick review xAPI and IMS Caliper -
Principle of both data capturing technologies. Online at
http://www.slideshare.net/zzosang/quick-review-xapi-and-ims-
caliper-principle-of-both-data-capturing-technologies
• Cope, B., & Kalantzis, M. (2015). Sources of Evidence-of-Learning:
Learning and assessment in the era of big data. Open Review of
Educational Research, 2(1)
• Hoel, T. & Chen, W. (2015). Privacy in Learning Analytics –
Implications for System Architecture. In Watanabe, T. and Seta, K.
(Eds.) Proceedings of the 11th International Conference on
Knowledge Management. Online at
http://hoel.nu/publications/Hoel_Chen_ICKM15_final_preprint.p
df
27

Contenu connexe

En vedette

Learning analytics - Analíticas de aprendizaje: tecnología, profesores, entor...
Learning analytics - Analíticas de aprendizaje: tecnología, profesores, entor...Learning analytics - Analíticas de aprendizaje: tecnología, profesores, entor...
Learning analytics - Analíticas de aprendizaje: tecnología, profesores, entor...Baltasar Fernández-Manjón
 
Cryptocurrency & Blockchain Regulation
Cryptocurrency & Blockchain RegulationCryptocurrency & Blockchain Regulation
Cryptocurrency & Blockchain RegulationEmily Hunt
 
Docker:- Application Delivery Platform Towards Edge Computing
Docker:- Application Delivery Platform Towards Edge ComputingDocker:- Application Delivery Platform Towards Edge Computing
Docker:- Application Delivery Platform Towards Edge ComputingBukhary Ikhwan Ismail
 
Trust :: Data: Beyond blockchain hype, Tradestreaming Money Conference, Novem...
Trust :: Data: Beyond blockchain hype, Tradestreaming Money Conference, Novem...Trust :: Data: Beyond blockchain hype, Tradestreaming Money Conference, Novem...
Trust :: Data: Beyond blockchain hype, Tradestreaming Money Conference, Novem...Digiday
 
Live migration in Mobile Edge Computing (MEC)
Live migration in Mobile Edge Computing (MEC)Live migration in Mobile Edge Computing (MEC)
Live migration in Mobile Edge Computing (MEC)Andy Jones
 
Hub and spokes network ppt slides presentation diagrams templates
Hub and spokes network ppt slides presentation diagrams templatesHub and spokes network ppt slides presentation diagrams templates
Hub and spokes network ppt slides presentation diagrams templatesSlideTeam.net
 
Personal data protection in the EU
Personal data protection in the EUPersonal data protection in the EU
Personal data protection in the EUArete-Zoe, LLC
 
Consideration of fixed mobile convergence in 5G
Consideration of fixed mobile convergence in 5GConsideration of fixed mobile convergence in 5G
Consideration of fixed mobile convergence in 5GITU
 
Hub and Spoke
Hub and SpokeHub and Spoke
Hub and SpokeWei Min
 
E3: Edge and Cloud Connectivity (Predix Transform 2016)
E3: Edge and Cloud Connectivity (Predix Transform 2016)E3: Edge and Cloud Connectivity (Predix Transform 2016)
E3: Edge and Cloud Connectivity (Predix Transform 2016)Predix
 
Why IoT needs Fog Computing ?
Why IoT needs Fog Computing ?Why IoT needs Fog Computing ?
Why IoT needs Fog Computing ?Ahmed Banafa
 
E1: Building the Digital Twin (Predix Transform 2016)
E1: Building the Digital Twin (Predix Transform 2016)E1: Building the Digital Twin (Predix Transform 2016)
E1: Building the Digital Twin (Predix Transform 2016)Predix
 
Security and Virtualization in the Data Center
Security and Virtualization in the Data CenterSecurity and Virtualization in the Data Center
Security and Virtualization in the Data CenterCisco Canada
 
Predix Builder Roadshow
Predix Builder RoadshowPredix Builder Roadshow
Predix Builder RoadshowPredix
 
Learning Analytics Metadata Standards, xAPI recipes & Learning Record Store -
Learning Analytics Metadata Standards, xAPI recipes & Learning Record Store - Learning Analytics Metadata Standards, xAPI recipes & Learning Record Store -
Learning Analytics Metadata Standards, xAPI recipes & Learning Record Store - Hendrik Drachsler
 

En vedette (19)

Learning analytics - Analíticas de aprendizaje: tecnología, profesores, entor...
Learning analytics - Analíticas de aprendizaje: tecnología, profesores, entor...Learning analytics - Analíticas de aprendizaje: tecnología, profesores, entor...
Learning analytics - Analíticas de aprendizaje: tecnología, profesores, entor...
 
Cryptocurrency & Blockchain Regulation
Cryptocurrency & Blockchain RegulationCryptocurrency & Blockchain Regulation
Cryptocurrency & Blockchain Regulation
 
Personal Data Store Project
Personal Data Store ProjectPersonal Data Store Project
Personal Data Store Project
 
Docker:- Application Delivery Platform Towards Edge Computing
Docker:- Application Delivery Platform Towards Edge ComputingDocker:- Application Delivery Platform Towards Edge Computing
Docker:- Application Delivery Platform Towards Edge Computing
 
Trust :: Data: Beyond blockchain hype, Tradestreaming Money Conference, Novem...
Trust :: Data: Beyond blockchain hype, Tradestreaming Money Conference, Novem...Trust :: Data: Beyond blockchain hype, Tradestreaming Money Conference, Novem...
Trust :: Data: Beyond blockchain hype, Tradestreaming Money Conference, Novem...
 
Live migration in Mobile Edge Computing (MEC)
Live migration in Mobile Edge Computing (MEC)Live migration in Mobile Edge Computing (MEC)
Live migration in Mobile Edge Computing (MEC)
 
Hub and spokes network ppt slides presentation diagrams templates
Hub and spokes network ppt slides presentation diagrams templatesHub and spokes network ppt slides presentation diagrams templates
Hub and spokes network ppt slides presentation diagrams templates
 
Personal data protection in the EU
Personal data protection in the EUPersonal data protection in the EU
Personal data protection in the EU
 
Consideration of fixed mobile convergence in 5G
Consideration of fixed mobile convergence in 5GConsideration of fixed mobile convergence in 5G
Consideration of fixed mobile convergence in 5G
 
Hub and Spoke
Hub and SpokeHub and Spoke
Hub and Spoke
 
Cloud, Fog & Edge Computing
Cloud, Fog & Edge ComputingCloud, Fog & Edge Computing
Cloud, Fog & Edge Computing
 
E3: Edge and Cloud Connectivity (Predix Transform 2016)
E3: Edge and Cloud Connectivity (Predix Transform 2016)E3: Edge and Cloud Connectivity (Predix Transform 2016)
E3: Edge and Cloud Connectivity (Predix Transform 2016)
 
Why IoT needs Fog Computing ?
Why IoT needs Fog Computing ?Why IoT needs Fog Computing ?
Why IoT needs Fog Computing ?
 
E1: Building the Digital Twin (Predix Transform 2016)
E1: Building the Digital Twin (Predix Transform 2016)E1: Building the Digital Twin (Predix Transform 2016)
E1: Building the Digital Twin (Predix Transform 2016)
 
Security and Virtualization in the Data Center
Security and Virtualization in the Data CenterSecurity and Virtualization in the Data Center
Security and Virtualization in the Data Center
 
Predix Builder Roadshow
Predix Builder RoadshowPredix Builder Roadshow
Predix Builder Roadshow
 
Fog computing
Fog computingFog computing
Fog computing
 
Learning Analytics Metadata Standards, xAPI recipes & Learning Record Store -
Learning Analytics Metadata Standards, xAPI recipes & Learning Record Store - Learning Analytics Metadata Standards, xAPI recipes & Learning Record Store -
Learning Analytics Metadata Standards, xAPI recipes & Learning Record Store -
 
FOG COMPUTING
FOG COMPUTINGFOG COMPUTING
FOG COMPUTING
 

Similaire à Learning Analytics – Opportunities for ISO/IEC JTC 1/SC36 standardisation

Towards Open Architectures and Interoperability for Learning Analytics
Towards Open Architectures and Interoperability for Learning Analytics Towards Open Architectures and Interoperability for Learning Analytics
Towards Open Architectures and Interoperability for Learning Analytics Tore Hoel
 
Scalable Learning Analytics and Interoperability – an assessment of potential...
Scalable Learning Analytics and Interoperability – an assessment of potential...Scalable Learning Analytics and Interoperability – an assessment of potential...
Scalable Learning Analytics and Interoperability – an assessment of potential...LACE Project
 
SCONUL Conference 2009: Workshop on Repositories for Teaching & Learning Mate...
SCONUL Conference 2009: Workshop on Repositories for Teaching & Learning Mate...SCONUL Conference 2009: Workshop on Repositories for Teaching & Learning Mate...
SCONUL Conference 2009: Workshop on Repositories for Teaching & Learning Mate...Sarah Currier
 
Data-Driven Learning Strategy
Data-Driven Learning StrategyData-Driven Learning Strategy
Data-Driven Learning StrategyJessie Chuang
 
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...Sebastian Dennerlein
 
Immersive Community Analytics for Wearable Enhanced Learning (HCI Internation...
Immersive Community Analytics for Wearable Enhanced Learning (HCI Internation...Immersive Community Analytics for Wearable Enhanced Learning (HCI Internation...
Immersive Community Analytics for Wearable Enhanced Learning (HCI Internation...IstvanKoren
 
Standards for Smart Learning Environments
Standards for Smart Learning EnvironmentsStandards for Smart Learning Environments
Standards for Smart Learning EnvironmentsTore Hoel
 
An introduction to repository reference models
An introduction to repository reference modelsAn introduction to repository reference models
An introduction to repository reference modelsJulie Allinson
 
EdMedia 2017 Outstanding Paper Award
EdMedia 2017 Outstanding Paper AwardEdMedia 2017 Outstanding Paper Award
EdMedia 2017 Outstanding Paper AwardAlan Amory
 
Introduction to Learner Analytics Session at Oslo Open Forum Conferences prio...
Introduction to Learner Analytics Session at Oslo Open Forum Conferences prio...Introduction to Learner Analytics Session at Oslo Open Forum Conferences prio...
Introduction to Learner Analytics Session at Oslo Open Forum Conferences prio...Tore Hoel
 
2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...
2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...
2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...eMadrid network
 
e-Science, Research Data and Libaries
e-Science, Research Data and Libariese-Science, Research Data and Libaries
e-Science, Research Data and LibariesRob Grim
 
LOR Characteristics and Considerations
LOR Characteristics and ConsiderationsLOR Characteristics and Considerations
LOR Characteristics and ConsiderationsScott Leslie
 
Mootnz13 Moodle Analytics
Mootnz13 Moodle AnalyticsMootnz13 Moodle Analytics
Mootnz13 Moodle AnalyticsNetSpot Pty Ltd
 
On data-driven systems analyzing, supporting and enhancing users’ interaction...
On data-driven systems analyzing, supporting and enhancing users’ interaction...On data-driven systems analyzing, supporting and enhancing users’ interaction...
On data-driven systems analyzing, supporting and enhancing users’ interaction...Grial - University of Salamanca
 
Open learning analytics overview (lasi) v1
Open learning analytics overview (lasi) v1Open learning analytics overview (lasi) v1
Open learning analytics overview (lasi) v1Joshua
 
Dr. Gábor Kismihók: Labour Market driven Learning Analytics
Dr. Gábor Kismihók: Labour Market driven Learning AnalyticsDr. Gábor Kismihók: Labour Market driven Learning Analytics
Dr. Gábor Kismihók: Labour Market driven Learning AnalyticsTextkernel
 
xAPI (Experience API):Potential for Open Educational Resources
xAPI (Experience API):Potential for Open Educational Resources xAPI (Experience API):Potential for Open Educational Resources
xAPI (Experience API):Potential for Open Educational Resources Ramesh C. Sharma
 
L yuan alt c 3
L yuan alt c 3L yuan alt c 3
L yuan alt c 3cetisli
 

Similaire à Learning Analytics – Opportunities for ISO/IEC JTC 1/SC36 standardisation (20)

Towards Open Architectures and Interoperability for Learning Analytics
Towards Open Architectures and Interoperability for Learning Analytics Towards Open Architectures and Interoperability for Learning Analytics
Towards Open Architectures and Interoperability for Learning Analytics
 
Scalable Learning Analytics and Interoperability – an assessment of potential...
Scalable Learning Analytics and Interoperability – an assessment of potential...Scalable Learning Analytics and Interoperability – an assessment of potential...
Scalable Learning Analytics and Interoperability – an assessment of potential...
 
SCONUL Conference 2009: Workshop on Repositories for Teaching & Learning Mate...
SCONUL Conference 2009: Workshop on Repositories for Teaching & Learning Mate...SCONUL Conference 2009: Workshop on Repositories for Teaching & Learning Mate...
SCONUL Conference 2009: Workshop on Repositories for Teaching & Learning Mate...
 
Data-Driven Learning Strategy
Data-Driven Learning StrategyData-Driven Learning Strategy
Data-Driven Learning Strategy
 
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
 
Immersive Community Analytics for Wearable Enhanced Learning (HCI Internation...
Immersive Community Analytics for Wearable Enhanced Learning (HCI Internation...Immersive Community Analytics for Wearable Enhanced Learning (HCI Internation...
Immersive Community Analytics for Wearable Enhanced Learning (HCI Internation...
 
Standards for Smart Learning Environments
Standards for Smart Learning EnvironmentsStandards for Smart Learning Environments
Standards for Smart Learning Environments
 
An introduction to repository reference models
An introduction to repository reference modelsAn introduction to repository reference models
An introduction to repository reference models
 
EdMedia 2017 Outstanding Paper Award
EdMedia 2017 Outstanding Paper AwardEdMedia 2017 Outstanding Paper Award
EdMedia 2017 Outstanding Paper Award
 
Introduction to Learner Analytics Session at Oslo Open Forum Conferences prio...
Introduction to Learner Analytics Session at Oslo Open Forum Conferences prio...Introduction to Learner Analytics Session at Oslo Open Forum Conferences prio...
Introduction to Learner Analytics Session at Oslo Open Forum Conferences prio...
 
2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...
2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...
2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...
 
e-Science, Research Data and Libaries
e-Science, Research Data and Libariese-Science, Research Data and Libaries
e-Science, Research Data and Libaries
 
LOR Characteristics and Considerations
LOR Characteristics and ConsiderationsLOR Characteristics and Considerations
LOR Characteristics and Considerations
 
Learning Analytics for MOOCs: EMMA case
Learning Analytics for MOOCs: EMMA caseLearning Analytics for MOOCs: EMMA case
Learning Analytics for MOOCs: EMMA case
 
Mootnz13 Moodle Analytics
Mootnz13 Moodle AnalyticsMootnz13 Moodle Analytics
Mootnz13 Moodle Analytics
 
On data-driven systems analyzing, supporting and enhancing users’ interaction...
On data-driven systems analyzing, supporting and enhancing users’ interaction...On data-driven systems analyzing, supporting and enhancing users’ interaction...
On data-driven systems analyzing, supporting and enhancing users’ interaction...
 
Open learning analytics overview (lasi) v1
Open learning analytics overview (lasi) v1Open learning analytics overview (lasi) v1
Open learning analytics overview (lasi) v1
 
Dr. Gábor Kismihók: Labour Market driven Learning Analytics
Dr. Gábor Kismihók: Labour Market driven Learning AnalyticsDr. Gábor Kismihók: Labour Market driven Learning Analytics
Dr. Gábor Kismihók: Labour Market driven Learning Analytics
 
xAPI (Experience API):Potential for Open Educational Resources
xAPI (Experience API):Potential for Open Educational Resources xAPI (Experience API):Potential for Open Educational Resources
xAPI (Experience API):Potential for Open Educational Resources
 
L yuan alt c 3
L yuan alt c 3L yuan alt c 3
L yuan alt c 3
 

Plus de Tore Hoel

Læringsanalyse - hva er det?
Læringsanalyse - hva er det?Læringsanalyse - hva er det?
Læringsanalyse - hva er det?Tore Hoel
 
Smart Learning Environments - a framework for standardisation?
Smart Learning Environments - a framework for standardisation?Smart Learning Environments - a framework for standardisation?
Smart Learning Environments - a framework for standardisation?Tore Hoel
 
Learning analytics in a standardisation context
Learning analytics in a standardisation contextLearning analytics in a standardisation context
Learning analytics in a standardisation contextTore Hoel
 
Deling av data fra UH-bibliotek
Deling av data fra UH-bibliotekDeling av data fra UH-bibliotek
Deling av data fra UH-bibliotekTore Hoel
 
Data protection and privacy framework in the design of learning analytics sys...
Data protection and privacy framework in the design of learning analytics sys...Data protection and privacy framework in the design of learning analytics sys...
Data protection and privacy framework in the design of learning analytics sys...Tore Hoel
 
Data Protection by Design and Default for Learning Analytics
Data Protection by Design and Default for Learning AnalyticsData Protection by Design and Default for Learning Analytics
Data Protection by Design and Default for Learning AnalyticsTore Hoel
 
Scaling up learning analytics solutions: Is privacy a show-stopper?
Scaling up learning analytics solutions:  Is privacy a show-stopper?Scaling up learning analytics solutions:  Is privacy a show-stopper?
Scaling up learning analytics solutions: Is privacy a show-stopper?Tore Hoel
 
Implications of the European Data Protection Regulations for Learning Analyti...
Implications of the European Data Protection Regulations for Learning Analyti...Implications of the European Data Protection Regulations for Learning Analyti...
Implications of the European Data Protection Regulations for Learning Analyti...Tore Hoel
 
Privacy and Data Protection - principles for design of a new part of an ISO s...
Privacy and Data Protection - principles for design of a new part of an ISO s...Privacy and Data Protection - principles for design of a new part of an ISO s...
Privacy and Data Protection - principles for design of a new part of an ISO s...Tore Hoel
 
Ethics & Privacy for Learning Analytics
Ethics & Privacy for Learning AnalyticsEthics & Privacy for Learning Analytics
Ethics & Privacy for Learning AnalyticsTore Hoel
 
Learning Analytics - Vision of the Future
Learning Analytics - Vision of the FutureLearning Analytics - Vision of the Future
Learning Analytics - Vision of the FutureTore Hoel
 
Privacy in Learning Analytics – Implications for System Architecture
Privacy in Learning Analytics – Implications for System ArchitecturePrivacy in Learning Analytics – Implications for System Architecture
Privacy in Learning Analytics – Implications for System ArchitectureTore Hoel
 
NordicOER wraps up 2 years of activiteis
NordicOER wraps up 2 years of activiteisNordicOER wraps up 2 years of activiteis
NordicOER wraps up 2 years of activiteisTore Hoel
 
Workshop on Learning Analytics @ EDEN15 in Barcelona - June 2015
Workshop on Learning Analytics @ EDEN15 in Barcelona - June 2015Workshop on Learning Analytics @ EDEN15 in Barcelona - June 2015
Workshop on Learning Analytics @ EDEN15 in Barcelona - June 2015Tore Hoel
 
Data security issues, ethical issues and challenges to privacy in knowledge-i...
Data security issues, ethical issues and challenges to privacy in knowledge-i...Data security issues, ethical issues and challenges to privacy in knowledge-i...
Data security issues, ethical issues and challenges to privacy in knowledge-i...Tore Hoel
 
Privacy-driven design of Learning Analytics applications – exploring the desi...
Privacy-driven design of Learning Analytics applications – exploring the desi...Privacy-driven design of Learning Analytics applications – exploring the desi...
Privacy-driven design of Learning Analytics applications – exploring the desi...Tore Hoel
 
Requirements for Learning Analytics
Requirements for Learning AnalyticsRequirements for Learning Analytics
Requirements for Learning AnalyticsTore Hoel
 
Introduction to Learning Analytics - Framework and Implementation Concerns
Introduction to Learning Analytics - Framework and Implementation ConcernsIntroduction to Learning Analytics - Framework and Implementation Concerns
Introduction to Learning Analytics - Framework and Implementation ConcernsTore Hoel
 
Learning Analytics – Ethical questions and dilemmas
Learning Analytics  – Ethical questions and dilemmasLearning Analytics  – Ethical questions and dilemmas
Learning Analytics – Ethical questions and dilemmasTore Hoel
 
Strategies for Dealing with Privacy in the context of Learning Analytics
Strategies for Dealing with Privacy in the context of Learning AnalyticsStrategies for Dealing with Privacy in the context of Learning Analytics
Strategies for Dealing with Privacy in the context of Learning AnalyticsTore Hoel
 

Plus de Tore Hoel (20)

Læringsanalyse - hva er det?
Læringsanalyse - hva er det?Læringsanalyse - hva er det?
Læringsanalyse - hva er det?
 
Smart Learning Environments - a framework for standardisation?
Smart Learning Environments - a framework for standardisation?Smart Learning Environments - a framework for standardisation?
Smart Learning Environments - a framework for standardisation?
 
Learning analytics in a standardisation context
Learning analytics in a standardisation contextLearning analytics in a standardisation context
Learning analytics in a standardisation context
 
Deling av data fra UH-bibliotek
Deling av data fra UH-bibliotekDeling av data fra UH-bibliotek
Deling av data fra UH-bibliotek
 
Data protection and privacy framework in the design of learning analytics sys...
Data protection and privacy framework in the design of learning analytics sys...Data protection and privacy framework in the design of learning analytics sys...
Data protection and privacy framework in the design of learning analytics sys...
 
Data Protection by Design and Default for Learning Analytics
Data Protection by Design and Default for Learning AnalyticsData Protection by Design and Default for Learning Analytics
Data Protection by Design and Default for Learning Analytics
 
Scaling up learning analytics solutions: Is privacy a show-stopper?
Scaling up learning analytics solutions:  Is privacy a show-stopper?Scaling up learning analytics solutions:  Is privacy a show-stopper?
Scaling up learning analytics solutions: Is privacy a show-stopper?
 
Implications of the European Data Protection Regulations for Learning Analyti...
Implications of the European Data Protection Regulations for Learning Analyti...Implications of the European Data Protection Regulations for Learning Analyti...
Implications of the European Data Protection Regulations for Learning Analyti...
 
Privacy and Data Protection - principles for design of a new part of an ISO s...
Privacy and Data Protection - principles for design of a new part of an ISO s...Privacy and Data Protection - principles for design of a new part of an ISO s...
Privacy and Data Protection - principles for design of a new part of an ISO s...
 
Ethics & Privacy for Learning Analytics
Ethics & Privacy for Learning AnalyticsEthics & Privacy for Learning Analytics
Ethics & Privacy for Learning Analytics
 
Learning Analytics - Vision of the Future
Learning Analytics - Vision of the FutureLearning Analytics - Vision of the Future
Learning Analytics - Vision of the Future
 
Privacy in Learning Analytics – Implications for System Architecture
Privacy in Learning Analytics – Implications for System ArchitecturePrivacy in Learning Analytics – Implications for System Architecture
Privacy in Learning Analytics – Implications for System Architecture
 
NordicOER wraps up 2 years of activiteis
NordicOER wraps up 2 years of activiteisNordicOER wraps up 2 years of activiteis
NordicOER wraps up 2 years of activiteis
 
Workshop on Learning Analytics @ EDEN15 in Barcelona - June 2015
Workshop on Learning Analytics @ EDEN15 in Barcelona - June 2015Workshop on Learning Analytics @ EDEN15 in Barcelona - June 2015
Workshop on Learning Analytics @ EDEN15 in Barcelona - June 2015
 
Data security issues, ethical issues and challenges to privacy in knowledge-i...
Data security issues, ethical issues and challenges to privacy in knowledge-i...Data security issues, ethical issues and challenges to privacy in knowledge-i...
Data security issues, ethical issues and challenges to privacy in knowledge-i...
 
Privacy-driven design of Learning Analytics applications – exploring the desi...
Privacy-driven design of Learning Analytics applications – exploring the desi...Privacy-driven design of Learning Analytics applications – exploring the desi...
Privacy-driven design of Learning Analytics applications – exploring the desi...
 
Requirements for Learning Analytics
Requirements for Learning AnalyticsRequirements for Learning Analytics
Requirements for Learning Analytics
 
Introduction to Learning Analytics - Framework and Implementation Concerns
Introduction to Learning Analytics - Framework and Implementation ConcernsIntroduction to Learning Analytics - Framework and Implementation Concerns
Introduction to Learning Analytics - Framework and Implementation Concerns
 
Learning Analytics – Ethical questions and dilemmas
Learning Analytics  – Ethical questions and dilemmasLearning Analytics  – Ethical questions and dilemmas
Learning Analytics – Ethical questions and dilemmas
 
Strategies for Dealing with Privacy in the context of Learning Analytics
Strategies for Dealing with Privacy in the context of Learning AnalyticsStrategies for Dealing with Privacy in the context of Learning Analytics
Strategies for Dealing with Privacy in the context of Learning Analytics
 

Dernier

Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024Janet Corral
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 

Dernier (20)

Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 

Learning Analytics – Opportunities for ISO/IEC JTC 1/SC36 standardisation

  • 1. Learning Analytics – Opportunities for ISO/IEC JTC 1/SC36 standardisation Tore Hoel Oslo and Akershus University College of Applied Sciences Norway ISO/IEC JTC 1/SC36 WG8 meeting, 29 November 2015 Hangzhou, China
  • 3. Characteristics of Educational Big Data • Grain size of recordable and analysable data has become smaller – every pen stroke, every keystroke is recorded • Sources of evidence are (more) varied – tests, essay scoring, learning games, social interactions, affects, body sensors, intelligent tutors, simulations, semantic mapping, LMS data… – Unstructured (e.g., . log files, clicks, timestamps) – When structured different schemas are used • How do we bring these data together to form a overall view of an individual learner or a cohort of learners? 3 (Cope, B., & Kalantzis, M., 2015)
  • 4. What data practices are emerging? • Multi-scalar Data Collection – Embedded, simultaneous collection of data that can be used for different purposes at different scales – Semantically legible datapoint (learner-actionable feedback): «teachable moment» • Self-describing, structured data ➔ meanings immediately evident to learners, teachers, others • Sample size n= all • Data and interventions are not separate: Recursive micro intervention ➔ result➔ redesign cycles • More widely distributed data collection roles 4 (Cope, B., & Kalantzis, M., 2015)
  • 5. Need for new Education Data Standards supporting Learning Analytics • Harmonization of Activity Stream Specifications (ADL xAPI, IMS Caliper, W3C Activity Streams) • Building Vocabularies – Profiles – Recipes – Communities of Practice • Storage designs – centralised data warehouses or distributed Learning Record/Event Stores • Extract, Transform and Load (ETL) tools for data storage • Privacy and Data Protection – how to do Privacy-by-design in this field? • Sharing of Algorithms and Predictive Models 5
  • 7. Activity Streams • Work started around 2009 by a group from IBM, Google, Microsoft, MySpace, Facebook, VMware a.o., • First version published in 2011 • 2014 W3C Social Web Working Group took over the specification • Working draft version 2.0 published October 2015 7 In its simplest form, an activity consists of an actor, a verb, an an object, and a target. It tells the story of a person performing an action on or with an object -- "Geraldine posted a photo to her album" or "John shared a video". In most cases these components will be explicit, but they may also be implied. (Activity Streams Working Group, 2011)
  • 8. Experience API (xAPI) • 1st version 2013 (component of ADL Training and Learning Architecture) • A Statement consists of an <actor (learner)>, a <verb>, an <object>, with a <result>, in a <context>. There is no constraint on what these objects should be. • Learning Record Store: a system that stores learning information • xAPI is dependent on the presence of LRS to function • Offered for standardisation in IEEE August 2014 – “it wasn’t the slam dunk [they were] naively hoping it would be” (Silvers, 2014) • End of 2015 a new Data Interoperability Standards Consortium (not-for-profit organization in the State of Pennsylvania, USA) to be the steward of Experience API 8
  • 9. IMS Caliper Analytics • White paper 2013 • Public release v 1.0 October 2015 • Information model buried in Sensor APIs • Metric Profiles • Base Metric Profile, Session, Annotation, Assignable, Assessment, Outcome, Reading, Media • IMS Learning Sensor API: defines basic learning events gathered as learning metics across learning environments • Leveraging of IMS LTI/LIS/QTI 9
  • 13. Talking about learning activities • Looser coupled systems, diverse Communities of Practice lead to more diverse schemas and data models • Interoperability could be promoted by more efficient sharing of vocabularies • Encourage smaller vocabularies / ontologies 13 IMS Caliper xAPI Communities
  • 14. How to promote more interoperable vocabularies for education? • "Document standards" for vocabularies have severe limitations! • Communities of Practice (ref xAPI) are part of the solution… • … but serious stewardship issues • What could ISO offer in terms of dynamic vocabulary management? 14
  • 17. Search architecture middle layer 17(Hoel & Chen, 2015)
  • 18. MIT Open Personal Data Store / Safe Answers • openPDS allows users to collect, store, and give fine- grained access to their data in the cloud. • openPDS also protects users’ privacy by only sharing anonymous answers, not raw data. • openPDS can also engage in privacy-preserving group computations to aggregate data across users without the need to share sensitive data with an intermediate entity. 18 http://openpds.media.mit.edu/#architecture
  • 19. Extract - Transform - Load tools • When data are coming from different sources in different structures, one need tools to extract, transform and load data into data stores • There are Open Source ( e.g., Pentaho Kettle and Talend), but most are commercial software • Are ETL tools a possible hot spot for standards efforts? 19
  • 20. SC36 20748-1 Data Storing & Processing 20
  • 21. Challenges for standardisation • Privacy and Data Ownership issues – how to turn these «soft» requirements into «hard» ones? • The role of Personal Data Stores in Learning Analytics • Harmonization of data schemes prior to analysis • Import / export facilities with ontology building (and automatic reasoning technologies) as part of the storage solutions • Publishing and Sharing of data for research and comparison and testing of predictive models, student models, etc. 21
  • 23. Implications for designs when Surveillance turns into Sousveillance? 23 Image credit: http://commons.wikimedia.org/wiki/File:SurSousVeillanceByStephanieMannAge6.png
  • 24. When Privacy is affecting all LA processes • Privacy-By-Design is the overall design principle. What does it mean for the LA processes? • Data Sharing • Search • Storing • Analysing • Visualising 24
  • 26. How to support sharing? • Exemplar predictive models are needed to advance learning analytics • Besides a Culture for sharing data, algorithms and predictive models, what else is needed? • Parallel data streams from production systems to support development and research • How to deal with anonymization? • How to get data for R&D from cloud-based systems? • How do we talk about these algorithms and models (create a vocabulary for tagging) • Where to host the resources (stewardship, openness policies, open repositories) 26
  • 27. References • Cho, Yong-Sang (2015) Quick review xAPI and IMS Caliper - Principle of both data capturing technologies. Online at http://www.slideshare.net/zzosang/quick-review-xapi-and-ims- caliper-principle-of-both-data-capturing-technologies • Cope, B., & Kalantzis, M. (2015). Sources of Evidence-of-Learning: Learning and assessment in the era of big data. Open Review of Educational Research, 2(1) • Hoel, T. & Chen, W. (2015). Privacy in Learning Analytics – Implications for System Architecture. In Watanabe, T. and Seta, K. (Eds.) Proceedings of the 11th International Conference on Knowledge Management. Online at http://hoel.nu/publications/Hoel_Chen_ICKM15_final_preprint.p df 27

Notes de l'éditeur

  1. ETL tools combine three important functions (extract, transform, load) required to get data from one big data environment and put it into another data environment. Traditionally, ETL has been used with batch processing in data warehouse environments. Data warehouses provide business users with a way to consolidate information to analyze and report on data relevant to their business focus. ETL tools are used to transform data into the format required by data warehouses. The transformation is actually done in an intermediate location before the data is loaded into the data warehouse. Many software vendors, including IBM, Informatica, Pervasive, Talend, and Pentaho, provide ETL software tools. ETL provides the underlying infrastructure for integration by performing three important functions: Extract: Read data from the source database. Transform: Convert the format of the extracted data so that it conforms to the requirements of the target database. Transformation is done by using rules or merging data with other data. Load: Write data to the target database. However, ETL is evolving to support integration across much more than traditional data warehouses. ETL can support integration across transactional systems, operational data stores, BI platforms, MDM hubs, the cloud, and Hadoop platforms. ETL software vendors are extending their solutions to provide big data extraction, transformation, and loading between Hadoop and traditional data management platforms. ETL and software tools for other data integration processes like data cleansing, profiling, and auditing all work on different aspects of the data to ensure that the data will be deemed trustworthy. ETL tools integrate with data quality tools, and many incorporate tools for data cleansing, data mapping, and identifying data lineage. With ETL, you only extract the data you will need for the integration. ETL tools are needed for the loading and conversion of structured and unstructured data into Hadoop. Advanced ETL tools can read and write multiple files in parallel from and to Hadoop to simplify how data is merged into a common transformation process. Some solutions incorporate libraries of prebuilt ETL transformations for both the transaction and interaction data that run on Hadoop or a traditional grid infrastructure. Data transformation is the process of changing the format of data so that it can be used by different applications. This may mean a change from the format the data is stored in into the format needed by the application that will use the data. This process also includes mapping instructions so that applications are told how to get the data they need to process. The process of data transformation is made far more complex because of the staggering growth in the amount of unstructured data. A business application such as a customer relationship management has specific requirements for how data should be stored. The data is likely to be structured in the organized rows and columns of a relational database. Data is semi-structured or unstructured if it does not follow rigid format requirements. The information contained in an e-mail message is considered unstructured, for example. Some of a company's most important information is in unstructured and semi-structured forms such as documents, e-mail messages, complex messaging formats, customer support interactions, transactions, and information coming from packaged applications like ERP and CRM. Data transformation tools are not designed to work well with unstructured data. As a result, companies needing to incorporate unstructured information into its business process decision making have been faced with a significant amount of manual coding to accomplish the required data integration. Given the growth and importance of unstructured data to decision making, ETL solutions from major vendors are beginning to offer standardized approaches to transforming unstructured data so that it can be more easily integrated with operational structured data.