SlideShare une entreprise Scribd logo
1  sur  12
User Centric Integration of Activity Data Mathieu d’Aquin Knowledge Media Institute The Open University
Consumer/user centric data
Challenges in user centric activity data Activity data that sit in logs are  Heterogeneous – different models for different sites/systems Raw – uninterpreted Horribly big – thousands of pieces of information generated every minute  Hard to exploit, understand, analyze
User Centric Activity Data Activity analysis for and by individual users Consolidation Integration Interpretation Ontologies Logs 2 Logs 4 Logs 1 Logs 3 Website 2 Website 4 Website 1 Website 3 Organisation Users
Technical infrastructure Semantic Triple Store Scheduler/Manager Daily RDF traces Daily RDF traces Parser/RDF renderer Parser/RDF renderer Daily RDF traces Daily RDF traces Daily RDF traces Log Log Parser/RDF renderer Parser/RDF renderer Parser/RDF renderer Application Log Log Log Application Server1 Server2 Server3
Ontologies Formal conceptual models of a domain: online user activity  Semantic Web technologies Standard languages for expressing ontologies and ontological data (RDF, OWL) Tools to manipulate and work with ontologies and semantic data (NeOn Toolkit, OWLIM) Many ontologies to reuse Adhere to a logical formalism inferences
User support PREFIX tr:<http://uciad.info/ontology/trace/> PREFIX actor:<http://uciad.info/ontology/actor/> construct {   ?trace ?p ?x.   ?x ?p2 ?x2.   ?x2 ?p3 ?x3.   ?x3 ?p4 ?x4 } where{   <http://uciad.info/actor/mathieu> actor:knownSetting ?set.   ?trace tr:hasSetting ?set.   ?trace ?p ?x.   ?x ?p2 ?x2.   ?x2 ?p3 ?x3.   ?x3 ?p4 ?x4 } Please Login User Logging or register Detect setting (agent+IP) User name: Password: mathieu ****** unknown setting It is the first time you log into UCIAD with this setting (detail) do you want to attach it to your account? Check setting non-ambiguous non-ambiguous Your current setting is: Computer IP:137.108.2x.1xx User Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US) AppleWebKit/525.13 (KHTML, like Gecko) Chrome/0.A.B.C Safari/525.13 This setting is not currently attached to a user, so it will be added to your known settings as you log into the system ambiguous known setting for user Add setting to known setting Register setting as ambiguous Display Activity Data related to all known settings of the user yes no
User support Export my data <rdf:RDF> <rdf:Descriptionrdf:about="http://uciad.info/trace/kmi-web13/ede2ab38da27695eec1e0b375f9b20da">   <rdf:typerdf:resource="http://uciad.info/ontology/trace/Trace"/>    <hasActionrdf:resource="http://uciad.info/action/GET"/>    <hasPageInvolvedrdf:resource="http://uciad.info/page/0b9abc62fcf90afc53797b938af435dd"/>   <hasResponserdf:resource="http://uciad.info/response/ea95add1414aba134ff9e0482b921a33"/>   <hasSettingrdf:resource="http://uciad.info/actorsetting/119696ec92c5acec29397dc7ef98817f"/>   <hasTimerdf:datatype="http://www.w3.org/2001/XMLSchema#string">13/Jun/2011:01:37:23+0100</hasTime> </rdf:Description> </rdf:RDF> <rdf:Descriptionrdf:about="http://uciad.info/page/0b9abc62fcf90afc53797b938af435dd">   <rdf:typerdf:resource="http://uciad.info/ontology/sitemap/WebPage"/>   <isPartOfrdf:resource="http://uciad.info/ontology/test1/dataopenacuk"/>   <onServerrdf:resource="http://kmi-web13.open.ac.uk"/>   <urlrdf:datatype="http://www.w3.org/2001/XMLSchema#string"> /resource/person/ext-718a372e10788bb58d562a8bf6fb864e   </url> </rdf:Description> <rdf:Descriptionrdf:about="http://uciad.info/ontology/test1/dataopenacuk">   <rdf:typerdf:resource="http://uciad.info/ontology/sitemap/Website"/>   <rdf:typerdf:resource="http://uciad.info/ontology/test1/LinkedDataPlatform"/>   <onServerrdf:resource="http://kmi-web13.open.ac.uk"/>   <urlPatternrdf:datatype="http://www.w3.org/2001/XMLSchema#string">/*</urlPattern> </rdf:Description>   <rdf:Descriptionrdf:about="http://uciad.info/response/ea95add1414aba134ff9e0482b921a33">   <rdf:typerdf:resource="http://uciad.info/ontology/trace/HTTPResponse"/>   <hasResponseCoderdf:resource="http://uciad.info/ontology/trace/200"/>   <hasSizeInBytesrdf:datatype="http://www.w3.org/2001/XMLSchema#int">1085</hasSizeInBytes> </rdf:Description> for graph http://uciad.info/users/mathieu User Logging or register Detect setting (agent+IP) unknown setting It is the first time you log into UCIAD with this setting (detail) do you want to attach it to your account? Check setting non-ambiguous non-ambiguous ambiguous known setting for user Add setting to known setting Register setting as ambiguous Display Activity Data related to all known settings of the user yes no
Example In the ontology: UCIAD-Blog and LUCERO-Blog are Blogs (Website) A BlogPage is a page which is part of a Blog An activity onBlog is an activity happening on a Blog Page Result: Can look specifically at activities happening on a Blog and specialize them (same applies to Wikis, and other types of websites)
Issues left to resolve Scalability OWLIM triple store can handle billions of triples But struggle with millions when inference is “on”  1 repository without inference with all historical data, 1 with inference with 1 week of data only, and 1 with inference for registered users User management and privacy Ensuring that the user who logs in from a particular setting is the one having the activity is difficult (e.g., in the case of shared computers) Is this really a problem? Check ambiguity – ask verification questions – moderate? Licensing Overall data: privacy issues (is k-anonymity actually applicable? Would it work?) Overall data: institutional issues (can we show the traffic on our websites to everybody) User data export: what license?
More info UCIAD Blog: http://uciad.info Code base: http://github.com/uciad Twitter: #uciad @mdaquin
Team Dr Mathieu d’Aquin– Research fellow, KMi – project director Stuart Brown – Web developments and online communities, communication services – member of the steering group, liaison with online services SalmanElahi– Resarch assistant and PhD student, KMi – developer/researcher  Prof Enrico Motta – Professor of knowledge technologies, KMi – Chair of the steering group

Contenu connexe

En vedette

Consuming Linked Data in Universities
Consuming Linked Data in UniversitiesConsuming Linked Data in Universities
Consuming Linked Data in Universities
Mathieu d'Aquin
 
Using the Semantic Web, and Contributing to it
Using the Semantic Web, and Contributing to itUsing the Semantic Web, and Contributing to it
Using the Semantic Web, and Contributing to it
Mathieu d'Aquin
 

En vedette (18)

LUCERO - Building the Open University's Web of Linked Data
LUCERO - Building the Open University's Web of Linked DataLUCERO - Building the Open University's Web of Linked Data
LUCERO - Building the Open University's Web of Linked Data
 
Consuming Linked Data in Universities
Consuming Linked Data in UniversitiesConsuming Linked Data in Universities
Consuming Linked Data in Universities
 
De l'intelligence dans les données ouvertes - Problèmes et solutions dans les...
De l'intelligence dans les données ouvertes - Problèmes et solutions dans les...De l'intelligence dans les données ouvertes - Problèmes et solutions dans les...
De l'intelligence dans les données ouvertes - Problèmes et solutions dans les...
 
LUCERO - Building the Open University Web of Linked Data
LUCERO - Building the Open University Web of Linked DataLUCERO - Building the Open University Web of Linked Data
LUCERO - Building the Open University Web of Linked Data
 
eBay Mobile Summit
eBay Mobile SummiteBay Mobile Summit
eBay Mobile Summit
 
Using the Semantic Web, and Contributing to it
Using the Semantic Web, and Contributing to itUsing the Semantic Web, and Contributing to it
Using the Semantic Web, and Contributing to it
 
Working with data.open.ac.uk, the Linked Data Platform of the Open University
Working with data.open.ac.uk, the Linked Data Platform of the Open UniversityWorking with data.open.ac.uk, the Linked Data Platform of the Open University
Working with data.open.ac.uk, the Linked Data Platform of the Open University
 
State and future of linked data in learning analytics
State and future of linked data in learning analyticsState and future of linked data in learning analytics
State and future of linked data in learning analytics
 
Open Data, Linked Data, Semantic Web
Open Data, Linked Data, Semantic WebOpen Data, Linked Data, Semantic Web
Open Data, Linked Data, Semantic Web
 
Linked Data in Learning Analytics Tools
Linked Data in Learning Analytics ToolsLinked Data in Learning Analytics Tools
Linked Data in Learning Analytics Tools
 
Open Web Data for Education - Linked Data technologies for connecting open ed...
Open Web Data for Education - Linked Data technologies for connecting open ed...Open Web Data for Education - Linked Data technologies for connecting open ed...
Open Web Data for Education - Linked Data technologies for connecting open ed...
 
Introduction to Linked Data
Introduction to Linked DataIntroduction to Linked Data
Introduction to Linked Data
 
Semantic Web / Linked Data Technologies
Semantic Web / Linked Data TechnologiesSemantic Web / Linked Data Technologies
Semantic Web / Linked Data Technologies
 
Moving from Device Centric to a User Centric Management
Moving from Device Centric to a User Centric Management Moving from Device Centric to a User Centric Management
Moving from Device Centric to a User Centric Management
 
Semantic Web Technologies for Social Translucence and Privacy Mirrors on the Web
Semantic Web Technologies for Social Translucence and Privacy Mirrors on the WebSemantic Web Technologies for Social Translucence and Privacy Mirrors on the Web
Semantic Web Technologies for Social Translucence and Privacy Mirrors on the Web
 
Semantic Web, Linked Data and Education: A Perfect Fit?
Semantic Web, Linked Data and Education: A Perfect Fit?Semantic Web, Linked Data and Education: A Perfect Fit?
Semantic Web, Linked Data and Education: A Perfect Fit?
 
Combining Data Mining and Ontology Engineering to enrich Ontologies and Linke...
Combining Data Mining and Ontology Engineering to enrich Ontologies and Linke...Combining Data Mining and Ontology Engineering to enrich Ontologies and Linke...
Combining Data Mining and Ontology Engineering to enrich Ontologies and Linke...
 
Designing a user-centric strategy using digital
Designing a user-centric strategy using digitalDesigning a user-centric strategy using digital
Designing a user-centric strategy using digital
 

Similaire à UCIAD - quick overview

Educause Annual 2007
Educause Annual 2007Educause Annual 2007
Educause Annual 2007
Neil Matatall
 
Information On Line Transaction Processing
Information On Line Transaction ProcessingInformation On Line Transaction Processing
Information On Line Transaction Processing
Stefanie Yang
 

Similaire à UCIAD - quick overview (20)

UCIAD overview
UCIAD overviewUCIAD overview
UCIAD overview
 
Monitoring as an entry point for collaboration
Monitoring as an entry point for collaborationMonitoring as an entry point for collaboration
Monitoring as an entry point for collaboration
 
Prometheus for the traditional datacenter
Prometheus for the traditional datacenterPrometheus for the traditional datacenter
Prometheus for the traditional datacenter
 
01 - Introduction to Distributed Systems
01 - Introduction to Distributed Systems01 - Introduction to Distributed Systems
01 - Introduction to Distributed Systems
 
IEEE ISM 2008: Kalman Graffi: A Distributed Platform for Multimedia Communities
IEEE ISM 2008: Kalman Graffi: A Distributed Platform for Multimedia CommunitiesIEEE ISM 2008: Kalman Graffi: A Distributed Platform for Multimedia Communities
IEEE ISM 2008: Kalman Graffi: A Distributed Platform for Multimedia Communities
 
ReflectInsight - Let your application speak volume
ReflectInsight - Let your application speak volumeReflectInsight - Let your application speak volume
ReflectInsight - Let your application speak volume
 
SAP Technology Services Conference 2013: Big Data and The Cloud at Yahoo!
SAP Technology Services Conference 2013: Big Data and The Cloud at Yahoo! SAP Technology Services Conference 2013: Big Data and The Cloud at Yahoo!
SAP Technology Services Conference 2013: Big Data and The Cloud at Yahoo!
 
Educause Annual 2007
Educause Annual 2007Educause Annual 2007
Educause Annual 2007
 
AWS re:Invent 2016: Automating Workflows for Analytics Pipelines (DEV401)
AWS re:Invent 2016: Automating Workflows for Analytics Pipelines (DEV401)AWS re:Invent 2016: Automating Workflows for Analytics Pipelines (DEV401)
AWS re:Invent 2016: Automating Workflows for Analytics Pipelines (DEV401)
 
Personium - Open Source PDS envisioning the Web of MyData
Personium - Open Source PDS envisioning the Web of MyDataPersonium - Open Source PDS envisioning the Web of MyData
Personium - Open Source PDS envisioning the Web of MyData
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication Repositories
 
The Ball Launch on 2013 Microsoft TechDays Part 1/2
The Ball Launch on 2013 Microsoft TechDays Part 1/2The Ball Launch on 2013 Microsoft TechDays Part 1/2
The Ball Launch on 2013 Microsoft TechDays Part 1/2
 
Information On Line Transaction Processing
Information On Line Transaction ProcessingInformation On Line Transaction Processing
Information On Line Transaction Processing
 
Biocatalogue Talk Slides
Biocatalogue Talk SlidesBiocatalogue Talk Slides
Biocatalogue Talk Slides
 
Open source: Making connections by Sunny Pai
Open source: Making connections by Sunny PaiOpen source: Making connections by Sunny Pai
Open source: Making connections by Sunny Pai
 
OSFair2017 Workshop | EGI applications database
OSFair2017 Workshop | EGI applications databaseOSFair2017 Workshop | EGI applications database
OSFair2017 Workshop | EGI applications database
 
Single sign on with TYPO3
Single sign on with TYPO3Single sign on with TYPO3
Single sign on with TYPO3
 
Log Management For e-Discovery, Database Monitoring and Other Unusual Uses
Log Management For e-Discovery, Database Monitoring and Other Unusual UsesLog Management For e-Discovery, Database Monitoring and Other Unusual Uses
Log Management For e-Discovery, Database Monitoring and Other Unusual Uses
 
data-mesh-101.pptx
data-mesh-101.pptxdata-mesh-101.pptx
data-mesh-101.pptx
 
Laboratory Information Management System
Laboratory Information Management SystemLaboratory Information Management System
Laboratory Information Management System
 

Plus de Mathieu d'Aquin

Plus de Mathieu d'Aquin (20)

A factorial study of neural network learning from differences for regression
A factorial study of neural network learning from  differences for regressionA factorial study of neural network learning from  differences for regression
A factorial study of neural network learning from differences for regression
 
Recentrer l'intelligence artificielle sur les connaissances
Recentrer l'intelligence artificielle sur les connaissancesRecentrer l'intelligence artificielle sur les connaissances
Recentrer l'intelligence artificielle sur les connaissances
 
Data and Knowledge as Commodities
Data and Knowledge as CommoditiesData and Knowledge as Commodities
Data and Knowledge as Commodities
 
Unsupervised learning approach for identifying sub-genres in music scores
Unsupervised learning approach for identifying sub-genres in music scoresUnsupervised learning approach for identifying sub-genres in music scores
Unsupervised learning approach for identifying sub-genres in music scores
 
Is knowledge engineering still relevant?
Is knowledge engineering still relevant?Is knowledge engineering still relevant?
Is knowledge engineering still relevant?
 
A data view of the data science process
A data view of the data science processA data view of the data science process
A data view of the data science process
 
Dealing with Open Domain Data
Dealing with Open Domain DataDealing with Open Domain Data
Dealing with Open Domain Data
 
Web Analytics for Everyday Learning
Web Analytics for  Everyday LearningWeb Analytics for  Everyday Learning
Web Analytics for Everyday Learning
 
Presentation a in ovive montpellier - 26%2 f06%2f2018 (1)
Presentation a in ovive   montpellier - 26%2 f06%2f2018 (1)Presentation a in ovive   montpellier - 26%2 f06%2f2018 (1)
Presentation a in ovive montpellier - 26%2 f06%2f2018 (1)
 
Learning Analytics: understand learning and support the learner
Learning Analytics: understand learning and support the learnerLearning Analytics: understand learning and support the learner
Learning Analytics: understand learning and support the learner
 
The AFEL Project
The AFEL ProjectThe AFEL Project
The AFEL Project
 
Assessing the Readability of Policy Documents: The Case of Terms of Use of On...
Assessing the Readability of Policy Documents: The Case of Terms of Use of On...Assessing the Readability of Policy Documents: The Case of Terms of Use of On...
Assessing the Readability of Policy Documents: The Case of Terms of Use of On...
 
Data ethics
Data ethicsData ethics
Data ethics
 
Data for Learning and Learning with Data
Data for Learning and Learning with DataData for Learning and Learning with Data
Data for Learning and Learning with Data
 
Towards an “Ethics in Design” methodology for AI research projects
Towards an “Ethics in Design” methodology  for AI research projects Towards an “Ethics in Design” methodology  for AI research projects
Towards an “Ethics in Design” methodology for AI research projects
 
AFEL: Towards Measuring Online Activities Contributions to Self-Directed Lear...
AFEL: Towards Measuring Online Activities Contributions to Self-Directed Lear...AFEL: Towards Measuring Online Activities Contributions to Self-Directed Lear...
AFEL: Towards Measuring Online Activities Contributions to Self-Directed Lear...
 
Profiling information sources and services for discovery
Profiling information sources and services for discoveryProfiling information sources and services for discovery
Profiling information sources and services for discovery
 
Analyse de données et de réseaux sociaux pour l’aide à l’apprentissage infor...
Analyse de données et de réseaux sociaux pour  l’aide à l’apprentissage infor...Analyse de données et de réseaux sociaux pour  l’aide à l’apprentissage infor...
Analyse de données et de réseaux sociaux pour l’aide à l’apprentissage infor...
 
From Knowledge Bases to Knowledge Infrastructures for Intelligent Systems
From Knowledge Bases to Knowledge Infrastructures for Intelligent SystemsFrom Knowledge Bases to Knowledge Infrastructures for Intelligent Systems
From Knowledge Bases to Knowledge Infrastructures for Intelligent Systems
 
Data analytics beyond data processing and how it affects Industry 4.0
Data analytics beyond data processing and how it affects Industry 4.0Data analytics beyond data processing and how it affects Industry 4.0
Data analytics beyond data processing and how it affects Industry 4.0
 

Dernier

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Dernier (20)

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 

UCIAD - quick overview

  • 1. User Centric Integration of Activity Data Mathieu d’Aquin Knowledge Media Institute The Open University
  • 3. Challenges in user centric activity data Activity data that sit in logs are Heterogeneous – different models for different sites/systems Raw – uninterpreted Horribly big – thousands of pieces of information generated every minute Hard to exploit, understand, analyze
  • 4. User Centric Activity Data Activity analysis for and by individual users Consolidation Integration Interpretation Ontologies Logs 2 Logs 4 Logs 1 Logs 3 Website 2 Website 4 Website 1 Website 3 Organisation Users
  • 5. Technical infrastructure Semantic Triple Store Scheduler/Manager Daily RDF traces Daily RDF traces Parser/RDF renderer Parser/RDF renderer Daily RDF traces Daily RDF traces Daily RDF traces Log Log Parser/RDF renderer Parser/RDF renderer Parser/RDF renderer Application Log Log Log Application Server1 Server2 Server3
  • 6. Ontologies Formal conceptual models of a domain: online user activity Semantic Web technologies Standard languages for expressing ontologies and ontological data (RDF, OWL) Tools to manipulate and work with ontologies and semantic data (NeOn Toolkit, OWLIM) Many ontologies to reuse Adhere to a logical formalism inferences
  • 7. User support PREFIX tr:<http://uciad.info/ontology/trace/> PREFIX actor:<http://uciad.info/ontology/actor/> construct { ?trace ?p ?x. ?x ?p2 ?x2. ?x2 ?p3 ?x3. ?x3 ?p4 ?x4 } where{ <http://uciad.info/actor/mathieu> actor:knownSetting ?set. ?trace tr:hasSetting ?set. ?trace ?p ?x. ?x ?p2 ?x2. ?x2 ?p3 ?x3. ?x3 ?p4 ?x4 } Please Login User Logging or register Detect setting (agent+IP) User name: Password: mathieu ****** unknown setting It is the first time you log into UCIAD with this setting (detail) do you want to attach it to your account? Check setting non-ambiguous non-ambiguous Your current setting is: Computer IP:137.108.2x.1xx User Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US) AppleWebKit/525.13 (KHTML, like Gecko) Chrome/0.A.B.C Safari/525.13 This setting is not currently attached to a user, so it will be added to your known settings as you log into the system ambiguous known setting for user Add setting to known setting Register setting as ambiguous Display Activity Data related to all known settings of the user yes no
  • 8. User support Export my data <rdf:RDF> <rdf:Descriptionrdf:about="http://uciad.info/trace/kmi-web13/ede2ab38da27695eec1e0b375f9b20da"> <rdf:typerdf:resource="http://uciad.info/ontology/trace/Trace"/> <hasActionrdf:resource="http://uciad.info/action/GET"/> <hasPageInvolvedrdf:resource="http://uciad.info/page/0b9abc62fcf90afc53797b938af435dd"/> <hasResponserdf:resource="http://uciad.info/response/ea95add1414aba134ff9e0482b921a33"/> <hasSettingrdf:resource="http://uciad.info/actorsetting/119696ec92c5acec29397dc7ef98817f"/> <hasTimerdf:datatype="http://www.w3.org/2001/XMLSchema#string">13/Jun/2011:01:37:23+0100</hasTime> </rdf:Description> </rdf:RDF> <rdf:Descriptionrdf:about="http://uciad.info/page/0b9abc62fcf90afc53797b938af435dd"> <rdf:typerdf:resource="http://uciad.info/ontology/sitemap/WebPage"/> <isPartOfrdf:resource="http://uciad.info/ontology/test1/dataopenacuk"/> <onServerrdf:resource="http://kmi-web13.open.ac.uk"/> <urlrdf:datatype="http://www.w3.org/2001/XMLSchema#string"> /resource/person/ext-718a372e10788bb58d562a8bf6fb864e </url> </rdf:Description> <rdf:Descriptionrdf:about="http://uciad.info/ontology/test1/dataopenacuk"> <rdf:typerdf:resource="http://uciad.info/ontology/sitemap/Website"/> <rdf:typerdf:resource="http://uciad.info/ontology/test1/LinkedDataPlatform"/> <onServerrdf:resource="http://kmi-web13.open.ac.uk"/> <urlPatternrdf:datatype="http://www.w3.org/2001/XMLSchema#string">/*</urlPattern> </rdf:Description> <rdf:Descriptionrdf:about="http://uciad.info/response/ea95add1414aba134ff9e0482b921a33"> <rdf:typerdf:resource="http://uciad.info/ontology/trace/HTTPResponse"/> <hasResponseCoderdf:resource="http://uciad.info/ontology/trace/200"/> <hasSizeInBytesrdf:datatype="http://www.w3.org/2001/XMLSchema#int">1085</hasSizeInBytes> </rdf:Description> for graph http://uciad.info/users/mathieu User Logging or register Detect setting (agent+IP) unknown setting It is the first time you log into UCIAD with this setting (detail) do you want to attach it to your account? Check setting non-ambiguous non-ambiguous ambiguous known setting for user Add setting to known setting Register setting as ambiguous Display Activity Data related to all known settings of the user yes no
  • 9. Example In the ontology: UCIAD-Blog and LUCERO-Blog are Blogs (Website) A BlogPage is a page which is part of a Blog An activity onBlog is an activity happening on a Blog Page Result: Can look specifically at activities happening on a Blog and specialize them (same applies to Wikis, and other types of websites)
  • 10. Issues left to resolve Scalability OWLIM triple store can handle billions of triples But struggle with millions when inference is “on”  1 repository without inference with all historical data, 1 with inference with 1 week of data only, and 1 with inference for registered users User management and privacy Ensuring that the user who logs in from a particular setting is the one having the activity is difficult (e.g., in the case of shared computers) Is this really a problem? Check ambiguity – ask verification questions – moderate? Licensing Overall data: privacy issues (is k-anonymity actually applicable? Would it work?) Overall data: institutional issues (can we show the traffic on our websites to everybody) User data export: what license?
  • 11. More info UCIAD Blog: http://uciad.info Code base: http://github.com/uciad Twitter: #uciad @mdaquin
  • 12. Team Dr Mathieu d’Aquin– Research fellow, KMi – project director Stuart Brown – Web developments and online communities, communication services – member of the steering group, liaison with online services SalmanElahi– Resarch assistant and PhD student, KMi – developer/researcher Prof Enrico Motta – Professor of knowledge technologies, KMi – Chair of the steering group