SlideShare une entreprise Scribd logo
1  sur  58
Télécharger pour lire hors ligne
Personal Information Systems
and Personal Semantics
Gregory Grefenstette
CLEF 2015
September 8, 2015
,
Information is moving from the Web to Apps
Each person generates a lot of data
Two communities use it now
Search in one’s own data is the future
Four ways to search
We need personal facets
2015CLEF 2015 Grefenstette - 3
http://www.statista.com/statistics/263795/number-of-available-apps-in-the-apple-app-store/
Apple announced that 100 billion apps had been downloaded from its App
Store (June 2015)
2015CLEF 2015 Grefenstette - 4
2014
Another trend
Smart Glasses
http://en.wikipedia.org/wiki/File:A_Google_Glass_wearer.jpg
http://en.wikipedia.org/wiki/File:Aimoneyetap.jpg
http://en.wikipedia.org/wiki/File:Golden-i_3.8_Headset_Computer.png
Sony US Patent Application 20130069850
Microsoft US Patent Application 20120293548
https://www.youtube.com/watch?v=b7I7JuQXttw
Okay …
Apps,
Quantified Self,
Smart Glasses
Step back to NOW
Personal
Big Data
Personal
Big Data
Email sent
Email received
Social network posts
IP address location
SMS, chats
Search history
Web pages visited
Media viewed
Credit card purchases
Call data
GPS locations
Vitals signs
Activity/inactivity
Lifestyle
Conversations
Reading
People seen
Noises heard
Who uses this data today?
Surely, each person should have the same
access to their own data
Impediments to using our own data
•  Data Silos
•  Ownership
•  Privacy
•  Big Data Problems
•  Variety
•  Volume
•  Merging -- Semantics
Supposing we could get all our data back into
our own hands, how could we search it?
Short course on 4 types of search
Search Engines – Cranfield/SMART Model
148 Sept 2015
CLEF 2015 Grefenstette
ftp://ftp.cs.cornell.edu/pub/smart/cran
.I 6
.W
ventricular septal defect
occurring in association
with aortic regurgitation
.I 7
.W
radioisotopes in heart scanning.
mainly used in diagnosis of
pericardial effusions. also used
to study tumors, heart enlargement,
aneurysms and pericardial thickening.
technetium, rihsa, radioactive
hippurate, cholegraffin are used.
.I 8
.W
the effects of drugs on the bone
marrow of man and animals,
…
5 332
5 333
6 112
6 115
6 116
6 118
6 122
6 238
6 239
6 242
6 260
6 309
6 320
6 321
6 323
7 92
7 121
7 189
7 389
7 390
7 391
7 392
7 393
8 52
8 60
conditions .
.I 237
cisternal fluid oxygen ...
using a beckman micro-oxyg..
tension simultaneously in the..
and in arterial blood under..
that the cisternal oxygen..
oxygen tension of the surroun.
the available free oxygen...
duration in the cerebral...
.I 238
ventricular septal defect
obstruction .
a case of ventricular...
lesion and infundibular...
coronary cusp of the aortic..
septal defect, was demonstra..
as a polyp-like mass in the...
catheterization and angiocard
ventricular outflow obstr...
.I 239
functional adaptations of the
congenital heart disease ....
queries
qrels documents
2015CLEF 2015 Grefenstette - 15
Search Engines – Cranfield/SMART Model
2015CLEF 2015 Grefenstette - 16
Schedules 3 Economics, Education, Society
33 Economics and Management
338 Industries, Products
338.1 – 338.4 Specific kinds of industries
338.4 Secondary Industries and Services
338.47 Goods and Services
Built from 338.471 – 338.479 Subdivisions for Goods and Services
Schedules 338.476 Technology
338.4767 Manufacturing
338.47677 Textiles
338.476772 Textiles of Seed hair fibres
338.4767721 Cotton
Built from 338.47677210 Facet Indicator for Standard Subdivision
Table 1 338.476772109 Historical, geographic, persons treatment
Built from 338.4767721094 Europe Western Europe
Table 2 338.47677210942 England and Wales
338.476772109427 Northwestern England and Isle of Man
338.4767721094276 Lancashire
“The Lancashire cotton industry : a study in economic development”
Assigned DDC Code: 338.4767721094276
Search Engines – Dewey Decimal Faceted Model
2015CLEF 2015 Grefenstette - 17
Search Engines – Dewey Decimal Faceted Model
2 Other Search Models: Maps, Time Intervals
2015CLEF 2015 Grefenstette - 18
Past Attempts
2015CLEF 2015 Grefenstette - 19
MyLifeBits
2015CLEF 2015 Grefenstette - 20
Gemmell, Jim, Gordon Bell, and Roger
Lueder. "MyLifeBits: a personal database
for everything." Communications of the
ACM 49.1 (2006): 88-95.
"But even with convenient
classifications and labels
ready to apply, we are still
asking the user to become
a filing clerk – manually
annotating every
document, email, photo, or
conversation."
LifeLog
2015CLEF 2015 Grefenstette - 21
…The user can order the life-log agent
to add retrieval keys (annotation) with
an arbitrary name by simple operations
on his cellular phone while the agent is
capturing a life-log video. This enables
the agent to identify a scene that the
user wants to remember throughout his
life, and thus the user can access easily
to the videos that were captured during
precious experiences"
Aizawa, Kiyoharu, Tetsuro Hori, Shinya
Kawasaki, and Takayuki Ishikawa.
"Capture and efficient retrieval of life log."
In Pervasive 2004 Workshop on Memory
and Sharing Experiences, pp. 15-20. 2004.
Stuff I’ve Seen
2015CLEF 2015 Grefenstette - 22
…Research in cognitive psychology has
found that people remember
information, particularly older
information, not in terms of exact time,
but in terms of key episodes, such as a
child’s birthday, exotic travel,…
Cutrell, Edward, Susan T. Dumais, and Jaime
Teevan. "Searching to eliminate personal
information management." Communications
of the ACM 49.1 (2006): 58-64
PERSON
2015CLEF 2015 Grefenstette - 23
…we define the general category for
user’s activity in advance, such as
ordinary activity and extra-ordinary
activity. In ordinary activity is related to
the activity in home or office. Generally,
the activities occurred outside of those
area, they are classified as
extraordinary activities. In addition to
these pre-defined activities, users can
add their own activity through our
learning based structure… For some
duration, we record whole activities of
user. For the repeated activities at
same time, in same place with similar
objects, our activity engine will register
as user defined activities by asking in
which category those can be included.
Kim, Ig-Jae, et al. "PERSON:
personalized experience recoding
and searching on networked
environment." Proceedings of the
3rd ACM workshop on Continuous
archival and retrival of personal
experences. ACM, 2006.
Personal Data Prototype
2015CLEF 2015 Grefenstette - 24
…Landmarks of tags are defined by the
frequency of tags that are assigned to
each item of personal data. A tag that has
been in heavy use during a period of time
is a candidate for a landmark. A tag that
has rarely been used during a long period
of time is also a candidate for a landmark.
Outliers are candidates for landmarks in
time-series data, such as home energy
use, the number of steps walked, and
histories of body weight. Data that
exceed pre-defined or user-defined
thresholds are also candidates.
Other landmarks are public landmarks,
which include shocking public news,
bestsellers, blockbuster films, and annual
rankings of top Web-search words. We
can recall our own experiences on those
days from these landmarks.
Teraoka, Teruhiko. "Organization and
exploration of heterogeneous personal
data collected in daily life." Human-
Centric Computing and Information
Sciences 2.1 (2012): 1-15.
Dublin City University
2015CLEF 2015 Grefenstette - 25
…The user can order the life-log agent to add retrieval
keys (annotation) with an arbitrary name by simple
operations on his cellular phone while the agent is
capturing a life-log video. This enables the agent to
identify a scene that the user wants to remember
throughout his life, and thus the user can access easily
to the videos that were captured during precious
experiences"
Qiu, Zhengwei. "A lifelogging system supporting multimodal
access." PhD diss., Dublin City University, 2013.
Wang, Peng, and Alan F. Smeaton. "Aggregating semantic concepts
for event representation in lifelogging." Proceedings of the
International Workshop on Semantic Web Information Management.
ACM, 2011.
Okay,
we’ve seen
-- Apps / QS
-- Personal Big Data
-- Some early attempts
Everyone says
Time is important
Maps are important
String search is important
but…
Facets, what are our personal facets?
How can we automate them?
2015CLEF 2015 Grefenstette - 26
2015PTraces Grefenstette - 27
swimming
2015PTraces Grefenstette - 28
swimming
(my) people involved
in something about
swimming
2015PTraces Grefenstette - 29
swimming
things I’ve bought
involving
swimming
2015PTraces Grefenstette - 30
swimming
(my) photos and facebook
posts related to swimming
2015PTraces Grefenstette - 31
swimming
emails about
swimming things
2015PTraces Grefenstette - 32
swimming
places I’ve been involving
swimming
2015PTraces Grefenstette - 33
swimming
days involving
swimming things
2015PTraces Grefenstette - 34
swimming
phone calls about
swimming things…
2015PTraces Grefenstette - 35
swimming
Rather Self-Centred, no?
2015CLEF 2015 Grefenstette - 36
Personal Information System
Personal
archives
Induction semantic
dimensions
Personal
Semantic hierachies
Crowdsourced semantic
Hierarchies (eg. Wikipedia)
Expert semantic
Hierarchies (eg. MeSH)
Ingest/Annotate/Merge
2015PTraces Grefenstette - 38
s
w
i
m
m
i
n
g
K
n
i
t
t
i
n
g
p
o
k
e
r
P
a
i
n
t
i
n
g
.
.
.
Expert >>> Crowdsourcing >>> Personal
Ontology Folksonomy Models
Expert >>> Crowdsourcing >>> Personal
Models Folksonomy Models
Expert >>> Crowdsourcing >>> Personal
Models Folksonomy Models
Knitting>Knitting_methods_for_shaping>Short_ro
Knitting>Knitting_stitches
Knitting>Knitting_stitches>List_of_knitting_stitche
Knitting>Knitting_stitches>Basic_knitted_fabrics
Knitting>Knitting_stitches>Decrease_(knitting)
Knitting>Knitting_stitches>Dip_stitch
Knitting>Knitting_stitches>Drop-stitch_knitting
Knitting>Knitting_stitches>Elongated_stitch
Knitting>Knitting_stitches>Fair_Isle_(technique)
Knitting>Knitting_stitches>Grafting_(knitting)
Knitting>Knitting_stitches>Loop_knitting
Knitting>Knitting_stitches>Pick_up_stitches_(kni
Knitting>Knitting_stitches>Plaited_stitch_(knitting
Knitting>Knitting_stitches>Slip-stitch_knitting
Knitting>Knitting_stitches>Yarn_over
Knitting>Knitting_tools_and_materials
Knitting>Knitting_tools_and_materials>Eisaku_N
Knitting>Knitting_tools_and_materials>Hank_(te
Knitting>Knitting_tools_and_materials>Knitting_m
Knitting>Knitting_tools_and_materials>Knitting_N
Knitting>Knitting_tools_and_materials>Knitting_n
Knitting>Knitting_tools_and_materials>Knitting_n
Knitting>Knitting_tools_and_materials>Lazy_Kat
Knitting>Knitting_tools_and_materials>Liaghra
Knitting>Knitting_tools_and_materials>Nostepinn
Knitting>Knitting_tools_and_materials>Row_cou
Expert >>> Crowdsourcing >>> Personal
Models Folksonomy Models
Expert >>> Crowdsourcing >>> Personal
Models Folksonomy Models
2015CLEF 2015 Grefenstette - 44
2015CLEF 2015 Grefenstette - 45
Well, no….
2015CLEF 2015 Grefenstette - 46
Tweet
2015CLEF 2015 Grefenstette - 47
Less than 12 hours until I am in the pool
crying... thankful for mirrored goggles
Swimming>pool
Swimming>goggles
facets
I’d want this …
2015CLEF 2015 Grefenstette - 48
swimming -- weightlifting, cycling, gymnastics, judo,
table, volleyball, archery, rowing, badminton, track,
water, taekwondo, tennis, field, diving, handball,
boxing, softball, karate, pentathlon, fencing, athletics,
triathlon, wrestling, soccer
http://webdocs.cs.ualberta.ca/~lindek/downloads.htm
Distributional Semantics 1.5 billion words
Wordnet
Existing taxonomies are for societal
exchanges
Do you want to buy this?
What famous person did this when?
What can we make for this?
2015CLEF 2015 Grefenstette - 49
We are missing a description of what is
related to us, doing something…
specific vocabularies
loose taxonomies
… facets
Somthing like….
Sports/swimming/backstroke
Sports/swimming/on my back
Sports/swimming/breastroke
Sports/swimming/fins
Sports/swimming/goggles
Sports/swimming/fast lane
Sports/swimming/slow lane
Sports/swimming/laps
Sports/swimming/lifeguard
Sports/swimming/pool
Sports/swimming/lake
Sports/swimming/ocean
Sports/swimming/Neuilly Nautic Centre
Sport/swimming/South Hills Pool
Sports/swimming/towel
Sports/swimming/25m
Sports/swimming/goggles
Sports/swimming/cap
Sports/swimming/swim suit
2015CLEF 2015 Grefenstette - 50
2015CLEF 2015 Grefenstette - 51
http://www.notsoboringlife.com/list-of-hobbies/Not just swimming!
Conclusion on Personal facets
There is a lot of work to do
•  for predictable needs (hobbies, pastimes, sports), we do not
have the basic facets we need
•  for personal information (family, friends, familiar places), we
have very little
•  And this should be multilingual, too
2015CLEF 2015 Grefenstette - 52
•  Information is moving from the Web into Apps
•  People are generating information in these siloed Apps
•  People generate more digital information every day
•  Wearable computing will create even more
2015CLEF 2015 Grefenstette - 53
Conclusion: Searching Personal Big Data
•  Information is moving from the Web into Apps
•  People are generating information in these siloed Apps
•  People generate more digital information every day
•  Wearable computing will create even more
•  At one point, people will want their information back
2015CLEF 2015 Grefenstette - 54
Conclusion: Searching Personal Big Data
•  Information is moving from the Web into Apps
•  People are generating information in these siloed Apps
•  People generate more digital information every day
•  Wearable computing will create even more
•  At one point, people will want their information back
•  When you have too much information, you need facets
•  The facets for organizing personal information will be
needed and do not yet exist
2015CLEF 2015 Grefenstette - 55
Conclusion: Searching Personal Big Data
Conclusion: Searching Personal Big Data
•  Information is moving from the Web into Apps
•  People are generating information in these siloed Apps
•  People generate more digital information every day
•  Wearable computing will create even more
•  At one point, people will want their information back
•  When you have too much information, you need facets
•  The facets for organizing personal information will be
needed and do not yet exist
•  There are billions of cell phone users. They will all
want this. You should start working on it.
2015CLEF 2015 Grefenstette - 56
- 57- 57
Thank you !
www.inria.fr
Gurrin, Cathal and Smeaton, Alan F. and Doherty, Aiden R. (2014) LifeLogging:
personal big data. Foundations and Trends in Information Retrieval, 8 (1). pp. 1-125.
ISSN 1554-0677
Content type Per day Volume per day Volume per year
Video 16 hours 90 GB 33 TB
Autographer
Camera
3000 images 1.3 GB 480 GB
Audio 16 hours 630 MB 230 GB
Microsoft
Sensecam
4500 images 82 MB 30 GB
Accelerometer 58,000 readings 138 KB 50 MB
Locations 10,000 readings 27 KB 10 MB
Bluetooth
Interactions
400 (estimated) 5 MB 2 GB
Words heard or
read
100,000 700 KB 255 MB

Contenu connexe

En vedette

Dichiarazionedeidirittiininternet 150730094031-lva1-app6891
Dichiarazionedeidirittiininternet 150730094031-lva1-app6891Dichiarazionedeidirittiininternet 150730094031-lva1-app6891
Dichiarazionedeidirittiininternet 150730094031-lva1-app6891Daniela Fisichella
 
Localisation workflows: the impact of process well-handledness on automation
Localisation workflows: the impact of process well-handledness on automationLocalisation workflows: the impact of process well-handledness on automation
Localisation workflows: the impact of process well-handledness on automationNicolas Martinez
 
Lancera company profile
Lancera company profileLancera company profile
Lancera company profileCalvin Gordon
 
Fall 2003 Kianoff Current Solutions Newsletter
Fall 2003 Kianoff Current Solutions NewsletterFall 2003 Kianoff Current Solutions Newsletter
Fall 2003 Kianoff Current Solutions NewsletterAlan Kianoff
 
Dichiarazionedeidirittiininternet 150730094031-lva1-app6891
Dichiarazionedeidirittiininternet 150730094031-lva1-app6891Dichiarazionedeidirittiininternet 150730094031-lva1-app6891
Dichiarazionedeidirittiininternet 150730094031-lva1-app6891Daniela Fisichella
 
Morris mano digital logic design
Morris mano  digital logic designMorris mano  digital logic design
Morris mano digital logic designJawed Alam
 
2015 Dementia Care Summit Presentation - Therese Adami, COO KinCare
2015 Dementia Care Summit Presentation - Therese Adami, COO KinCare2015 Dementia Care Summit Presentation - Therese Adami, COO KinCare
2015 Dementia Care Summit Presentation - Therese Adami, COO KinCareKinCare
 
Awesome bedding Modern Bedroom Furniture
Awesome bedding Modern Bedroom FurnitureAwesome bedding Modern Bedroom Furniture
Awesome bedding Modern Bedroom Furnituresongpon pirom
 

En vedette (17)

Dichiarazionedeidirittiininternet 150730094031-lva1-app6891
Dichiarazionedeidirittiininternet 150730094031-lva1-app6891Dichiarazionedeidirittiininternet 150730094031-lva1-app6891
Dichiarazionedeidirittiininternet 150730094031-lva1-app6891
 
Localisation workflows: the impact of process well-handledness on automation
Localisation workflows: the impact of process well-handledness on automationLocalisation workflows: the impact of process well-handledness on automation
Localisation workflows: the impact of process well-handledness on automation
 
ALDEHÍDOS
ALDEHÍDOSALDEHÍDOS
ALDEHÍDOS
 
xceed_for_venues [EN _ Web]
xceed_for_venues [EN _ Web]xceed_for_venues [EN _ Web]
xceed_for_venues [EN _ Web]
 
Lancera company profile
Lancera company profileLancera company profile
Lancera company profile
 
Heri
HeriHeri
Heri
 
Fall 2003 Kianoff Current Solutions Newsletter
Fall 2003 Kianoff Current Solutions NewsletterFall 2003 Kianoff Current Solutions Newsletter
Fall 2003 Kianoff Current Solutions Newsletter
 
Dichiarazionedeidirittiininternet 150730094031-lva1-app6891
Dichiarazionedeidirittiininternet 150730094031-lva1-app6891Dichiarazionedeidirittiininternet 150730094031-lva1-app6891
Dichiarazionedeidirittiininternet 150730094031-lva1-app6891
 
Fiverun Overview 06-2013
Fiverun Overview 06-2013Fiverun Overview 06-2013
Fiverun Overview 06-2013
 
My vacations
My vacationsMy vacations
My vacations
 
Morris mano digital logic design
Morris mano  digital logic designMorris mano  digital logic design
Morris mano digital logic design
 
2015 Dementia Care Summit Presentation - Therese Adami, COO KinCare
2015 Dementia Care Summit Presentation - Therese Adami, COO KinCare2015 Dementia Care Summit Presentation - Therese Adami, COO KinCare
2015 Dementia Care Summit Presentation - Therese Adami, COO KinCare
 
Awesome bedding Modern Bedroom Furniture
Awesome bedding Modern Bedroom FurnitureAwesome bedding Modern Bedroom Furniture
Awesome bedding Modern Bedroom Furniture
 
transcript
transcripttranscript
transcript
 
CISANTOS
CISANTOSCISANTOS
CISANTOS
 
Mechanical Keys
Mechanical KeysMechanical Keys
Mechanical Keys
 
Extreme sports
Extreme sportsExtreme sports
Extreme sports
 

Similaire à Clef 2015 Keynote Grefenstette September 8, 2015, Toulouse

Intelligent Video Surveillance System using Deep Learning
Intelligent Video Surveillance System using Deep LearningIntelligent Video Surveillance System using Deep Learning
Intelligent Video Surveillance System using Deep LearningIRJET Journal
 
Video Liveness Verification
Video Liveness VerificationVideo Liveness Verification
Video Liveness Verificationijtsrd
 
IRJET- Design an Approach for Prediction of Human Activity Recognition us...
IRJET-  	  Design an Approach for Prediction of Human Activity Recognition us...IRJET-  	  Design an Approach for Prediction of Human Activity Recognition us...
IRJET- Design an Approach for Prediction of Human Activity Recognition us...IRJET Journal
 
Google Glass Technology
Google Glass TechnologyGoogle Glass Technology
Google Glass TechnologyIRJET Journal
 
Three mustketeers-samsung-sw-conf-2014-autoidlab-kaist-daeyoung
Three mustketeers-samsung-sw-conf-2014-autoidlab-kaist-daeyoungThree mustketeers-samsung-sw-conf-2014-autoidlab-kaist-daeyoung
Three mustketeers-samsung-sw-conf-2014-autoidlab-kaist-daeyoungDaeyoung Kim
 
Motion capture for Animation
Motion capture for AnimationMotion capture for Animation
Motion capture for AnimationIRJET Journal
 
Realtime Face mask Detector using YoloV4
Realtime Face mask Detector using YoloV4Realtime Face mask Detector using YoloV4
Realtime Face mask Detector using YoloV4IRJET Journal
 
Daily Human Activity Recognition using Adaboost Classifiers on Wisdm Dataset
Daily Human Activity Recognition using Adaboost Classifiers on Wisdm DatasetDaily Human Activity Recognition using Adaboost Classifiers on Wisdm Dataset
Daily Human Activity Recognition using Adaboost Classifiers on Wisdm Datasetijtsrd
 
Self-Knowledge through Self-Tracking Devices
Self-Knowledge through Self-Tracking DevicesSelf-Knowledge through Self-Tracking Devices
Self-Knowledge through Self-Tracking DevicesBurcu Avci
 
CREW VRE Release 5 - 2009 May
CREW VRE Release 5 - 2009 MayCREW VRE Release 5 - 2009 May
CREW VRE Release 5 - 2009 MayMartin Turner
 
IRJET - Smart Vision System for Visually Impaired People
IRJET -  	  Smart Vision System for Visually Impaired PeopleIRJET -  	  Smart Vision System for Visually Impaired People
IRJET - Smart Vision System for Visually Impaired PeopleIRJET Journal
 
IoT Based Human Activity Recognition and Classification Using Machine Learning
IoT Based Human Activity Recognition and Classification Using Machine LearningIoT Based Human Activity Recognition and Classification Using Machine Learning
IoT Based Human Activity Recognition and Classification Using Machine LearningIRJET Journal
 
DYNAMIC ENERGY MANAGEMENT USING REAL TIME OBJECT DETECTION
DYNAMIC ENERGY MANAGEMENT USING REAL TIME OBJECT DETECTIONDYNAMIC ENERGY MANAGEMENT USING REAL TIME OBJECT DETECTION
DYNAMIC ENERGY MANAGEMENT USING REAL TIME OBJECT DETECTIONIRJET Journal
 
Wearable Accessibility - Accessing Higher Ground 2014
Wearable Accessibility - Accessing Higher Ground 2014Wearable Accessibility - Accessing Higher Ground 2014
Wearable Accessibility - Accessing Higher Ground 2014Ted Drake
 
Human Activity Recognition
Human Activity RecognitionHuman Activity Recognition
Human Activity RecognitionIRJET Journal
 
WearAbility = Wearable Computers and Accessibilty
WearAbility =  Wearable Computers and AccessibiltyWearAbility =  Wearable Computers and Accessibilty
WearAbility = Wearable Computers and AccessibiltyTed Drake
 
Network Intelligence Driven Human Behavior Modeling
Network Intelligence Driven Human Behavior ModelingNetwork Intelligence Driven Human Behavior Modeling
Network Intelligence Driven Human Behavior ModelingFahim Kawsar
 
Bibliometric Analysis on Computer Vision based Anomaly Detection using Deep L...
Bibliometric Analysis on Computer Vision based Anomaly Detection using Deep L...Bibliometric Analysis on Computer Vision based Anomaly Detection using Deep L...
Bibliometric Analysis on Computer Vision based Anomaly Detection using Deep L...IRJET Journal
 
Intelligent System For Face Mask Detection
Intelligent System For Face Mask DetectionIntelligent System For Face Mask Detection
Intelligent System For Face Mask DetectionIRJET Journal
 
Suspicious Activity Detection
Suspicious Activity DetectionSuspicious Activity Detection
Suspicious Activity DetectionMushahid Ali
 

Similaire à Clef 2015 Keynote Grefenstette September 8, 2015, Toulouse (20)

Intelligent Video Surveillance System using Deep Learning
Intelligent Video Surveillance System using Deep LearningIntelligent Video Surveillance System using Deep Learning
Intelligent Video Surveillance System using Deep Learning
 
Video Liveness Verification
Video Liveness VerificationVideo Liveness Verification
Video Liveness Verification
 
IRJET- Design an Approach for Prediction of Human Activity Recognition us...
IRJET-  	  Design an Approach for Prediction of Human Activity Recognition us...IRJET-  	  Design an Approach for Prediction of Human Activity Recognition us...
IRJET- Design an Approach for Prediction of Human Activity Recognition us...
 
Google Glass Technology
Google Glass TechnologyGoogle Glass Technology
Google Glass Technology
 
Three mustketeers-samsung-sw-conf-2014-autoidlab-kaist-daeyoung
Three mustketeers-samsung-sw-conf-2014-autoidlab-kaist-daeyoungThree mustketeers-samsung-sw-conf-2014-autoidlab-kaist-daeyoung
Three mustketeers-samsung-sw-conf-2014-autoidlab-kaist-daeyoung
 
Motion capture for Animation
Motion capture for AnimationMotion capture for Animation
Motion capture for Animation
 
Realtime Face mask Detector using YoloV4
Realtime Face mask Detector using YoloV4Realtime Face mask Detector using YoloV4
Realtime Face mask Detector using YoloV4
 
Daily Human Activity Recognition using Adaboost Classifiers on Wisdm Dataset
Daily Human Activity Recognition using Adaboost Classifiers on Wisdm DatasetDaily Human Activity Recognition using Adaboost Classifiers on Wisdm Dataset
Daily Human Activity Recognition using Adaboost Classifiers on Wisdm Dataset
 
Self-Knowledge through Self-Tracking Devices
Self-Knowledge through Self-Tracking DevicesSelf-Knowledge through Self-Tracking Devices
Self-Knowledge through Self-Tracking Devices
 
CREW VRE Release 5 - 2009 May
CREW VRE Release 5 - 2009 MayCREW VRE Release 5 - 2009 May
CREW VRE Release 5 - 2009 May
 
IRJET - Smart Vision System for Visually Impaired People
IRJET -  	  Smart Vision System for Visually Impaired PeopleIRJET -  	  Smart Vision System for Visually Impaired People
IRJET - Smart Vision System for Visually Impaired People
 
IoT Based Human Activity Recognition and Classification Using Machine Learning
IoT Based Human Activity Recognition and Classification Using Machine LearningIoT Based Human Activity Recognition and Classification Using Machine Learning
IoT Based Human Activity Recognition and Classification Using Machine Learning
 
DYNAMIC ENERGY MANAGEMENT USING REAL TIME OBJECT DETECTION
DYNAMIC ENERGY MANAGEMENT USING REAL TIME OBJECT DETECTIONDYNAMIC ENERGY MANAGEMENT USING REAL TIME OBJECT DETECTION
DYNAMIC ENERGY MANAGEMENT USING REAL TIME OBJECT DETECTION
 
Wearable Accessibility - Accessing Higher Ground 2014
Wearable Accessibility - Accessing Higher Ground 2014Wearable Accessibility - Accessing Higher Ground 2014
Wearable Accessibility - Accessing Higher Ground 2014
 
Human Activity Recognition
Human Activity RecognitionHuman Activity Recognition
Human Activity Recognition
 
WearAbility = Wearable Computers and Accessibilty
WearAbility =  Wearable Computers and AccessibiltyWearAbility =  Wearable Computers and Accessibilty
WearAbility = Wearable Computers and Accessibilty
 
Network Intelligence Driven Human Behavior Modeling
Network Intelligence Driven Human Behavior ModelingNetwork Intelligence Driven Human Behavior Modeling
Network Intelligence Driven Human Behavior Modeling
 
Bibliometric Analysis on Computer Vision based Anomaly Detection using Deep L...
Bibliometric Analysis on Computer Vision based Anomaly Detection using Deep L...Bibliometric Analysis on Computer Vision based Anomaly Detection using Deep L...
Bibliometric Analysis on Computer Vision based Anomaly Detection using Deep L...
 
Intelligent System For Face Mask Detection
Intelligent System For Face Mask DetectionIntelligent System For Face Mask Detection
Intelligent System For Face Mask Detection
 
Suspicious Activity Detection
Suspicious Activity DetectionSuspicious Activity Detection
Suspicious Activity Detection
 

Dernier

6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPRPirithiRaju
 
Telephone Traffic Engineering Online Lec
Telephone Traffic Engineering Online LecTelephone Traffic Engineering Online Lec
Telephone Traffic Engineering Online Lecfllcampolet
 
Loudspeaker- direct radiating type and horn type.pptx
Loudspeaker- direct radiating type and horn type.pptxLoudspeaker- direct radiating type and horn type.pptx
Loudspeaker- direct radiating type and horn type.pptxpriyankatabhane
 
DNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxDNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxGiDMOh
 
Pests of Sunflower_Binomics_Identification_Dr.UPR
Pests of Sunflower_Binomics_Identification_Dr.UPRPests of Sunflower_Binomics_Identification_Dr.UPR
Pests of Sunflower_Binomics_Identification_Dr.UPRPirithiRaju
 
bonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlsbonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlshansessene
 
Immunoblott technique for protein detection.ppt
Immunoblott technique for protein detection.pptImmunoblott technique for protein detection.ppt
Immunoblott technique for protein detection.pptAmirRaziq1
 
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...Sérgio Sacani
 
Environmental acoustics- noise criteria.pptx
Environmental acoustics- noise criteria.pptxEnvironmental acoustics- noise criteria.pptx
Environmental acoustics- noise criteria.pptxpriyankatabhane
 
The Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionThe Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionJadeNovelo1
 
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfKDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfGABYFIORELAMALPARTID1
 
Role of Gibberellins, mode of action and external applications.pptx
Role of Gibberellins, mode of action and external applications.pptxRole of Gibberellins, mode of action and external applications.pptx
Role of Gibberellins, mode of action and external applications.pptxjana861314
 
3.-Acknowledgment-Dedication-Abstract.docx
3.-Acknowledgment-Dedication-Abstract.docx3.-Acknowledgment-Dedication-Abstract.docx
3.-Acknowledgment-Dedication-Abstract.docxUlahVanessaBasa
 
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Christina Parmionova
 
Harry Coumnas Thinks That Human Teleportation May Ensure Humanity's Survival
Harry Coumnas Thinks That Human Teleportation May Ensure Humanity's SurvivalHarry Coumnas Thinks That Human Teleportation May Ensure Humanity's Survival
Harry Coumnas Thinks That Human Teleportation May Ensure Humanity's Survivalkevin8smith
 
Oxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxOxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxfarhanvvdk
 
Science (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and PitfallsScience (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and PitfallsDobusch Leonhard
 
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests GlycosidesGLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests GlycosidesNandakishor Bhaurao Deshmukh
 
Total Legal: A “Joint” Journey into the Chemistry of Cannabinoids
Total Legal: A “Joint” Journey into the Chemistry of CannabinoidsTotal Legal: A “Joint” Journey into the Chemistry of Cannabinoids
Total Legal: A “Joint” Journey into the Chemistry of CannabinoidsMarkus Roggen
 

Dernier (20)

6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR
 
Telephone Traffic Engineering Online Lec
Telephone Traffic Engineering Online LecTelephone Traffic Engineering Online Lec
Telephone Traffic Engineering Online Lec
 
Loudspeaker- direct radiating type and horn type.pptx
Loudspeaker- direct radiating type and horn type.pptxLoudspeaker- direct radiating type and horn type.pptx
Loudspeaker- direct radiating type and horn type.pptx
 
DNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxDNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptx
 
Pests of Sunflower_Binomics_Identification_Dr.UPR
Pests of Sunflower_Binomics_Identification_Dr.UPRPests of Sunflower_Binomics_Identification_Dr.UPR
Pests of Sunflower_Binomics_Identification_Dr.UPR
 
bonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlsbonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girls
 
Immunoblott technique for protein detection.ppt
Immunoblott technique for protein detection.pptImmunoblott technique for protein detection.ppt
Immunoblott technique for protein detection.ppt
 
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
 
Environmental acoustics- noise criteria.pptx
Environmental acoustics- noise criteria.pptxEnvironmental acoustics- noise criteria.pptx
Environmental acoustics- noise criteria.pptx
 
The Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionThe Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and Function
 
PLASMODIUM. PPTX
PLASMODIUM. PPTXPLASMODIUM. PPTX
PLASMODIUM. PPTX
 
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfKDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
 
Role of Gibberellins, mode of action and external applications.pptx
Role of Gibberellins, mode of action and external applications.pptxRole of Gibberellins, mode of action and external applications.pptx
Role of Gibberellins, mode of action and external applications.pptx
 
3.-Acknowledgment-Dedication-Abstract.docx
3.-Acknowledgment-Dedication-Abstract.docx3.-Acknowledgment-Dedication-Abstract.docx
3.-Acknowledgment-Dedication-Abstract.docx
 
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
 
Harry Coumnas Thinks That Human Teleportation May Ensure Humanity's Survival
Harry Coumnas Thinks That Human Teleportation May Ensure Humanity's SurvivalHarry Coumnas Thinks That Human Teleportation May Ensure Humanity's Survival
Harry Coumnas Thinks That Human Teleportation May Ensure Humanity's Survival
 
Oxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxOxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptx
 
Science (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and PitfallsScience (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and Pitfalls
 
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests GlycosidesGLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
 
Total Legal: A “Joint” Journey into the Chemistry of Cannabinoids
Total Legal: A “Joint” Journey into the Chemistry of CannabinoidsTotal Legal: A “Joint” Journey into the Chemistry of Cannabinoids
Total Legal: A “Joint” Journey into the Chemistry of Cannabinoids
 

Clef 2015 Keynote Grefenstette September 8, 2015, Toulouse

  • 1. Personal Information Systems and Personal Semantics Gregory Grefenstette CLEF 2015 September 8, 2015 ,
  • 2. Information is moving from the Web to Apps Each person generates a lot of data Two communities use it now Search in one’s own data is the future Four ways to search We need personal facets
  • 3. 2015CLEF 2015 Grefenstette - 3 http://www.statista.com/statistics/263795/number-of-available-apps-in-the-apple-app-store/ Apple announced that 100 billion apps had been downloaded from its App Store (June 2015)
  • 8. Okay … Apps, Quantified Self, Smart Glasses Step back to NOW
  • 10. Personal Big Data Email sent Email received Social network posts IP address location SMS, chats Search history Web pages visited Media viewed Credit card purchases Call data GPS locations Vitals signs Activity/inactivity Lifestyle Conversations Reading People seen Noises heard
  • 11. Who uses this data today? Surely, each person should have the same access to their own data
  • 12. Impediments to using our own data •  Data Silos •  Ownership •  Privacy •  Big Data Problems •  Variety •  Volume •  Merging -- Semantics
  • 13. Supposing we could get all our data back into our own hands, how could we search it? Short course on 4 types of search
  • 14. Search Engines – Cranfield/SMART Model 148 Sept 2015 CLEF 2015 Grefenstette ftp://ftp.cs.cornell.edu/pub/smart/cran .I 6 .W ventricular septal defect occurring in association with aortic regurgitation .I 7 .W radioisotopes in heart scanning. mainly used in diagnosis of pericardial effusions. also used to study tumors, heart enlargement, aneurysms and pericardial thickening. technetium, rihsa, radioactive hippurate, cholegraffin are used. .I 8 .W the effects of drugs on the bone marrow of man and animals, … 5 332 5 333 6 112 6 115 6 116 6 118 6 122 6 238 6 239 6 242 6 260 6 309 6 320 6 321 6 323 7 92 7 121 7 189 7 389 7 390 7 391 7 392 7 393 8 52 8 60 conditions . .I 237 cisternal fluid oxygen ... using a beckman micro-oxyg.. tension simultaneously in the.. and in arterial blood under.. that the cisternal oxygen.. oxygen tension of the surroun. the available free oxygen... duration in the cerebral... .I 238 ventricular septal defect obstruction . a case of ventricular... lesion and infundibular... coronary cusp of the aortic.. septal defect, was demonstra.. as a polyp-like mass in the... catheterization and angiocard ventricular outflow obstr... .I 239 functional adaptations of the congenital heart disease .... queries qrels documents
  • 15. 2015CLEF 2015 Grefenstette - 15 Search Engines – Cranfield/SMART Model
  • 16. 2015CLEF 2015 Grefenstette - 16 Schedules 3 Economics, Education, Society 33 Economics and Management 338 Industries, Products 338.1 – 338.4 Specific kinds of industries 338.4 Secondary Industries and Services 338.47 Goods and Services Built from 338.471 – 338.479 Subdivisions for Goods and Services Schedules 338.476 Technology 338.4767 Manufacturing 338.47677 Textiles 338.476772 Textiles of Seed hair fibres 338.4767721 Cotton Built from 338.47677210 Facet Indicator for Standard Subdivision Table 1 338.476772109 Historical, geographic, persons treatment Built from 338.4767721094 Europe Western Europe Table 2 338.47677210942 England and Wales 338.476772109427 Northwestern England and Isle of Man 338.4767721094276 Lancashire “The Lancashire cotton industry : a study in economic development” Assigned DDC Code: 338.4767721094276 Search Engines – Dewey Decimal Faceted Model
  • 17. 2015CLEF 2015 Grefenstette - 17 Search Engines – Dewey Decimal Faceted Model
  • 18. 2 Other Search Models: Maps, Time Intervals 2015CLEF 2015 Grefenstette - 18
  • 19. Past Attempts 2015CLEF 2015 Grefenstette - 19
  • 20. MyLifeBits 2015CLEF 2015 Grefenstette - 20 Gemmell, Jim, Gordon Bell, and Roger Lueder. "MyLifeBits: a personal database for everything." Communications of the ACM 49.1 (2006): 88-95. "But even with convenient classifications and labels ready to apply, we are still asking the user to become a filing clerk – manually annotating every document, email, photo, or conversation."
  • 21. LifeLog 2015CLEF 2015 Grefenstette - 21 …The user can order the life-log agent to add retrieval keys (annotation) with an arbitrary name by simple operations on his cellular phone while the agent is capturing a life-log video. This enables the agent to identify a scene that the user wants to remember throughout his life, and thus the user can access easily to the videos that were captured during precious experiences" Aizawa, Kiyoharu, Tetsuro Hori, Shinya Kawasaki, and Takayuki Ishikawa. "Capture and efficient retrieval of life log." In Pervasive 2004 Workshop on Memory and Sharing Experiences, pp. 15-20. 2004.
  • 22. Stuff I’ve Seen 2015CLEF 2015 Grefenstette - 22 …Research in cognitive psychology has found that people remember information, particularly older information, not in terms of exact time, but in terms of key episodes, such as a child’s birthday, exotic travel,… Cutrell, Edward, Susan T. Dumais, and Jaime Teevan. "Searching to eliminate personal information management." Communications of the ACM 49.1 (2006): 58-64
  • 23. PERSON 2015CLEF 2015 Grefenstette - 23 …we define the general category for user’s activity in advance, such as ordinary activity and extra-ordinary activity. In ordinary activity is related to the activity in home or office. Generally, the activities occurred outside of those area, they are classified as extraordinary activities. In addition to these pre-defined activities, users can add their own activity through our learning based structure… For some duration, we record whole activities of user. For the repeated activities at same time, in same place with similar objects, our activity engine will register as user defined activities by asking in which category those can be included. Kim, Ig-Jae, et al. "PERSON: personalized experience recoding and searching on networked environment." Proceedings of the 3rd ACM workshop on Continuous archival and retrival of personal experences. ACM, 2006.
  • 24. Personal Data Prototype 2015CLEF 2015 Grefenstette - 24 …Landmarks of tags are defined by the frequency of tags that are assigned to each item of personal data. A tag that has been in heavy use during a period of time is a candidate for a landmark. A tag that has rarely been used during a long period of time is also a candidate for a landmark. Outliers are candidates for landmarks in time-series data, such as home energy use, the number of steps walked, and histories of body weight. Data that exceed pre-defined or user-defined thresholds are also candidates. Other landmarks are public landmarks, which include shocking public news, bestsellers, blockbuster films, and annual rankings of top Web-search words. We can recall our own experiences on those days from these landmarks. Teraoka, Teruhiko. "Organization and exploration of heterogeneous personal data collected in daily life." Human- Centric Computing and Information Sciences 2.1 (2012): 1-15.
  • 25. Dublin City University 2015CLEF 2015 Grefenstette - 25 …The user can order the life-log agent to add retrieval keys (annotation) with an arbitrary name by simple operations on his cellular phone while the agent is capturing a life-log video. This enables the agent to identify a scene that the user wants to remember throughout his life, and thus the user can access easily to the videos that were captured during precious experiences" Qiu, Zhengwei. "A lifelogging system supporting multimodal access." PhD diss., Dublin City University, 2013. Wang, Peng, and Alan F. Smeaton. "Aggregating semantic concepts for event representation in lifelogging." Proceedings of the International Workshop on Semantic Web Information Management. ACM, 2011.
  • 26. Okay, we’ve seen -- Apps / QS -- Personal Big Data -- Some early attempts Everyone says Time is important Maps are important String search is important but… Facets, what are our personal facets? How can we automate them? 2015CLEF 2015 Grefenstette - 26
  • 28. 2015PTraces Grefenstette - 28 swimming (my) people involved in something about swimming
  • 29. 2015PTraces Grefenstette - 29 swimming things I’ve bought involving swimming
  • 30. 2015PTraces Grefenstette - 30 swimming (my) photos and facebook posts related to swimming
  • 31. 2015PTraces Grefenstette - 31 swimming emails about swimming things
  • 32. 2015PTraces Grefenstette - 32 swimming places I’ve been involving swimming
  • 33. 2015PTraces Grefenstette - 33 swimming days involving swimming things
  • 34. 2015PTraces Grefenstette - 34 swimming phone calls about swimming things…
  • 36. Rather Self-Centred, no? 2015CLEF 2015 Grefenstette - 36
  • 37. Personal Information System Personal archives Induction semantic dimensions Personal Semantic hierachies Crowdsourced semantic Hierarchies (eg. Wikipedia) Expert semantic Hierarchies (eg. MeSH) Ingest/Annotate/Merge
  • 38. 2015PTraces Grefenstette - 38 s w i m m i n g K n i t t i n g p o k e r P a i n t i n g . . .
  • 39. Expert >>> Crowdsourcing >>> Personal Ontology Folksonomy Models
  • 40. Expert >>> Crowdsourcing >>> Personal Models Folksonomy Models
  • 41. Expert >>> Crowdsourcing >>> Personal Models Folksonomy Models
  • 42. Knitting>Knitting_methods_for_shaping>Short_ro Knitting>Knitting_stitches Knitting>Knitting_stitches>List_of_knitting_stitche Knitting>Knitting_stitches>Basic_knitted_fabrics Knitting>Knitting_stitches>Decrease_(knitting) Knitting>Knitting_stitches>Dip_stitch Knitting>Knitting_stitches>Drop-stitch_knitting Knitting>Knitting_stitches>Elongated_stitch Knitting>Knitting_stitches>Fair_Isle_(technique) Knitting>Knitting_stitches>Grafting_(knitting) Knitting>Knitting_stitches>Loop_knitting Knitting>Knitting_stitches>Pick_up_stitches_(kni Knitting>Knitting_stitches>Plaited_stitch_(knitting Knitting>Knitting_stitches>Slip-stitch_knitting Knitting>Knitting_stitches>Yarn_over Knitting>Knitting_tools_and_materials Knitting>Knitting_tools_and_materials>Eisaku_N Knitting>Knitting_tools_and_materials>Hank_(te Knitting>Knitting_tools_and_materials>Knitting_m Knitting>Knitting_tools_and_materials>Knitting_N Knitting>Knitting_tools_and_materials>Knitting_n Knitting>Knitting_tools_and_materials>Knitting_n Knitting>Knitting_tools_and_materials>Lazy_Kat Knitting>Knitting_tools_and_materials>Liaghra Knitting>Knitting_tools_and_materials>Nostepinn Knitting>Knitting_tools_and_materials>Row_cou Expert >>> Crowdsourcing >>> Personal Models Folksonomy Models
  • 43. Expert >>> Crowdsourcing >>> Personal Models Folksonomy Models
  • 46. Well, no…. 2015CLEF 2015 Grefenstette - 46
  • 47. Tweet 2015CLEF 2015 Grefenstette - 47 Less than 12 hours until I am in the pool crying... thankful for mirrored goggles Swimming>pool Swimming>goggles facets I’d want this …
  • 48. 2015CLEF 2015 Grefenstette - 48 swimming -- weightlifting, cycling, gymnastics, judo, table, volleyball, archery, rowing, badminton, track, water, taekwondo, tennis, field, diving, handball, boxing, softball, karate, pentathlon, fencing, athletics, triathlon, wrestling, soccer http://webdocs.cs.ualberta.ca/~lindek/downloads.htm Distributional Semantics 1.5 billion words Wordnet
  • 49. Existing taxonomies are for societal exchanges Do you want to buy this? What famous person did this when? What can we make for this? 2015CLEF 2015 Grefenstette - 49 We are missing a description of what is related to us, doing something… specific vocabularies loose taxonomies … facets
  • 50. Somthing like…. Sports/swimming/backstroke Sports/swimming/on my back Sports/swimming/breastroke Sports/swimming/fins Sports/swimming/goggles Sports/swimming/fast lane Sports/swimming/slow lane Sports/swimming/laps Sports/swimming/lifeguard Sports/swimming/pool Sports/swimming/lake Sports/swimming/ocean Sports/swimming/Neuilly Nautic Centre Sport/swimming/South Hills Pool Sports/swimming/towel Sports/swimming/25m Sports/swimming/goggles Sports/swimming/cap Sports/swimming/swim suit 2015CLEF 2015 Grefenstette - 50
  • 51. 2015CLEF 2015 Grefenstette - 51 http://www.notsoboringlife.com/list-of-hobbies/Not just swimming!
  • 52. Conclusion on Personal facets There is a lot of work to do •  for predictable needs (hobbies, pastimes, sports), we do not have the basic facets we need •  for personal information (family, friends, familiar places), we have very little •  And this should be multilingual, too 2015CLEF 2015 Grefenstette - 52
  • 53. •  Information is moving from the Web into Apps •  People are generating information in these siloed Apps •  People generate more digital information every day •  Wearable computing will create even more 2015CLEF 2015 Grefenstette - 53 Conclusion: Searching Personal Big Data
  • 54. •  Information is moving from the Web into Apps •  People are generating information in these siloed Apps •  People generate more digital information every day •  Wearable computing will create even more •  At one point, people will want their information back 2015CLEF 2015 Grefenstette - 54 Conclusion: Searching Personal Big Data
  • 55. •  Information is moving from the Web into Apps •  People are generating information in these siloed Apps •  People generate more digital information every day •  Wearable computing will create even more •  At one point, people will want their information back •  When you have too much information, you need facets •  The facets for organizing personal information will be needed and do not yet exist 2015CLEF 2015 Grefenstette - 55 Conclusion: Searching Personal Big Data
  • 56. Conclusion: Searching Personal Big Data •  Information is moving from the Web into Apps •  People are generating information in these siloed Apps •  People generate more digital information every day •  Wearable computing will create even more •  At one point, people will want their information back •  When you have too much information, you need facets •  The facets for organizing personal information will be needed and do not yet exist •  There are billions of cell phone users. They will all want this. You should start working on it. 2015CLEF 2015 Grefenstette - 56
  • 57. - 57- 57 Thank you ! www.inria.fr
  • 58. Gurrin, Cathal and Smeaton, Alan F. and Doherty, Aiden R. (2014) LifeLogging: personal big data. Foundations and Trends in Information Retrieval, 8 (1). pp. 1-125. ISSN 1554-0677 Content type Per day Volume per day Volume per year Video 16 hours 90 GB 33 TB Autographer Camera 3000 images 1.3 GB 480 GB Audio 16 hours 630 MB 230 GB Microsoft Sensecam 4500 images 82 MB 30 GB Accelerometer 58,000 readings 138 KB 50 MB Locations 10,000 readings 27 KB 10 MB Bluetooth Interactions 400 (estimated) 5 MB 2 GB Words heard or read 100,000 700 KB 255 MB