SlideShare une entreprise Scribd logo
1  sur  23
Télécharger pour lire hors ligne
TRECVID 2016
AD-HOC VIDEO SEARCH TASK : OVERVIEW
Georges Quénot
Laboratoire d'Informatique de Grenoble
George Awad
Dakota Consulting, Inc
National Institute of Standards and Technology
Ad-hoc Video Search Task Definition
•  Goal: promote progress in content-based retrieval based on end
user ad-hoc queries that include persons, objects, locations,
activities and their combinations.
•  Task: Given a test collection, a query, and a master shot
boundary reference, return a ranked list of at most 1000 shots
(out of 335 944) which best satisfy the need.
•  New testing data: 4593 Internet Archive videos (IACC.3), 600
total hours with video durations between 6.5 min to 9.5 min.
•  Development data: ≈1400 hours of previous IACC data used
between 2010-2015 with concept annotations.
TRECVID 2016 23/9/17
Query Development
•  Test videos were viewed by 10 human assessors hired by the
National Institute of Standards and Technology (NIST).
•  4 facet description of different scenes were used (if
applicable):
•  Who : concrete objects and being (kind of persons, animals, things)
•  What : are the objects and/or beings doing ? (generic actions,
conditions/state)
•  Where : locale, site, place, geographic, architectural
•  When : time of day, season
•  In total assessors watched ≈35% of the IACC.3 videos
•  90 Candidate queries chosen from human written descriptions
to be used between 2016-2018.
TRECVID 2016 33/9/17
TV2016 Queries samples by complexity
•  Person + Action + Object + Location
Find shots of a person playing guitar outdoors.
Find shots of a man indoors looking at camera where a bookcase is behind him.
Find shots of a person playing drums indoors.
Find shots of a diver wearing diving suit and swimming under water.
•  Person + Action + Location
Find shots of the 43rd president George W. Bush sitting down talking with people indoors.
Find shots of a choir or orchestra and conductor performing on stage.
Find shots of one or more people walking or bicycling on a bridge during daytime.
3/9/17 TRECVID 2016 4
TV2016 Queries by complexity
•  Person + Action/state + Object
Find shots of a person sitting down with a laptop visible.
Find shots of a man with beard talking or singing into a microphone.
Find shots of one or more people opening a door and exiting through it.
Find shots of a person holding a knife.
Find shots of a woman wearing glasses.
Find shots of a person drinking from a cup, mug, bottle, or other container.
Find shots of a person wearing a helmet.
Find shots of a person lighting a candle.
•  Person + Action
Find shots of people shopping.
Find shots of soldiers performing training or other military maneuvers.
Find shots of a person jumping.
Find shots of a man shake hands with a woman.
3/9/17 TRECVID 2016 5
TV2016 Queries by complexity
•  Person + Location
Find shots of one or more people at train station platform.
Find shots of two or more men at a beach scene.
•  Person + Object
Find shots of a policeman where a police car is visible.
•  Object + Location
Find shots of any type of fountains outdoors.
•  Object
Find shots of a sewing machine.
Find shots of destroyed buildings.
Find shots of palm trees.
3/9/17 TRECVID 2016 6
3/9/17 TRECVID 2016 7
Training and run types
Four training data types:
ü  A – used only IACC training data (4 runs)
ü  D – used any other training data (42 runs)
ü  E – used only training data collected automatically using
only the query text (6 runs)
ü  F – used only training data collected automatically using
a query built manually from the given query text (0 runs)
Two run submission types:
ü  Manually-assisted (M) – Query built manually
ü  Fully automatic (F) – System uses official query directly
3/9/17 TRECVID 2016 8
Evaluation
Each query assumed to be binary: absent or present for each
master reference shot.
NIST sampled ranked pools and judged top results from all
submissions.
Metrics: inferred average precision per query.
Compared runs in terms of mean inferred average precision
across the 30 queries.
3/9/17 TRECVID 2016 9
mean extended Inferred average precision
(xinfAP)
2 pools were created for each query and sampled as:
ü  Top pool (ranks 1 to 200) sampled at 100 %
ü  Bottom pool (ranks 201 to 1000) sampled at 11.1 %
ü  % of sampled and judged clips from rank 201 to 1000 across all runs
(min= 10.5 %, max = 76 %, mean = 35 %)
Judgment process: one assessor per query, watched complete
shot while listening to the audio. infAP was calculated using the
judged and unjudged pool by sample_eval
30 queries
187 918 total judgments
7448 total hits
4642 hits at ranks (1 to100)
2080 hits at ranks (101 to200)
726 hits at ranks (201 to 2000)
3/9/17 TRECVID 2016 10
Finishers : 13 out of 29
M	 F	
INF 	
CMU; Beijing University of Posts and Telecommunication;
University Autonoma de Madrid; Shandong University;
Xian JiaoTong University Singapore	
- 	 4	
kobe_nict_siegen 	
Kobe University, Japan; National Institute of Information
and Communications Technology, Japan; University of
Siegen, Germany	
3	 - 	
UEC 	
Dept. of Informatics, The University of Electro-
Communications, Tokyo 	
2	 - 	
ITI_CERTH 	
Inf. Tech. Inst., Centre for Research and Technology
Hellas	
4	 4	
ITEC_UNIKLU 	Klagenfurt University	 - 	 3	
NII_Hitachi_UIT 	
Natl. Inst. Of Info.; Hitachi Ltd; University of Inf. Tech.
(HCM-UIT)	
- 	 4	
IMOTION 	
University of Basel, Switzerland; University of Mons,
Belgium; Koc University, Turkey	
2	 2	
MediaMill 	 University of Amsterdam Qualcomm	 - 	 4	
Vitrivr 	 University of Basel	 2	 2	
Waseda 	 Waseda University	 4	 - 	
VIREO 	 City University of Hong Kong 	 3	 3	
EURECOM 	EURECOM	 - 	 4	
FIU_UM 	 Florida International University, University of Miami	 2	 -
3/9/17 TRECVID 2016 11
Inferred frequency of hits varies by query
0
500
1000
1500
2000
2500
501 503 505 507 509 511 513 515 517 519 521 523 525 527 529
Inf. Hits / query
0.5 % of test shots
Topics
Inf.hits
3/9/17 TRECVID 2016 12
Total true shots contributed uniquely by team
0
20
40
60
80
100
120
140
Numberoftrueshots
3/9/17 TRECVID 2016 13
2016 run submissions scores
(22 Manually-assisted runs)
0
0.02
0.04
0.06
0.08
0.1
0.12
0.14
0.16
0.18
0.2 M_D_Waseda.16_2
M_D_Waseda.16_1
M_D_Waseda.16_4
M_D_Waseda.16_3
M_D_kobe_nict_siegen.
M_D_IMOTION.16_1
M_D_kobe_nict_siegen.
M_D_IMOTION.16_2
M_D_vitrivr.16_1
M_D_VIREO.16_5
M_D_vitrivr.16_2
M_D_VIREO.16_1
M_D_ITI_CERTH.16_4
M_D_ITI_CERTH.16_1
M_D_kobe_nict_siegen.
M_D_ITI_CERTH.16_3
M_D_ITI_CERTH.16_2
M_D_FIU_UM.16_2
M_D_FIU_UM.16_1
M_A_VIREO.16_3
M_A_UEC.16_2
M_A_UEC.16_1
MeanInf.AP
Median = 0.043
Gap due to
searcher or
interface ?!
3/9/17 TRECVID 2016 14
2016 run submissions scores
(30 Fully automatic runs)
0
0.01
0.02
0.03
0.04
0.05
0.06
F_D_NII_Hitachi_UIT.
F_D_ITI_CERTH.16_4
F_D_ITI_CERTH.16_3
F_D_ITI_CERTH.16_1
F_D_NII_Hitachi_UIT.
F_D_NII_Hitachi_UIT.
F_D_NII_Hitachi_UIT.
F_D_ITI_CERTH.16_2
F_E_INF.16_1
F_D_VIREO.16_6
F_D_VIREO.16_2
F_D_MediaMill.16_4
F_D_MediaMill.16_2
F_D_MediaMill.16_1
F_E_INF.16_2
F_D_MediaMill.16_3
F_D_EURECOM.16_2
F_E_INF.16_3
F_D_IMOTION.16_3
F_D_IMOTION.16_4
F_D_EURECOM.16_1
F_A_VIREO.16_4
F_D_EURECOM.16_4
F_D_INF.16_4
F_D_vitrivr.16_4
F_D_vitrivr.16_3
F_E_ITEC_UNIKLU.16_1
F_D_EURECOM.16_3
F_E_ITEC_UNIKLU.16_2
F_E_ITEC_UNIKLU.16_3
MeanInf.AP
Median = 0.024
3/9/17 TRECVID 2016 15
Top 10 infAP scores by query
(Manually-assisted)
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
1 3 5 7 9 11 13 15 17 19 21 23 25 27 29
Inf.AP
10
9
8
7
6
5
4
3
2
1
Median
Topics
3/9/17 TRECVID 2016 16
Top 10 infAP scores by query
(Fully automatic)
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
1 3 5 7 9 11 13 15 17 19 21 23 25 27 29
Inf.AP
10
9
8
7
6
5
4
3
2
1
Median
Topics
3/9/17 TRECVID 2016 17
Statistical significant differences among top 10 “M” runs
(using randomization test, p < 0.05)
D_Waseda.16_2
Ø  D_Waseda.16_3
Ø  D_kobe_nict_siegen.16_3
Ø  D_kobe_nict_siegen.16_1
Ø  D_IMOTION.16_1
Ø  D_IMOTION.16_2
Ø  D_vitrivr.16_1
Ø  D_VIREO.16_5
Ø  D_Waseda.16_4
Ø  D_kobe_nict_siegen.16_3
Ø  D_kobe_nict_siegen.16_1
Ø  D_IMOTION.16_1
Ø  D_IMOTION.16_2
Ø  D_vitrivr.16_1
Ø  D_VIREO.16_5
D_Waseda.16_1
Ø  D_Waseda.16_3
Ø  D_kobe_nict_siegen.16_3
Ø  D_kobe_nict_siegen.16_1
Ø  D_IMOTION.16_1
Ø  D_IMOTION.16_2
Ø  D_vitrivr.16_1
Ø  D_VIREO.16_5
Run Inf. AP score
D_Waseda.16_2 0.177 *
D_Waseda.16_1 0.169 *
D_Waseda.16_4 0.164 #
D_Waseda.16_3 0.156 #
D_kobe_nict_siegen.16_3 0.047 ^
D_IMOTION.16_1 0.047 ^
D_kobe_nict_siegen.16_1 0.046 ^
D_IMOTION.16_2 0.046 ^
D_vitrivr.16_1 0.044 ^
D_VIREO.16_5 0.044 ^
3/9/17 TRECVID 2016 18
Statistical significant differences among top 10 “F” runs
(using randomization test, p < 0.05)
Run Inf. AP score
D_NII_Hitachi_UIT.16_4 0.054
D_ITI_CERTH.16_4 0.051
D_ITI_CERTH.16_3 0.051
D_ITI_CERTH.16_1 0.051
D_NII_Hitachi_UIT.16_3 0.046
D_NII_Hitachi_UIT.16_2 0.043
D_NII_Hitachi_UIT.16_1 0.043
D_ITI_CERTH.16_2 0.042
E_INF.16_1 0.040
D_VIREO.16_6 0.038
No statistical
significant
differences among
the top 10 runs
3/9/17 TRECVID 2016 19
Processing time vs Inf. AP (“M” runs)
1
10
100
1000
10000
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7
Time(s)
Inf. AP
3/9/17 TRECVID 2016 20
Processing time vs Inf. AP (“F” runs)
1
10
100
1000
10000
0 0.2 0.4 0.6 0.8
Time(s)
Inf. AP
Not fast
enough?!
3/9/17 TRECVID 2016 21
2016 Observations / Questions
•  Most teams relied on intensive visual concept indexing, leveraging on
past Semantic Indexing (SIN) task and similar like ImageNet, Scenes …
•  Combined with manual or automatic query transformation
•  Clever combination of concept scores (e.g., Waseda)
•  Ad-hoc search is more difficult than simple concept-based tagging.
•  Big gap between SIN best performance and AVS: maybe performance
should be better compared with the “concept pair” task within SIN
•  Manually-assisted runs performed better than fully-automatic.
•  Most systems are not real-time (slower systems were not necessarily
effective).
•  Some systems reported 0 time!!!
•  E and F runs are still rare compared to A and D
•  Was the task/queries realistic enough?!
•  Do we need to change/add/remove anything from the task in 2017 ?
3/9/17 TRECVID 2016 22
Continued at MMM2017
•  10 Ad-Hoc Video Search (AVS) tasks, 5 of which are a random subset
of the 30 AVS tasks of TRECVID 2016 and 5 will be chosen directly by
human judges as a surprise. Each AVS task has several/many target
shots that should be found.
•  10 Known-Item Search (KIS) tasks, which are selected completely
random on site. Each KIS task has only one single 20 s long target
segment.
•  Registration for the task is now closed
3/9/17 TRECVID 2016 23
9:20 - 12:00 : Ad-hoc Video Search
•  9:20 - 9:40, Task Overview
•  9:40 - 10:00, NII_Hitachi_UIT (National Institute of Informatics; Hitachi;
U. of Inf. Tech.)
•  10:00 - 10:20, ITI_CERTH (Centre for Research and Technology
Hellas)
•  10:20 - 10:40, Break with refreshments
•  10:40 - 11:00, Waseda (Waseda University)
•  11:00 - 11:20, kobe_nict_siegen (Kobe U.; Japan National Institute of
Inf. and Communications Tech.;U. of Siegen)
•  11:20 - 11:40, INF (Carnegie Mellon University, University of
Technology Sydney, Renmin University of China, Shandong University)
•  11:40 - 12:00, AVS discussion

Contenu connexe

En vedette

TRECVID 2016 workshop
TRECVID 2016 workshopTRECVID 2016 workshop
TRECVID 2016 workshopGeorge Awad
 
La observación- Marcela Rojas, Viviana Arévalo
La observación- Marcela Rojas, Viviana ArévaloLa observación- Marcela Rojas, Viviana Arévalo
La observación- Marcela Rojas, Viviana ArévaloURIELYMARCELA
 
David Walker's Appeal
David Walker's AppealDavid Walker's Appeal
David Walker's AppealBro Cadence
 
TRECVID 2016 : Multimedia event Detection
TRECVID 2016 : Multimedia event DetectionTRECVID 2016 : Multimedia event Detection
TRECVID 2016 : Multimedia event DetectionGeorge Awad
 
Boletin digital 03
Boletin digital 03Boletin digital 03
Boletin digital 03chemadelaf
 
Website management model
Website management modelWebsite management model
Website management modeladgclarke
 
диаграмм 5
диаграмм  5диаграмм  5
диаграмм 5ttuya
 
Inv. Dramatización
Inv. DramatizaciónInv. Dramatización
Inv. DramatizaciónFrisu
 
Metodo De Observacion
Metodo De ObservacionMetodo De Observacion
Metodo De Observacionaustinmi
 

En vedette (15)

TRECVID 2016 workshop
TRECVID 2016 workshopTRECVID 2016 workshop
TRECVID 2016 workshop
 
La observación- Marcela Rojas, Viviana Arévalo
La observación- Marcela Rojas, Viviana ArévaloLa observación- Marcela Rojas, Viviana Arévalo
La observación- Marcela Rojas, Viviana Arévalo
 
Tertulia dialógica
Tertulia dialógicaTertulia dialógica
Tertulia dialógica
 
David Walker's Appeal
David Walker's AppealDavid Walker's Appeal
David Walker's Appeal
 
Spratne montazne kuce
Spratne montazne kuceSpratne montazne kuce
Spratne montazne kuce
 
TRECVID 2016 : Multimedia event Detection
TRECVID 2016 : Multimedia event DetectionTRECVID 2016 : Multimedia event Detection
TRECVID 2016 : Multimedia event Detection
 
Boletin digital 03
Boletin digital 03Boletin digital 03
Boletin digital 03
 
Eko montazne kuce
Eko montazne kuceEko montazne kuce
Eko montazne kuce
 
Website management model
Website management modelWebsite management model
Website management model
 
диаграмм 5
диаграмм  5диаграмм  5
диаграмм 5
 
Kabinet kerja
Kabinet kerjaKabinet kerja
Kabinet kerja
 
Inv. Dramatización
Inv. DramatizaciónInv. Dramatización
Inv. Dramatización
 
Ejemplo de Fichas de Salud Mental
Ejemplo de Fichas de Salud MentalEjemplo de Fichas de Salud Mental
Ejemplo de Fichas de Salud Mental
 
Metodo De Observacion
Metodo De ObservacionMetodo De Observacion
Metodo De Observacion
 
Listas de Chequeo
Listas de ChequeoListas de Chequeo
Listas de Chequeo
 

Similaire à TRECVID 2016 : Ad-hoc Video Search

TRECVID 2019 Ad-hoc Video Search
TRECVID 2019 Ad-hoc Video SearchTRECVID 2019 Ad-hoc Video Search
TRECVID 2019 Ad-hoc Video SearchGeorge Awad
 
[TRECVID 2018] Ad-hoc Video Search
[TRECVID 2018] Ad-hoc Video Search[TRECVID 2018] Ad-hoc Video Search
[TRECVID 2018] Ad-hoc Video SearchGeorge Awad
 
Collecting and Coding Twitter Data in DiscoverText
Collecting and Coding Twitter Data in DiscoverTextCollecting and Coding Twitter Data in DiscoverText
Collecting and Coding Twitter Data in DiscoverTextJill Hopke
 
Video Search at TRECVID 2022
Video Search at TRECVID 2022Video Search at TRECVID 2022
Video Search at TRECVID 2022George Awad
 
The Search and Hyperlinking Task at MediaEval 2014
The Search and Hyperlinking Task at MediaEval 2014The Search and Hyperlinking Task at MediaEval 2014
The Search and Hyperlinking Task at MediaEval 2014multimediaeval
 
PH-HISTO-PPT.FINAL DEF.pptx
PH-HISTO-PPT.FINAL DEF.pptxPH-HISTO-PPT.FINAL DEF.pptx
PH-HISTO-PPT.FINAL DEF.pptxJcModelo
 
Ib Extended Essay Politics. Online assignment writing service.
Ib Extended Essay Politics. Online assignment writing service.Ib Extended Essay Politics. Online assignment writing service.
Ib Extended Essay Politics. Online assignment writing service.Lucy Jensen
 
Let's understand Data Science
Let's understand Data Science Let's understand Data Science
Let's understand Data Science Sachin Rastogi
 
Thesis-Defense-YuhuiWang-small
Thesis-Defense-YuhuiWang-smallThesis-Defense-YuhuiWang-small
Thesis-Defense-YuhuiWang-smallYuhui Wang
 
Master's Thesis Presentation
Master's Thesis PresentationMaster's Thesis Presentation
Master's Thesis PresentationWajdi Khattel
 
Eduworks kick-off presentation: USAL
Eduworks kick-off presentation: USALEduworks kick-off presentation: USAL
Eduworks kick-off presentation: USALEduworks Network
 
UGC NET Paper 1 Major Topics in New Syllabus
UGC NET Paper 1 Major Topics in New SyllabusUGC NET Paper 1 Major Topics in New Syllabus
UGC NET Paper 1 Major Topics in New SyllabusDr Rajnikant Dodiya
 
Realtime Face mask Detector using YoloV4
Realtime Face mask Detector using YoloV4Realtime Face mask Detector using YoloV4
Realtime Face mask Detector using YoloV4IRJET Journal
 
MediaEval 2017 Retrieving Diverse Social Images Task (Overview)
MediaEval 2017 Retrieving Diverse Social Images Task (Overview)MediaEval 2017 Retrieving Diverse Social Images Task (Overview)
MediaEval 2017 Retrieving Diverse Social Images Task (Overview)multimediaeval
 
Preparing Testimony about Cellebrite UFED In a Daubert or Frye Hearing
Preparing Testimony about Cellebrite UFED In a Daubert or Frye HearingPreparing Testimony about Cellebrite UFED In a Daubert or Frye Hearing
Preparing Testimony about Cellebrite UFED In a Daubert or Frye HearingCellebrite
 
IEEE p1589 'ARLEM' virtual meeting, September 9, 2015
IEEE p1589 'ARLEM' virtual meeting, September 9, 2015IEEE p1589 'ARLEM' virtual meeting, September 9, 2015
IEEE p1589 'ARLEM' virtual meeting, September 9, 2015fridolin.wild
 

Similaire à TRECVID 2016 : Ad-hoc Video Search (20)

TRECVID 2019 Ad-hoc Video Search
TRECVID 2019 Ad-hoc Video SearchTRECVID 2019 Ad-hoc Video Search
TRECVID 2019 Ad-hoc Video Search
 
[TRECVID 2018] Ad-hoc Video Search
[TRECVID 2018] Ad-hoc Video Search[TRECVID 2018] Ad-hoc Video Search
[TRECVID 2018] Ad-hoc Video Search
 
Collecting and Coding Twitter Data in DiscoverText
Collecting and Coding Twitter Data in DiscoverTextCollecting and Coding Twitter Data in DiscoverText
Collecting and Coding Twitter Data in DiscoverText
 
Video Search at TRECVID 2022
Video Search at TRECVID 2022Video Search at TRECVID 2022
Video Search at TRECVID 2022
 
The Search and Hyperlinking Task at MediaEval 2014
The Search and Hyperlinking Task at MediaEval 2014The Search and Hyperlinking Task at MediaEval 2014
The Search and Hyperlinking Task at MediaEval 2014
 
PH-HISTO-PPT.FINAL DEF.pptx
PH-HISTO-PPT.FINAL DEF.pptxPH-HISTO-PPT.FINAL DEF.pptx
PH-HISTO-PPT.FINAL DEF.pptx
 
Ib Extended Essay Politics. Online assignment writing service.
Ib Extended Essay Politics. Online assignment writing service.Ib Extended Essay Politics. Online assignment writing service.
Ib Extended Essay Politics. Online assignment writing service.
 
II-SDV 2016 Deep SEARCH 9
II-SDV 2016 Deep SEARCH 9II-SDV 2016 Deep SEARCH 9
II-SDV 2016 Deep SEARCH 9
 
Let's understand Data Science
Let's understand Data Science Let's understand Data Science
Let's understand Data Science
 
Thesis-Defense-YuhuiWang-small
Thesis-Defense-YuhuiWang-smallThesis-Defense-YuhuiWang-small
Thesis-Defense-YuhuiWang-small
 
Master's Thesis Presentation
Master's Thesis PresentationMaster's Thesis Presentation
Master's Thesis Presentation
 
Eduworks kick-off presentation: USAL
Eduworks kick-off presentation: USALEduworks kick-off presentation: USAL
Eduworks kick-off presentation: USAL
 
UGC NET Paper 1 Major Topics in New Syllabus
UGC NET Paper 1 Major Topics in New SyllabusUGC NET Paper 1 Major Topics in New Syllabus
UGC NET Paper 1 Major Topics in New Syllabus
 
Fehlmann and Kranich - Measuring tests using cosmic
Fehlmann and Kranich - Measuring tests using cosmicFehlmann and Kranich - Measuring tests using cosmic
Fehlmann and Kranich - Measuring tests using cosmic
 
An overview on diversity and Software Testing
An overview on diversity and Software TestingAn overview on diversity and Software Testing
An overview on diversity and Software Testing
 
Realtime Face mask Detector using YoloV4
Realtime Face mask Detector using YoloV4Realtime Face mask Detector using YoloV4
Realtime Face mask Detector using YoloV4
 
MediaEval 2017 Retrieving Diverse Social Images Task (Overview)
MediaEval 2017 Retrieving Diverse Social Images Task (Overview)MediaEval 2017 Retrieving Diverse Social Images Task (Overview)
MediaEval 2017 Retrieving Diverse Social Images Task (Overview)
 
2015 genome-center
2015 genome-center2015 genome-center
2015 genome-center
 
Preparing Testimony about Cellebrite UFED In a Daubert or Frye Hearing
Preparing Testimony about Cellebrite UFED In a Daubert or Frye HearingPreparing Testimony about Cellebrite UFED In a Daubert or Frye Hearing
Preparing Testimony about Cellebrite UFED In a Daubert or Frye Hearing
 
IEEE p1589 'ARLEM' virtual meeting, September 9, 2015
IEEE p1589 'ARLEM' virtual meeting, September 9, 2015IEEE p1589 'ARLEM' virtual meeting, September 9, 2015
IEEE p1589 'ARLEM' virtual meeting, September 9, 2015
 

Plus de George Awad

Movie Summarization at TRECVID 2022
Movie Summarization at TRECVID 2022Movie Summarization at TRECVID 2022
Movie Summarization at TRECVID 2022George Awad
 
Disaster Scene Indexing at TRECVID 2022
Disaster Scene Indexing at TRECVID 2022Disaster Scene Indexing at TRECVID 2022
Disaster Scene Indexing at TRECVID 2022George Awad
 
Activity Detection at TRECVID 2022
Activity Detection at TRECVID 2022Activity Detection at TRECVID 2022
Activity Detection at TRECVID 2022George Awad
 
Multimedia Understanding at TRECVID 2022
Multimedia Understanding at TRECVID 2022Multimedia Understanding at TRECVID 2022
Multimedia Understanding at TRECVID 2022George Awad
 
Video Captioning at TRECVID 2022
Video Captioning at TRECVID 2022Video Captioning at TRECVID 2022
Video Captioning at TRECVID 2022George Awad
 
TRECVID 2019 Video to Text
TRECVID 2019 Video to TextTRECVID 2019 Video to Text
TRECVID 2019 Video to TextGeorge Awad
 
TRECVID 2019 introduction slides
TRECVID 2019 introduction slidesTRECVID 2019 introduction slides
TRECVID 2019 introduction slidesGeorge Awad
 
TRECVID 2019 Instance Search
TRECVID 2019 Instance SearchTRECVID 2019 Instance Search
TRECVID 2019 Instance SearchGeorge Awad
 
[TRECVID 2018] Video to Text
[TRECVID 2018] Video to Text[TRECVID 2018] Video to Text
[TRECVID 2018] Video to TextGeorge Awad
 
[TRECVID 2018] Instance Search
[TRECVID 2018] Instance Search[TRECVID 2018] Instance Search
[TRECVID 2018] Instance SearchGeorge Awad
 
Instance Search (INS) task at TRECVID
Instance Search (INS) task at TRECVIDInstance Search (INS) task at TRECVID
Instance Search (INS) task at TRECVIDGeorge Awad
 
Activity detection at TRECVID
Activity detection at TRECVIDActivity detection at TRECVID
Activity detection at TRECVIDGeorge Awad
 
The Video-to-Text task evaluation at TRECVID
The Video-to-Text task evaluation at TRECVIDThe Video-to-Text task evaluation at TRECVID
The Video-to-Text task evaluation at TRECVIDGeorge Awad
 
Introduction about TRECVID (The Video Retreival benchmark)
Introduction about TRECVID (The Video Retreival benchmark)Introduction about TRECVID (The Video Retreival benchmark)
Introduction about TRECVID (The Video Retreival benchmark)George Awad
 
TRECVID ECCV2018 Tutorial (Ad-hoc Video Search)
TRECVID ECCV2018 Tutorial (Ad-hoc Video Search)TRECVID ECCV2018 Tutorial (Ad-hoc Video Search)
TRECVID ECCV2018 Tutorial (Ad-hoc Video Search)George Awad
 

Plus de George Awad (15)

Movie Summarization at TRECVID 2022
Movie Summarization at TRECVID 2022Movie Summarization at TRECVID 2022
Movie Summarization at TRECVID 2022
 
Disaster Scene Indexing at TRECVID 2022
Disaster Scene Indexing at TRECVID 2022Disaster Scene Indexing at TRECVID 2022
Disaster Scene Indexing at TRECVID 2022
 
Activity Detection at TRECVID 2022
Activity Detection at TRECVID 2022Activity Detection at TRECVID 2022
Activity Detection at TRECVID 2022
 
Multimedia Understanding at TRECVID 2022
Multimedia Understanding at TRECVID 2022Multimedia Understanding at TRECVID 2022
Multimedia Understanding at TRECVID 2022
 
Video Captioning at TRECVID 2022
Video Captioning at TRECVID 2022Video Captioning at TRECVID 2022
Video Captioning at TRECVID 2022
 
TRECVID 2019 Video to Text
TRECVID 2019 Video to TextTRECVID 2019 Video to Text
TRECVID 2019 Video to Text
 
TRECVID 2019 introduction slides
TRECVID 2019 introduction slidesTRECVID 2019 introduction slides
TRECVID 2019 introduction slides
 
TRECVID 2019 Instance Search
TRECVID 2019 Instance SearchTRECVID 2019 Instance Search
TRECVID 2019 Instance Search
 
[TRECVID 2018] Video to Text
[TRECVID 2018] Video to Text[TRECVID 2018] Video to Text
[TRECVID 2018] Video to Text
 
[TRECVID 2018] Instance Search
[TRECVID 2018] Instance Search[TRECVID 2018] Instance Search
[TRECVID 2018] Instance Search
 
Instance Search (INS) task at TRECVID
Instance Search (INS) task at TRECVIDInstance Search (INS) task at TRECVID
Instance Search (INS) task at TRECVID
 
Activity detection at TRECVID
Activity detection at TRECVIDActivity detection at TRECVID
Activity detection at TRECVID
 
The Video-to-Text task evaluation at TRECVID
The Video-to-Text task evaluation at TRECVIDThe Video-to-Text task evaluation at TRECVID
The Video-to-Text task evaluation at TRECVID
 
Introduction about TRECVID (The Video Retreival benchmark)
Introduction about TRECVID (The Video Retreival benchmark)Introduction about TRECVID (The Video Retreival benchmark)
Introduction about TRECVID (The Video Retreival benchmark)
 
TRECVID ECCV2018 Tutorial (Ad-hoc Video Search)
TRECVID ECCV2018 Tutorial (Ad-hoc Video Search)TRECVID ECCV2018 Tutorial (Ad-hoc Video Search)
TRECVID ECCV2018 Tutorial (Ad-hoc Video Search)
 

Dernier

ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 

Dernier (20)

ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

TRECVID 2016 : Ad-hoc Video Search

  • 1. TRECVID 2016 AD-HOC VIDEO SEARCH TASK : OVERVIEW Georges Quénot Laboratoire d'Informatique de Grenoble George Awad Dakota Consulting, Inc National Institute of Standards and Technology
  • 2. Ad-hoc Video Search Task Definition •  Goal: promote progress in content-based retrieval based on end user ad-hoc queries that include persons, objects, locations, activities and their combinations. •  Task: Given a test collection, a query, and a master shot boundary reference, return a ranked list of at most 1000 shots (out of 335 944) which best satisfy the need. •  New testing data: 4593 Internet Archive videos (IACC.3), 600 total hours with video durations between 6.5 min to 9.5 min. •  Development data: ≈1400 hours of previous IACC data used between 2010-2015 with concept annotations. TRECVID 2016 23/9/17
  • 3. Query Development •  Test videos were viewed by 10 human assessors hired by the National Institute of Standards and Technology (NIST). •  4 facet description of different scenes were used (if applicable): •  Who : concrete objects and being (kind of persons, animals, things) •  What : are the objects and/or beings doing ? (generic actions, conditions/state) •  Where : locale, site, place, geographic, architectural •  When : time of day, season •  In total assessors watched ≈35% of the IACC.3 videos •  90 Candidate queries chosen from human written descriptions to be used between 2016-2018. TRECVID 2016 33/9/17
  • 4. TV2016 Queries samples by complexity •  Person + Action + Object + Location Find shots of a person playing guitar outdoors. Find shots of a man indoors looking at camera where a bookcase is behind him. Find shots of a person playing drums indoors. Find shots of a diver wearing diving suit and swimming under water. •  Person + Action + Location Find shots of the 43rd president George W. Bush sitting down talking with people indoors. Find shots of a choir or orchestra and conductor performing on stage. Find shots of one or more people walking or bicycling on a bridge during daytime. 3/9/17 TRECVID 2016 4
  • 5. TV2016 Queries by complexity •  Person + Action/state + Object Find shots of a person sitting down with a laptop visible. Find shots of a man with beard talking or singing into a microphone. Find shots of one or more people opening a door and exiting through it. Find shots of a person holding a knife. Find shots of a woman wearing glasses. Find shots of a person drinking from a cup, mug, bottle, or other container. Find shots of a person wearing a helmet. Find shots of a person lighting a candle. •  Person + Action Find shots of people shopping. Find shots of soldiers performing training or other military maneuvers. Find shots of a person jumping. Find shots of a man shake hands with a woman. 3/9/17 TRECVID 2016 5
  • 6. TV2016 Queries by complexity •  Person + Location Find shots of one or more people at train station platform. Find shots of two or more men at a beach scene. •  Person + Object Find shots of a policeman where a police car is visible. •  Object + Location Find shots of any type of fountains outdoors. •  Object Find shots of a sewing machine. Find shots of destroyed buildings. Find shots of palm trees. 3/9/17 TRECVID 2016 6
  • 7. 3/9/17 TRECVID 2016 7 Training and run types Four training data types: ü  A – used only IACC training data (4 runs) ü  D – used any other training data (42 runs) ü  E – used only training data collected automatically using only the query text (6 runs) ü  F – used only training data collected automatically using a query built manually from the given query text (0 runs) Two run submission types: ü  Manually-assisted (M) – Query built manually ü  Fully automatic (F) – System uses official query directly
  • 8. 3/9/17 TRECVID 2016 8 Evaluation Each query assumed to be binary: absent or present for each master reference shot. NIST sampled ranked pools and judged top results from all submissions. Metrics: inferred average precision per query. Compared runs in terms of mean inferred average precision across the 30 queries.
  • 9. 3/9/17 TRECVID 2016 9 mean extended Inferred average precision (xinfAP) 2 pools were created for each query and sampled as: ü  Top pool (ranks 1 to 200) sampled at 100 % ü  Bottom pool (ranks 201 to 1000) sampled at 11.1 % ü  % of sampled and judged clips from rank 201 to 1000 across all runs (min= 10.5 %, max = 76 %, mean = 35 %) Judgment process: one assessor per query, watched complete shot while listening to the audio. infAP was calculated using the judged and unjudged pool by sample_eval 30 queries 187 918 total judgments 7448 total hits 4642 hits at ranks (1 to100) 2080 hits at ranks (101 to200) 726 hits at ranks (201 to 2000)
  • 10. 3/9/17 TRECVID 2016 10 Finishers : 13 out of 29 M F INF CMU; Beijing University of Posts and Telecommunication; University Autonoma de Madrid; Shandong University; Xian JiaoTong University Singapore - 4 kobe_nict_siegen Kobe University, Japan; National Institute of Information and Communications Technology, Japan; University of Siegen, Germany 3 - UEC Dept. of Informatics, The University of Electro- Communications, Tokyo 2 - ITI_CERTH Inf. Tech. Inst., Centre for Research and Technology Hellas 4 4 ITEC_UNIKLU Klagenfurt University - 3 NII_Hitachi_UIT Natl. Inst. Of Info.; Hitachi Ltd; University of Inf. Tech. (HCM-UIT) - 4 IMOTION University of Basel, Switzerland; University of Mons, Belgium; Koc University, Turkey 2 2 MediaMill University of Amsterdam Qualcomm - 4 Vitrivr University of Basel 2 2 Waseda Waseda University 4 - VIREO City University of Hong Kong 3 3 EURECOM EURECOM - 4 FIU_UM Florida International University, University of Miami 2 -
  • 11. 3/9/17 TRECVID 2016 11 Inferred frequency of hits varies by query 0 500 1000 1500 2000 2500 501 503 505 507 509 511 513 515 517 519 521 523 525 527 529 Inf. Hits / query 0.5 % of test shots Topics Inf.hits
  • 12. 3/9/17 TRECVID 2016 12 Total true shots contributed uniquely by team 0 20 40 60 80 100 120 140 Numberoftrueshots
  • 13. 3/9/17 TRECVID 2016 13 2016 run submissions scores (22 Manually-assisted runs) 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 M_D_Waseda.16_2 M_D_Waseda.16_1 M_D_Waseda.16_4 M_D_Waseda.16_3 M_D_kobe_nict_siegen. M_D_IMOTION.16_1 M_D_kobe_nict_siegen. M_D_IMOTION.16_2 M_D_vitrivr.16_1 M_D_VIREO.16_5 M_D_vitrivr.16_2 M_D_VIREO.16_1 M_D_ITI_CERTH.16_4 M_D_ITI_CERTH.16_1 M_D_kobe_nict_siegen. M_D_ITI_CERTH.16_3 M_D_ITI_CERTH.16_2 M_D_FIU_UM.16_2 M_D_FIU_UM.16_1 M_A_VIREO.16_3 M_A_UEC.16_2 M_A_UEC.16_1 MeanInf.AP Median = 0.043 Gap due to searcher or interface ?!
  • 14. 3/9/17 TRECVID 2016 14 2016 run submissions scores (30 Fully automatic runs) 0 0.01 0.02 0.03 0.04 0.05 0.06 F_D_NII_Hitachi_UIT. F_D_ITI_CERTH.16_4 F_D_ITI_CERTH.16_3 F_D_ITI_CERTH.16_1 F_D_NII_Hitachi_UIT. F_D_NII_Hitachi_UIT. F_D_NII_Hitachi_UIT. F_D_ITI_CERTH.16_2 F_E_INF.16_1 F_D_VIREO.16_6 F_D_VIREO.16_2 F_D_MediaMill.16_4 F_D_MediaMill.16_2 F_D_MediaMill.16_1 F_E_INF.16_2 F_D_MediaMill.16_3 F_D_EURECOM.16_2 F_E_INF.16_3 F_D_IMOTION.16_3 F_D_IMOTION.16_4 F_D_EURECOM.16_1 F_A_VIREO.16_4 F_D_EURECOM.16_4 F_D_INF.16_4 F_D_vitrivr.16_4 F_D_vitrivr.16_3 F_E_ITEC_UNIKLU.16_1 F_D_EURECOM.16_3 F_E_ITEC_UNIKLU.16_2 F_E_ITEC_UNIKLU.16_3 MeanInf.AP Median = 0.024
  • 15. 3/9/17 TRECVID 2016 15 Top 10 infAP scores by query (Manually-assisted) 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 Inf.AP 10 9 8 7 6 5 4 3 2 1 Median Topics
  • 16. 3/9/17 TRECVID 2016 16 Top 10 infAP scores by query (Fully automatic) 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 Inf.AP 10 9 8 7 6 5 4 3 2 1 Median Topics
  • 17. 3/9/17 TRECVID 2016 17 Statistical significant differences among top 10 “M” runs (using randomization test, p < 0.05) D_Waseda.16_2 Ø  D_Waseda.16_3 Ø  D_kobe_nict_siegen.16_3 Ø  D_kobe_nict_siegen.16_1 Ø  D_IMOTION.16_1 Ø  D_IMOTION.16_2 Ø  D_vitrivr.16_1 Ø  D_VIREO.16_5 Ø  D_Waseda.16_4 Ø  D_kobe_nict_siegen.16_3 Ø  D_kobe_nict_siegen.16_1 Ø  D_IMOTION.16_1 Ø  D_IMOTION.16_2 Ø  D_vitrivr.16_1 Ø  D_VIREO.16_5 D_Waseda.16_1 Ø  D_Waseda.16_3 Ø  D_kobe_nict_siegen.16_3 Ø  D_kobe_nict_siegen.16_1 Ø  D_IMOTION.16_1 Ø  D_IMOTION.16_2 Ø  D_vitrivr.16_1 Ø  D_VIREO.16_5 Run Inf. AP score D_Waseda.16_2 0.177 * D_Waseda.16_1 0.169 * D_Waseda.16_4 0.164 # D_Waseda.16_3 0.156 # D_kobe_nict_siegen.16_3 0.047 ^ D_IMOTION.16_1 0.047 ^ D_kobe_nict_siegen.16_1 0.046 ^ D_IMOTION.16_2 0.046 ^ D_vitrivr.16_1 0.044 ^ D_VIREO.16_5 0.044 ^
  • 18. 3/9/17 TRECVID 2016 18 Statistical significant differences among top 10 “F” runs (using randomization test, p < 0.05) Run Inf. AP score D_NII_Hitachi_UIT.16_4 0.054 D_ITI_CERTH.16_4 0.051 D_ITI_CERTH.16_3 0.051 D_ITI_CERTH.16_1 0.051 D_NII_Hitachi_UIT.16_3 0.046 D_NII_Hitachi_UIT.16_2 0.043 D_NII_Hitachi_UIT.16_1 0.043 D_ITI_CERTH.16_2 0.042 E_INF.16_1 0.040 D_VIREO.16_6 0.038 No statistical significant differences among the top 10 runs
  • 19. 3/9/17 TRECVID 2016 19 Processing time vs Inf. AP (“M” runs) 1 10 100 1000 10000 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 Time(s) Inf. AP
  • 20. 3/9/17 TRECVID 2016 20 Processing time vs Inf. AP (“F” runs) 1 10 100 1000 10000 0 0.2 0.4 0.6 0.8 Time(s) Inf. AP Not fast enough?!
  • 21. 3/9/17 TRECVID 2016 21 2016 Observations / Questions •  Most teams relied on intensive visual concept indexing, leveraging on past Semantic Indexing (SIN) task and similar like ImageNet, Scenes … •  Combined with manual or automatic query transformation •  Clever combination of concept scores (e.g., Waseda) •  Ad-hoc search is more difficult than simple concept-based tagging. •  Big gap between SIN best performance and AVS: maybe performance should be better compared with the “concept pair” task within SIN •  Manually-assisted runs performed better than fully-automatic. •  Most systems are not real-time (slower systems were not necessarily effective). •  Some systems reported 0 time!!! •  E and F runs are still rare compared to A and D •  Was the task/queries realistic enough?! •  Do we need to change/add/remove anything from the task in 2017 ?
  • 22. 3/9/17 TRECVID 2016 22 Continued at MMM2017 •  10 Ad-Hoc Video Search (AVS) tasks, 5 of which are a random subset of the 30 AVS tasks of TRECVID 2016 and 5 will be chosen directly by human judges as a surprise. Each AVS task has several/many target shots that should be found. •  10 Known-Item Search (KIS) tasks, which are selected completely random on site. Each KIS task has only one single 20 s long target segment. •  Registration for the task is now closed
  • 23. 3/9/17 TRECVID 2016 23 9:20 - 12:00 : Ad-hoc Video Search •  9:20 - 9:40, Task Overview •  9:40 - 10:00, NII_Hitachi_UIT (National Institute of Informatics; Hitachi; U. of Inf. Tech.) •  10:00 - 10:20, ITI_CERTH (Centre for Research and Technology Hellas) •  10:20 - 10:40, Break with refreshments •  10:40 - 11:00, Waseda (Waseda University) •  11:00 - 11:20, kobe_nict_siegen (Kobe U.; Japan National Institute of Inf. and Communications Tech.;U. of Siegen) •  11:20 - 11:40, INF (Carnegie Mellon University, University of Technology Sydney, Renmin University of China, Shandong University) •  11:40 - 12:00, AVS discussion