SlideShare une entreprise Scribd logo
1  sur  5
Télécharger pour lire hors ligne
VCode and VData: Illustrating a new Framework for
              Supporting the Video Annotation Workflow

                                    Joey Hagedorn, Joshua Hailpern, Karrie G. Karahalios
                                                         Department of Computer Science
                                                     University of Illinois at Urbana Champaign
                                                               Urbana, IL 61801 USA
                                                {hagedorn, jhailpe2, kkarahal}@uiuc.edu

ABSTRACT
Digital tools for annotation of video have the promise to
provide immense value to researchers in disciplines rang-
ing from psychology to ethnography to computer science.
With traditional methods for annotation being cumbersome,
time-consuming, and frustrating, technological solutions are
situated to aid in video annotation by increasing reliabil-
ity, repeatability, and workflow optimizations. Three no-
table limitations of existing video annotation tools are lack
of support for the annotation workflow, poor representation
of data on a timeline, and poor interaction techniques with
video, data, and annotations. This paper details a set of
design requirements intended to enhance video annotation.
Our framework is grounded in existing literature, interviews
with experienced coders, and ongoing discussions with re-
searchers in multiple disciplines. Our model is demonstrated
                                                                                    Figure 1: The VCode application graphically repre-
in a new system called VCode and VData. The benefit of
                                                                                    sents the behaviors being coded as marks on a time-
our system is that is directly addresses the workflow and
                                                                                    line. It is easy to see correlation between the marks
needs of both researchers and video coders.
                                                                                    on the timeline and sensor data displayed below.

Categories and Subject Descriptors                                                  human centered domains can now rely on video annotation
                                                                                    to provide them with measures on which to draw conclu-
H.5.1 [Multimedia Information Systems]:
                                                                                    sions. Unlike transcription, which is akin to what a court
Evaluation/methodology Video
                                                                                    stenographer does, annotation is the marking of movements,
; H.5.2 [User Interfaces]: Graphical user interfaces
                                                                                    sounds, and other such events (with or without additional
                                                                                    metadata such as rankings). The emergence of technology as
General Terms                                                                       a tool to aid in video annotation has raised the possibility of
Design, Human Factors, Measurement                                                  increasing reliability, repeatability, and workflow optimiza-
                                                                                    tions [6]. Three notable limitations of existing video anno-
                                                                                    tation tools are lack of support for the annotation workflow,
Keywords
                                                                                    poor representation of data on a timeline, and poor inter-
Graphical user interfaces (GUI), Annotation, Video                                  action techniques with video, data, and annotations. This
                                                                                    paper details a set of requirements to guide the design of
1.     INTRODUCTION                                                                 video annotation tools. Our model is the direct result of an
                                                                                    analysis of existing tools, current practices by researchers,
  Human behavior does not naturally lend itself to being
                                                                                    and workflow difficulties experienced by real-world video
quantifiable. Yet time and again, researchers in disciplines
                                                                                    coders. By understanding what data researchers are looking
ranging from psychology to ethnography to computer sci-
                                                                                    to gather, and the shortcomings of existing techniques and
ence, are forced to analyze as if it was quantified. Those in
                                                                                    technology utilized by coders, we believe that we have cre-
                                                                                    ated a framework for video annotation that can reach across
                                                                                    disciplines. Our model is demonstrated through the design
                                                                                    and construction of our new system VCode and VData (Fig-
Permission to make digital or hard copies of all or part of this work for
                                                                                    ure 1); two fully functional, open-source tools which bridge
personal or classroom use is granted without fee provided that copies are
                                                                                    the video annotation workflow.
not made or distributed for profit or commercial advantage and that copies
bear this notice and the full citation on the first page. To copy otherwise, to         The primary contribution of this paper is the set of design
republish, to post on servers or to redistribute to lists, requires prior specific   requirements for facilitating a system conducive to video an-
permission and/or a fee.
                                                                                    notation. Specifically, we demonstrate how a system could
AVI ’08 May 28-30, 2008, Naples, Italy
                                                                                    be designed and built to meet these requirements through a
Copyright 200X ACM X-XXXXX-XX-X/XX/XX ...$5.00.
set of carefully designed interfaces and graphical represen-          One last limitation is lack of support for the full annota-
tations of data.                                                   tion workflow; 1) collect video 2) create segments to code 3)
                                                                   train coders/demonstrate reliability 4) gather data 5) per-
                                                                   form regular checks on reliability & discuss discrepancies 6)
2.   RELATED WORK
                                                                   perform data analysis. Many tools support small portions
                                                                   of this workflow (i.e. simply facilitating segmentation, an-
2.1 Video Coding In Practice
                                                                   notation, or reliability [22, 6, 12]), but with each break
   The analysis of human behavior is a study that dates back
                                                                   in the process researchers can become delayed. Without ex-
hundreds of years. This has ranged from anthropological
                                                                   port/import data reentry is required. Technology is situated
ethnographies [10] to psychological evaluations. As tech-
                                                                   to optimize this process.
nology has developed, the use of video and creation of an-
                                                                      Researchers have also explored dialogue transcription [17,
notation techniques have aided researchers by providing a
                                                                   13, 12], tagging [2, 12], scene based automatic annotation
referable document that can be used as evidence to back up
                                                                   [9, 19], automatic event logging [5], and object of focus
claims and observations made [21, 16, 20]. These techniques
                                                                   identification [8]. Our work contrasts these other foci by
involve detailed event logging on paper, specifying features
                                                                   demonstrating techniques for supporting human based an-
such as durations, ratings/levels, and time-stamps [15].
                                                                   notation of events that occur in video.
   To ensure a reliable set of data from annotation, researchers
perform agreement calculations between coders [7]. This
                                                                   3. INTERVIEWS & COLLABORATION
agreement is utilized throughout the data gathering process
(by testing some small percentage of data segments to en-            To gain a deeper understanding of methods, analysis pro-
sure consistency throughout), but also during training of          cesses, bottlenecks, and types of data needed for effective
coders (to decide when they fully understand what events           video annotation software, we maintained an active dialogue
they are looking for). There are many techniques for calcu-        with researchers (in Special Education, Speech and Hear-
lating agreement including Cohens Kappa [11], Cochran’s Q-         ing Sciences, and Computer Science) who use video anno-
test, and Point-By-Point Agreement. Regardless, the man-           tation, conducted informal 40 minute interviews with two
agement of data with traditional means is considered “cum-         experienced video coders, and refined functionality through
bersome” [20].                                                     dialog with current users of VCode and VData. Existing
                                                                   tools for video annotation may address a subset of the be-
2.2 Video Coding Tools                                             low described requirements, however, our system more fully
   Digital annotation tools have demonstrated significant ben-      satisfies all of them.
efits from simple copy/paste and undo to increased qual-
                                                                   R1 Facilitate Coding Workflow: The coding workflow con-
ity of coding by the facilitation of multiple passes on video
                                                                      sists of; (1) establishing video clips and coding guide-
and graphical representations [4, 6]. A timeline is com-
                                                                      lines, (2) intense training of coders and checks for re-
monly utilized in these tools and is familiar without ex-
                                                                      liability, (3) annotation of videos, (4) weekly reliability
tensive training [1]. Existing research also indicates that
                                                                      checks on annotated videos, (5) repeat 3 and 4 ad in-
presenting coders with secondary or sensor data on a time-
                                                                      finitum, (6) analyze data in statistical packages. Tools
line helps them outperform coders without sensor data [6].
                                                                      targeting video annotation should attempt to optimize
Increased accuracy, quality, and speed not only enhance the
                                                                      the transition between steps in this workflow.
data collected, but also allow for more annotation to be con-
ducted in the same amount of time. In addition to the com-
                                                                   R2 Video, Annotations, and Coding Guidelines should be
putational benefits of digital annotation tools, they also pro-
                                                                      presented in a synchronized manner: Interviewees de-
vide a controllable mechanism for different forms of reliable
                                                                      scribed their coding process centering around analog
video playback. [4].
                                                                      video on a TV-VCR device, annotating in a Microsoft
   One critical limitation of existing tools is poor represen-
                                                                      Excel file, and referencing lengthy code guidelines. Due
tation of data on a timeline and utilization of screen real-
                                                                      to the visual separation between annotations, source ma-
estate. For example, the VACA solution, while utilizing min-
                                                                      terial, and video, coders had great difficulty during re-
imal screen real-estate by condensing all annotations to one
                                                                      views.
large easy to read track, presents a problem with overlap-
ping and simultaneous events [6]. The VisSTA solution              R3 Capture Appropriate Data: Researchers and existing lit-
takes the contrary approach by showing many small verti-              erature indicate that there are different types of data
cally tiled tracks. Though this allows for a good comparative         that are collected through the annotation process: count-
view, reading individual annotations & holistic interpreta-           ing events/occurences, determining duration of events,
tion is difficult due to scrolling [4]. These and other existing        assigning levels, values, or ranking to events, performing
solutions have not successfully dealt with this problem [14,          phonetic transcription, and general commenting [14].
3, 13, 12, 1, 2, 22, 18].                                             Effective interfaces must provide methods for capturing
   Another limitation of current annotation tools is poor in-         these conceptually different data types while preserving
teraction techniques with video and data. Though robust               each of their unique nuances.
functionality is provided for playback, controls can be cum-
bersome & overly complex, (e.g.         [4]). Too many win-        R4 Additional data should be displayed to coders: Effec-
dows resulting in an over-saturation of information, impre-           tive annotation tools should allow researchers to pro-
cise video interaction & annotation or rigid, inSSexible mark-        vide additional data to coders to aid in their assessment
ing interfaces (e.g. [4, 14, 12, 18, 3]). Each of these are           of video; for example, a volume histogram of the cur-
common stumbling blocks which could result in unreliable              rent video, sensor/log data collected in tandem to the
data.                                                                 video capture, or annotations made automatically or
from another source. Displaying additional datapoints       can make the best annotation decision possible.
     has shown to increase the accuracy of coded events [6].     Video: To facilitate multiple video streams VCode presents
     Further, annotation software should facilitate the man-     one main video at full size, and a dock with other streams
     agement of multiple video streams to get the most ac-       playing in real time. When a docked stream is clicked on,
     curate ”view” on the session, and thus produce the most     it repositions itself into the main video window, while the
     accurate data [4].                                          video which was the previous focus, scales down to the dock,
                                                                 thus equating visual importance with relative size and visual
R5 Allow multiple forms of playback: Researchers mentioned
                                                                 weight.
   that continuous playback is not always the preferred
   method of analyzing a video. Often multiple modes of          Events: When annotating a video, two different classes of
   playback are utilized; continuous or standard playback,       coding events emerge: ranged and momentary. A ranged
   continuous interval playback (play for N seconds, then        event is one which extends over a period of time (mark-
   stop), and skip interval playback (jump N seconds, then       ing action start and duration). Momentary marks have no
   stop). This allows the video to be divided in to smaller      duration, and thus represent one specific moment in time.
   segments for annotation of events that are more diffi-          Comments can be attached to any mark, allowing addi-
   cult to pinpoint (i.e. when a smile starts or ends) [4].      tional observations, levels/ranking, or phonetic transcrip-
   Though conceptually simple, manipulations of video us-        tion (through onscreen phonetic keyboard). Any mark with
   ing a standard VCR was described as “annoying” and            a comment has a inverted outlines to signify that it has a
   “a mess” due to hand eye coordination and repeatability       comment attached. Figure 1 shows a ranged event repre-
   issues.                                                       senting the length of time which a child is making a sound,
                                                                 with additional momentary marks at the start noting other
R6 Agreement calculations should be easy and manipulat-
                                                                 features of the child’s state of being).
   able: Regardless of agreement technique used, researchers
                                                                 Timeline: The timeline is the heart of VCode. It is mod-
   expressed a frustration in attempts to calculate inter-
                                                                 eled after the moving timeline one might find in a video edit-
   observer reliability. Specifically, existing solutions were
                                                                 ing application (e.g. iMovie, Final Cut Pro, etc.). Events,
   limited to importing data into a statistical software pack-
                                                                 graphically represented by diamonds, appear in a spatial lin-
   age for calculation or calculating them by hand. Video
                                                                 ear fashion to sync with the video. Once an event has been
   annotation tools should provide quick & easy reliability
                                                                 placed on the timeline, it can be graphically manipulated by
   calculations for individual variables, as well as overall.
                                                                 dragging, clicking, and double-clicking. The standard solu-
R7 Provide functionality for visual, graphical and contextual    tion for dealing with large numbers of tracks or variables is
   review of annotations: In interviews, coders lamented         to provide a vertical scroll bar or overlay tracks. Rather than
   the process of ensuring reliability on a weekly basis; as     limiting the amount of information on screen by scrolling,
   it consisted of searching through printouts of a spread-      tracks representing momentary events are “stacked,” such
   sheet for discrepancies. Specifically, by lacking context      that they vertically overlap. This optimizes usage of the
   in this spreadsheet coders found it difficult to recognize      screen while still providing enough area for track isolation
   what a given coding mark referred to due to the lack          and selection, even under dense data conditions. Ranged
   of synchronization with video. By providing a visual,         event tracks are unable to benefit from this stacking opti-
   graphical way to review annotations (in the context of        mization because of the more complicated interaction for
   the video) coders would be better able to justify the de-     manipulation and thus are vertically tiled. Researchers can
   cisions, determine the correct solution, and save time        present video volume, sensor data, software log data (from
   identifying the errors.                                       Eclipse or Photoshop for example), and even other annota-
                                                                 tions to the coders. This additional information is presented
4.    VCODE AND VDATA                                            graphically to the users by bar, line, or scatter plot. This
                                                                 secondary data can allow coders to annotate data captured
   VCode and VData are a suite of applications which create
                                                                 by other sources than the video streams, as well as pro-
a set of effective interfaces for the coding workflow following
                                                                 vide additional context to their code. For example, should
the above design requirements. Our system has three main
                                                                 a coder be instructed to mark when a certain noise occurs,
components: VCode (annotation), VCode Admin Window
                                                                 he can line the mark up with an audio peek, rather than
(configuration) and VData (examination of data, coder agree-
                                                                 estimate it and be concerned with reaction time.
ment and training). The interaction with VCode and VData
is demonstrated in Figures 2-4 in which two coders are mark-     Interaction: Annotations can be inserted into the timeline
ing a video of a child in an experiment, and checking the        via UI buttons or keyboard hot keys. To optimize the typi-
agreement between their annotations. The reader should           cally complex transport controls we isolated the key activi-
note our solution is only one possible implementation of the     ties that coders need execute and provided controls limited
design requirements, and that these requirements could be        to play/pause buttons, coarse and fine grained playhead po-
applied to improving existing video annotation software.         sitioning, and step controls. The three modes of playback
                                                                 outlined in R5 are available.
4.1 VCode
                                                                 4.2 VCode Administration Window
  The VCode application (Figure 1) is designed to provide
researchers with an effective way to obtain reliable data from       To ensure consistent configuration between coders and ses-
an observational research video. By allowing researchers to      sions, all administrative features are consolidated in a single
present multiple video streams in addition to other sensor       window. The expected workflow is such that a researcher
data (e.g. log data, annotations from other software, or sig-    would setup a single coding document with all the variables
nals recorded by a computer/monitoring device) the coder         to be used on all the videos. This template would then be
centage agreement. For each event (momentary or ranged)
                                                                 an opportunity is said to occur when the primary coder
                                                                 makes a mark. If the secondary coder also makes a mark
                                                                 within a specified short interval, the marks are said to agree.
                                                                                                     agreements
                                                                 A percentage is calculated from opportunities for easy inter-
                                                                 pretation. A tolerance variable is also present to (1) ac-
                                                                 commodate for variability in the mark placement by the
                                                                 coders, and (2) recognition that there is no quantization of
                                                                 marks beyond the granularity of the millisecond timescale,
                                                                 a property of the system. VData also provides agreement
                                                                 for ranged events and annotations in a similar fashion. It is
                                                                 not uncommon for multiple tracks or variables to be mea-
                                                                 suring slight variations on a theme (e.g. smiling vs. large
                                                                 smile vs. grin ), thus VData implements a track-merging
Figure 2: The code is specified in the Administration             feature which allows opportunities on two distinct tracks to
Window along with the different video angles, screen              be treated indistinguishably. For a holistic view, researchers
capture, and log data.                                           can select tracks to be added into a total agreement calcu-
                                                                 lation. In other words, if analysis determines that a single
duplicated (with media and log files inserted for each trial).    track is not reliable or it is determined that a given track
The main task the Administration Window (Figure 2) is to         will not be used in the future, it can be easily excluded from
facilitate is the creation of tracks, used to code data. Re-     the total agreement calcuation.
searchers can add, remove, and reorder tracks which appear
                                                                 Conflict Resolution & Exporting: We have optimized
in a list format. The name, color and hot key of each tack
                                                                 coder training and reliability analysis by providing a graphi-
can be set through this list presentation. Tracks can be en-
                                                                 cal mechanism to directly compare annotations of two coders.
abled as ranged events through a check box in this interface.
                                                                 VData can create a VCode session containing specific tracks
The Administration Window is also where a researcher spec-
                                                                 of two individual coders for side-by-side comparison. The vi-
ifies videos and data file to be coded, as well as secondary
                                                                 sual, side by side, representation of the data makes it easy to
data for contextual annotation. These elements are speci-
                                                                 recognize systematic errors in context and detect differences
fied and synchronized through a drag and drop interface, all
                                                                 between two coders markings. This reduces the time nec-
of which is hidden from the coder to prevent configuration
                                                                 essary to locate discrepancies and discuss the reasons why
corruption.
                                                                 they might have occurred. It is necessary to keep records
                                                                 of these agreement analyses performed with VData by text
4.3 VData
                                                                 export. Maintaining export at each stage of the process pro-
                                                                 vides additional transparency and maintains traceability of
                                                                 results that come out of the system.

                                                                 4.4 Implementation
                                                                   Our system was implemented in Objective-C using the
                                                                 Cocoa Framework for Mac OS X 10.5. VCode supports all
                                                                 video formats and codecs supported by QuickTime to enable
                                                                 wide compatibility with available video files.


                                                                 5. MEETING REQUIREMENTS
                                                                    To ensure Video, Annotations, and Coding Guidelines are
                                                                 presented in a synchronized manner, VCode provides a uni-
                                                                 fied interface containing the target video, a timeline with
                                                                 graphically represented annotations (ranged event, momen-
Figure 3: Later, analysis is performed on indepen-
                                                                 tary event, or comment depending on data metaphor), ad-
dent codings of the same video. A track with low
                                                                 ditional tracks of signal data (to increase accuracy), and a
agreement can be reconciled by viewing the results
                                                                 list of coding guidelines which place marks and stand as
of two coders side-by-side in VCode, thanks to the
                                                                 a visual reminder. Three forms of video playback (conti-
capabilities of the VData analysis tool.
                                                                 nous/standard, interval playback, skip interval playback)
                                                                 are available via check boxes on the main VCode window
   Critical aspects of the video coding workflow (training, re-   to allow easily switching between modes of playback.
liability, and accuracy) revolve around demonstrating agree-        VData provides a dynamic interface for real time calcula-
ment between coders. VData (Figure 3) is a separate exe-         tion of multiple agreement values to facilitate easy and dy-
cutable application specifically targeted to aid researchers      namic agreement calculations. Through the transparent cal-
in training and agreement analysis of coded data produced        culation process, researchers can see both the raw data, and
in VCode.                                                        the percentages side by side for easy judgements about the
Multi Coder Analysis: By loading two VCode files into             reliability of data collected. Upon request a visual, graphi-
VData, tracks are automatically loaded into the main data        cal and contextual review of annotations for both agreement
table which presents opportunities, agreements, and per-         review and training is supported.
Finally, the Coding Workflow is encouraged through VCode’s      needs of both researchers and coders.
template model in conjunction with the separate VCode              From the reaction of the coders, as well as our own as-
Adminstration Window for easy set up and configuration.           sessment of VCode and VData, we have many directions of
Training, data collection, and inter-coder agreement are en-     possible future work. One avenue is creating a database or
abled through a tight collaboration between annotation envi-     networked system in order to facilitate remote access to con-
ronment and agreement analysis. By consistently providing        tent, and management of coding objects and assignments for
data export, researchers can be assured that any information     individual coders. It is foreseeable that the system could be
annotated by coders can be easily extracted and exported         extended to a tool to prepare coding files; assist in dividing
into the statistical analysis tool of their choice.              up raw footage, syncing data to video enmasse, and other
                                                                 automation hooks. This could leverage some of the other
                                                                 existing work in automatic video segmentation. Lastly, we
6.   INITIAL REACTION                                            hope to address some of the concerns of our coders, including
   To evaluate our system in a cursory fashion, we conducted     creating a richer set of data visualizations.
an informal series of interviews with several coders that used
our system during the course of an independent study. Anal-      8. ACKNOWLEDGMENTS
ysis using VData showed inter-observer agreement was good
                                                                   We would like to thank NSF for their support of this work
and provided valuable coded data for the study. In general,
                                                                 (NSF-0643502).
comments from the coders were positive, especially when
comparing the VCode system to non-computerized meth-
                                                                 9. REFERENCES
ods. One coder wrote: “The software was easy to use in
general, and cut down on coding time.” Several features of        [1] Annotations as multiple perspectives of video content,
VCode stood out in their comments; color coding of tracks             Juan-les-Pins, France, 2002. ACM.
                                                                  [2] The Family Video Archive: an annotation and browsing
provided direct linkage between events on the timeline and
                                                                      environment for home movies, Berkeley, CA, 2003. ACM.
the description panel, the correlation between files was clear
                                                                  [3] Fluid interaction techniques for the control and annotation
to see during review, sensor data helped anticipate events            of digital video, Vancouver, Canada, 2003. ACM.
and accurately code them. It was also noted that the sensor       [4] A Coding Tool for Multimodal Analysis of Meeting Video,
                                                                      Montreal, Quebec, Canada, 2004. IEEE.
data provided reassurance that what they had noticed in the
                                                                  [5] Creating Multi-Modal, User-Centric Records of Meetings
video was actually correct.
                                                                      with the Carnegie Mellon Meeting Recorder Architecture,
   In addition to these positive marks we uncovered several           Montreal, Quebec, Canada, 2004. IEEE.
shortcomings of the interface. The seemingly low-resolution       [6] Work-in-progress: VACA: a tool for qualitative video
                                                                      analysis, Montreal, Quebec, Canada, 2006. ACM.
bar-graph of volume data left coders unsure where precisely
                                                                  [7] K. J. Berry and P. W. Mielke. A generalization of cohen’s
to make their mark. Because the elements of this graph are            kappa agreement measure to interval measurement and
relatively wide, it appears especially coarse in comparison           multiple raters, 1988.
with the precision with which one may place a mark on             [8] M. Bertini, A. Del Bimbo, R. Cucchiara, and A. Prati.
                                                                      Applications ii: Semantic video adaptation based on automatic
the timeline. A spectrogram was suggested as an alternate
                                                                      annotation of sport videos. In Proceedings of the 6th ACM
visualization of the audio data that could help understand            SIGMM international workshop on Multimedia information
sound and video.                                                      retrieval MIR, 2004.
   Overall, results from these interviews were very encour-       [9] S.-C. Chen, M.-L. Shyu, W. Liao, and C. Zhang. Scene change
                                                                      detection by audio and video clues.
aging and suggest a more formal study to determine if per-
                                                                      www.cs.fiu.edu/~chens/PDF/ICME02_Video.pdf.
formance improves in the same way that coders stated that        [10] J. Clifford and G. Marcus, editors. Writing Culture. University
they felt as the tool lowered the amount of time necessary            of California Press, Berkeley, California, 1986.
for coding.                                                      [11] A. J. Conger. Kappa reliabilities for continuous behaviors and
                                                                      events, 1985.
                                                                 [12] S. B. Group. Studiocode business group - supplier of
7.   CONCLUSION AND FUTURE WORK                                       studiocode and stream video analysis and distribution
                                                                      software. http://www.studiocodegroup.com, 2007.
   Video annotation tools can be valuable to researchers by      [13] A. Johnson. About vprism video data analysis software.
enhancing the annotation process through increased relia-             http://www.camse.org/andy/VP/vprism.htm, 2007.
                                                                 [14] M. Kipp. Anvil - the video annotation research tool.
bility, repeatability, and workflow optimizations. However,
                                                                      http://www.anvil-software.de, 2007.
many existing solutions do not fully address all the needs of
                                                                 [15] B. J. Leadholm and J. F. Miller. Language Sample Analysis:
researchers and coders; effective representation of data on            The Wisconsin Guide. Wisconsin Department of Public
a timeline, efficient and robust interaction techniques with            Instruction, Madison, Wisconsin, 1995.
                                                                 [16] L. Lee. Developmental Sentence Analysis. Northwestern
video and data, and support for the full video annotation
                                                                      University Press, Evanston, IL, 1974.
workflow. Our research has provided many contributions in
                                                                 [17] S. S. LLC. Salt software. http://www.saltsoftware.com, 2007.
addressing these weak points.                                    [18] Noldus. The observer.
   We create a set of design requirements based on existing           http://www.noldus.com/site/doc200401012, 2007.
literature and annotation techniques, interviews with expe-      [19] D. Ponceleon and S. Srinivasan. Automatic discovery of salient
                                                                      segments in imperfect speech transcripts. In CIKM ’01:
rienced coders, and discussions with researchers in multiple
                                                                      Proceedings of the tenth international conference on
disciplines. Based on these investigations, we implemented            Information and knowledge management, pages 490–497, New
a system, VCode and VData, that largely satisfies the re-              York, NY, USA, 2001. ACM.
                                                                 [20] K. S. Retherford. Guide to Analysis of Language Transcripts.
quirements we outlined. These systems were then used in
                                                                      Thinking Publications, Eau Claire, Wisconsin, 1993.
ongoing research, and coders were interviewed concurrent
                                                                 [21] K. L. Rosenblum, C. Zeanah, S. McDonough, and M. Muzik.
with and after using the software, and their reactions were           Video-taped coding of working model of the child interviews: a
solicited. Our model demonstrates how video annotation                viable and useful alternative to verbatim transcripts?, 2004.
                                                                 [22] S. Soft. Annotation. http://www.saysosoft.com/, 2006.
software, for many disciplines, can be enhanced to meet the

Contenu connexe

Tendances

International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)ijceronline
 
A Quick Glance over the Digital Watermarking
A Quick Glance over the Digital WatermarkingA Quick Glance over the Digital Watermarking
A Quick Glance over the Digital Watermarkingijsrd.com
 
Qo E E2 E5 User Centric Approach Katrien De Moor
Qo E E2 E5   User Centric Approach   Katrien De MoorQo E E2 E5   User Centric Approach   Katrien De Moor
Qo E E2 E5 User Centric Approach Katrien De Moorimec.archive
 
Ee472 ieee report loren schwappach
Ee472 ieee report loren schwappachEe472 ieee report loren schwappach
Ee472 ieee report loren schwappachLoren Schwappach
 
A Hybrid Model of Watermarking Scheme for Color Image Authentication Using Di...
A Hybrid Model of Watermarking Scheme for Color Image Authentication Using Di...A Hybrid Model of Watermarking Scheme for Color Image Authentication Using Di...
A Hybrid Model of Watermarking Scheme for Color Image Authentication Using Di...iosrjce
 
SMARCOS Project Brochure
SMARCOS Project Brochure SMARCOS Project Brochure
SMARCOS Project Brochure Smarcos Eu
 
CosTriage: A Cost-Aware Algorithm for Bug Reporting Systems (AAAI 2011)
CosTriage: A Cost-Aware Algorithm for Bug Reporting Systems (AAAI 2011)CosTriage: A Cost-Aware Algorithm for Bug Reporting Systems (AAAI 2011)
CosTriage: A Cost-Aware Algorithm for Bug Reporting Systems (AAAI 2011)Sung Kim
 
Paper id 312201518
Paper id 312201518Paper id 312201518
Paper id 312201518IJRAT
 
Towards Methods for Efficient Access to Spoken Content in the AMI Corpus (SSC...
Towards Methods for Efficient Access to Spoken Content in the AMI Corpus (SSC...Towards Methods for Efficient Access to Spoken Content in the AMI Corpus (SSC...
Towards Methods for Efficient Access to Spoken Content in the AMI Corpus (SSC...Maria Eskevich
 
A GENERIC FRAMEWORK FOR DEVICE PAIRING IN UBIQUITOUS COMPUTING ENVIRONMENTS
A GENERIC FRAMEWORK FOR DEVICE PAIRING IN UBIQUITOUS COMPUTING ENVIRONMENTSA GENERIC FRAMEWORK FOR DEVICE PAIRING IN UBIQUITOUS COMPUTING ENVIRONMENTS
A GENERIC FRAMEWORK FOR DEVICE PAIRING IN UBIQUITOUS COMPUTING ENVIRONMENTSIJNSA Journal
 
Form 5 topics combined
Form 5 topics combinedForm 5 topics combined
Form 5 topics combinedMummy Irdina
 
A Robust Deinterlacing Multiple Image Watermarking Technique in Discrete Wave...
A Robust Deinterlacing Multiple Image Watermarking Technique in Discrete Wave...A Robust Deinterlacing Multiple Image Watermarking Technique in Discrete Wave...
A Robust Deinterlacing Multiple Image Watermarking Technique in Discrete Wave...Shalu Singh
 
Recent advances in content based video copy detection (IEEE)
Recent advances in content based video copy detection (IEEE)Recent advances in content based video copy detection (IEEE)
Recent advances in content based video copy detection (IEEE)PACE 2.0
 
Hfes 2012 saqer et al
Hfes 2012   saqer et alHfes 2012   saqer et al
Hfes 2012 saqer et alhsaqer
 
Visual Programming and Program Visualization – Towards an Ideal Visual Softwa...
Visual Programming and Program Visualization – Towards an Ideal Visual Softwa...Visual Programming and Program Visualization – Towards an Ideal Visual Softwa...
Visual Programming and Program Visualization – Towards an Ideal Visual Softwa...IDES Editor
 

Tendances (19)

"Parallel and Distributed Computing: BOINC Grid Implementation" por Rodrigo N...
"Parallel and Distributed Computing: BOINC Grid Implementation" por Rodrigo N..."Parallel and Distributed Computing: BOINC Grid Implementation" por Rodrigo N...
"Parallel and Distributed Computing: BOINC Grid Implementation" por Rodrigo N...
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
A Quick Glance over the Digital Watermarking
A Quick Glance over the Digital WatermarkingA Quick Glance over the Digital Watermarking
A Quick Glance over the Digital Watermarking
 
50120140504020 2
50120140504020 250120140504020 2
50120140504020 2
 
Qo E E2 E5 User Centric Approach Katrien De Moor
Qo E E2 E5   User Centric Approach   Katrien De MoorQo E E2 E5   User Centric Approach   Katrien De Moor
Qo E E2 E5 User Centric Approach Katrien De Moor
 
Ee472 ieee report loren schwappach
Ee472 ieee report loren schwappachEe472 ieee report loren schwappach
Ee472 ieee report loren schwappach
 
A Hybrid Model of Watermarking Scheme for Color Image Authentication Using Di...
A Hybrid Model of Watermarking Scheme for Color Image Authentication Using Di...A Hybrid Model of Watermarking Scheme for Color Image Authentication Using Di...
A Hybrid Model of Watermarking Scheme for Color Image Authentication Using Di...
 
SMARCOS Project Brochure
SMARCOS Project Brochure SMARCOS Project Brochure
SMARCOS Project Brochure
 
CosTriage: A Cost-Aware Algorithm for Bug Reporting Systems (AAAI 2011)
CosTriage: A Cost-Aware Algorithm for Bug Reporting Systems (AAAI 2011)CosTriage: A Cost-Aware Algorithm for Bug Reporting Systems (AAAI 2011)
CosTriage: A Cost-Aware Algorithm for Bug Reporting Systems (AAAI 2011)
 
6543456
65434566543456
6543456
 
Paper id 312201518
Paper id 312201518Paper id 312201518
Paper id 312201518
 
Towards Methods for Efficient Access to Spoken Content in the AMI Corpus (SSC...
Towards Methods for Efficient Access to Spoken Content in the AMI Corpus (SSC...Towards Methods for Efficient Access to Spoken Content in the AMI Corpus (SSC...
Towards Methods for Efficient Access to Spoken Content in the AMI Corpus (SSC...
 
A GENERIC FRAMEWORK FOR DEVICE PAIRING IN UBIQUITOUS COMPUTING ENVIRONMENTS
A GENERIC FRAMEWORK FOR DEVICE PAIRING IN UBIQUITOUS COMPUTING ENVIRONMENTSA GENERIC FRAMEWORK FOR DEVICE PAIRING IN UBIQUITOUS COMPUTING ENVIRONMENTS
A GENERIC FRAMEWORK FOR DEVICE PAIRING IN UBIQUITOUS COMPUTING ENVIRONMENTS
 
dwm
dwmdwm
dwm
 
Form 5 topics combined
Form 5 topics combinedForm 5 topics combined
Form 5 topics combined
 
A Robust Deinterlacing Multiple Image Watermarking Technique in Discrete Wave...
A Robust Deinterlacing Multiple Image Watermarking Technique in Discrete Wave...A Robust Deinterlacing Multiple Image Watermarking Technique in Discrete Wave...
A Robust Deinterlacing Multiple Image Watermarking Technique in Discrete Wave...
 
Recent advances in content based video copy detection (IEEE)
Recent advances in content based video copy detection (IEEE)Recent advances in content based video copy detection (IEEE)
Recent advances in content based video copy detection (IEEE)
 
Hfes 2012 saqer et al
Hfes 2012   saqer et alHfes 2012   saqer et al
Hfes 2012 saqer et al
 
Visual Programming and Program Visualization – Towards an Ideal Visual Softwa...
Visual Programming and Program Visualization – Towards an Ideal Visual Softwa...Visual Programming and Program Visualization – Towards an Ideal Visual Softwa...
Visual Programming and Program Visualization – Towards an Ideal Visual Softwa...
 

En vedette

Secure QR code payment
Secure QR code paymentSecure QR code payment
Secure QR code paymentJames Wu
 
Braç Robot Hidràulic
Braç Robot HidràulicBraç Robot Hidràulic
Braç Robot Hidràulicsergi1navarro
 
Show 68 PERQ INTERVIEW w/ Scott Hill & Andy Medley
Show 68 PERQ INTERVIEW w/ Scott Hill & Andy MedleyShow 68 PERQ INTERVIEW w/ Scott Hill & Andy Medley
Show 68 PERQ INTERVIEW w/ Scott Hill & Andy MedleyErin Sparks
 
April Vacation
April VacationApril Vacation
April VacationDBrandon69
 
Marketing During Recession, Pham Viet Anh
Marketing During Recession, Pham Viet AnhMarketing During Recession, Pham Viet Anh
Marketing During Recession, Pham Viet Anhphamvietanh
 
Getting published – how the library can help
Getting published – how the library can helpGetting published – how the library can help
Getting published – how the library can helpbradscifi
 
Atty Fee 08 2009
Atty Fee 08 2009Atty Fee 08 2009
Atty Fee 08 2009bigfaz2009
 
蘇富惠
蘇富惠蘇富惠
蘇富惠nice567
 
Save Your Client\'s Money With MSA
Save Your Client\'s Money With MSASave Your Client\'s Money With MSA
Save Your Client\'s Money With MSAbigfaz2009
 
黃品豪
黃品豪黃品豪
黃品豪nice567
 
Using Google Analytics to Measure Social Media Success
Using Google Analytics to Measure Social Media SuccessUsing Google Analytics to Measure Social Media Success
Using Google Analytics to Measure Social Media SuccessPerficient
 
M. VRUBEL
M. VRUBELM. VRUBEL
M. VRUBELlensu
 
Founders guild epiphany
Founders guild epiphanyFounders guild epiphany
Founders guild epiphanySteve Newcomb
 

En vedette (20)

Secure QR code payment
Secure QR code paymentSecure QR code payment
Secure QR code payment
 
Braç Robot Hidràulic
Braç Robot HidràulicBraç Robot Hidràulic
Braç Robot Hidràulic
 
Show 68 PERQ INTERVIEW w/ Scott Hill & Andy Medley
Show 68 PERQ INTERVIEW w/ Scott Hill & Andy MedleyShow 68 PERQ INTERVIEW w/ Scott Hill & Andy Medley
Show 68 PERQ INTERVIEW w/ Scott Hill & Andy Medley
 
April Vacation
April VacationApril Vacation
April Vacation
 
Marketing During Recession, Pham Viet Anh
Marketing During Recession, Pham Viet AnhMarketing During Recession, Pham Viet Anh
Marketing During Recession, Pham Viet Anh
 
Getting published – how the library can help
Getting published – how the library can helpGetting published – how the library can help
Getting published – how the library can help
 
Atty Fee 08 2009
Atty Fee 08 2009Atty Fee 08 2009
Atty Fee 08 2009
 
inflation
inflationinflation
inflation
 
Facebook Por Facebook
Facebook Por FacebookFacebook Por Facebook
Facebook Por Facebook
 
蘇富惠
蘇富惠蘇富惠
蘇富惠
 
Websites updated trends
Websites updated trendsWebsites updated trends
Websites updated trends
 
Det Starkaste
Det StarkasteDet Starkaste
Det Starkaste
 
Wc To2009
Wc To2009Wc To2009
Wc To2009
 
Save Your Client\'s Money With MSA
Save Your Client\'s Money With MSASave Your Client\'s Money With MSA
Save Your Client\'s Money With MSA
 
黃品豪
黃品豪黃品豪
黃品豪
 
Using Google Analytics to Measure Social Media Success
Using Google Analytics to Measure Social Media SuccessUsing Google Analytics to Measure Social Media Success
Using Google Analytics to Measure Social Media Success
 
M. VRUBEL
M. VRUBELM. VRUBEL
M. VRUBEL
 
Paisajes
PaisajesPaisajes
Paisajes
 
Founders guild epiphany
Founders guild epiphanyFounders guild epiphany
Founders guild epiphany
 
DEFINICION DE LITERATURA
DEFINICION DE LITERATURADEFINICION DE LITERATURA
DEFINICION DE LITERATURA
 

Similaire à V Code And V Data Illustrating A New Framework For Supporting The Video Annotation Workow

Hardware Design Practices For Modern Hardware
Hardware Design Practices For Modern HardwareHardware Design Practices For Modern Hardware
Hardware Design Practices For Modern HardwareWinstina Kennedy
 
An Stepped Forward Security System for Multimedia Content Material for Cloud ...
An Stepped Forward Security System for Multimedia Content Material for Cloud ...An Stepped Forward Security System for Multimedia Content Material for Cloud ...
An Stepped Forward Security System for Multimedia Content Material for Cloud ...IRJET Journal
 
Video copy detection using segmentation method and
Video copy detection using segmentation method andVideo copy detection using segmentation method and
Video copy detection using segmentation method andeSAT Publishing House
 
Automatic Visual Concept Detection in Videos: Review
Automatic Visual Concept Detection in Videos: ReviewAutomatic Visual Concept Detection in Videos: Review
Automatic Visual Concept Detection in Videos: ReviewIRJET Journal
 
Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization  Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization ijcga
 
Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization  Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization ijcga
 
Mtech First progress PRESENTATION ON VIDEO SUMMARIZATION
Mtech First progress PRESENTATION ON VIDEO SUMMARIZATIONMtech First progress PRESENTATION ON VIDEO SUMMARIZATION
Mtech First progress PRESENTATION ON VIDEO SUMMARIZATIONNEERAJ BAGHEL
 
Content based video retrieval system
Content based video retrieval systemContent based video retrieval system
Content based video retrieval systemeSAT Publishing House
 
Engelman.2011.exploring interaction modes for image retrieval
Engelman.2011.exploring interaction modes for image retrievalEngelman.2011.exploring interaction modes for image retrieval
Engelman.2011.exploring interaction modes for image retrievalmrgazer
 
Inverted File Based Search Technique for Video Copy Retrieval
Inverted File Based Search Technique for Video Copy RetrievalInverted File Based Search Technique for Video Copy Retrieval
Inverted File Based Search Technique for Video Copy Retrievalijcsa
 
BrownResearch_CV
BrownResearch_CVBrownResearch_CV
BrownResearch_CVAbby Brown
 
AUTOMATIC DETECTION OF TRAFFIC ACCIDENTS FROM VIDEO USING DEEP LEARNING
AUTOMATIC DETECTION OF TRAFFIC ACCIDENTS FROM VIDEO USING DEEP LEARNINGAUTOMATIC DETECTION OF TRAFFIC ACCIDENTS FROM VIDEO USING DEEP LEARNING
AUTOMATIC DETECTION OF TRAFFIC ACCIDENTS FROM VIDEO USING DEEP LEARNINGIRJET Journal
 
Parking Surveillance Footage Summarization
Parking Surveillance Footage SummarizationParking Surveillance Footage Summarization
Parking Surveillance Footage SummarizationIRJET Journal
 
3 d video coding & streaming real time of hd
3 d video coding & streaming real time of hd3 d video coding & streaming real time of hd
3 d video coding & streaming real time of hdEmpirix
 
3 d video coding & streaming real time of hd
3 d video coding & streaming real time of hd3 d video coding & streaming real time of hd
3 d video coding & streaming real time of hdEmpirix
 
IRJET - Applications of Image and Video Deduplication: A Survey
IRJET -  	  Applications of Image and Video Deduplication: A SurveyIRJET -  	  Applications of Image and Video Deduplication: A Survey
IRJET - Applications of Image and Video Deduplication: A SurveyIRJET Journal
 
Automated Java Code Generation (ICDIM 2006)
Automated Java Code Generation (ICDIM 2006)Automated Java Code Generation (ICDIM 2006)
Automated Java Code Generation (ICDIM 2006)IT Industry
 
Interactive Video Search and Browsing Systems
Interactive Video Search and Browsing SystemsInteractive Video Search and Browsing Systems
Interactive Video Search and Browsing SystemsAndrea Ferracani
 

Similaire à V Code And V Data Illustrating A New Framework For Supporting The Video Annotation Workow (20)

Hardware Design Practices For Modern Hardware
Hardware Design Practices For Modern HardwareHardware Design Practices For Modern Hardware
Hardware Design Practices For Modern Hardware
 
An Stepped Forward Security System for Multimedia Content Material for Cloud ...
An Stepped Forward Security System for Multimedia Content Material for Cloud ...An Stepped Forward Security System for Multimedia Content Material for Cloud ...
An Stepped Forward Security System for Multimedia Content Material for Cloud ...
 
Video copy detection using segmentation method and
Video copy detection using segmentation method andVideo copy detection using segmentation method and
Video copy detection using segmentation method and
 
Guru_poster
Guru_posterGuru_poster
Guru_poster
 
Automatic Visual Concept Detection in Videos: Review
Automatic Visual Concept Detection in Videos: ReviewAutomatic Visual Concept Detection in Videos: Review
Automatic Visual Concept Detection in Videos: Review
 
Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization  Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization
 
Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization  Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization
 
Mtech First progress PRESENTATION ON VIDEO SUMMARIZATION
Mtech First progress PRESENTATION ON VIDEO SUMMARIZATIONMtech First progress PRESENTATION ON VIDEO SUMMARIZATION
Mtech First progress PRESENTATION ON VIDEO SUMMARIZATION
 
Content based video retrieval system
Content based video retrieval systemContent based video retrieval system
Content based video retrieval system
 
Engelman.2011.exploring interaction modes for image retrieval
Engelman.2011.exploring interaction modes for image retrievalEngelman.2011.exploring interaction modes for image retrieval
Engelman.2011.exploring interaction modes for image retrieval
 
Inverted File Based Search Technique for Video Copy Retrieval
Inverted File Based Search Technique for Video Copy RetrievalInverted File Based Search Technique for Video Copy Retrieval
Inverted File Based Search Technique for Video Copy Retrieval
 
BrownResearch_CV
BrownResearch_CVBrownResearch_CV
BrownResearch_CV
 
AUTOMATIC DETECTION OF TRAFFIC ACCIDENTS FROM VIDEO USING DEEP LEARNING
AUTOMATIC DETECTION OF TRAFFIC ACCIDENTS FROM VIDEO USING DEEP LEARNINGAUTOMATIC DETECTION OF TRAFFIC ACCIDENTS FROM VIDEO USING DEEP LEARNING
AUTOMATIC DETECTION OF TRAFFIC ACCIDENTS FROM VIDEO USING DEEP LEARNING
 
Parking Surveillance Footage Summarization
Parking Surveillance Footage SummarizationParking Surveillance Footage Summarization
Parking Surveillance Footage Summarization
 
3 d video coding & streaming real time of hd
3 d video coding & streaming real time of hd3 d video coding & streaming real time of hd
3 d video coding & streaming real time of hd
 
3 d video coding & streaming real time of hd
3 d video coding & streaming real time of hd3 d video coding & streaming real time of hd
3 d video coding & streaming real time of hd
 
IRJET - Applications of Image and Video Deduplication: A Survey
IRJET -  	  Applications of Image and Video Deduplication: A SurveyIRJET -  	  Applications of Image and Video Deduplication: A Survey
IRJET - Applications of Image and Video Deduplication: A Survey
 
Automated Java Code Generation (ICDIM 2006)
Automated Java Code Generation (ICDIM 2006)Automated Java Code Generation (ICDIM 2006)
Automated Java Code Generation (ICDIM 2006)
 
Interactive Video Search and Browsing Systems
Interactive Video Search and Browsing SystemsInteractive Video Search and Browsing Systems
Interactive Video Search and Browsing Systems
 
Interactive Video Search and Browsing Systems
Interactive Video Search and Browsing SystemsInteractive Video Search and Browsing Systems
Interactive Video Search and Browsing Systems
 

Plus de GoogleTecTalks

Web Hooks And The Programmable World Of Tomorrow
Web Hooks And The Programmable World Of TomorrowWeb Hooks And The Programmable World Of Tomorrow
Web Hooks And The Programmable World Of TomorrowGoogleTecTalks
 
Using The Google Collections Library For Java
Using The Google Collections Library For JavaUsing The Google Collections Library For Java
Using The Google Collections Library For JavaGoogleTecTalks
 
Voice Browsing And Multimodal Interaction In 2009
Voice Browsing And Multimodal Interaction In 2009Voice Browsing And Multimodal Interaction In 2009
Voice Browsing And Multimodal Interaction In 2009GoogleTecTalks
 
New Media Mavericks Will The Revolution Be Spidered
New Media Mavericks Will The Revolution Be SpideredNew Media Mavericks Will The Revolution Be Spidered
New Media Mavericks Will The Revolution Be SpideredGoogleTecTalks
 
Performance Improvements In Browsers
Performance Improvements In BrowsersPerformance Improvements In Browsers
Performance Improvements In BrowsersGoogleTecTalks
 
13353102 Putting The Fun In Functional Applying Game Mechanics To Functional ...
13353102 Putting The Fun In Functional Applying Game Mechanics To Functional ...13353102 Putting The Fun In Functional Applying Game Mechanics To Functional ...
13353102 Putting The Fun In Functional Applying Game Mechanics To Functional ...GoogleTecTalks
 
Black Cloud Patterns Toward The Future
Black Cloud Patterns Toward The FutureBlack Cloud Patterns Toward The Future
Black Cloud Patterns Toward The FutureGoogleTecTalks
 
Advanced Ruby Scripting For Sketch Up
Advanced Ruby Scripting For Sketch UpAdvanced Ruby Scripting For Sketch Up
Advanced Ruby Scripting For Sketch UpGoogleTecTalks
 
An Introduction To Android
An Introduction To AndroidAn Introduction To Android
An Introduction To AndroidGoogleTecTalks
 
Advanced Gadget And Ui Development Using Googles Ajax Ap Is
Advanced Gadget And Ui Development Using Googles Ajax Ap IsAdvanced Gadget And Ui Development Using Googles Ajax Ap Is
Advanced Gadget And Ui Development Using Googles Ajax Ap IsGoogleTecTalks
 
A World Beyond Ajax Accessing Googles Ap Is From Flash And Non Java Script En...
A World Beyond Ajax Accessing Googles Ap Is From Flash And Non Java Script En...A World Beyond Ajax Accessing Googles Ap Is From Flash And Non Java Script En...
A World Beyond Ajax Accessing Googles Ap Is From Flash And Non Java Script En...GoogleTecTalks
 
Keynote Client Connectivity And The Cloud
Keynote Client Connectivity And The CloudKeynote Client Connectivity And The Cloud
Keynote Client Connectivity And The CloudGoogleTecTalks
 

Plus de GoogleTecTalks (13)

Web Hooks And The Programmable World Of Tomorrow
Web Hooks And The Programmable World Of TomorrowWeb Hooks And The Programmable World Of Tomorrow
Web Hooks And The Programmable World Of Tomorrow
 
Using The Google Collections Library For Java
Using The Google Collections Library For JavaUsing The Google Collections Library For Java
Using The Google Collections Library For Java
 
Voice Browsing And Multimodal Interaction In 2009
Voice Browsing And Multimodal Interaction In 2009Voice Browsing And Multimodal Interaction In 2009
Voice Browsing And Multimodal Interaction In 2009
 
New Media Mavericks Will The Revolution Be Spidered
New Media Mavericks Will The Revolution Be SpideredNew Media Mavericks Will The Revolution Be Spidered
New Media Mavericks Will The Revolution Be Spidered
 
Performance Improvements In Browsers
Performance Improvements In BrowsersPerformance Improvements In Browsers
Performance Improvements In Browsers
 
13353102 Putting The Fun In Functional Applying Game Mechanics To Functional ...
13353102 Putting The Fun In Functional Applying Game Mechanics To Functional ...13353102 Putting The Fun In Functional Applying Game Mechanics To Functional ...
13353102 Putting The Fun In Functional Applying Game Mechanics To Functional ...
 
Black Cloud Patterns Toward The Future
Black Cloud Patterns Toward The FutureBlack Cloud Patterns Toward The Future
Black Cloud Patterns Toward The Future
 
Advanced Ruby Scripting For Sketch Up
Advanced Ruby Scripting For Sketch UpAdvanced Ruby Scripting For Sketch Up
Advanced Ruby Scripting For Sketch Up
 
An Introduction To Android
An Introduction To AndroidAn Introduction To Android
An Introduction To Android
 
Advanced Gadget And Ui Development Using Googles Ajax Ap Is
Advanced Gadget And Ui Development Using Googles Ajax Ap IsAdvanced Gadget And Ui Development Using Googles Ajax Ap Is
Advanced Gadget And Ui Development Using Googles Ajax Ap Is
 
Advanced Kml
Advanced KmlAdvanced Kml
Advanced Kml
 
A World Beyond Ajax Accessing Googles Ap Is From Flash And Non Java Script En...
A World Beyond Ajax Accessing Googles Ap Is From Flash And Non Java Script En...A World Beyond Ajax Accessing Googles Ap Is From Flash And Non Java Script En...
A World Beyond Ajax Accessing Googles Ap Is From Flash And Non Java Script En...
 
Keynote Client Connectivity And The Cloud
Keynote Client Connectivity And The CloudKeynote Client Connectivity And The Cloud
Keynote Client Connectivity And The Cloud
 

Dernier

Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 

Dernier (20)

Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 

V Code And V Data Illustrating A New Framework For Supporting The Video Annotation Workow

  • 1. VCode and VData: Illustrating a new Framework for Supporting the Video Annotation Workflow Joey Hagedorn, Joshua Hailpern, Karrie G. Karahalios Department of Computer Science University of Illinois at Urbana Champaign Urbana, IL 61801 USA {hagedorn, jhailpe2, kkarahal}@uiuc.edu ABSTRACT Digital tools for annotation of video have the promise to provide immense value to researchers in disciplines rang- ing from psychology to ethnography to computer science. With traditional methods for annotation being cumbersome, time-consuming, and frustrating, technological solutions are situated to aid in video annotation by increasing reliabil- ity, repeatability, and workflow optimizations. Three no- table limitations of existing video annotation tools are lack of support for the annotation workflow, poor representation of data on a timeline, and poor interaction techniques with video, data, and annotations. This paper details a set of design requirements intended to enhance video annotation. Our framework is grounded in existing literature, interviews with experienced coders, and ongoing discussions with re- searchers in multiple disciplines. Our model is demonstrated Figure 1: The VCode application graphically repre- in a new system called VCode and VData. The benefit of sents the behaviors being coded as marks on a time- our system is that is directly addresses the workflow and line. It is easy to see correlation between the marks needs of both researchers and video coders. on the timeline and sensor data displayed below. Categories and Subject Descriptors human centered domains can now rely on video annotation to provide them with measures on which to draw conclu- H.5.1 [Multimedia Information Systems]: sions. Unlike transcription, which is akin to what a court Evaluation/methodology Video stenographer does, annotation is the marking of movements, ; H.5.2 [User Interfaces]: Graphical user interfaces sounds, and other such events (with or without additional metadata such as rankings). The emergence of technology as General Terms a tool to aid in video annotation has raised the possibility of Design, Human Factors, Measurement increasing reliability, repeatability, and workflow optimiza- tions [6]. Three notable limitations of existing video anno- tation tools are lack of support for the annotation workflow, Keywords poor representation of data on a timeline, and poor inter- Graphical user interfaces (GUI), Annotation, Video action techniques with video, data, and annotations. This paper details a set of requirements to guide the design of 1. INTRODUCTION video annotation tools. Our model is the direct result of an analysis of existing tools, current practices by researchers, Human behavior does not naturally lend itself to being and workflow difficulties experienced by real-world video quantifiable. Yet time and again, researchers in disciplines coders. By understanding what data researchers are looking ranging from psychology to ethnography to computer sci- to gather, and the shortcomings of existing techniques and ence, are forced to analyze as if it was quantified. Those in technology utilized by coders, we believe that we have cre- ated a framework for video annotation that can reach across disciplines. Our model is demonstrated through the design and construction of our new system VCode and VData (Fig- Permission to make digital or hard copies of all or part of this work for ure 1); two fully functional, open-source tools which bridge personal or classroom use is granted without fee provided that copies are the video annotation workflow. not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, to The primary contribution of this paper is the set of design republish, to post on servers or to redistribute to lists, requires prior specific requirements for facilitating a system conducive to video an- permission and/or a fee. notation. Specifically, we demonstrate how a system could AVI ’08 May 28-30, 2008, Naples, Italy be designed and built to meet these requirements through a Copyright 200X ACM X-XXXXX-XX-X/XX/XX ...$5.00.
  • 2. set of carefully designed interfaces and graphical represen- One last limitation is lack of support for the full annota- tations of data. tion workflow; 1) collect video 2) create segments to code 3) train coders/demonstrate reliability 4) gather data 5) per- form regular checks on reliability & discuss discrepancies 6) 2. RELATED WORK perform data analysis. Many tools support small portions of this workflow (i.e. simply facilitating segmentation, an- 2.1 Video Coding In Practice notation, or reliability [22, 6, 12]), but with each break The analysis of human behavior is a study that dates back in the process researchers can become delayed. Without ex- hundreds of years. This has ranged from anthropological port/import data reentry is required. Technology is situated ethnographies [10] to psychological evaluations. As tech- to optimize this process. nology has developed, the use of video and creation of an- Researchers have also explored dialogue transcription [17, notation techniques have aided researchers by providing a 13, 12], tagging [2, 12], scene based automatic annotation referable document that can be used as evidence to back up [9, 19], automatic event logging [5], and object of focus claims and observations made [21, 16, 20]. These techniques identification [8]. Our work contrasts these other foci by involve detailed event logging on paper, specifying features demonstrating techniques for supporting human based an- such as durations, ratings/levels, and time-stamps [15]. notation of events that occur in video. To ensure a reliable set of data from annotation, researchers perform agreement calculations between coders [7]. This 3. INTERVIEWS & COLLABORATION agreement is utilized throughout the data gathering process (by testing some small percentage of data segments to en- To gain a deeper understanding of methods, analysis pro- sure consistency throughout), but also during training of cesses, bottlenecks, and types of data needed for effective coders (to decide when they fully understand what events video annotation software, we maintained an active dialogue they are looking for). There are many techniques for calcu- with researchers (in Special Education, Speech and Hear- lating agreement including Cohens Kappa [11], Cochran’s Q- ing Sciences, and Computer Science) who use video anno- test, and Point-By-Point Agreement. Regardless, the man- tation, conducted informal 40 minute interviews with two agement of data with traditional means is considered “cum- experienced video coders, and refined functionality through bersome” [20]. dialog with current users of VCode and VData. Existing tools for video annotation may address a subset of the be- 2.2 Video Coding Tools low described requirements, however, our system more fully Digital annotation tools have demonstrated significant ben- satisfies all of them. efits from simple copy/paste and undo to increased qual- R1 Facilitate Coding Workflow: The coding workflow con- ity of coding by the facilitation of multiple passes on video sists of; (1) establishing video clips and coding guide- and graphical representations [4, 6]. A timeline is com- lines, (2) intense training of coders and checks for re- monly utilized in these tools and is familiar without ex- liability, (3) annotation of videos, (4) weekly reliability tensive training [1]. Existing research also indicates that checks on annotated videos, (5) repeat 3 and 4 ad in- presenting coders with secondary or sensor data on a time- finitum, (6) analyze data in statistical packages. Tools line helps them outperform coders without sensor data [6]. targeting video annotation should attempt to optimize Increased accuracy, quality, and speed not only enhance the the transition between steps in this workflow. data collected, but also allow for more annotation to be con- ducted in the same amount of time. In addition to the com- R2 Video, Annotations, and Coding Guidelines should be putational benefits of digital annotation tools, they also pro- presented in a synchronized manner: Interviewees de- vide a controllable mechanism for different forms of reliable scribed their coding process centering around analog video playback. [4]. video on a TV-VCR device, annotating in a Microsoft One critical limitation of existing tools is poor represen- Excel file, and referencing lengthy code guidelines. Due tation of data on a timeline and utilization of screen real- to the visual separation between annotations, source ma- estate. For example, the VACA solution, while utilizing min- terial, and video, coders had great difficulty during re- imal screen real-estate by condensing all annotations to one views. large easy to read track, presents a problem with overlap- ping and simultaneous events [6]. The VisSTA solution R3 Capture Appropriate Data: Researchers and existing lit- takes the contrary approach by showing many small verti- erature indicate that there are different types of data cally tiled tracks. Though this allows for a good comparative that are collected through the annotation process: count- view, reading individual annotations & holistic interpreta- ing events/occurences, determining duration of events, tion is difficult due to scrolling [4]. These and other existing assigning levels, values, or ranking to events, performing solutions have not successfully dealt with this problem [14, phonetic transcription, and general commenting [14]. 3, 13, 12, 1, 2, 22, 18]. Effective interfaces must provide methods for capturing Another limitation of current annotation tools is poor in- these conceptually different data types while preserving teraction techniques with video and data. Though robust each of their unique nuances. functionality is provided for playback, controls can be cum- bersome & overly complex, (e.g. [4]). Too many win- R4 Additional data should be displayed to coders: Effec- dows resulting in an over-saturation of information, impre- tive annotation tools should allow researchers to pro- cise video interaction & annotation or rigid, inSSexible mark- vide additional data to coders to aid in their assessment ing interfaces (e.g. [4, 14, 12, 18, 3]). Each of these are of video; for example, a volume histogram of the cur- common stumbling blocks which could result in unreliable rent video, sensor/log data collected in tandem to the data. video capture, or annotations made automatically or
  • 3. from another source. Displaying additional datapoints can make the best annotation decision possible. has shown to increase the accuracy of coded events [6]. Video: To facilitate multiple video streams VCode presents Further, annotation software should facilitate the man- one main video at full size, and a dock with other streams agement of multiple video streams to get the most ac- playing in real time. When a docked stream is clicked on, curate ”view” on the session, and thus produce the most it repositions itself into the main video window, while the accurate data [4]. video which was the previous focus, scales down to the dock, thus equating visual importance with relative size and visual R5 Allow multiple forms of playback: Researchers mentioned weight. that continuous playback is not always the preferred method of analyzing a video. Often multiple modes of Events: When annotating a video, two different classes of playback are utilized; continuous or standard playback, coding events emerge: ranged and momentary. A ranged continuous interval playback (play for N seconds, then event is one which extends over a period of time (mark- stop), and skip interval playback (jump N seconds, then ing action start and duration). Momentary marks have no stop). This allows the video to be divided in to smaller duration, and thus represent one specific moment in time. segments for annotation of events that are more diffi- Comments can be attached to any mark, allowing addi- cult to pinpoint (i.e. when a smile starts or ends) [4]. tional observations, levels/ranking, or phonetic transcrip- Though conceptually simple, manipulations of video us- tion (through onscreen phonetic keyboard). Any mark with ing a standard VCR was described as “annoying” and a comment has a inverted outlines to signify that it has a “a mess” due to hand eye coordination and repeatability comment attached. Figure 1 shows a ranged event repre- issues. senting the length of time which a child is making a sound, with additional momentary marks at the start noting other R6 Agreement calculations should be easy and manipulat- features of the child’s state of being). able: Regardless of agreement technique used, researchers Timeline: The timeline is the heart of VCode. It is mod- expressed a frustration in attempts to calculate inter- eled after the moving timeline one might find in a video edit- observer reliability. Specifically, existing solutions were ing application (e.g. iMovie, Final Cut Pro, etc.). Events, limited to importing data into a statistical software pack- graphically represented by diamonds, appear in a spatial lin- age for calculation or calculating them by hand. Video ear fashion to sync with the video. Once an event has been annotation tools should provide quick & easy reliability placed on the timeline, it can be graphically manipulated by calculations for individual variables, as well as overall. dragging, clicking, and double-clicking. The standard solu- R7 Provide functionality for visual, graphical and contextual tion for dealing with large numbers of tracks or variables is review of annotations: In interviews, coders lamented to provide a vertical scroll bar or overlay tracks. Rather than the process of ensuring reliability on a weekly basis; as limiting the amount of information on screen by scrolling, it consisted of searching through printouts of a spread- tracks representing momentary events are “stacked,” such sheet for discrepancies. Specifically, by lacking context that they vertically overlap. This optimizes usage of the in this spreadsheet coders found it difficult to recognize screen while still providing enough area for track isolation what a given coding mark referred to due to the lack and selection, even under dense data conditions. Ranged of synchronization with video. By providing a visual, event tracks are unable to benefit from this stacking opti- graphical way to review annotations (in the context of mization because of the more complicated interaction for the video) coders would be better able to justify the de- manipulation and thus are vertically tiled. Researchers can cisions, determine the correct solution, and save time present video volume, sensor data, software log data (from identifying the errors. Eclipse or Photoshop for example), and even other annota- tions to the coders. This additional information is presented 4. VCODE AND VDATA graphically to the users by bar, line, or scatter plot. This secondary data can allow coders to annotate data captured VCode and VData are a suite of applications which create by other sources than the video streams, as well as pro- a set of effective interfaces for the coding workflow following vide additional context to their code. For example, should the above design requirements. Our system has three main a coder be instructed to mark when a certain noise occurs, components: VCode (annotation), VCode Admin Window he can line the mark up with an audio peek, rather than (configuration) and VData (examination of data, coder agree- estimate it and be concerned with reaction time. ment and training). The interaction with VCode and VData is demonstrated in Figures 2-4 in which two coders are mark- Interaction: Annotations can be inserted into the timeline ing a video of a child in an experiment, and checking the via UI buttons or keyboard hot keys. To optimize the typi- agreement between their annotations. The reader should cally complex transport controls we isolated the key activi- note our solution is only one possible implementation of the ties that coders need execute and provided controls limited design requirements, and that these requirements could be to play/pause buttons, coarse and fine grained playhead po- applied to improving existing video annotation software. sitioning, and step controls. The three modes of playback outlined in R5 are available. 4.1 VCode 4.2 VCode Administration Window The VCode application (Figure 1) is designed to provide researchers with an effective way to obtain reliable data from To ensure consistent configuration between coders and ses- an observational research video. By allowing researchers to sions, all administrative features are consolidated in a single present multiple video streams in addition to other sensor window. The expected workflow is such that a researcher data (e.g. log data, annotations from other software, or sig- would setup a single coding document with all the variables nals recorded by a computer/monitoring device) the coder to be used on all the videos. This template would then be
  • 4. centage agreement. For each event (momentary or ranged) an opportunity is said to occur when the primary coder makes a mark. If the secondary coder also makes a mark within a specified short interval, the marks are said to agree. agreements A percentage is calculated from opportunities for easy inter- pretation. A tolerance variable is also present to (1) ac- commodate for variability in the mark placement by the coders, and (2) recognition that there is no quantization of marks beyond the granularity of the millisecond timescale, a property of the system. VData also provides agreement for ranged events and annotations in a similar fashion. It is not uncommon for multiple tracks or variables to be mea- suring slight variations on a theme (e.g. smiling vs. large smile vs. grin ), thus VData implements a track-merging Figure 2: The code is specified in the Administration feature which allows opportunities on two distinct tracks to Window along with the different video angles, screen be treated indistinguishably. For a holistic view, researchers capture, and log data. can select tracks to be added into a total agreement calcu- lation. In other words, if analysis determines that a single duplicated (with media and log files inserted for each trial). track is not reliable or it is determined that a given track The main task the Administration Window (Figure 2) is to will not be used in the future, it can be easily excluded from facilitate is the creation of tracks, used to code data. Re- the total agreement calcuation. searchers can add, remove, and reorder tracks which appear Conflict Resolution & Exporting: We have optimized in a list format. The name, color and hot key of each tack coder training and reliability analysis by providing a graphi- can be set through this list presentation. Tracks can be en- cal mechanism to directly compare annotations of two coders. abled as ranged events through a check box in this interface. VData can create a VCode session containing specific tracks The Administration Window is also where a researcher spec- of two individual coders for side-by-side comparison. The vi- ifies videos and data file to be coded, as well as secondary sual, side by side, representation of the data makes it easy to data for contextual annotation. These elements are speci- recognize systematic errors in context and detect differences fied and synchronized through a drag and drop interface, all between two coders markings. This reduces the time nec- of which is hidden from the coder to prevent configuration essary to locate discrepancies and discuss the reasons why corruption. they might have occurred. It is necessary to keep records of these agreement analyses performed with VData by text 4.3 VData export. Maintaining export at each stage of the process pro- vides additional transparency and maintains traceability of results that come out of the system. 4.4 Implementation Our system was implemented in Objective-C using the Cocoa Framework for Mac OS X 10.5. VCode supports all video formats and codecs supported by QuickTime to enable wide compatibility with available video files. 5. MEETING REQUIREMENTS To ensure Video, Annotations, and Coding Guidelines are presented in a synchronized manner, VCode provides a uni- fied interface containing the target video, a timeline with graphically represented annotations (ranged event, momen- Figure 3: Later, analysis is performed on indepen- tary event, or comment depending on data metaphor), ad- dent codings of the same video. A track with low ditional tracks of signal data (to increase accuracy), and a agreement can be reconciled by viewing the results list of coding guidelines which place marks and stand as of two coders side-by-side in VCode, thanks to the a visual reminder. Three forms of video playback (conti- capabilities of the VData analysis tool. nous/standard, interval playback, skip interval playback) are available via check boxes on the main VCode window Critical aspects of the video coding workflow (training, re- to allow easily switching between modes of playback. liability, and accuracy) revolve around demonstrating agree- VData provides a dynamic interface for real time calcula- ment between coders. VData (Figure 3) is a separate exe- tion of multiple agreement values to facilitate easy and dy- cutable application specifically targeted to aid researchers namic agreement calculations. Through the transparent cal- in training and agreement analysis of coded data produced culation process, researchers can see both the raw data, and in VCode. the percentages side by side for easy judgements about the Multi Coder Analysis: By loading two VCode files into reliability of data collected. Upon request a visual, graphi- VData, tracks are automatically loaded into the main data cal and contextual review of annotations for both agreement table which presents opportunities, agreements, and per- review and training is supported.
  • 5. Finally, the Coding Workflow is encouraged through VCode’s needs of both researchers and coders. template model in conjunction with the separate VCode From the reaction of the coders, as well as our own as- Adminstration Window for easy set up and configuration. sessment of VCode and VData, we have many directions of Training, data collection, and inter-coder agreement are en- possible future work. One avenue is creating a database or abled through a tight collaboration between annotation envi- networked system in order to facilitate remote access to con- ronment and agreement analysis. By consistently providing tent, and management of coding objects and assignments for data export, researchers can be assured that any information individual coders. It is foreseeable that the system could be annotated by coders can be easily extracted and exported extended to a tool to prepare coding files; assist in dividing into the statistical analysis tool of their choice. up raw footage, syncing data to video enmasse, and other automation hooks. This could leverage some of the other existing work in automatic video segmentation. Lastly, we 6. INITIAL REACTION hope to address some of the concerns of our coders, including To evaluate our system in a cursory fashion, we conducted creating a richer set of data visualizations. an informal series of interviews with several coders that used our system during the course of an independent study. Anal- 8. ACKNOWLEDGMENTS ysis using VData showed inter-observer agreement was good We would like to thank NSF for their support of this work and provided valuable coded data for the study. In general, (NSF-0643502). comments from the coders were positive, especially when comparing the VCode system to non-computerized meth- 9. REFERENCES ods. One coder wrote: “The software was easy to use in general, and cut down on coding time.” Several features of [1] Annotations as multiple perspectives of video content, VCode stood out in their comments; color coding of tracks Juan-les-Pins, France, 2002. ACM. [2] The Family Video Archive: an annotation and browsing provided direct linkage between events on the timeline and environment for home movies, Berkeley, CA, 2003. ACM. the description panel, the correlation between files was clear [3] Fluid interaction techniques for the control and annotation to see during review, sensor data helped anticipate events of digital video, Vancouver, Canada, 2003. ACM. and accurately code them. It was also noted that the sensor [4] A Coding Tool for Multimodal Analysis of Meeting Video, Montreal, Quebec, Canada, 2004. IEEE. data provided reassurance that what they had noticed in the [5] Creating Multi-Modal, User-Centric Records of Meetings video was actually correct. with the Carnegie Mellon Meeting Recorder Architecture, In addition to these positive marks we uncovered several Montreal, Quebec, Canada, 2004. IEEE. shortcomings of the interface. The seemingly low-resolution [6] Work-in-progress: VACA: a tool for qualitative video analysis, Montreal, Quebec, Canada, 2006. ACM. bar-graph of volume data left coders unsure where precisely [7] K. J. Berry and P. W. Mielke. A generalization of cohen’s to make their mark. Because the elements of this graph are kappa agreement measure to interval measurement and relatively wide, it appears especially coarse in comparison multiple raters, 1988. with the precision with which one may place a mark on [8] M. Bertini, A. Del Bimbo, R. Cucchiara, and A. Prati. Applications ii: Semantic video adaptation based on automatic the timeline. A spectrogram was suggested as an alternate annotation of sport videos. In Proceedings of the 6th ACM visualization of the audio data that could help understand SIGMM international workshop on Multimedia information sound and video. retrieval MIR, 2004. Overall, results from these interviews were very encour- [9] S.-C. Chen, M.-L. Shyu, W. Liao, and C. Zhang. Scene change detection by audio and video clues. aging and suggest a more formal study to determine if per- www.cs.fiu.edu/~chens/PDF/ICME02_Video.pdf. formance improves in the same way that coders stated that [10] J. Clifford and G. Marcus, editors. Writing Culture. University they felt as the tool lowered the amount of time necessary of California Press, Berkeley, California, 1986. for coding. [11] A. J. Conger. Kappa reliabilities for continuous behaviors and events, 1985. [12] S. B. Group. Studiocode business group - supplier of 7. CONCLUSION AND FUTURE WORK studiocode and stream video analysis and distribution software. http://www.studiocodegroup.com, 2007. Video annotation tools can be valuable to researchers by [13] A. Johnson. About vprism video data analysis software. enhancing the annotation process through increased relia- http://www.camse.org/andy/VP/vprism.htm, 2007. [14] M. Kipp. Anvil - the video annotation research tool. bility, repeatability, and workflow optimizations. However, http://www.anvil-software.de, 2007. many existing solutions do not fully address all the needs of [15] B. J. Leadholm and J. F. Miller. Language Sample Analysis: researchers and coders; effective representation of data on The Wisconsin Guide. Wisconsin Department of Public a timeline, efficient and robust interaction techniques with Instruction, Madison, Wisconsin, 1995. [16] L. Lee. Developmental Sentence Analysis. Northwestern video and data, and support for the full video annotation University Press, Evanston, IL, 1974. workflow. Our research has provided many contributions in [17] S. S. LLC. Salt software. http://www.saltsoftware.com, 2007. addressing these weak points. [18] Noldus. The observer. We create a set of design requirements based on existing http://www.noldus.com/site/doc200401012, 2007. literature and annotation techniques, interviews with expe- [19] D. Ponceleon and S. Srinivasan. Automatic discovery of salient segments in imperfect speech transcripts. In CIKM ’01: rienced coders, and discussions with researchers in multiple Proceedings of the tenth international conference on disciplines. Based on these investigations, we implemented Information and knowledge management, pages 490–497, New a system, VCode and VData, that largely satisfies the re- York, NY, USA, 2001. ACM. [20] K. S. Retherford. Guide to Analysis of Language Transcripts. quirements we outlined. These systems were then used in Thinking Publications, Eau Claire, Wisconsin, 1993. ongoing research, and coders were interviewed concurrent [21] K. L. Rosenblum, C. Zeanah, S. McDonough, and M. Muzik. with and after using the software, and their reactions were Video-taped coding of working model of the child interviews: a solicited. Our model demonstrates how video annotation viable and useful alternative to verbatim transcripts?, 2004. [22] S. Soft. Annotation. http://www.saysosoft.com/, 2006. software, for many disciplines, can be enhanced to meet the