1. Joint Inference of Groups, Events and
Human Roles in Aerial Videos
School of Electrical Engineering and Computer Science
Sinisa Todorovic
Autonomous Systems @ OSU Event
June 30, 2015
5. Our Approach
Input Inference Output
Exchange Box
Input Inference Output
Role AssignmentGrouping
Event Recognition
Guide
Consultant
Receiver
Box
Car
Info Consult
Exchange BoxGroup Tour
Guide
Consultant
Visitor
Deliverer
Receiver
Box
Trajectories
Frame registration
Tourist
Exchange Box
Group Tour
Info Consult
Exchange Box
Group Tour
Info Consult
18. Results
• Baseline: Hierarchical clustering of trajectories + SVM
• 3-fold cross validation
Method Group Event Role
Baseline 39.64% 16.94% 5.53%
Ours w/o sub-
event layer
40.41% 18.51% 8.69%
Our full model 49.47% 32.84% 18.92%
19. Summary
• New domain: low-res aerial videos
• Unified representation and joint inference of
groups, events and human roles
• New mid-level feature: ST-templates
• New aerial video dataset with detailed
annotations of:
– Human trajectories
– Objects
– Groups
– Events
– Human roles