A Fusion Framework for Multimodal Interactive Applications

•Télécharger en tant que PPT, PDF•

2 j'aime•638 vues

This research aims to propose a multi-modal fusion framework for high-level data fusion between two or more modalities. It takes as input low level features extracted from dier- ent system devices, analyses and identies intrinsic meanings in these data. Extracted meanings are mutually compared to identify complementarities, ambiguities and inconsistencies to better understand the user intention when interacting with the system. The whole fusion life cycle will be described and evaluated in an OCE environment scenario, where two co-workers interact by voice and movements, which might show their intentions. The fusion in this case is focusing on combining modalities for capturing a context to enhance the user experience.

Technologie Formation

A Fusion Framework for Multimodal
Interactive Applications
Presented by: Hildeberto Mendonça
Jean-Yves Lionel Lawson
Olga Vybornova
Benoit Macq
Jean Vanderdonckt
ICMI-MLMI 2009 – Cambridge MA, USA, November 2-6, 2009
Special Session Fusion Engines for Multimodal Interfaces
November 3, 2009

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA2
Motivations

How to support multimodal fusion in order to
maximize reuse and minimize complexity?

If there is complexity on multimodal fusion it should
be about the fusion in itself

What already exists should be reused with minimal
adaptation

A general life cycle can guarantee a standard
treatment for each modality

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA3
Research Goal
To deﬁne and develop a multipurpose framework
for high level data fusion on multimodal
interactive applications

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA4
Fusion Principles

Type: Parallel + Combined = Synergistic

Each modality is endowed of meanings

Level: Feature (i.e. pattern extraction) + decision (i.e.
Recognized task)

Input Devices: Multiple

Notation: Defined by the developer

Ambiguity resolution: Defined by the developer

Time representation (Quantitative – Qualitative): Both

Application Type : The domain is defined using ontologies

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA5
Process

Recognition: identification of patterns on input signals.

Segmentation: delimitation of identified areas.

Meanings Extraction: deeper analysis to identify
meanings and correlations between segments according
to specific domains.

Annotation: formal description of segments through
domain concepts.

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA6
Process

The flow is fixed but it can start at any point
respecting the sequence.

Not fixed to any particular method. The method
is “plugged”.

Focus on good level of analysis, not on real
time processing.

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA7
OpenInterface

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA8
OpenInterface

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA9
OpenInterface

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA10
Fusion Mechanism
 Define a process for each modality and put them in parallel.
 Data from each stage is buffered and processed together for the
purpose of fusion.
 Agent-oriented: problem solved in a distributed fashion.

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA11
Fusion Mechanism

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA12
Fusion Mechanism

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA13
Fusion Mechanism – OpenInterface
OI Modeling Tool

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA14
Fusion Mechanism - Instance

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA15
Scenario
Maybe I can find a
book about it in the
library
Ronald is moving
towards the book
shelves

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA16
Results

managed spatial relationships based on the fixed objects
in the room

made semantic fusion of events not coinciding in time

achieved good results in speaker identification -
synchronization between image and speech identification

created an open framework to manage fusion between two
(in our case) or more modalities (in enhanced future work)

designed the system so that each component can run in a
separate machine due to the distribution mechanism
interchanging data through a TCP/IP network

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA17
Next Steps

Implementing the segmentation and annotation
of 3D content

Migrate the framework to a real-time
implementation

Evaluate other methods under the rules of the
framework

Continuously extend the framework to support
other fusion concepts and methods of
implementation

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA18
Thank you for your attention!

Recommandé

Multimodal Interaction: An IntroductionAbdallah El Ali

Replication of Macroeconomics from the Bottom-upRichard Oliver Legendi

Model Replication in the Context of Agent-based SimulationRichard Oliver Legendi

Event-Based vs. Multi-Agent Systems: Towards a Unified Conceptual FrameworkAndrea Omicini

M2CAT: Extracting reproducible simulation studies from model repositories usi...Martin Scharm

Auto visualization and vimlBill Liu

Bringing discrete event simulation concepts into multi agent systems ppt97__...Daniele Gianni

Recommandé

Multimodal Interaction: An IntroductionAbdallah El Ali

Replication of Macroeconomics from the Bottom-upRichard Oliver Legendi

Model Replication in the Context of Agent-based SimulationRichard Oliver Legendi

Event-Based vs. Multi-Agent Systems: Towards a Unified Conceptual FrameworkAndrea Omicini

M2CAT: Extracting reproducible simulation studies from model repositories usi...Martin Scharm

Auto visualization and vimlBill Liu

Bringing discrete event simulation concepts into multi agent systems ppt97__...Daniele Gianni

Learning from ubicomp deployments keio 2010Adrian Friday

Introduction to Model-Based Machine LearningDaniel Emaasit

Molecules of Knowledge: Self-Organisation in Knowledge-Intensive EnvironmentsStefano Mariani

Modeling and simulationPayel Rani

A Framework to Specify and Verify Computational Fields for Pervasive Computin...Danilo Pianini

A review-miml-framework-and-image-annotationEditor IJMTER

Tutorial on Social Multimedia ComputingJitao Sang

Model-based Measurement of Interaction in Mobile Multimodal EnvironmentsPedro Luis Mateo Navarro

Collaborative Filtering - MF, NCF, NGCFPark JunPyo

Data and model management in Systems BiologyUniversity Medicine Greifswald

محازةرةي يةكةمGaylan Blbas

GEOFRAME: a system for doing hydrology by computerRiccardo Rigon

SocialCom 2009 - Social SynchronyMunmun De Choudhury

Handling Uncertainty in Automatically Generated Implementation Models in the ...Alessio Bucaioni

Engineering Complex Computational Ecosystems (PhD defense)Danilo Pianini

DEFENSEFabio Caraffini

Advertising Networks & Exchanges - ad:tech, SingaporeChris Schaumann

Implementation of miml framework using annotatedEditor IJMTER

ODSC West 2021 – Composition in MLBryan Bischof

Irpb workshopKhalid Belhajjame

To the end of our possibilities with Adaptive User InterfacesJean Vanderdonckt

Engineering the Transition of Interactive Collaborative Software from Cloud C...Jean Vanderdonckt

Contenu connexe

Similaire à A Fusion Framework for Multimodal Interactive Applications

Learning from ubicomp deployments keio 2010Adrian Friday

Introduction to Model-Based Machine LearningDaniel Emaasit

Molecules of Knowledge: Self-Organisation in Knowledge-Intensive EnvironmentsStefano Mariani

Modeling and simulationPayel Rani

A Framework to Specify and Verify Computational Fields for Pervasive Computin...Danilo Pianini

A review-miml-framework-and-image-annotationEditor IJMTER

Tutorial on Social Multimedia ComputingJitao Sang

Model-based Measurement of Interaction in Mobile Multimodal EnvironmentsPedro Luis Mateo Navarro

Collaborative Filtering - MF, NCF, NGCFPark JunPyo

Data and model management in Systems BiologyUniversity Medicine Greifswald

محازةرةي يةكةمGaylan Blbas

GEOFRAME: a system for doing hydrology by computerRiccardo Rigon

SocialCom 2009 - Social SynchronyMunmun De Choudhury

Handling Uncertainty in Automatically Generated Implementation Models in the ...Alessio Bucaioni

Engineering Complex Computational Ecosystems (PhD defense)Danilo Pianini

DEFENSEFabio Caraffini

Advertising Networks & Exchanges - ad:tech, SingaporeChris Schaumann

Implementation of miml framework using annotatedEditor IJMTER

ODSC West 2021 – Composition in MLBryan Bischof

Irpb workshopKhalid Belhajjame

Similaire à A Fusion Framework for Multimodal Interactive Applications (20)

Learning from ubicomp deployments keio 2010

Introduction to Model-Based Machine Learning

Molecules of Knowledge: Self-Organisation in Knowledge-Intensive Environments

Modeling and simulation

A Framework to Specify and Verify Computational Fields for Pervasive Computin...

A review-miml-framework-and-image-annotation

Tutorial on Social Multimedia Computing

Model-based Measurement of Interaction in Mobile Multimodal Environments

Collaborative Filtering - MF, NCF, NGCF

Data and model management in Systems Biology

محازةرةي يةكةم

GEOFRAME: a system for doing hydrology by computer

SocialCom 2009 - Social Synchrony

Handling Uncertainty in Automatically Generated Implementation Models in the ...

Engineering Complex Computational Ecosystems (PhD defense)

DEFENSE

Advertising Networks & Exchanges - ad:tech, Singapore

Implementation of miml framework using annotated

ODSC West 2021 – Composition in ML

Irpb workshop

Plus de Jean Vanderdonckt

To the end of our possibilities with Adaptive User InterfacesJean Vanderdonckt

Engineering the Transition of Interactive Collaborative Software from Cloud C...Jean Vanderdonckt

UsyBus: A Communication Framework among Reusable Agents integrating Eye-Track...Jean Vanderdonckt

µV: An Articulation, Rotation, Scaling, and Translation Invariant (ARST) Mult...Jean Vanderdonckt

RepliGES and GEStory: Visual Tools for Systematizing and Consolidating Knowle...Jean Vanderdonckt

Gesture-based information systems: from DesignOps to DevOpsJean Vanderdonckt

Engineering Slidable User Interfaces with SlimeJean Vanderdonckt

Evaluating Gestural Interaction: Models, Methods, and MeasuresJean Vanderdonckt

Conducting a Gesture Elicitation Study: How to Get the Best Gestures From Peo...Jean Vanderdonckt

Designing Gestural Interaction: Challenges and PitfallsJean Vanderdonckt

Fundamentals of Gestural InteractionJean Vanderdonckt

Gestural Interaction, Is it Really Natural?Jean Vanderdonckt

User-centred Development of a Clinical Decision-support System for Breast Can...Jean Vanderdonckt

Simplifying the Development of Cross-Platform Web User Interfaces by Collabo...Jean Vanderdonckt

Attach Me, Detach Me, Assemble Me like you WorkJean Vanderdonckt

The Impact of Comfortable Viewing Positions on Smart TV GesturesJean Vanderdonckt

Head and Shoulders Gestures: Exploring User-Defined Gestures with Upper BodyJean Vanderdonckt

G-Menu: A Keyword-by-Gesture based Dynamic Menu Interface for SmartphonesJean Vanderdonckt

Vector-based, Structure Preserving Stroke Gesture RecognitionJean Vanderdonckt

An ontology for reasoning on body-based gesturesJean Vanderdonckt

Plus de Jean Vanderdonckt (20)

To the end of our possibilities with Adaptive User Interfaces

Engineering the Transition of Interactive Collaborative Software from Cloud C...

UsyBus: A Communication Framework among Reusable Agents integrating Eye-Track...

µV: An Articulation, Rotation, Scaling, and Translation Invariant (ARST) Mult...

RepliGES and GEStory: Visual Tools for Systematizing and Consolidating Knowle...

Gesture-based information systems: from DesignOps to DevOps

Engineering Slidable User Interfaces with Slime

Evaluating Gestural Interaction: Models, Methods, and Measures

Conducting a Gesture Elicitation Study: How to Get the Best Gestures From Peo...

Designing Gestural Interaction: Challenges and Pitfalls

Fundamentals of Gestural Interaction

Gestural Interaction, Is it Really Natural?

User-centred Development of a Clinical Decision-support System for Breast Can...

Simplifying the Development of Cross-Platform Web User Interfaces by Collabo...

Attach Me, Detach Me, Assemble Me like you Work

The Impact of Comfortable Viewing Positions on Smart TV Gestures

Head and Shoulders Gestures: Exploring User-Defined Gestures with Upper Body

G-Menu: A Keyword-by-Gesture based Dynamic Menu Interface for Smartphones

Vector-based, Structure Preserving Stroke Gesture Recognition

An ontology for reasoning on body-based gestures

Dernier

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Histor y of HAM Radio presentation slidevu2urc

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Slack Application Development 101 Slidespraypatel2

GenCyber Cyber Security Day PresentationMichael W. Hawkins

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

Dernier (20)

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Data Cloud, More than a CDP by Matt Robison

Advantages of Hiring UIUX Design Service Providers for Your Business

Boost PC performance: How more available memory can improve productivity

Histor y of HAM Radio presentation slide

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...

Presentation on how to chat with PDF using ChatGPT code interpreter

Boost Fertility New Invention Ups Success Rates.pdf

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

08448380779 Call Girls In Civil Lines Women Seeking Men

Slack Application Development 101 Slides

GenCyber Cyber Security Day Presentation

08448380779 Call Girls In Friends Colony Women Seeking Men

Breaking the Kubernetes Kill Chain: Host Path Mount

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

Powerful Google developer tools for immediate impact! (2023-24 C)

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

A Fusion Framework for Multimodal Interactive Applications

1. A Fusion Framework for Multimodal Interactive Applications Presented by: Hildeberto Mendonça Jean-Yves Lionel Lawson Olga Vybornova Benoit Macq Jean Vanderdonckt ICMI-MLMI 2009 – Cambridge MA, USA, November 2-6, 2009 Special Session Fusion Engines for Multimodal Interfaces November 3, 2009

2. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA2 Motivations  How to support multimodal fusion in order to maximize reuse and minimize complexity?  If there is complexity on multimodal fusion it should be about the fusion in itself  What already exists should be reused with minimal adaptation  A general life cycle can guarantee a standard treatment for each modality

3. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA3 Research Goal To deﬁne and develop a multipurpose framework for high level data fusion on multimodal interactive applications

4. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA4 Fusion Principles  Type: Parallel + Combined = Synergistic  Each modality is endowed of meanings  Level: Feature (i.e. pattern extraction) + decision (i.e. Recognized task)  Input Devices: Multiple  Notation: Defined by the developer  Ambiguity resolution: Defined by the developer  Time representation (Quantitative – Qualitative): Both  Application Type : The domain is defined using ontologies

5. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA5 Process  Recognition: identification of patterns on input signals.  Segmentation: delimitation of identified areas.  Meanings Extraction: deeper analysis to identify meanings and correlations between segments according to specific domains.  Annotation: formal description of segments through domain concepts.

6. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA6 Process  The flow is fixed but it can start at any point respecting the sequence.  Not fixed to any particular method. The method is “plugged”.  Focus on good level of analysis, not on real time processing.

7. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA7 OpenInterface

8. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA8 OpenInterface

9. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA9 OpenInterface

10. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA10 Fusion Mechanism  Define a process for each modality and put them in parallel.  Data from each stage is buffered and processed together for the purpose of fusion.  Agent-oriented: problem solved in a distributed fashion.

11. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA11 Fusion Mechanism

12. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA12 Fusion Mechanism

13. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA13 Fusion Mechanism – OpenInterface OI Modeling Tool

14. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA14 Fusion Mechanism - Instance

15. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA15 Scenario Maybe I can find a book about it in the library Ronald is moving towards the book shelves

16. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA16 Results  managed spatial relationships based on the fixed objects in the room  made semantic fusion of events not coinciding in time  achieved good results in speaker identification - synchronization between image and speech identification  created an open framework to manage fusion between two (in our case) or more modalities (in enhanced future work)  designed the system so that each component can run in a separate machine due to the distribution mechanism interchanging data through a TCP/IP network

17. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA17 Next Steps  Implementing the segmentation and annotation of 3D content  Migrate the framework to a real-time implementation  Evaluate other methods under the rules of the framework  Continuously extend the framework to support other fusion concepts and methods of implementation

18. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA18 Thank you for your attention!