SlideShare une entreprise Scribd logo
1  sur  14
Télécharger pour lire hors ligne
Automated Information Retrieval
   and Text Categorization:
    The RIKS Demonstrator

           Acknowledge final event
             November 25, 2008
     Marie-Francine Moens, Erik Boiy, Javier Arias (HMDB-LIIR)
                      Saskia Debergh (i.Know)
         Philippe De Lombaerde, Birger Fühne (UNU-CRIS)




                     Overview
• UNU CRIS: The RIKS Demonstrator
  UNU-CRIS:
• K.U.Leuven:
    – Content extraction from multilingual Web pages
    – Text categorization: machine learning approach
    – Search engine and indexing infrastructure
    – Interfacing the Acknowledge platform
• i.Know:
    – Information forensics


                       Acknowledge 25-11-2008




                                                                 1
The RIKS Demonstrator
• United Nations University – Comparative Regional
  Integration Studies (UNU-CRIS)
• Issues addressed in research and capacity building:
   – (i) emergence of regional (= supra-national)
     governance level
   – (ii) linkages with other governance levels (national,
     global/UN)
   – (iii) building of regional institutions
   – (iv) growing regional interdependence, etc.
• RIKS = Regional Integration Knowledge System
  (UNU-CRIS and GARNET NoE)
                     Acknowledge 25-11-2008




                     Acknowledge 25-11-2008




                                                             2
The RIKS Demonstrator
Issues addressed in the demonstrator:
  How to automate retrieval and processing
                                  p          g
  (cleaning, search, categorization,
  presentation) of particular types of relevant
  information in an e-learning environment?:
   – ‘News’: short texts, various formats, dynamic
     collection, short life cycle, role of news in e-
     learning application
   – ‘Documentation’: heterogeneous texts: scientific
     articles, theses, essays, ... , rather static collection
   – Treaty texts: long and complex texts, static
     collection, issue of accessibility
                     Acknowledge 25-11-2008




                          RIKS
                   example output




                     Acknowledge 25-11-2008




                                                                3
Demo




                  Acknowledge 25-11-2008




K.U.Leuven: Content extraction
 from multilingual Web pages

• = Extracting main content from Web page and
  removing extraneous data (navigation menu’s,
  advertisements, etc.)
• Requirements of the tool:
   – Accurate
   – Generic
   – Multilingual
   – Fast

                  Acknowledge 25-11-2008




                                                 4
Acknowledge 25-11-2008




                 [Arias et al. submitted]
Acknowledge 25-11-2008




                                            5
[Arias et al. submitted]
                                      [5] =[Gottron 2008]
                     Acknowledge 25-11-2008




 K.U.Leuven:Text categorization
• Heterogeneous documentation and Google News
  classified into 27 categories (e.g., trade, poverty, ...)
                                (e g trade poverty )
• Supervised classifier: Multinomial Naïve Bayes, Support
  Vector Machine, ...
• Features:
   – different features: unigrams, bigrams, feature item
      sets, ...
• Additional feature Selection:
   – Chi Square, Information Gain, Linear Classifier
      Weights, Orthogonal Centroid Feature Selection
• Different test set ups




                                                                 6
K.U.Leuven: Text categorization




          Acknowledge 25-11-2008




           RIKS
 K.U.Leuven: search engine




          Acknowledge 25-11-2008




                                   7
Acknowledge 25-11-2008




Demo




   Acknowledge 25-11-2008




                            8
Weten dat je niet weet wat je zou moeten weten

          1. Information Forensics ‐ Smart Indexing
                        more than just an index
                       distinguishes between concepts and relations
                       distinguishes between concepts and relations
                       starts from unstructured text (bottom‐up instead of top‐down)

                         recognises word groups as meaningful units
 Top‐down:                                                                            Bottom‐up:




     knowledge                                                                                                           knowledge
                                    keywords                                            concepts and relations
                                                         text                  text
                                                          Acknowledge 25-11-2008
           © i.Know NV ‐ All rights reserved.




                           Weten dat je niet weet wat je zou moeten weten

      1. Information Forensics – Smart Indexing
                                De Fortis Bank werd overgenomen                                        door BNP Paribas.


          Traditional indexing (keywords):

     De    Fortis          Bank         werd       overgenomen   door    BNP      Paribas.
                                                                                                               Keyword Index
                                                                                                               Fortis        0.23
                     stopwords                               calculation                                       Bank          0.38
                                                                                                               werd          0.08
                     stemming                                correlation
                                                                                                               overgenomen   0.21
                                                                                                               door          0.12
                                                                                                               BNP           0.34
De         Fortis          Bank             werd      overgenomen       door     BNP         Paribas
                                                                                                               Paribas       0.27




                                                          Acknowledge 25-11-2008
           © i.Know NV ‐ All rights reserved.




                                                                                                                                     9
Weten dat je niet weet wat je zou moeten weten

     1. Information Forensics – Smart Indexing
                            De Fortis Bank werd overgenomen                door BNP Paribas.


     Smart Indexing (concepts and relations):

     De Fortis Bank werd overgenomen door BNP Paribas.

                                                                            Smart Index
                   relation                          concept 
                                                                            Concept    Fortis Bank
                   detection                         detection
                                                                            Relation   werd overgenomen door
                                                                                       werd overgenomen door
                                                                            Concept    BNP Paribas


De    Fortis Bank                   werd overgenomen door    BNP Paribas




                                                Acknowledge 25-11-2008
       © i.Know NV ‐ All rights reserved.




                       Weten dat je niet weet wat je zou moeten weten

     2. Categorisation based on Smart Indexing
 Preconditions:
     Pre defined taxonomy/ontology
     Pre‐defined taxonomy/ontology
     Top‐down processing


 Advantages of Smart Indexing:
 Smart Indexing Results can be used to fill and enrich the taxonomy, thus ensuring 
 the entries are
           relevant
           precise
           complete



                                                Acknowledge 25-11-2008
       © i.Know NV ‐ All rights reserved.




                                                                                                               10
Weten dat je niet weet wat je zou moeten weten

  2. Categorisation

Categorisation

                                                                             EU                      EFTA


Smart Indexing (concepts and relations):


  The       Agreement                     will be applied with   the   European    and with   the   EFTA states.
                                                                       Union



Input:
   The Agreement will be applied with the European Union and with the EFTA states.

                                                    Acknowledge 25-11-2008
     © i.Know NV ‐ All rights reserved.




                                                           RIKS
                             i.Know: news categorization




                                                    Acknowledge 25-11-2008




                                                                                                                   11
RIKS
i.Know: news categorization




       Acknowledge 25-11-2008




       Acknowledge 25-11-2008




                                12
Acknowledge 25-11-2008




Demo




   Acknowledge 25-11-2008




                            13
Thank you




Acknowledge 25-11-2008




                         14

Contenu connexe

En vedette

Qo E E2 E6 Slotevent Programma
Qo E E2 E6 Slotevent ProgrammaQo E E2 E6 Slotevent Programma
Qo E E2 E6 Slotevent Programmaimec.archive
 
Ddo1 Bernd Langeheine 081017 Ghent
Ddo1 Bernd Langeheine   081017 GhentDdo1 Bernd Langeheine   081017 Ghent
Ddo1 Bernd Langeheine 081017 Ghentimec.archive
 
Acknowledge 08 Ontwikkeling Front End Benny Daems Ibbt Edm U Hasselt En Al...
Acknowledge 08 Ontwikkeling Front End  Benny Daems   Ibbt Edm U Hasselt En Al...Acknowledge 08 Ontwikkeling Front End  Benny Daems   Ibbt Edm U Hasselt En Al...
Acknowledge 08 Ontwikkeling Front End Benny Daems Ibbt Edm U Hasselt En Al...imec.archive
 
Ecrea3e Zoran Kostov Ppt
Ecrea3e Zoran Kostov PptEcrea3e Zoran Kostov Ppt
Ecrea3e Zoran Kostov Pptimec.archive
 
Curriculumvitae 100425072655-phpapp01
Curriculumvitae 100425072655-phpapp01Curriculumvitae 100425072655-phpapp01
Curriculumvitae 100425072655-phpapp01nnasirkful
 
Vânia goncalves isbo ng wi nets - accounting interference
Vânia goncalves   isbo ng wi nets - accounting interferenceVânia goncalves   isbo ng wi nets - accounting interference
Vânia goncalves isbo ng wi nets - accounting interferenceimec.archive
 
I Minds2009 Michela Pollone Csp–Innovazione Nelle Ict Innovative Paths To I...
I Minds2009 Michela Pollone   Csp–Innovazione Nelle Ict Innovative Paths To I...I Minds2009 Michela Pollone   Csp–Innovazione Nelle Ict Innovative Paths To I...
I Minds2009 Michela Pollone Csp–Innovazione Nelle Ict Innovative Paths To I...imec.archive
 
Romas02 De Digitale Revolutie Jan Potemans
Romas02 De Digitale Revolutie Jan PotemansRomas02 De Digitale Revolutie Jan Potemans
Romas02 De Digitale Revolutie Jan Potemansimec.archive
 
Brokerage2006 ontsluiting van multimediale archieven
Brokerage2006 ontsluiting van multimediale archievenBrokerage2006 ontsluiting van multimediale archieven
Brokerage2006 ontsluiting van multimediale archievenimec.archive
 
Benoit Felten - The Universal Connectivity Revolution
Benoit Felten - The Universal Connectivity RevolutionBenoit Felten - The Universal Connectivity Revolution
Benoit Felten - The Universal Connectivity Revolutionimec.archive
 
Ict Sd11 Vlaams E Governement En Ondernemingen Geert Mareels
Ict Sd11 Vlaams E Governement En Ondernemingen   Geert MareelsIct Sd11 Vlaams E Governement En Ondernemingen   Geert Mareels
Ict Sd11 Vlaams E Governement En Ondernemingen Geert Mareelsimec.archive
 
Workshopvin5 Sociability
Workshopvin5 SociabilityWorkshopvin5 Sociability
Workshopvin5 Sociabilityimec.archive
 
Guido Impens - One access at iLab.t
Guido Impens - One access at iLab.tGuido Impens - One access at iLab.t
Guido Impens - One access at iLab.timec.archive
 
Presentatie voor Avans Breda @ Muscom
Presentatie voor Avans Breda @ MuscomPresentatie voor Avans Breda @ Muscom
Presentatie voor Avans Breda @ MuscomVera Bartels
 
Break out: Collaboration tools - Kris Naessens
Break out: Collaboration tools - Kris NaessensBreak out: Collaboration tools - Kris Naessens
Break out: Collaboration tools - Kris Naessensimec.archive
 

En vedette (17)

Qo E E2 E6 Slotevent Programma
Qo E E2 E6 Slotevent ProgrammaQo E E2 E6 Slotevent Programma
Qo E E2 E6 Slotevent Programma
 
Ddo1 Bernd Langeheine 081017 Ghent
Ddo1 Bernd Langeheine   081017 GhentDdo1 Bernd Langeheine   081017 Ghent
Ddo1 Bernd Langeheine 081017 Ghent
 
Acknowledge 08 Ontwikkeling Front End Benny Daems Ibbt Edm U Hasselt En Al...
Acknowledge 08 Ontwikkeling Front End  Benny Daems   Ibbt Edm U Hasselt En Al...Acknowledge 08 Ontwikkeling Front End  Benny Daems   Ibbt Edm U Hasselt En Al...
Acknowledge 08 Ontwikkeling Front End Benny Daems Ibbt Edm U Hasselt En Al...
 
Ecrea3e Zoran Kostov Ppt
Ecrea3e Zoran Kostov PptEcrea3e Zoran Kostov Ppt
Ecrea3e Zoran Kostov Ppt
 
Curriculumvitae 100425072655-phpapp01
Curriculumvitae 100425072655-phpapp01Curriculumvitae 100425072655-phpapp01
Curriculumvitae 100425072655-phpapp01
 
Vânia goncalves isbo ng wi nets - accounting interference
Vânia goncalves   isbo ng wi nets - accounting interferenceVânia goncalves   isbo ng wi nets - accounting interference
Vânia goncalves isbo ng wi nets - accounting interference
 
documentouta
documentoutadocumentouta
documentouta
 
I Minds2009 Michela Pollone Csp–Innovazione Nelle Ict Innovative Paths To I...
I Minds2009 Michela Pollone   Csp–Innovazione Nelle Ict Innovative Paths To I...I Minds2009 Michela Pollone   Csp–Innovazione Nelle Ict Innovative Paths To I...
I Minds2009 Michela Pollone Csp–Innovazione Nelle Ict Innovative Paths To I...
 
Romas02 De Digitale Revolutie Jan Potemans
Romas02 De Digitale Revolutie Jan PotemansRomas02 De Digitale Revolutie Jan Potemans
Romas02 De Digitale Revolutie Jan Potemans
 
Brokerage2006 ontsluiting van multimediale archieven
Brokerage2006 ontsluiting van multimediale archievenBrokerage2006 ontsluiting van multimediale archieven
Brokerage2006 ontsluiting van multimediale archieven
 
Benoit Felten - The Universal Connectivity Revolution
Benoit Felten - The Universal Connectivity RevolutionBenoit Felten - The Universal Connectivity Revolution
Benoit Felten - The Universal Connectivity Revolution
 
Ict Sd11 Vlaams E Governement En Ondernemingen Geert Mareels
Ict Sd11 Vlaams E Governement En Ondernemingen   Geert MareelsIct Sd11 Vlaams E Governement En Ondernemingen   Geert Mareels
Ict Sd11 Vlaams E Governement En Ondernemingen Geert Mareels
 
Workshopvin5 Sociability
Workshopvin5 SociabilityWorkshopvin5 Sociability
Workshopvin5 Sociability
 
Guido Impens - One access at iLab.t
Guido Impens - One access at iLab.tGuido Impens - One access at iLab.t
Guido Impens - One access at iLab.t
 
cavenger2000
cavenger2000cavenger2000
cavenger2000
 
Presentatie voor Avans Breda @ Muscom
Presentatie voor Avans Breda @ MuscomPresentatie voor Avans Breda @ Muscom
Presentatie voor Avans Breda @ Muscom
 
Break out: Collaboration tools - Kris Naessens
Break out: Collaboration tools - Kris NaessensBreak out: Collaboration tools - Kris Naessens
Break out: Collaboration tools - Kris Naessens
 

Similaire à Acknowledge 07 Automated Retrieval And Categorization Of Texts In An E Learning Environment The Riks Demonstrator

First european research for web information extraction and analysis for sup...
First   european research for web information extraction and analysis for sup...First   european research for web information extraction and analysis for sup...
First european research for web information extraction and analysis for sup...Tomas Pariente Lobo
 
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...Stichting ePortfolio Support
 
Insemtives cluj iccp
Insemtives cluj iccpInsemtives cluj iccp
Insemtives cluj iccpElena Simperl
 
Financial i: Roundtable Corporate SWIFT Access
Financial i: Roundtable Corporate SWIFT AccessFinancial i: Roundtable Corporate SWIFT Access
Financial i: Roundtable Corporate SWIFT AccessDirk Braun
 
Finding fraud in large, diverse data sets
Finding fraud in large, diverse data setsFinding fraud in large, diverse data sets
Finding fraud in large, diverse data setsChris Selland
 
03 heemskerk eramind mobility mtg_trieste italy_fh_27_may10
03 heemskerk eramind mobility mtg_trieste italy_fh_27_may1003 heemskerk eramind mobility mtg_trieste italy_fh_27_may10
03 heemskerk eramind mobility mtg_trieste italy_fh_27_may10AREA Science Park
 
Effective Internal Investigations
Effective Internal InvestigationsEffective Internal Investigations
Effective Internal InvestigationsDaegis
 
Finance 2.0: Future or Feature
Finance 2.0: Future or FeatureFinance 2.0: Future or Feature
Finance 2.0: Future or FeatureAman Narain
 
Audaxis : BI Project for an Association of Pharmacists
Audaxis : BI Project for an Association of PharmacistsAudaxis : BI Project for an Association of Pharmacists
Audaxis : BI Project for an Association of PharmacistsAudaxis
 
Digitale fabriek - I2 Iterias
Digitale fabriek - I2 IteriasDigitale fabriek - I2 Iterias
Digitale fabriek - I2 IteriasSirris
 
Enterprise 2.0 - How to b
Enterprise 2.0 - How to bEnterprise 2.0 - How to b
Enterprise 2.0 - How to bPeter H. Reiser
 

Similaire à Acknowledge 07 Automated Retrieval And Categorization Of Texts In An E Learning Environment The Riks Demonstrator (15)

First european research for web information extraction and analysis for sup...
First   european research for web information extraction and analysis for sup...First   european research for web information extraction and analysis for sup...
First european research for web information extraction and analysis for sup...
 
Urbact II-The local support group toolkit-Ministry of Regional Development an...
Urbact II-The local support group toolkit-Ministry of Regional Development an...Urbact II-The local support group toolkit-Ministry of Regional Development an...
Urbact II-The local support group toolkit-Ministry of Regional Development an...
 
2 1-research roadmap task force michele missikoff
2 1-research roadmap task force michele missikoff2 1-research roadmap task force michele missikoff
2 1-research roadmap task force michele missikoff
 
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...
 
Lecture 01 Data Mining
Lecture 01 Data MiningLecture 01 Data Mining
Lecture 01 Data Mining
 
Insemtives cluj iccp
Insemtives cluj iccpInsemtives cluj iccp
Insemtives cluj iccp
 
Financial i: Roundtable Corporate SWIFT Access
Financial i: Roundtable Corporate SWIFT AccessFinancial i: Roundtable Corporate SWIFT Access
Financial i: Roundtable Corporate SWIFT Access
 
Finding fraud in large, diverse data sets
Finding fraud in large, diverse data setsFinding fraud in large, diverse data sets
Finding fraud in large, diverse data sets
 
03 heemskerk eramind mobility mtg_trieste italy_fh_27_may10
03 heemskerk eramind mobility mtg_trieste italy_fh_27_may1003 heemskerk eramind mobility mtg_trieste italy_fh_27_may10
03 heemskerk eramind mobility mtg_trieste italy_fh_27_may10
 
Effective Internal Investigations
Effective Internal InvestigationsEffective Internal Investigations
Effective Internal Investigations
 
Finance 2.0: Future or Feature
Finance 2.0: Future or FeatureFinance 2.0: Future or Feature
Finance 2.0: Future or Feature
 
Audaxis : BI Project for an Association of Pharmacists
Audaxis : BI Project for an Association of PharmacistsAudaxis : BI Project for an Association of Pharmacists
Audaxis : BI Project for an Association of Pharmacists
 
Digitale fabriek - I2 Iterias
Digitale fabriek - I2 IteriasDigitale fabriek - I2 Iterias
Digitale fabriek - I2 Iterias
 
Enterprise 2.0 - How to b
Enterprise 2.0 - How to bEnterprise 2.0 - How to b
Enterprise 2.0 - How to b
 
Blaszkowsky
BlaszkowskyBlaszkowsky
Blaszkowsky
 

Plus de imec.archive

iMinds-iLab.o, Open Innovation in ICT
iMinds-iLab.o, Open Innovation in ICTiMinds-iLab.o, Open Innovation in ICT
iMinds-iLab.o, Open Innovation in ICTimec.archive
 
Accio presentation closing event
Accio presentation closing eventAccio presentation closing event
Accio presentation closing eventimec.archive
 
PRoF+ Patient Room of the Future
PRoF+ Patient Room of the FuturePRoF+ Patient Room of the Future
PRoF+ Patient Room of the Futureimec.archive
 
Results of the Apollon pilot in homecare and independent living
Results of the Apollon pilot in homecare and independent livingResults of the Apollon pilot in homecare and independent living
Results of the Apollon pilot in homecare and independent livingimec.archive
 
Delivery of feedback on Health, Home Security and Home Energy in Aware Homes ...
Delivery of feedback on Health, Home Security and Home Energy in Aware Homes ...Delivery of feedback on Health, Home Security and Home Energy in Aware Homes ...
Delivery of feedback on Health, Home Security and Home Energy in Aware Homes ...imec.archive
 
NMMU-Emmanuel Haven Living Lab
NMMU-Emmanuel Haven Living LabNMMU-Emmanuel Haven Living Lab
NMMU-Emmanuel Haven Living Labimec.archive
 
The Humanicité workshops
The Humanicité workshopsThe Humanicité workshops
The Humanicité workshopsimec.archive
 
A Real-World Experimentation Platform
A Real-World Experimentation PlatformA Real-World Experimentation Platform
A Real-World Experimentation Platformimec.archive
 
ENoLL @ AAL Forum 2012
ENoLL @ AAL Forum 2012ENoLL @ AAL Forum 2012
ENoLL @ AAL Forum 2012imec.archive
 
ENoLL 6th Wave Results Ceremony (Jesse Marsh)
ENoLL 6th Wave Results Ceremony (Jesse Marsh)ENoLL 6th Wave Results Ceremony (Jesse Marsh)
ENoLL 6th Wave Results Ceremony (Jesse Marsh)imec.archive
 
The Connected Smart Cities Network and Living Labs - Towards Horizon 2020 - K...
The Connected Smart Cities Network and Living Labs - Towards Horizon 2020 - K...The Connected Smart Cities Network and Living Labs - Towards Horizon 2020 - K...
The Connected Smart Cities Network and Living Labs - Towards Horizon 2020 - K...imec.archive
 
Apollon-23/05/2012-9u30- Parallell session: Living Labs added value
Apollon-23/05/2012-9u30- Parallell session: Living Labs added value  Apollon-23/05/2012-9u30- Parallell session: Living Labs added value
Apollon-23/05/2012-9u30- Parallell session: Living Labs added value imec.archive
 
Apollon - 22/5/12 - 11:30 - Local SME's - Innovating Across borders
Apollon - 22/5/12 - 11:30 - Local SME's - Innovating Across bordersApollon - 22/5/12 - 11:30 - Local SME's - Innovating Across borders
Apollon - 22/5/12 - 11:30 - Local SME's - Innovating Across bordersimec.archive
 
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future InternetApollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internetimec.archive
 
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future InternetApollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internetimec.archive
 
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future InternetApollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internetimec.archive
 
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future InternetApollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internetimec.archive
 
Apollon - 22/5/12 - 11:30 - Local SME's - Innovating Across borders
Apollon - 22/5/12 - 11:30 - Local SME's - Innovating Across bordersApollon - 22/5/12 - 11:30 - Local SME's - Innovating Across borders
Apollon - 22/5/12 - 11:30 - Local SME's - Innovating Across bordersimec.archive
 
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystems
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation EcosystemsApollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystems
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystemsimec.archive
 
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystems
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation EcosystemsApollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystems
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystemsimec.archive
 

Plus de imec.archive (20)

iMinds-iLab.o, Open Innovation in ICT
iMinds-iLab.o, Open Innovation in ICTiMinds-iLab.o, Open Innovation in ICT
iMinds-iLab.o, Open Innovation in ICT
 
Accio presentation closing event
Accio presentation closing eventAccio presentation closing event
Accio presentation closing event
 
PRoF+ Patient Room of the Future
PRoF+ Patient Room of the FuturePRoF+ Patient Room of the Future
PRoF+ Patient Room of the Future
 
Results of the Apollon pilot in homecare and independent living
Results of the Apollon pilot in homecare and independent livingResults of the Apollon pilot in homecare and independent living
Results of the Apollon pilot in homecare and independent living
 
Delivery of feedback on Health, Home Security and Home Energy in Aware Homes ...
Delivery of feedback on Health, Home Security and Home Energy in Aware Homes ...Delivery of feedback on Health, Home Security and Home Energy in Aware Homes ...
Delivery of feedback on Health, Home Security and Home Energy in Aware Homes ...
 
NMMU-Emmanuel Haven Living Lab
NMMU-Emmanuel Haven Living LabNMMU-Emmanuel Haven Living Lab
NMMU-Emmanuel Haven Living Lab
 
The Humanicité workshops
The Humanicité workshopsThe Humanicité workshops
The Humanicité workshops
 
A Real-World Experimentation Platform
A Real-World Experimentation PlatformA Real-World Experimentation Platform
A Real-World Experimentation Platform
 
ENoLL @ AAL Forum 2012
ENoLL @ AAL Forum 2012ENoLL @ AAL Forum 2012
ENoLL @ AAL Forum 2012
 
ENoLL 6th Wave Results Ceremony (Jesse Marsh)
ENoLL 6th Wave Results Ceremony (Jesse Marsh)ENoLL 6th Wave Results Ceremony (Jesse Marsh)
ENoLL 6th Wave Results Ceremony (Jesse Marsh)
 
The Connected Smart Cities Network and Living Labs - Towards Horizon 2020 - K...
The Connected Smart Cities Network and Living Labs - Towards Horizon 2020 - K...The Connected Smart Cities Network and Living Labs - Towards Horizon 2020 - K...
The Connected Smart Cities Network and Living Labs - Towards Horizon 2020 - K...
 
Apollon-23/05/2012-9u30- Parallell session: Living Labs added value
Apollon-23/05/2012-9u30- Parallell session: Living Labs added value  Apollon-23/05/2012-9u30- Parallell session: Living Labs added value
Apollon-23/05/2012-9u30- Parallell session: Living Labs added value
 
Apollon - 22/5/12 - 11:30 - Local SME's - Innovating Across borders
Apollon - 22/5/12 - 11:30 - Local SME's - Innovating Across bordersApollon - 22/5/12 - 11:30 - Local SME's - Innovating Across borders
Apollon - 22/5/12 - 11:30 - Local SME's - Innovating Across borders
 
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future InternetApollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
 
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future InternetApollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
 
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future InternetApollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
 
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future InternetApollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
 
Apollon - 22/5/12 - 11:30 - Local SME's - Innovating Across borders
Apollon - 22/5/12 - 11:30 - Local SME's - Innovating Across bordersApollon - 22/5/12 - 11:30 - Local SME's - Innovating Across borders
Apollon - 22/5/12 - 11:30 - Local SME's - Innovating Across borders
 
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystems
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation EcosystemsApollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystems
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystems
 
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystems
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation EcosystemsApollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystems
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystems
 

Acknowledge 07 Automated Retrieval And Categorization Of Texts In An E Learning Environment The Riks Demonstrator

  • 1. Automated Information Retrieval and Text Categorization: The RIKS Demonstrator Acknowledge final event November 25, 2008 Marie-Francine Moens, Erik Boiy, Javier Arias (HMDB-LIIR) Saskia Debergh (i.Know) Philippe De Lombaerde, Birger Fühne (UNU-CRIS) Overview • UNU CRIS: The RIKS Demonstrator UNU-CRIS: • K.U.Leuven: – Content extraction from multilingual Web pages – Text categorization: machine learning approach – Search engine and indexing infrastructure – Interfacing the Acknowledge platform • i.Know: – Information forensics Acknowledge 25-11-2008 1
  • 2. The RIKS Demonstrator • United Nations University – Comparative Regional Integration Studies (UNU-CRIS) • Issues addressed in research and capacity building: – (i) emergence of regional (= supra-national) governance level – (ii) linkages with other governance levels (national, global/UN) – (iii) building of regional institutions – (iv) growing regional interdependence, etc. • RIKS = Regional Integration Knowledge System (UNU-CRIS and GARNET NoE) Acknowledge 25-11-2008 Acknowledge 25-11-2008 2
  • 3. The RIKS Demonstrator Issues addressed in the demonstrator: How to automate retrieval and processing p g (cleaning, search, categorization, presentation) of particular types of relevant information in an e-learning environment?: – ‘News’: short texts, various formats, dynamic collection, short life cycle, role of news in e- learning application – ‘Documentation’: heterogeneous texts: scientific articles, theses, essays, ... , rather static collection – Treaty texts: long and complex texts, static collection, issue of accessibility Acknowledge 25-11-2008 RIKS example output Acknowledge 25-11-2008 3
  • 4. Demo Acknowledge 25-11-2008 K.U.Leuven: Content extraction from multilingual Web pages • = Extracting main content from Web page and removing extraneous data (navigation menu’s, advertisements, etc.) • Requirements of the tool: – Accurate – Generic – Multilingual – Fast Acknowledge 25-11-2008 4
  • 5. Acknowledge 25-11-2008 [Arias et al. submitted] Acknowledge 25-11-2008 5
  • 6. [Arias et al. submitted] [5] =[Gottron 2008] Acknowledge 25-11-2008 K.U.Leuven:Text categorization • Heterogeneous documentation and Google News classified into 27 categories (e.g., trade, poverty, ...) (e g trade poverty ) • Supervised classifier: Multinomial Naïve Bayes, Support Vector Machine, ... • Features: – different features: unigrams, bigrams, feature item sets, ... • Additional feature Selection: – Chi Square, Information Gain, Linear Classifier Weights, Orthogonal Centroid Feature Selection • Different test set ups 6
  • 7. K.U.Leuven: Text categorization Acknowledge 25-11-2008 RIKS K.U.Leuven: search engine Acknowledge 25-11-2008 7
  • 8. Acknowledge 25-11-2008 Demo Acknowledge 25-11-2008 8
  • 9. Weten dat je niet weet wat je zou moeten weten 1. Information Forensics ‐ Smart Indexing more than just an index distinguishes between concepts and relations distinguishes between concepts and relations starts from unstructured text (bottom‐up instead of top‐down) recognises word groups as meaningful units Top‐down: Bottom‐up: knowledge knowledge keywords concepts and relations text text Acknowledge 25-11-2008 © i.Know NV ‐ All rights reserved. Weten dat je niet weet wat je zou moeten weten 1. Information Forensics – Smart Indexing De Fortis Bank werd overgenomen door BNP Paribas. Traditional indexing (keywords): De Fortis Bank werd overgenomen door BNP Paribas. Keyword Index Fortis 0.23 stopwords calculation Bank 0.38 werd 0.08 stemming correlation overgenomen 0.21 door 0.12 BNP 0.34 De Fortis Bank werd overgenomen door BNP Paribas Paribas 0.27 Acknowledge 25-11-2008 © i.Know NV ‐ All rights reserved. 9
  • 10. Weten dat je niet weet wat je zou moeten weten 1. Information Forensics – Smart Indexing De Fortis Bank werd overgenomen door BNP Paribas. Smart Indexing (concepts and relations): De Fortis Bank werd overgenomen door BNP Paribas. Smart Index relation  concept  Concept Fortis Bank detection detection Relation werd overgenomen door werd overgenomen door Concept BNP Paribas De Fortis Bank werd overgenomen door BNP Paribas Acknowledge 25-11-2008 © i.Know NV ‐ All rights reserved. Weten dat je niet weet wat je zou moeten weten 2. Categorisation based on Smart Indexing Preconditions: Pre defined taxonomy/ontology Pre‐defined taxonomy/ontology Top‐down processing Advantages of Smart Indexing: Smart Indexing Results can be used to fill and enrich the taxonomy, thus ensuring  the entries are relevant precise complete Acknowledge 25-11-2008 © i.Know NV ‐ All rights reserved. 10
  • 11. Weten dat je niet weet wat je zou moeten weten 2. Categorisation Categorisation EU EFTA Smart Indexing (concepts and relations): The Agreement will be applied with the European  and with the EFTA states. Union Input: The Agreement will be applied with the European Union and with the EFTA states. Acknowledge 25-11-2008 © i.Know NV ‐ All rights reserved. RIKS i.Know: news categorization Acknowledge 25-11-2008 11
  • 12. RIKS i.Know: news categorization Acknowledge 25-11-2008 Acknowledge 25-11-2008 12
  • 13. Acknowledge 25-11-2008 Demo Acknowledge 25-11-2008 13