SlideShare une entreprise Scribd logo
1  sur  18
Télécharger pour lire hors ligne
the DISCUS project & SEASR

               Xavier Llorà1,2, David E. Goldberg1 & Michael Welge2
                                                                  

  1Illinois Genetic Algorithms Lab, Department of Industrial and Enterprise Systems Engineering,!
                            University of Illinois at Urbana-Champaign!

2Data-Intensive Technologies and Applications, National Center for Supercomputing Applications, !
                            University of Illinois at Urbana-Champaign!
The Vision

•  Computers have become mediators of collaborations
   –  Email, chat rooms, blogs, wikis…

   –  A flood of available information
   –  Different modes of communication

•  Let’s take advantage of such information 
   –  Logs of conversations

   –  Archive of documents (email attachments, blogs, personal web
      pages…)

   –  Human-computer interactions

   –  Social aspect of the communication and collaboration

   –  Needs to work for multiple languages
The Project

•  DISCUS started in 2003 as an IlliGAL & NCSA collaboration

•  Supports innovation and creativity:

  DISCUS: Distributed Innovation and Scalable Collaboration in Uncertain Settings

•  Basic research components

     –  Competent genetic algorithms (HBGA, iGA)

     –  Advance chance discovery components

     –  Adapt and expand the analysis of social interaction

     –  Efficient data mining techniques for conversations 

     –  Develop a social network analysis for creativity and innovation processes 



the
DISCUS
project
(May
2007)
         Xavier
Llorà
                                  3

The Project

•  Technology development

     –  Infrastructure to support creativity and innovation processes

     –  Reusable repositories of analytic components 

     –  Standardize heterogeneous data storage to boost interoperability

     –  Create hooks for non-intrusive usage and deployment

     –  Rapid adaptation cycle to new technologies




the
DISCUS
project
(May
2007)
         Xavier
Llorà
                        4

Research and Commercial Partners

•  Some research partners along the quot;
   way
   –  University of Illinois (IlliGAL, NCSA & CEE)

   –  University of Osaka

   –  University of Tokyo (School of Management, quot;
      School of Engineering)

   –  University of Kyushu

•  Commercial partner
   –  Hakuhodo Inc and HOW

   –  Mazda

   –  Toyota
The Research Picture
                                   Analysis


                                                                    Data mining




                 Social networks




                                                                                  Content




                                                         Knowledge management



               Social aspects
the
DISCUS
project
(May
2007)
           Xavier
Llorà
                                      6

DISCUS in Action
Online Communities




the
DISCUS
project
(May
2007)
   Xavier
Llorà
   8

Online Communities




the
DISCUS
project
(May
2007)
   Xavier
Llorà
   9

Content Analysis




the
DISCUS
project
(May
2007)
   Xavier
Llorà
   10

Social Network Analysis




the
DISCUS
project
(May
2007)
   Xavier
Llorà
   11

Topic Overlap




the
DISCUS
project
(May
2007)
   Xavier
Llorà
   12

Topic Dynamics




the
DISCUS
project
(May
2007)
   Xavier
Llorà
   13

CSPAN

•  CSPAN digital library
   –  Videos

   –  Transcripts
   –  Annotations

•  Example of real-time analysis
•  Crawling and results
Some Facts 


•  Number of document: 110,234
•  Number of persons: 78,915
•  Number of total sentences: 252,132
•  Number of total word: 2,034,209
Documents per Year



                             5000
                             500
       Number of documents

                             50 100
                             10
                             5
                             1




                                      1940   1960          1980   2000

                                                    Year
Number of words

              1e+01   1e+02      1e+03      1e+04   1e+05




       1940
                                                            Words per Year




       1960

Year
       1980
       2000
the DISCUS project & SEASR

               Xavier Llorà1,2, David E. Goldberg1 & Michael Welge2
                                                                  

  1Illinois Genetic Algorithms Lab, Department of Industrial and Enterprise Systems Engineering,!
                            University of Illinois at Urbana-Champaign!

2Data-Intensive Technologies and Applications, National Center for Supercomputing Applications, !
                            University of Illinois at Urbana-Champaign!

Contenu connexe

En vedette

Text Mining Wksp Auvil
Text Mining Wksp AuvilText Mining Wksp Auvil
Text Mining Wksp AuvilLoretta Auvil
 
Text Mining and SEASR
Text Mining and SEASRText Mining and SEASR
Text Mining and SEASRLoretta Auvil
 
SEASR-Meandre Architecture Ws Jan 2009
SEASR-Meandre Architecture Ws Jan 2009SEASR-Meandre Architecture Ws Jan 2009
SEASR-Meandre Architecture Ws Jan 2009Loretta Auvil
 

En vedette (6)

Text Mining Wksp Auvil
Text Mining Wksp AuvilText Mining Wksp Auvil
Text Mining Wksp Auvil
 
SEASR and UIMA
SEASR and UIMASEASR and UIMA
SEASR and UIMA
 
Text Mining and SEASR
Text Mining and SEASRText Mining and SEASR
Text Mining and SEASR
 
SEASR Installation
SEASR InstallationSEASR Installation
SEASR Installation
 
SEASR Community Hub
SEASR Community HubSEASR Community Hub
SEASR Community Hub
 
SEASR-Meandre Architecture Ws Jan 2009
SEASR-Meandre Architecture Ws Jan 2009SEASR-Meandre Architecture Ws Jan 2009
SEASR-Meandre Architecture Ws Jan 2009
 

Similaire à DISCUS Project Overview

ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012Lee Dirks
 
Understanding Research 2.0 from a Socio-technical Perspective
Understanding Research 2.0 from a Socio-technical PerspectiveUnderstanding Research 2.0 from a Socio-technical Perspective
Understanding Research 2.0 from a Socio-technical PerspectiveYuwei Lin
 
Using Semantics to Improve Corporate Online Communities
Using Semantics to Improve Corporate Online CommunitiesUsing Semantics to Improve Corporate Online Communities
Using Semantics to Improve Corporate Online CommunitiesAlexandre Passant
 
Facilitating a Digital Commons with Free and Open Source Software: Paving the...
Facilitating a Digital Commons with Free and Open Source Software: Paving the...Facilitating a Digital Commons with Free and Open Source Software: Paving the...
Facilitating a Digital Commons with Free and Open Source Software: Paving the...Sameer Verma
 
Canada 3.0 Keynote Address Day 1
Canada 3.0 Keynote Address Day 1Canada 3.0 Keynote Address Day 1
Canada 3.0 Keynote Address Day 1canada30
 
Datos enlazados BNE and MARiMbA
Datos enlazados BNE and MARiMbADatos enlazados BNE and MARiMbA
Datos enlazados BNE and MARiMbADaniel Vila Suero
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3guru122
 
Federating Distributed Social Data to Build an Interlinked Online Information...
Federating Distributed Social Data to Build an Interlinked Online Information...Federating Distributed Social Data to Build an Interlinked Online Information...
Federating Distributed Social Data to Build an Interlinked Online Information...Alexandre Passant
 
Teaching 2.0 Learning & Leading in the Digital Age
Teaching 2.0 Learning & Leading in the Digital AgeTeaching 2.0 Learning & Leading in the Digital Age
Teaching 2.0 Learning & Leading in the Digital AgeMatthew Hayden
 
Web 2.0 E Oltre
Web 2.0 E OltreWeb 2.0 E Oltre
Web 2.0 E Oltreronchet
 
Enhancing the Social Web through Augmented Social Cognition Research
Enhancing the Social Web through Augmented Social Cognition ResearchEnhancing the Social Web through Augmented Social Cognition Research
Enhancing the Social Web through Augmented Social Cognition ResearchEd Chi
 
Hello Open World - The Web of Data for the Pragmatic Developer
Hello Open World - The Web of Data for the Pragmatic DeveloperHello Open World - The Web of Data for the Pragmatic Developer
Hello Open World - The Web of Data for the Pragmatic DeveloperAlexandre Passant
 
Where Is eXtension
Where Is eXtensionWhere Is eXtension
Where Is eXtensionchwood
 
Web 2.0 and e-elearning
Web 2.0 and e-elearningWeb 2.0 and e-elearning
Web 2.0 and e-elearningDavid Wilcox
 
2007 KMWorld Presentation on Augmented Social Cognition Research at PARC
2007 KMWorld Presentation on Augmented Social Cognition Research at PARC2007 KMWorld Presentation on Augmented Social Cognition Research at PARC
2007 KMWorld Presentation on Augmented Social Cognition Research at PARCEd Chi
 
Social Media: Why and how to take advantage of it
Social Media:  Why and how to take advantage of itSocial Media:  Why and how to take advantage of it
Social Media: Why and how to take advantage of itAlexandre Passant
 
Building a Digital Library
Building a Digital LibraryBuilding a Digital Library
Building a Digital LibraryEd Fay
 

Similaire à DISCUS Project Overview (20)

ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
 
Understanding Research 2.0 from a Socio-technical Perspective
Understanding Research 2.0 from a Socio-technical PerspectiveUnderstanding Research 2.0 from a Socio-technical Perspective
Understanding Research 2.0 from a Socio-technical Perspective
 
Social Media and Web 2.0
Social Media and Web 2.0Social Media and Web 2.0
Social Media and Web 2.0
 
Using Semantics to Improve Corporate Online Communities
Using Semantics to Improve Corporate Online CommunitiesUsing Semantics to Improve Corporate Online Communities
Using Semantics to Improve Corporate Online Communities
 
Facilitating a Digital Commons with Free and Open Source Software: Paving the...
Facilitating a Digital Commons with Free and Open Source Software: Paving the...Facilitating a Digital Commons with Free and Open Source Software: Paving the...
Facilitating a Digital Commons with Free and Open Source Software: Paving the...
 
SEASR Overview
SEASR OverviewSEASR Overview
SEASR Overview
 
Canada 3.0 Keynote Address Day 1
Canada 3.0 Keynote Address Day 1Canada 3.0 Keynote Address Day 1
Canada 3.0 Keynote Address Day 1
 
Datos enlazados BNE and MARiMbA
Datos enlazados BNE and MARiMbADatos enlazados BNE and MARiMbA
Datos enlazados BNE and MARiMbA
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3
 
Federating Distributed Social Data to Build an Interlinked Online Information...
Federating Distributed Social Data to Build an Interlinked Online Information...Federating Distributed Social Data to Build an Interlinked Online Information...
Federating Distributed Social Data to Build an Interlinked Online Information...
 
Teaching 2.0 Learning & Leading in the Digital Age
Teaching 2.0 Learning & Leading in the Digital AgeTeaching 2.0 Learning & Leading in the Digital Age
Teaching 2.0 Learning & Leading in the Digital Age
 
Web 2.0 E Oltre
Web 2.0 E OltreWeb 2.0 E Oltre
Web 2.0 E Oltre
 
Enhancing the Social Web through Augmented Social Cognition Research
Enhancing the Social Web through Augmented Social Cognition ResearchEnhancing the Social Web through Augmented Social Cognition Research
Enhancing the Social Web through Augmented Social Cognition Research
 
Hello Open World - The Web of Data for the Pragmatic Developer
Hello Open World - The Web of Data for the Pragmatic DeveloperHello Open World - The Web of Data for the Pragmatic Developer
Hello Open World - The Web of Data for the Pragmatic Developer
 
Where Is eXtension
Where Is eXtensionWhere Is eXtension
Where Is eXtension
 
Web 2.0 and e-elearning
Web 2.0 and e-elearningWeb 2.0 and e-elearning
Web 2.0 and e-elearning
 
2007 KMWorld Presentation on Augmented Social Cognition Research at PARC
2007 KMWorld Presentation on Augmented Social Cognition Research at PARC2007 KMWorld Presentation on Augmented Social Cognition Research at PARC
2007 KMWorld Presentation on Augmented Social Cognition Research at PARC
 
Social Media: Why and how to take advantage of it
Social Media:  Why and how to take advantage of itSocial Media:  Why and how to take advantage of it
Social Media: Why and how to take advantage of it
 
Building a Digital Library
Building a Digital LibraryBuilding a Digital Library
Building a Digital Library
 

Plus de Loretta Auvil

Fedora App Slide 2009 Hastac
Fedora App Slide 2009 HastacFedora App Slide 2009 Hastac
Fedora App Slide 2009 HastacLoretta Auvil
 
Meandre Workbench Ws Jan 2009
Meandre Workbench Ws Jan 2009Meandre Workbench Ws Jan 2009
Meandre Workbench Ws Jan 2009Loretta Auvil
 
ICHASS Workshop Seasr
ICHASS Workshop SeasrICHASS Workshop Seasr
ICHASS Workshop SeasrLoretta Auvil
 
ICHASS Workshop Text Mining
ICHASS Workshop Text MiningICHASS Workshop Text Mining
ICHASS Workshop Text MiningLoretta Auvil
 

Plus de Loretta Auvil (9)

Fedora App Slide 2009 Hastac
Fedora App Slide 2009 HastacFedora App Slide 2009 Hastac
Fedora App Slide 2009 Hastac
 
SEASR Overview
SEASR OverviewSEASR Overview
SEASR Overview
 
SEASR Text
SEASR TextSEASR Text
SEASR Text
 
SEASR-Fedora App
SEASR-Fedora AppSEASR-Fedora App
SEASR-Fedora App
 
Meandre Workbench Ws Jan 2009
Meandre Workbench Ws Jan 2009Meandre Workbench Ws Jan 2009
Meandre Workbench Ws Jan 2009
 
SEASR eScience 2008
SEASR eScience 2008SEASR eScience 2008
SEASR eScience 2008
 
ICHASS Workshop Lab
ICHASS Workshop LabICHASS Workshop Lab
ICHASS Workshop Lab
 
ICHASS Workshop Seasr
ICHASS Workshop SeasrICHASS Workshop Seasr
ICHASS Workshop Seasr
 
ICHASS Workshop Text Mining
ICHASS Workshop Text MiningICHASS Workshop Text Mining
ICHASS Workshop Text Mining
 

Dernier

Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 

Dernier (20)

Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 

DISCUS Project Overview

  • 1. the DISCUS project & SEASR Xavier Llorà1,2, David E. Goldberg1 & Michael Welge2 1Illinois Genetic Algorithms Lab, Department of Industrial and Enterprise Systems Engineering,! University of Illinois at Urbana-Champaign! 2Data-Intensive Technologies and Applications, National Center for Supercomputing Applications, ! University of Illinois at Urbana-Champaign!
  • 2. The Vision •  Computers have become mediators of collaborations –  Email, chat rooms, blogs, wikis… –  A flood of available information –  Different modes of communication •  Let’s take advantage of such information –  Logs of conversations –  Archive of documents (email attachments, blogs, personal web pages…) –  Human-computer interactions –  Social aspect of the communication and collaboration –  Needs to work for multiple languages
  • 3. The Project •  DISCUS started in 2003 as an IlliGAL & NCSA collaboration •  Supports innovation and creativity: DISCUS: Distributed Innovation and Scalable Collaboration in Uncertain Settings •  Basic research components –  Competent genetic algorithms (HBGA, iGA) –  Advance chance discovery components –  Adapt and expand the analysis of social interaction –  Efficient data mining techniques for conversations –  Develop a social network analysis for creativity and innovation processes the
DISCUS
project
(May
2007)
 Xavier
Llorà
 3

  • 4. The Project •  Technology development –  Infrastructure to support creativity and innovation processes –  Reusable repositories of analytic components –  Standardize heterogeneous data storage to boost interoperability –  Create hooks for non-intrusive usage and deployment –  Rapid adaptation cycle to new technologies the
DISCUS
project
(May
2007)
 Xavier
Llorà
 4

  • 5. Research and Commercial Partners •  Some research partners along the quot; way –  University of Illinois (IlliGAL, NCSA & CEE) –  University of Osaka –  University of Tokyo (School of Management, quot; School of Engineering) –  University of Kyushu •  Commercial partner –  Hakuhodo Inc and HOW –  Mazda –  Toyota
  • 6. The Research Picture Analysis Data mining Social networks Content Knowledge management Social aspects the
DISCUS
project
(May
2007)
 Xavier
Llorà
 6

  • 14. CSPAN •  CSPAN digital library –  Videos –  Transcripts –  Annotations •  Example of real-time analysis •  Crawling and results
  • 15. Some Facts •  Number of document: 110,234 •  Number of persons: 78,915 •  Number of total sentences: 252,132 •  Number of total word: 2,034,209
  • 16. Documents per Year 5000 500 Number of documents 50 100 10 5 1 1940 1960 1980 2000 Year
  • 17. Number of words 1e+01 1e+02 1e+03 1e+04 1e+05 1940 Words per Year 1960 Year 1980 2000
  • 18. the DISCUS project & SEASR Xavier Llorà1,2, David E. Goldberg1 & Michael Welge2 1Illinois Genetic Algorithms Lab, Department of Industrial and Enterprise Systems Engineering,! University of Illinois at Urbana-Champaign! 2Data-Intensive Technologies and Applications, National Center for Supercomputing Applications, ! University of Illinois at Urbana-Champaign!