SlideShare une entreprise Scribd logo
1  sur  33
ResUbic Research Seminar


           ResUbic Research Lab Dresden
           EDYRA Engineering of Do-it-Yourself Analytic
           Rich Internet Applications

           Wolfgang Lehner
           Maik Thiele
           Katrin Braunschweig
           Julian Eberius

© Prof. Dr. -Ing. Wolfgang Lehner
>




                                                 MAD Skills

                     [Jeffrey Cohen, Brian Dolan, Mark Dunlap, Joseph M. Hellerstein, Caleb Welton:
                               MAD Skills: New Analysis Practices for Big Data. PVLDB 2009]




© Prof. Dr.-Ing. Wolfgang Lehner|                                                ResUbic Research Seminar   |   2
> Motivation (1)

         In the days of Kings
         and Priests
           Computers and Data: Crown Jewels
           Executives depend on computers
             But cannot work with them directly
           The DBA “Priesthood”
             And their Acronymia: EDW, BI, OLAP


         The architected Enterprise DWH
           Rational behavior…for a bygone era
           “There is no point in bringing data … into the
            data warehouse environment without
            integrating it.”
            —Bill Inmon, Building the Data Warehouse,
              2005



© Prof. Dr.-Ing. Wolfgang Lehner|                            ResUbic Research Seminar   3
> Motivation (2)

         New Realities
           TB disks < $100
           Everything is data
           Rise of data-driven culture
             Very publicly espoused by Google,
              Wired, etc.
             Sloan Digital Sky Survey, Terraserver, etc.

                The quest for knowledge used
                to begin with grand theories.

                   Now it begins with massive
                       amounts of data.

                     Welcome to the Petabyte
                              Age.


© Prof. Dr.-Ing. Wolfgang Lehner|                           ResUbic Research Seminar   4
> MAD Skills

         Magnetic
           „Attract data and practitioners“
           Usage of all data source
            independet of their data
            quality

         Agile
           „Rapid iteration: ingest, analyze, productionalize“
           Continous evolution of the logical and physical
            structures
           ELT (Extraction, Loading, Transformation)

         Deep
           „Sophisticated analytics in Big Data“
           Extended algorithmic run-time
           Ad-hoc advanced analytics and statistics


© Prof. Dr.-Ing. Wolfgang Lehner|                                 ResUbic Research Seminar   5
> Open Data, Services and Mashups

       Web of Data
                E-Government 2.0, Initiative i2010
                Europeana, World Digital Library
                Public data catalogs
                     http://data.gov/
                     http://data.gov.uk/
                Free to
                    Copy, distribute and transmit the data
                    Adapt the data
                    Exploiting the data commercially, whether by sub-licensing it, combining it with other
                     data, or by including it in your own product
       Web of Services
                  OpenSocial-API (Google, Yahoo!, MySpace, Xing)
                  Scientific Computations (http://www.wolframalpha.com)
                  Entitiy Detection (http://www.yooname.com)
                  Visualization (http://manyeyes.alphaworks.ibm.com/manyeyes)
       Web of Mashups
                Programmale Web (http://www.programmableweb.com/)


© Prof. Dr.-Ing. Wolfgang Lehner|                                                 ResUbic Research Seminar    6
> Principles of Open Data

       Data shall be considered open if it is made public in a way that complies with
       the principles below
        Complete: All public data is made available. Public data is data that is not subject to valid privacy,
         security or privilege limitations.
        Primary: Data is as collected at the source, with the highest possible level of granularity, not in
         aggregate or modified forms
        Timely: Data is made available as quickly as necessary to preserve the value of the data.
        Accessible: Data is available to the widest range of users for the widest range of purposes.
        Machine processable: Data is reasonably structured to allow automated processing.
        Non-discriminatory: Data is available to anyone, with no requirement of registration.
        Non-proprietary: Data is available in a format over which no entity has exclusive control.
        License-free: Data is not subject to any copyright, patent, trademark or trade secret regulation.
         Reasonable privacy, security and privilege restrictions may be allowed.


         Quelle: http://resource.org/8_principles.html


© Prof. Dr.-Ing. Wolfgang Lehner|                                                 ResUbic Research Seminar        7
>

        „Daten gehören den Menschen“ – typische Beispiele: Genome, Daten von
         Organismen, medizinische Forschung, umweltwissenschaftliche Daten
        öffentliche Gelder haben die Generierung der Daten erst ermöglicht, also
         müssen sie auch öffentlich zugänglich sein (tatsächlich treten Wissenschaftler
         in der Regel die Rechte an den von ihnen generierten Daten an private Verlage
         ab, wenn sie ihre Ergebnisse publizieren)
        Fakten können nicht dem Urheberrecht unterliegen
        Forschung wird gefördert, wenn wissenschaftliche Erkenntnisse für alle
         Forscher frei zugänglich sind




© Prof. Dr.-Ing. Wolfgang Lehner|                               ResUbic Research Seminar   8
> Gapminder




 http://www.gapminder.org/
© Prof. Dr.-Ing. Wolfgang Lehner|   ResUbic Research Seminar   9
> Gapminder (2)

      Vision: making sense of the world by having fun with statistics!
          Gapminder is a non-profit venture for development and provision of free software to
           visualize human development trends
          Gapminder will ultimately be integrated into Google: this is the first time global
           datasets will be searchable over the Internet

      Hans Rosling @ TED
          TEDTalks: annual technology conference in California, USA
           http://www.ted.com/tedtalks/
          Hans Rosling is a professor of global health at the Karolinska Institute, data
           visualization extraordinaire and the creator of the Gapminder tools
          see http://www.youtube.com/watch?v=YpKbO6O3O3M




© Prof. Dr.-Ing. Wolfgang Lehner|                                        ResUbic Research Seminar   10
> Public.Resource.Org

         Idea: Make government more transparent
         Project funded: Public.Resource.Org is a non-profit organization focused on
         enabling online access to public government documents in the United States.
         We are providing $2 million to Public.Resource.Org to support the Law.Gov
         initiative, which aims to make all primary legal materials in the United States
         available to all.
         Gewinner des Projekts 10100
         http://www.project10tothe100.com/intl/DE/index.html




© Prof. Dr.-Ing. Wolfgang Lehner|                                ResUbic Research Seminar   11
>     Microsoft’s Open Government Data Initiative



       • The Open Government Data Initiative (OGDI) is a cloud-based
         collection of software assets that enables publicly available government
         data to be easily accessible. Using open standards and application
         programming interfaces (API), developers and government agencies can
         retrieve the data programmatically for use in new and innovative online
         applications, or mash-ups that can help:
          – Improve citizen services
          – Enhance collaboration between
            government agencies and private organizations
          – Increase government transparency
       • OGDI promotes the use of this data by capturing and publishing re-
         usable software assets, patterns, and practices. The data repository
         already holds over 60 different government datasets that are readily
         available for use in new applications, and is continuously updated with
         additional government datasets.
       • More: http://www.microsoft.com/industry/government/opengovdata/

© Prof. Dr.-Ing. Wolfgang Lehner|                           ResUbic Research Seminar   12
> Civic Commons




                             http://civiccommons.com/
© Prof. Dr.-Ing. Wolfgang Lehner|                       ResUbic Research Seminar   13
> data.gov




© Prof. Dr.-Ing. Wolfgang Lehner|   ResUbic Research Seminar   14
> data.gov.uk




© Prof. Dr.-Ing. Wolfgang Lehner|   ResUbic Research Seminar   15
> data.worldbank.org




© Prof. Dr.-Ing. Wolfgang Lehner|   ResUbic Research Seminar   16
> unData




                         http://data.un.org/

© Prof. Dr.-Ing. Wolfgang Lehner|              ResUbic Research Seminar   17
> Ushahidi




               http://www.ushahidi.com/
© Prof. Dr.-Ing. Wolfgang Lehner|         ResUbic Research Seminar   18
> Statistisches Bundesamt Deutschland




https://www-genesis.destatis.de/genesis/online/

© Prof. Dr.-Ing. Wolfgang Lehner|                 ResUbic Research Seminar   19
> offenedaten.de




© Prof. Dr.-Ing. Wolfgang Lehner|   ResUbic Research Seminar   20
> Data360




   http://www.data360.org
© Prof. Dr.-Ing. Wolfgang Lehner|   ResUbic Research Seminar   21
> IBM ManyEyes




   http://manyeyes.alphaworks.ibm.com)/manyeyes/
© Prof. Dr.-Ing. Wolfgang Lehner|                  ResUbic Research Seminar   22
> Open Citizen‘s Platform

       Public issue tracking provides increased engagement, transparency, and
       participation in the community
       Manage issues in urban environments, like pot-holes, broken street lighting or
       lack of accessibility
        What are the benefits to…


         Governments                                        Citizens
           Reduce time, effort and resources in             Open access to complete, formatted data
            fulfilling public information requests            rather than relying on third party
           Increase data quality by providing correct        interpretations or subsets
            data to public from the source                   Information accessibility leads to greater
           Reduce duplication of effort                      government accountability
           Increase data access, availability, and speed    Fosters better community action on social
            of delivery                                       issues, e.g. crime, pollution, permits,
           Improve citizen satisfaction and create           accidents, and education
            good public relations with your community        Improves regional competitiveness by giving
                                                              businesses quicker and fuller access to data

© Prof. Dr.-Ing. Wolfgang Lehner|                                              ResUbic Research Seminar      23
> What are the goals of the project?

        Long Term…
           Build a open citizen platform for Dresden  www.opendresden.de
           Process it.. compare it... mix it.. filter it... visualize it…
           Basic premises
                Build a simple system and let it evolve
                Design for participation
                Openness

        For now…
           Start with a series of value-added municipal services (e.g.
            Mapnificient, Schooloscope, Cycling Planner, see following slides)
             Transport, Education, Economy, (Local) Politics, Environment, Entertainment
           Promote the open data principle in Saxony
           Develop a fluid data repository (for municipal data)
           Design a domain specific language in order to integrate and analyze data
             Different levels of abstraction
             Reuse existing apps  Visual dataflow languages  Textual DSL editors

© Prof. Dr.-Ing. Wolfgang Lehner|                                     ResUbic Research Seminar   24
> Mapnificient




 http://www.mapnificent.net/london/
© Prof. Dr.-Ing. Wolfgang Lehner|     ResUbic Research Seminar   25
> Schooloscope




 http://schooloscope.com

© Prof. Dr.-Ing. Wolfgang Lehner|   ResUbic Research Seminar   26
> Where can I live




   http://www.where-can-i-live.com/londonproperty
© Prof. Dr.-Ing. Wolfgang Lehner|                   ResUbic Research Seminar   27
> UBC/Google cycling planner




              http://www.cyclevancouver.ubc.ca/cv.aspx

© Prof. Dr.-Ing. Wolfgang Lehner|                        ResUbic Research Seminar   28
> CitySourced




           http://www.citysourced.com
© Prof. Dr.-Ing. Wolfgang Lehner|       ResUbic Research Seminar   29
> EveryBlock




           http://chicago.everyblock.com/
© Prof. Dr.-Ing. Wolfgang Lehner|           ResUbic Research Seminar   30
> Architecture – Sketch

                                    Lightweight Integration Techniques
                                    • Join across dimensions (e.g. Entity + Time        REST                             Google
       Public Data Sources



                                      + Place)




                                                                                                        Visualization
         Open Data and




                                                                                                                          Maps
                                    • Aggregations
                                                                                        JSON
                                    Lightweight Composite Applications
                                                                                                                        Openstreet
                                    •    Create information from the data                                                 Map
                                    •    Uncover hidden aspects of data                  KML
                                    •    Which becomes new data itself                                                    IBM
                                    •    Classification, prediction, clustering                                         ManyEyes
                                                                                        GeoRSS
                                    •    Embrace recursion


                                                                                   API for location-based
                                                                                   collaborative issue-tracking
                                                                                   http://open311.org
            http://www.omgstandard.com
                                        Repository
                                        Fluid Data




                                                                             Citizen
                                Geo Data
                                                                            Request‘s
                                                           Municipal
                                                            Data

© Prof. Dr.-Ing. Wolfgang Lehner|                                                                ResUbic Research Seminar            31
> Fluid Data Repository

        Platform for the web of things, each represented by an openly writable
         „social“ object
        Share, annotate, augment and re-use information
        Mainly concerns data mediation and integration
        Need to access and integrate data residing in multiple and heterogeneous
         sources
        Adaptive, add metrics, aggregations,
         data sources or data connections
         without re-building analysis processes
         or visualizations  “non-destructive
         change”




© Prof. Dr.-Ing. Wolfgang Lehner|                              ResUbic Research Seminar   32
> Alternative Data Models
                           BigTable
                                         HBase                                                 RavenDB
         SimpleDB
                                                                                MongoDB                   OrientDB
    Cassandra
                                                                             CouchDB                            ThruDB
           Hypertable
                                      Column
                                      Families                                                           Terrastore
                                                                        Documents

                                                                                                                            FluidDB
                                                                                                       other
                   Voldemort
                                                            NoSQL
        Dynomite
                                        Key/Value
     Dynamo
                                                                                                     Triple                   RedStore

Tokio Cabinet                                     GT.M                                               Stores
                                                                                                                               Viruoso
                                          Redis
                                                           Graph
       Scalaris                                                                  Sones                   Jena
                                                                                                                   Sesame     YARS
                                    Pahoehoe
                      Riak
                                                                                       Neo4J
                                                         AllegroGraph

                                                                                 HyperGraphDB
                                                                   FlockDB

© Prof. Dr.-Ing. Wolfgang Lehner|                                                              ResUbic Research Seminar              33

Contenu connexe

Tendances

20160719 23 Research Data Things
20160719 23 Research Data Things20160719 23 Research Data Things
20160719 23 Research Data ThingsKatina Toufexis
 
The Future(s) of the World Wide Web
The Future(s) of the World Wide WebThe Future(s) of the World Wide Web
The Future(s) of the World Wide WebJames Hendler
 
Rethinking how we provide science IT in an era of massive data but modest bud...
Rethinking how we provide science IT in an era of massive data but modest bud...Rethinking how we provide science IT in an era of massive data but modest bud...
Rethinking how we provide science IT in an era of massive data but modest bud...Ian Foster
 
The Web of Things: Enabling the Physical World to the Web
The Web of Things: Enabling the Physical World to the WebThe Web of Things: Enabling the Physical World to the Web
The Web of Things: Enabling the Physical World to the WebAndreas Kamilaris
 
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012Lee Dirks
 
20160414 23 Research Data Things
20160414 23 Research Data Things20160414 23 Research Data Things
20160414 23 Research Data ThingsKatina Toufexis
 
Taming the Big Data Beast - Together
Taming the Big Data Beast - TogetherTaming the Big Data Beast - Together
Taming the Big Data Beast - TogetherKennisalliantie
 
Briefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data ApproachBriefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data Approach3 Round Stones
 
Introduction to Big Data and Data Science
Introduction to Big Data and Data ScienceIntroduction to Big Data and Data Science
Introduction to Big Data and Data ScienceFeyzi R. Bagirov
 
2016 05 sanger
2016 05 sanger2016 05 sanger
2016 05 sangerChris Dwan
 
Open Research Problems in Linked Data - WWW2010
Open Research Problems in Linked Data - WWW2010Open Research Problems in Linked Data - WWW2010
Open Research Problems in Linked Data - WWW2010Juan Sequeda
 
Trust threads: Provenance for Data Reuse in Long Tail Science
Trust threads: Provenance for Data Reuse in Long Tail ScienceTrust threads: Provenance for Data Reuse in Long Tail Science
Trust threads: Provenance for Data Reuse in Long Tail ScienceBeth Plale
 
Humanities Crowdsourcing on the Zooniverse Platform
Humanities Crowdsourcing on the Zooniverse PlatformHumanities Crowdsourcing on the Zooniverse Platform
Humanities Crowdsourcing on the Zooniverse PlatformUCLDH
 
Big Data and the Future of Publishing
Big Data and the Future of PublishingBig Data and the Future of Publishing
Big Data and the Future of PublishingAnita de Waard
 
SemanticXO: connecting the XO with the World’s largest information network
SemanticXO: connecting the XO with the World’s largest information networkSemanticXO: connecting the XO with the World’s largest information network
SemanticXO: connecting the XO with the World’s largest information networkChristophe Guéret
 
What is eScience, and where does it go from here?
What is eScience, and where does it go from here?What is eScience, and where does it go from here?
What is eScience, and where does it go from here?Daniel S. Katz
 
Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Alexandru Iosup
 
The Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big DataThe Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big DataPhilip Bourne
 
DataEd Online: Demystifying Big Data
DataEd Online: Demystifying Big DataDataEd Online: Demystifying Big Data
DataEd Online: Demystifying Big DataDATAVERSITY
 

Tendances (20)

20160719 23 Research Data Things
20160719 23 Research Data Things20160719 23 Research Data Things
20160719 23 Research Data Things
 
The Future(s) of the World Wide Web
The Future(s) of the World Wide WebThe Future(s) of the World Wide Web
The Future(s) of the World Wide Web
 
Rethinking how we provide science IT in an era of massive data but modest bud...
Rethinking how we provide science IT in an era of massive data but modest bud...Rethinking how we provide science IT in an era of massive data but modest bud...
Rethinking how we provide science IT in an era of massive data but modest bud...
 
The Web of Things: Enabling the Physical World to the Web
The Web of Things: Enabling the Physical World to the WebThe Web of Things: Enabling the Physical World to the Web
The Web of Things: Enabling the Physical World to the Web
 
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
 
20160414 23 Research Data Things
20160414 23 Research Data Things20160414 23 Research Data Things
20160414 23 Research Data Things
 
Taming the Big Data Beast - Together
Taming the Big Data Beast - TogetherTaming the Big Data Beast - Together
Taming the Big Data Beast - Together
 
Briefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data ApproachBriefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data Approach
 
Introduction to Big Data and Data Science
Introduction to Big Data and Data ScienceIntroduction to Big Data and Data Science
Introduction to Big Data and Data Science
 
2016 05 sanger
2016 05 sanger2016 05 sanger
2016 05 sanger
 
Open Research Problems in Linked Data - WWW2010
Open Research Problems in Linked Data - WWW2010Open Research Problems in Linked Data - WWW2010
Open Research Problems in Linked Data - WWW2010
 
Trust threads: Provenance for Data Reuse in Long Tail Science
Trust threads: Provenance for Data Reuse in Long Tail ScienceTrust threads: Provenance for Data Reuse in Long Tail Science
Trust threads: Provenance for Data Reuse in Long Tail Science
 
Humanities Crowdsourcing on the Zooniverse Platform
Humanities Crowdsourcing on the Zooniverse PlatformHumanities Crowdsourcing on the Zooniverse Platform
Humanities Crowdsourcing on the Zooniverse Platform
 
Big Data and the Future of Publishing
Big Data and the Future of PublishingBig Data and the Future of Publishing
Big Data and the Future of Publishing
 
SemanticXO: connecting the XO with the World’s largest information network
SemanticXO: connecting the XO with the World’s largest information networkSemanticXO: connecting the XO with the World’s largest information network
SemanticXO: connecting the XO with the World’s largest information network
 
What is eScience, and where does it go from here?
What is eScience, and where does it go from here?What is eScience, and where does it go from here?
What is eScience, and where does it go from here?
 
Intro to Data Science Concepts
Intro to Data Science ConceptsIntro to Data Science Concepts
Intro to Data Science Concepts
 
Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.
 
The Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big DataThe Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big Data
 
DataEd Online: Demystifying Big Data
DataEd Online: Demystifying Big DataDataEd Online: Demystifying Big Data
DataEd Online: Demystifying Big Data
 

En vedette

Aprile Company Profile - 2013
Aprile Company Profile - 2013Aprile Company Profile - 2013
Aprile Company Profile - 2013Dario Morandotti
 
B'nai Aviv Monitor Presentation
B'nai Aviv Monitor PresentationB'nai Aviv Monitor Presentation
B'nai Aviv Monitor PresentationEric
 
In the Age of Open Information - Do-It-Yourself Analytical Mashups on Schema-...
In the Age of Open Information - Do-It-Yourself Analytical Mashups on Schema-...In the Age of Open Information - Do-It-Yourself Analytical Mashups on Schema-...
In the Age of Open Information - Do-It-Yourself Analytical Mashups on Schema-...MaikThiele
 
MS Cognitive Issues by Justin C. Koenitzer, Psy.d.
MS Cognitive Issues   by Justin C. Koenitzer, Psy.d.MS Cognitive Issues   by Justin C. Koenitzer, Psy.d.
MS Cognitive Issues by Justin C. Koenitzer, Psy.d.Eric
 

En vedette (6)

Aprile Company Profile - 2013
Aprile Company Profile - 2013Aprile Company Profile - 2013
Aprile Company Profile - 2013
 
B'nai Aviv Monitor Presentation
B'nai Aviv Monitor PresentationB'nai Aviv Monitor Presentation
B'nai Aviv Monitor Presentation
 
How To Use Excel
How To Use ExcelHow To Use Excel
How To Use Excel
 
In the Age of Open Information - Do-It-Yourself Analytical Mashups on Schema-...
In the Age of Open Information - Do-It-Yourself Analytical Mashups on Schema-...In the Age of Open Information - Do-It-Yourself Analytical Mashups on Schema-...
In the Age of Open Information - Do-It-Yourself Analytical Mashups on Schema-...
 
MS Cognitive Issues by Justin C. Koenitzer, Psy.d.
MS Cognitive Issues   by Justin C. Koenitzer, Psy.d.MS Cognitive Issues   by Justin C. Koenitzer, Psy.d.
MS Cognitive Issues by Justin C. Koenitzer, Psy.d.
 
Succession “Losers”: What Happens to Executives Passed Over for the CEO Job?
Succession “Losers”: What Happens to Executives Passed Over for the CEO Job? Succession “Losers”: What Happens to Executives Passed Over for the CEO Job?
Succession “Losers”: What Happens to Executives Passed Over for the CEO Job?
 

Similaire à Edyra

Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...Peter Löwe
 
The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?Anna Fensel
 
ODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open Data
ODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open DataODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open Data
ODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open DataMartin Kaltenböck
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonAfrican Open Science Platform
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?LEARN Project
 
Open science and its advocacy
Open science and its advocacyOpen science and its advocacy
Open science and its advocacySarah Jones
 
Navigating the data management ecosystem - Dan Valen
Navigating the data management ecosystem - Dan ValenNavigating the data management ecosystem - Dan Valen
Navigating the data management ecosystem - Dan ValenDigital Science
 
Towards long-term preservation of linked data - the PRELIDA project
Towards long-term preservation of linked data - the PRELIDA projectTowards long-term preservation of linked data - the PRELIDA project
Towards long-term preservation of linked data - the PRELIDA projectPRELIDA Project
 
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open scienceSarah Jones
 
MMEA final seminar opening speech
MMEA final seminar opening speechMMEA final seminar opening speech
MMEA final seminar opening speechCLIC Innovation Ltd
 
GROUND Lab Presentation at WCS
GROUND Lab Presentation at WCSGROUND Lab Presentation at WCS
GROUND Lab Presentation at WCSGROUND Lab LLC
 
Winning Horizon 2020 with Open Science
Winning Horizon 2020 with Open ScienceWinning Horizon 2020 with Open Science
Winning Horizon 2020 with Open ScienceMartin Donnelly
 
Open science / open research
Open science / open researchOpen science / open research
Open science / open researchheila1
 
The FOSTER project - general overview
The FOSTER project - general overviewThe FOSTER project - general overview
The FOSTER project - general overviewMartin Donnelly
 
Open Knowledge and the Benefits for University-based Research
Open Knowledge and the Benefits for University-based ResearchOpen Knowledge and the Benefits for University-based Research
Open Knowledge and the Benefits for University-based ResearchUQSCADS
 
Open Source & Open Data Session report from imaGIne 2014 Conference
Open Source & Open Data Session report from imaGIne 2014 ConferenceOpen Source & Open Data Session report from imaGIne 2014 Conference
Open Source & Open Data Session report from imaGIne 2014 ConferenceGSDI Association
 
Improving Access to Research Data: What does changing legislation mean for y...
Improving Access to Research Data:  What does changing legislation mean for y...Improving Access to Research Data:  What does changing legislation mean for y...
Improving Access to Research Data: What does changing legislation mean for y...Marieke Guy
 
From Open Data to Open Science, by Geoffrey Boulton
 From Open Data to Open Science, by Geoffrey Boulton From Open Data to Open Science, by Geoffrey Boulton
From Open Data to Open Science, by Geoffrey BoultonLEARN Project
 

Similaire à Edyra (20)

Open Science
Open ScienceOpen Science
Open Science
 
Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...
 
The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?
 
ODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open Data
ODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open DataODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open Data
ODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open Data
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon Hodson
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?
 
Open science and its advocacy
Open science and its advocacyOpen science and its advocacy
Open science and its advocacy
 
Navigating the data management ecosystem - Dan Valen
Navigating the data management ecosystem - Dan ValenNavigating the data management ecosystem - Dan Valen
Navigating the data management ecosystem - Dan Valen
 
Towards long-term preservation of linked data - the PRELIDA project
Towards long-term preservation of linked data - the PRELIDA projectTowards long-term preservation of linked data - the PRELIDA project
Towards long-term preservation of linked data - the PRELIDA project
 
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open science
 
MMEA final seminar opening speech
MMEA final seminar opening speechMMEA final seminar opening speech
MMEA final seminar opening speech
 
GROUND Lab Presentation at WCS
GROUND Lab Presentation at WCSGROUND Lab Presentation at WCS
GROUND Lab Presentation at WCS
 
Winning Horizon 2020 with Open Science
Winning Horizon 2020 with Open ScienceWinning Horizon 2020 with Open Science
Winning Horizon 2020 with Open Science
 
Open science / open research
Open science / open researchOpen science / open research
Open science / open research
 
The FOSTER project - general overview
The FOSTER project - general overviewThe FOSTER project - general overview
The FOSTER project - general overview
 
Open Knowledge and the Benefits for University-based Research
Open Knowledge and the Benefits for University-based ResearchOpen Knowledge and the Benefits for University-based Research
Open Knowledge and the Benefits for University-based Research
 
BLC & Digital Science: Mark Hahnel, Figshare
BLC & Digital Science: Mark Hahnel, FigshareBLC & Digital Science: Mark Hahnel, Figshare
BLC & Digital Science: Mark Hahnel, Figshare
 
Open Source & Open Data Session report from imaGIne 2014 Conference
Open Source & Open Data Session report from imaGIne 2014 ConferenceOpen Source & Open Data Session report from imaGIne 2014 Conference
Open Source & Open Data Session report from imaGIne 2014 Conference
 
Improving Access to Research Data: What does changing legislation mean for y...
Improving Access to Research Data:  What does changing legislation mean for y...Improving Access to Research Data:  What does changing legislation mean for y...
Improving Access to Research Data: What does changing legislation mean for y...
 
From Open Data to Open Science, by Geoffrey Boulton
 From Open Data to Open Science, by Geoffrey Boulton From Open Data to Open Science, by Geoffrey Boulton
From Open Data to Open Science, by Geoffrey Boulton
 

Dernier

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 

Dernier (20)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 

Edyra

  • 1. ResUbic Research Seminar ResUbic Research Lab Dresden EDYRA Engineering of Do-it-Yourself Analytic Rich Internet Applications Wolfgang Lehner Maik Thiele Katrin Braunschweig Julian Eberius © Prof. Dr. -Ing. Wolfgang Lehner
  • 2. > MAD Skills [Jeffrey Cohen, Brian Dolan, Mark Dunlap, Joseph M. Hellerstein, Caleb Welton: MAD Skills: New Analysis Practices for Big Data. PVLDB 2009] © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar | 2
  • 3. > Motivation (1) In the days of Kings and Priests  Computers and Data: Crown Jewels  Executives depend on computers  But cannot work with them directly  The DBA “Priesthood”  And their Acronymia: EDW, BI, OLAP The architected Enterprise DWH  Rational behavior…for a bygone era  “There is no point in bringing data … into the data warehouse environment without integrating it.” —Bill Inmon, Building the Data Warehouse, 2005 © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 3
  • 4. > Motivation (2) New Realities  TB disks < $100  Everything is data  Rise of data-driven culture  Very publicly espoused by Google, Wired, etc.  Sloan Digital Sky Survey, Terraserver, etc. The quest for knowledge used to begin with grand theories. Now it begins with massive amounts of data. Welcome to the Petabyte Age. © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 4
  • 5. > MAD Skills Magnetic  „Attract data and practitioners“  Usage of all data source independet of their data quality Agile  „Rapid iteration: ingest, analyze, productionalize“  Continous evolution of the logical and physical structures  ELT (Extraction, Loading, Transformation) Deep  „Sophisticated analytics in Big Data“  Extended algorithmic run-time  Ad-hoc advanced analytics and statistics © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 5
  • 6. > Open Data, Services and Mashups Web of Data  E-Government 2.0, Initiative i2010  Europeana, World Digital Library  Public data catalogs  http://data.gov/  http://data.gov.uk/  Free to  Copy, distribute and transmit the data  Adapt the data  Exploiting the data commercially, whether by sub-licensing it, combining it with other data, or by including it in your own product Web of Services  OpenSocial-API (Google, Yahoo!, MySpace, Xing)  Scientific Computations (http://www.wolframalpha.com)  Entitiy Detection (http://www.yooname.com)  Visualization (http://manyeyes.alphaworks.ibm.com/manyeyes) Web of Mashups  Programmale Web (http://www.programmableweb.com/) © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 6
  • 7. > Principles of Open Data Data shall be considered open if it is made public in a way that complies with the principles below  Complete: All public data is made available. Public data is data that is not subject to valid privacy, security or privilege limitations.  Primary: Data is as collected at the source, with the highest possible level of granularity, not in aggregate or modified forms  Timely: Data is made available as quickly as necessary to preserve the value of the data.  Accessible: Data is available to the widest range of users for the widest range of purposes.  Machine processable: Data is reasonably structured to allow automated processing.  Non-discriminatory: Data is available to anyone, with no requirement of registration.  Non-proprietary: Data is available in a format over which no entity has exclusive control.  License-free: Data is not subject to any copyright, patent, trademark or trade secret regulation. Reasonable privacy, security and privilege restrictions may be allowed. Quelle: http://resource.org/8_principles.html © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 7
  • 8. >  „Daten gehören den Menschen“ – typische Beispiele: Genome, Daten von Organismen, medizinische Forschung, umweltwissenschaftliche Daten  öffentliche Gelder haben die Generierung der Daten erst ermöglicht, also müssen sie auch öffentlich zugänglich sein (tatsächlich treten Wissenschaftler in der Regel die Rechte an den von ihnen generierten Daten an private Verlage ab, wenn sie ihre Ergebnisse publizieren)  Fakten können nicht dem Urheberrecht unterliegen  Forschung wird gefördert, wenn wissenschaftliche Erkenntnisse für alle Forscher frei zugänglich sind © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 8
  • 9. > Gapminder http://www.gapminder.org/ © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 9
  • 10. > Gapminder (2) Vision: making sense of the world by having fun with statistics!  Gapminder is a non-profit venture for development and provision of free software to visualize human development trends  Gapminder will ultimately be integrated into Google: this is the first time global datasets will be searchable over the Internet Hans Rosling @ TED  TEDTalks: annual technology conference in California, USA http://www.ted.com/tedtalks/  Hans Rosling is a professor of global health at the Karolinska Institute, data visualization extraordinaire and the creator of the Gapminder tools  see http://www.youtube.com/watch?v=YpKbO6O3O3M © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 10
  • 11. > Public.Resource.Org Idea: Make government more transparent Project funded: Public.Resource.Org is a non-profit organization focused on enabling online access to public government documents in the United States. We are providing $2 million to Public.Resource.Org to support the Law.Gov initiative, which aims to make all primary legal materials in the United States available to all. Gewinner des Projekts 10100 http://www.project10tothe100.com/intl/DE/index.html © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 11
  • 12. > Microsoft’s Open Government Data Initiative • The Open Government Data Initiative (OGDI) is a cloud-based collection of software assets that enables publicly available government data to be easily accessible. Using open standards and application programming interfaces (API), developers and government agencies can retrieve the data programmatically for use in new and innovative online applications, or mash-ups that can help: – Improve citizen services – Enhance collaboration between government agencies and private organizations – Increase government transparency • OGDI promotes the use of this data by capturing and publishing re- usable software assets, patterns, and practices. The data repository already holds over 60 different government datasets that are readily available for use in new applications, and is continuously updated with additional government datasets. • More: http://www.microsoft.com/industry/government/opengovdata/ © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 12
  • 13. > Civic Commons http://civiccommons.com/ © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 13
  • 14. > data.gov © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 14
  • 15. > data.gov.uk © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 15
  • 16. > data.worldbank.org © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 16
  • 17. > unData http://data.un.org/ © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 17
  • 18. > Ushahidi http://www.ushahidi.com/ © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 18
  • 19. > Statistisches Bundesamt Deutschland https://www-genesis.destatis.de/genesis/online/ © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 19
  • 20. > offenedaten.de © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 20
  • 21. > Data360 http://www.data360.org © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 21
  • 22. > IBM ManyEyes http://manyeyes.alphaworks.ibm.com)/manyeyes/ © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 22
  • 23. > Open Citizen‘s Platform Public issue tracking provides increased engagement, transparency, and participation in the community Manage issues in urban environments, like pot-holes, broken street lighting or lack of accessibility  What are the benefits to… Governments Citizens  Reduce time, effort and resources in  Open access to complete, formatted data fulfilling public information requests rather than relying on third party  Increase data quality by providing correct interpretations or subsets data to public from the source  Information accessibility leads to greater  Reduce duplication of effort government accountability  Increase data access, availability, and speed  Fosters better community action on social of delivery issues, e.g. crime, pollution, permits,  Improve citizen satisfaction and create accidents, and education good public relations with your community  Improves regional competitiveness by giving businesses quicker and fuller access to data © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 23
  • 24. > What are the goals of the project?  Long Term…  Build a open citizen platform for Dresden  www.opendresden.de  Process it.. compare it... mix it.. filter it... visualize it…  Basic premises  Build a simple system and let it evolve  Design for participation  Openness  For now…  Start with a series of value-added municipal services (e.g. Mapnificient, Schooloscope, Cycling Planner, see following slides)  Transport, Education, Economy, (Local) Politics, Environment, Entertainment  Promote the open data principle in Saxony  Develop a fluid data repository (for municipal data)  Design a domain specific language in order to integrate and analyze data  Different levels of abstraction  Reuse existing apps  Visual dataflow languages  Textual DSL editors © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 24
  • 25. > Mapnificient http://www.mapnificent.net/london/ © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 25
  • 26. > Schooloscope http://schooloscope.com © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 26
  • 27. > Where can I live http://www.where-can-i-live.com/londonproperty © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 27
  • 28. > UBC/Google cycling planner http://www.cyclevancouver.ubc.ca/cv.aspx © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 28
  • 29. > CitySourced http://www.citysourced.com © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 29
  • 30. > EveryBlock http://chicago.everyblock.com/ © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 30
  • 31. > Architecture – Sketch Lightweight Integration Techniques • Join across dimensions (e.g. Entity + Time REST Google Public Data Sources + Place) Visualization Open Data and Maps • Aggregations JSON Lightweight Composite Applications Openstreet • Create information from the data Map • Uncover hidden aspects of data KML • Which becomes new data itself IBM • Classification, prediction, clustering ManyEyes GeoRSS • Embrace recursion API for location-based collaborative issue-tracking http://open311.org http://www.omgstandard.com Repository Fluid Data Citizen Geo Data Request‘s Municipal Data © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 31
  • 32. > Fluid Data Repository  Platform for the web of things, each represented by an openly writable „social“ object  Share, annotate, augment and re-use information  Mainly concerns data mediation and integration  Need to access and integrate data residing in multiple and heterogeneous sources  Adaptive, add metrics, aggregations, data sources or data connections without re-building analysis processes or visualizations  “non-destructive change” © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 32
  • 33. > Alternative Data Models BigTable HBase RavenDB SimpleDB MongoDB OrientDB Cassandra CouchDB ThruDB Hypertable Column Families Terrastore Documents FluidDB other Voldemort NoSQL Dynomite Key/Value Dynamo Triple RedStore Tokio Cabinet GT.M Stores Viruoso Redis Graph Scalaris Sones Jena Sesame YARS Pahoehoe Riak Neo4J AllegroGraph HyperGraphDB FlockDB © Prof. Dr.-Ing. Wolfgang Lehner| ResUbic Research Seminar 33