SlideShare a Scribd company logo
1 of 47
@openaire_eu
OpenAIRE Services
Paolo Manghi
Istituto di Scienza e Tecnologie
dell’Informazione, CNR
Research
communities
Researchers (All)
Content providers
Innovators
Research
managers
Funders
Building the graph and Dashboards
OpenAIRE Dashboards
Validation
Cleaning De-duplication
Inference
Research Graph Services
Project communiity
FunderFunding
Product
Publicatio
n
Data Software
Organizatio
n
TERMS
OF USE
Harvesting Uploading
Brokering
Source
ORP
Publications
repositories
Data
repositories
Hybrid
repositories
Registries
OA
Journals
Software
repositories
Content Providers Research
Infras
GUIDE
LINES
Metadata
records
files
cleaned
records
Full-text
cache
Transform
Clean
Identify
equivelent
products
and
organisation
s
Aggregation subsystem
De-duplication
subsystem
Information Inference subsystem
Data Sources
Populate
Merge equivalent objects
Data provision
subsystem
Collect
Native graph
“slices”
Publishing
subsystem
Data Monitoring
Action Sets
(similarity
rels)
Front-end
Native
graph
Deduped
graph
Extract full-text
Copy of deduped
graph
Enrich graphs with links
Action Set
(inferred
links)
Enriched
graph
Propagation
Text-mining of
the full-texts and
the graph to
derive new
semantic links
Architecture and technologies: today
Round-table of Open Source Technologies
Resources
Public
System
20srv
122CPU
320GB
8TB
Mining
System
21srv
406CPU
2TB
385TB
Data provision
System
23srv
154CPU
430GB
23TB
Testing
System
5srv
30CPU
100GB
3TB
Public
System
44srv
274CPU
905GB
20TB
Mining
System
22srv
414CPU
2.2TB
388TB
Data provision
System
23srv
154CPU
430GB
24TB
Testing
System
14srv
86CPU
302GB
9TB
6
OpenAIRE technical staff (40+ members)
The OpenAIRE
Research Graph
Materializing the Open Science Graph
Project
communit
y
FunderFunding
Product
Publicatio
n
Researc
h Data
Software
Organizatio
n
Source
Other
res.
products
Mining
Deduplication
End-user feedback
Harvesting
GUIDE
LINES
Research Infrastructures Publishing
IT
OpenAIREAdvance1stReview|Luxembourg|10Oct2019
Providing an open metadata
research graph of interlinked
scientific products, with Open
Access information, linked to
funding information and research
communities
The OpenAIRE research graph
Open
Complete
De-duplicated
Transparent
Participatory
Decentralized
Trusted
Complete: community-trusted sources
Academic Graph
… and more
… and more
… and more
… and more
… and more
… and more
Harvesting/transformation workflows
Source A
Collect Transform
Source B
Native
XML
Cleaned
XML
Collect Transform
Native
XML
Cleaned
XML
Data Collection Workflow
Sub-Workflow Sub-Workflow
Monitoring Data Quality/Expectations
across sources, within sources, etc.
• Workflow templates and workflow
executions (scheduled)
• Provenance
• Types of products
• Etc.
Transformation
• Moving from XML to JSON
frameworks: XSLT to JSON, XML to
JSON
GUIDE
LINES
GUIDE
LINES
Fine-grained classification of Research Products
Publications
• Article
• Preprint
• Report
• …
Datasets
• Dataset
• Collection
• Clinical Trials
• …
Software
• Research
Software
• …
Other Research
Products
• Service
• Workflow
• Interactive
Resource
• …
Institutional/
publication
repositories
Journals/
publishers
Data
repositories
Other
Products
repositories
Software
repositories
OpenAIRE-Advance Review, January 2019
Pre-processed sources
Article-datasetlinks
480Milinks
CrossRefenriched
85Mipublicationrecords
DOIBoost
Academic Graph
Published every 6 months
(new versions to be published next week)
Generating and
maintaing dumps
overtime
• Versions
• Incremental
• MapReduce on HDFS/Spark
• 13 Millions full-texts
• Java/Python framework
Mining
Find new metadata and links
• Identification of links to entities (URLs, PIDs)
• Semantics for documents, datasets, software
• Semantics of links
• Links to web docs
• Ecc
Collect Open Access PDFs
• Pro-actively collect pre-prints
• Identify Open Access versions
Context Propagation
Product
Source
Country
Project
Organization
communit
y
Product
Project Source
Product
Project
Product
supplementedBy
fundedBy
hostedBy
(institutional repository)
located
Funder
funds
(National Funder)
fundedBy
jurisdiction
located
ofInterestofInterest
fundedBy
hostedBy
Product
supplementedBy
157K
8Mi 10K
OpenAIREAdvance1stReview|Luxembourg|10Oct2019
De-duplication (BETA Content)
More information about the de-duplication framework used by
OpenAIRE can be found searching on Zenodo for :
• “De-duplicating the OpenAIRE Scholarly Communication Big Graph”
(poster)
• “GDup: De-Duplication of Scholarly Communication Big Graphs”
Deduplication techniques
(MapReduce based, Java)
• Improving results by adding
context
Production: Open Access CAPs
BETA: Open Science CAPs
0
10000000
20000000
30000000
40000000
50000000
60000000
70000000
80000000
90000000
100000000
Old CAP New CAP
literature
0
2000000
4000000
6000000
8000000
10000000
12000000
Old CAP New CAP
research data
0
20000
40000
60000
80000
100000
120000
140000
Old CAP New CAP
software
0
500000
1000000
1500000
2000000
2500000
3000000
3500000
4000000
4500000
Old CAP New CAP
other
110Mi
30Mi
1Mi
10Mi
100K
180K
3Mi
7Mi
Harvested content
• Data sources
12K +
• Records
450Mi
• Publication full-texts
11,6Mi (Springer N. coming)
• Links (also text-mined)
680Mi
PROD BETA PROD BETA
PROD BETAPROD BETA
OpenAIREAdvance1stReview|Luxembourg|10Oct2019
How to access the
services
API and access
Bulk
OAI-PMH
Dumps in
Zenodo for large
datasets
HTTP
Search
Search REST
APIs
Linked Open
Data
SparQL
LOD dumps
Workshop Técnico OpenAIRE / LA Referencia | 29-30 October, 2019 | Costa Rica
http://develop.openaire.eu
Average unique visitors per month 25,000
Average hits per month 2,2Mi
DOIBoost
Result DOI
Preprint 10.5281/zenodo.1492766
Software toolkit 10.5281/zenodo.1492210
Dataset dump 10.5281/zenodo.1438356
Scholexplorer
• October-November 2019:
OpenAIRE Research Graph open for consultation
Collecting feedback via Trello (operational end of
September)
• December 2019:
OpenAIRE Research Graph
in production
BETA Graph Open Consultation
http://beta.explore.openaire.eu
• Identify errors/inconsistencies (semi-)automatically
• Crowd-sourcing
OpenAIRE Stand-Alone
Services
Access use-cases: APIs and web portal
Harvesting of article-
dataset and dataset-
dataset scholarly links
API
WebUI: link
discovery/navigation
API: link
search/resolution
Other
sources
17,5Mi literature
objects, 50,7Mi
datasets, 481,3Mi
Scholix links;
Workshop Técnico OpenAIRE / LA Referencia | 29-30 October, 2019 | Costa Rica
40Mi hits/month
(~1Bi hits since Jan
2018)
• Numbers
17,5Mi literature objects, 50,7Mi datasets, 481,3Mi Scholix
links;
• API Adoption
40Mi hits per month
Scholexplorer
Access use-cases: APIs and web portal
Other
sources
Harvesting of links
API
API: link
search/resolutio
n
WebUI: link
discovery/navigation
40Mi hits/month
(~1Bi hits since Jan
2018)
OpenAIREAdvance1stReview|Luxembourg|10Oct2019
OpenAIREAdvance1stReview|Luxembourg|10Oct2019
• Data: 141TB
• Files: 3.5M
• Records: 1,389,303
• Largest File: 516GB
• Largest Dataset: 2.5TB
• Visitors: 2M / year
Zenodo: Content & Usage
27
Growth
Is it a questionnaire
management system? Definite
no!
• Articulated handling of a DMP
Publishing, discovery, reuse, statistics onDMPs
• Actionable DMPs
Validation ofstatements viaexternal services
• Collaborative DMP composition
Researchers intheloop
ArgOS
Machine-actionable data
management planning
Powered by OpenDMP
Workshop Técnico OpenAIRE / LA Referencia | 29-30 October, 2019 | Costa Rica
• Amnesia is a data anonymization tool available at
https://amnesia.openaire.eu
Amnesiacanbeusedlocallyoron-line
On-line is for demos and training, not safe
• Offers true anonymity and not pseudo-anonymity
k-anonymityandkm-anonymity
• Numbers in 2019 till now:
33Khits
7Kusesoftheon-lineservice
470installations
Amnesia
Workshop Técnico OpenAIRE / LA Referencia | 29-30 October, 2019 | Costa Rica
OpenAIRE Dashboards
High-Level View
Harvesting
GUIDE
LINES
Research Infrastructures Publishing
IT
• Repository registration
and validation
• Repository Usage
Statistics
• Repository Broker
Service
Services for Content Providers
http://provide.openaire.eu
Screenshot
• 24 repositories defined at least one
subscription
• Integrate with repositories (Zenodo) and
aggregators (LA Referencia)
• Towards PlanS implementation (PDF
brokering)
Broker Service
Example of record enrichments: From
LaReferencia to OpenAIRE
• Topics have data sources as targets
• Events regard an object in a given data source
• Data sources:
Publication repositories from OpenDOAR
Data Archives from re3data.org
Topics
Event (potential notification):
• Message
• Topic
• TargetRepository
• Trust
Events
Properties or links that are not
available in the records
Merge
Inference
Claims
Enrichments
Records that should be in
the repository but are NOT
in the repository
Deduction from authors
Deduction from
affiliation
Additions
Wrong links
End-user feedbacks
Alerts
Broker User interfaces
37
Usage statistics service for Content Providers
● Join OpenAIRE Usage Statistics
○ enable “usage metrics” for your data source
○ download & configure tracking plugin in your data source
○ confirmation by OpenAIRE once usage events are tracked in PIWIK
● or enter SUSHI endpoint to let OpenAIRE collect COUNTER
reports
Metrics
Download
tracker
Configure Deploy & Test
Validation & Confirmation
Enable Metrics for content providers
Summarized Usage Statistics on the
content provider level
Research Community Dashboard and
Gateways
Research Community
Dashboard
Researcher
Search-Navigate-Monitor
Research Products
Community
Gateway
Community
Gateway
Community Manager
Configure criteria of
inclusion into Gateway
as-a-Service
IT
• Subjects of pertinence
• Provenance (data source) + critieria
• Zenodo communities
• Projects
• Propagation via relationships
Publication «supplementedBy» Data/Software
Project «funds» Publication/Data/Software
Criteria for inclusion
New criteria
• Via ORCID
• Others?
Monitoring trends and impact
MONITOR
Funding
impact
Funding
attraction
Open
Science
impact
Open
Access
impact
Research
Impact
28 Funders in BETA
Monitoring trends and impact
MONITOR
Funding
impact
Funding
attraction
Open
Science
impact
Open
Access
impact
Research
Impact
28 Funders in BETA
Funders
• Trends in research fields: new (multidisciplinary)
disciplines
Institutions
• OA/OS behavior, ability to attract cross-funder
grants
Projects
• Success, interconnections, possible liaisons
Funders
• Recent and past EC and other funders’ activities
(representing various funding levels)
• Checking compliance to funder mandates
Institutions
• Collaboration network (by institution) via projects and
products
• Ability to attract funds from different funders
Projects
• Check if projects are eligible for Post-Grant APC
funding
• Compare project portfolio against that of other similar
institutions (anonymized)
Search and discovery portal
http://explore.openaire.euhttp://beta.explore.openaire.eu
Thank you!
Paolo Manghi
paolo.manghi@isti.cnr.it

More Related Content

What's hot

Moving ahead: The ARIADNE integration process
Moving ahead: The ARIADNE integration processMoving ahead: The ARIADNE integration process
Moving ahead: The ARIADNE integration processariadnenetwork
 
What is the "Big Data" version of the Linpack Benchmark? ; What is “Big Data...
What is the "Big Data" version of the Linpack Benchmark?; What is “Big Data...What is the "Big Data" version of the Linpack Benchmark?; What is “Big Data...
What is the "Big Data" version of the Linpack Benchmark? ; What is “Big Data...Geoffrey Fox
 
HPC-ABDS: The Case for an Integrating Apache Big Data Stack with HPC
HPC-ABDS: The Case for an Integrating Apache Big Data Stack with HPC HPC-ABDS: The Case for an Integrating Apache Big Data Stack with HPC
HPC-ABDS: The Case for an Integrating Apache Big Data Stack with HPC Geoffrey Fox
 
Classification of Big Data Use Cases by different Facets
Classification of Big Data Use Cases by different FacetsClassification of Big Data Use Cases by different Facets
Classification of Big Data Use Cases by different FacetsGeoffrey Fox
 
Analytics and Access to the UK web archive
Analytics and Access to the UK web archiveAnalytics and Access to the UK web archive
Analytics and Access to the UK web archiveLewis Crawford
 
07 data structures_and_representations
07 data structures_and_representations07 data structures_and_representations
07 data structures_and_representationsMarco Quartulli
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsCarole Goble
 
Nanopublications and Decentralized Publishing
Nanopublications and Decentralized PublishingNanopublications and Decentralized Publishing
Nanopublications and Decentralized PublishingTobias Kuhn
 
51 Use Cases and implications for HPC & Apache Big Data Stack
51 Use Cases and implications for HPC & Apache Big Data Stack51 Use Cases and implications for HPC & Apache Big Data Stack
51 Use Cases and implications for HPC & Apache Big Data StackGeoffrey Fox
 
Search Joins with the Web - ICDT2014 Invited Lecture
Search Joins with the Web - ICDT2014 Invited LectureSearch Joins with the Web - ICDT2014 Invited Lecture
Search Joins with the Web - ICDT2014 Invited LectureChris Bizer
 
FIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software ComponentsFIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software ComponentsFIWARE
 
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...inside-BigData.com
 

What's hot (15)

Moving ahead: The ARIADNE integration process
Moving ahead: The ARIADNE integration processMoving ahead: The ARIADNE integration process
Moving ahead: The ARIADNE integration process
 
What is the "Big Data" version of the Linpack Benchmark? ; What is “Big Data...
What is the "Big Data" version of the Linpack Benchmark?; What is “Big Data...What is the "Big Data" version of the Linpack Benchmark?; What is “Big Data...
What is the "Big Data" version of the Linpack Benchmark? ; What is “Big Data...
 
HPC-ABDS: The Case for an Integrating Apache Big Data Stack with HPC
HPC-ABDS: The Case for an Integrating Apache Big Data Stack with HPC HPC-ABDS: The Case for an Integrating Apache Big Data Stack with HPC
HPC-ABDS: The Case for an Integrating Apache Big Data Stack with HPC
 
Classification of Big Data Use Cases by different Facets
Classification of Big Data Use Cases by different FacetsClassification of Big Data Use Cases by different Facets
Classification of Big Data Use Cases by different Facets
 
Analytics and Access to the UK web archive
Analytics and Access to the UK web archiveAnalytics and Access to the UK web archive
Analytics and Access to the UK web archive
 
07 data structures_and_representations
07 data structures_and_representations07 data structures_and_representations
07 data structures_and_representations
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research Objects
 
04 open source_tools
04 open source_tools04 open source_tools
04 open source_tools
 
Nanopublications and Decentralized Publishing
Nanopublications and Decentralized PublishingNanopublications and Decentralized Publishing
Nanopublications and Decentralized Publishing
 
51 Use Cases and implications for HPC & Apache Big Data Stack
51 Use Cases and implications for HPC & Apache Big Data Stack51 Use Cases and implications for HPC & Apache Big Data Stack
51 Use Cases and implications for HPC & Apache Big Data Stack
 
Search Joins with the Web - ICDT2014 Invited Lecture
Search Joins with the Web - ICDT2014 Invited LectureSearch Joins with the Web - ICDT2014 Invited Lecture
Search Joins with the Web - ICDT2014 Invited Lecture
 
RDF data clustering
RDF data clusteringRDF data clustering
RDF data clustering
 
FIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software ComponentsFIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software Components
 
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...
 
Session 2
Session 2Session 2
Session 2
 

Similar to Introduction to OpenAIRE services and the OpenAIRE Research Graph

20191119_The OpenAIRE Research Graph
20191119_The OpenAIRE Research Graph 20191119_The OpenAIRE Research Graph
20191119_The OpenAIRE Research Graph OpenAIRE
 
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...Pedro Príncipe
 
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...OpenAIRE
 
A user journey in OpenAIRE services through the lens of repository managers -...
A user journey in OpenAIRE services through the lens of repository managers -...A user journey in OpenAIRE services through the lens of repository managers -...
A user journey in OpenAIRE services through the lens of repository managers -...OpenAIRE
 
OpenAIRE services and tools - presentation at #DI4R2016
OpenAIRE services and tools - presentation at #DI4R2016OpenAIRE services and tools - presentation at #DI4R2016
OpenAIRE services and tools - presentation at #DI4R2016OpenAIRE
 
20190527_Paolo Manghi_ OpenAIRE monitoring
20190527_Paolo Manghi_ OpenAIRE monitoring20190527_Paolo Manghi_ OpenAIRE monitoring
20190527_Paolo Manghi_ OpenAIRE monitoringOpenAIRE
 
Enabling better science: Results and vision of the OpenAIRE infrastructure an...
Enabling better science: Results and vision of the OpenAIRE infrastructure an...Enabling better science: Results and vision of the OpenAIRE infrastructure an...
Enabling better science: Results and vision of the OpenAIRE infrastructure an...OpenAIRE
 
Enabling better science - Results and vision of the OpenAIRE infrastructure a...
Enabling better science - Results and vision of the OpenAIRE infrastructure a...Enabling better science - Results and vision of the OpenAIRE infrastructure a...
Enabling better science - Results and vision of the OpenAIRE infrastructure a...Paolo Manghi
 
Infraestrutura para a Ciência Aberta na Europa - OpenAIRE: O poder dos reposi...
Infraestrutura para a Ciência Aberta na Europa - OpenAIRE: O poder dos reposi...Infraestrutura para a Ciência Aberta na Europa - OpenAIRE: O poder dos reposi...
Infraestrutura para a Ciência Aberta na Europa - OpenAIRE: O poder dos reposi...Pedro Príncipe
 
Facilitate Research Communities Adoption of Open Science Publishing Principle...
Facilitate Research Communities Adoption of Open Science Publishing Principle...Facilitate Research Communities Adoption of Open Science Publishing Principle...
Facilitate Research Communities Adoption of Open Science Publishing Principle...OpenAIRE
 
Infraestructuras, recursos y servicios de OpenAIRE. OpenAIRE Workshop Spain, ...
Infraestructuras, recursos y servicios de OpenAIRE. OpenAIRE Workshop Spain, ...Infraestructuras, recursos y servicios de OpenAIRE. OpenAIRE Workshop Spain, ...
Infraestructuras, recursos y servicios de OpenAIRE. OpenAIRE Workshop Spain, ...OpenAIRE
 
20200130_Mannocci_OpenAIRE_ResearchGraph
20200130_Mannocci_OpenAIRE_ResearchGraph20200130_Mannocci_OpenAIRE_ResearchGraph
20200130_Mannocci_OpenAIRE_ResearchGraphOpenAIRE
 
7th Content Providers Community Call
7th Content Providers Community Call7th Content Providers Community Call
7th Content Providers Community CallOpenAIRE
 
Research data discovery in OpenAIRE (Presentation by Paolo Manghi at DI4R2018)
Research data discovery in OpenAIRE (Presentation by Paolo Manghi at DI4R2018)Research data discovery in OpenAIRE (Presentation by Paolo Manghi at DI4R2018)
Research data discovery in OpenAIRE (Presentation by Paolo Manghi at DI4R2018)OpenAIRE
 
OpenAIRE services and tools for Open Science
OpenAIRE services and tools for Open Science OpenAIRE services and tools for Open Science
OpenAIRE services and tools for Open Science Pedro Príncipe
 
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)Blue BRIDGE
 
IDCC workshop: OpenAIRE services and tools for Open Research Data in H2020
IDCC workshop: OpenAIRE services and tools for Open Research Data in H2020IDCC workshop: OpenAIRE services and tools for Open Research Data in H2020
IDCC workshop: OpenAIRE services and tools for Open Research Data in H2020OpenAIRE
 
OpenAIRE infrastructure and Services (OpenAIRE Workshop Malta)
OpenAIRE infrastructure and Services (OpenAIRE Workshop Malta)OpenAIRE infrastructure and Services (OpenAIRE Workshop Malta)
OpenAIRE infrastructure and Services (OpenAIRE Workshop Malta)Pedro Príncipe
 
OpenAIRE: Science. Set Free, Iryna Kuchma, EIFL
OpenAIRE: Science. Set Free, Iryna Kuchma, EIFLOpenAIRE: Science. Set Free, Iryna Kuchma, EIFL
OpenAIRE: Science. Set Free, Iryna Kuchma, EIFLPlatforma Otwartej Nauki
 

Similar to Introduction to OpenAIRE services and the OpenAIRE Research Graph (20)

20191119_The OpenAIRE Research Graph
20191119_The OpenAIRE Research Graph 20191119_The OpenAIRE Research Graph
20191119_The OpenAIRE Research Graph
 
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
 
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
 
A user journey in OpenAIRE services through the lens of repository managers -...
A user journey in OpenAIRE services through the lens of repository managers -...A user journey in OpenAIRE services through the lens of repository managers -...
A user journey in OpenAIRE services through the lens of repository managers -...
 
OpenAIRE services and tools - presentation at #DI4R2016
OpenAIRE services and tools - presentation at #DI4R2016OpenAIRE services and tools - presentation at #DI4R2016
OpenAIRE services and tools - presentation at #DI4R2016
 
20190527_Paolo Manghi_ OpenAIRE monitoring
20190527_Paolo Manghi_ OpenAIRE monitoring20190527_Paolo Manghi_ OpenAIRE monitoring
20190527_Paolo Manghi_ OpenAIRE monitoring
 
Enabling better science: Results and vision of the OpenAIRE infrastructure an...
Enabling better science: Results and vision of the OpenAIRE infrastructure an...Enabling better science: Results and vision of the OpenAIRE infrastructure an...
Enabling better science: Results and vision of the OpenAIRE infrastructure an...
 
Enabling better science - Results and vision of the OpenAIRE infrastructure a...
Enabling better science - Results and vision of the OpenAIRE infrastructure a...Enabling better science - Results and vision of the OpenAIRE infrastructure a...
Enabling better science - Results and vision of the OpenAIRE infrastructure a...
 
Infraestrutura para a Ciência Aberta na Europa - OpenAIRE: O poder dos reposi...
Infraestrutura para a Ciência Aberta na Europa - OpenAIRE: O poder dos reposi...Infraestrutura para a Ciência Aberta na Europa - OpenAIRE: O poder dos reposi...
Infraestrutura para a Ciência Aberta na Europa - OpenAIRE: O poder dos reposi...
 
Facilitate Research Communities Adoption of Open Science Publishing Principle...
Facilitate Research Communities Adoption of Open Science Publishing Principle...Facilitate Research Communities Adoption of Open Science Publishing Principle...
Facilitate Research Communities Adoption of Open Science Publishing Principle...
 
Infraestructuras, recursos y servicios de OpenAIRE. OpenAIRE Workshop Spain, ...
Infraestructuras, recursos y servicios de OpenAIRE. OpenAIRE Workshop Spain, ...Infraestructuras, recursos y servicios de OpenAIRE. OpenAIRE Workshop Spain, ...
Infraestructuras, recursos y servicios de OpenAIRE. OpenAIRE Workshop Spain, ...
 
20200130_Mannocci_OpenAIRE_ResearchGraph
20200130_Mannocci_OpenAIRE_ResearchGraph20200130_Mannocci_OpenAIRE_ResearchGraph
20200130_Mannocci_OpenAIRE_ResearchGraph
 
7th Content Providers Community Call
7th Content Providers Community Call7th Content Providers Community Call
7th Content Providers Community Call
 
Research data discovery in OpenAIRE (Presentation by Paolo Manghi at DI4R2018)
Research data discovery in OpenAIRE (Presentation by Paolo Manghi at DI4R2018)Research data discovery in OpenAIRE (Presentation by Paolo Manghi at DI4R2018)
Research data discovery in OpenAIRE (Presentation by Paolo Manghi at DI4R2018)
 
OpenAIRE services and tools for Open Science
OpenAIRE services and tools for Open Science OpenAIRE services and tools for Open Science
OpenAIRE services and tools for Open Science
 
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
 
IDCC workshop: OpenAIRE services and tools for Open Research Data in H2020
IDCC workshop: OpenAIRE services and tools for Open Research Data in H2020IDCC workshop: OpenAIRE services and tools for Open Research Data in H2020
IDCC workshop: OpenAIRE services and tools for Open Research Data in H2020
 
CORE APIv3
CORE APIv3CORE APIv3
CORE APIv3
 
OpenAIRE infrastructure and Services (OpenAIRE Workshop Malta)
OpenAIRE infrastructure and Services (OpenAIRE Workshop Malta)OpenAIRE infrastructure and Services (OpenAIRE Workshop Malta)
OpenAIRE infrastructure and Services (OpenAIRE Workshop Malta)
 
OpenAIRE: Science. Set Free, Iryna Kuchma, EIFL
OpenAIRE: Science. Set Free, Iryna Kuchma, EIFLOpenAIRE: Science. Set Free, Iryna Kuchma, EIFL
OpenAIRE: Science. Set Free, Iryna Kuchma, EIFL
 

More from OpenAIRE

10th OpenAIRE Content Providers Community Call
10th OpenAIRE Content Providers Community Call10th OpenAIRE Content Providers Community Call
10th OpenAIRE Content Providers Community CallOpenAIRE
 
9th Content Providers Community Call\
9th Content Providers Community Call\9th Content Providers Community Call\
9th Content Providers Community Call\OpenAIRE
 
OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE in the European Open Science Cloud (EOSC)OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE in the European Open Science Cloud (EOSC)OpenAIRE
 
8th Content Providers Community Call
8th Content Providers Community Call8th Content Providers Community Call
8th Content Providers Community CallOpenAIRE
 
OpenAIRE PROVIDE Dashboard for Turkish repository managers
OpenAIRE PROVIDE Dashboard for Turkish repository managersOpenAIRE PROVIDE Dashboard for Turkish repository managers
OpenAIRE PROVIDE Dashboard for Turkish repository managersOpenAIRE
 
What will it cost to manage and share my data?
What will it cost to manage and share my data?What will it cost to manage and share my data?
What will it cost to manage and share my data?OpenAIRE
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)OpenAIRE
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)OpenAIRE
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)OpenAIRE
 
6th Content Providers Community Call
6th Content Providers Community Call6th Content Providers Community Call
6th Content Providers Community CallOpenAIRE
 
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing DataOpenAIRE
 
20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?OpenAIRE
 
20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open Science20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open ScienceOpenAIRE
 
20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)OpenAIRE
 
20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open Science20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open ScienceOpenAIRE
 
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing DataOpenAIRE
 
COVID-19: Activities, tools, best practice and contact points in Greece
 COVID-19: Activities, tools, best practice and contact points in Greece COVID-19: Activities, tools, best practice and contact points in Greece
COVID-19: Activities, tools, best practice and contact points in GreeceOpenAIRE
 
5th Content Providers Community Call
5th Content Providers Community Call5th Content Providers Community Call
5th Content Providers Community CallOpenAIRE
 
4th Content Providers Community Call
4th Content Providers Community Call4th Content Providers Community Call
4th Content Providers Community CallOpenAIRE
 
3rd Content Providers Community Call
3rd Content Providers Community Call3rd Content Providers Community Call
3rd Content Providers Community CallOpenAIRE
 

More from OpenAIRE (20)

10th OpenAIRE Content Providers Community Call
10th OpenAIRE Content Providers Community Call10th OpenAIRE Content Providers Community Call
10th OpenAIRE Content Providers Community Call
 
9th Content Providers Community Call\
9th Content Providers Community Call\9th Content Providers Community Call\
9th Content Providers Community Call\
 
OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE in the European Open Science Cloud (EOSC)OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE in the European Open Science Cloud (EOSC)
 
8th Content Providers Community Call
8th Content Providers Community Call8th Content Providers Community Call
8th Content Providers Community Call
 
OpenAIRE PROVIDE Dashboard for Turkish repository managers
OpenAIRE PROVIDE Dashboard for Turkish repository managersOpenAIRE PROVIDE Dashboard for Turkish repository managers
OpenAIRE PROVIDE Dashboard for Turkish repository managers
 
What will it cost to manage and share my data?
What will it cost to manage and share my data?What will it cost to manage and share my data?
What will it cost to manage and share my data?
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
 
6th Content Providers Community Call
6th Content Providers Community Call6th Content Providers Community Call
6th Content Providers Community Call
 
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
 
20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?
 
20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open Science20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open Science
 
20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)
 
20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open Science20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open Science
 
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
 
COVID-19: Activities, tools, best practice and contact points in Greece
 COVID-19: Activities, tools, best practice and contact points in Greece COVID-19: Activities, tools, best practice and contact points in Greece
COVID-19: Activities, tools, best practice and contact points in Greece
 
5th Content Providers Community Call
5th Content Providers Community Call5th Content Providers Community Call
5th Content Providers Community Call
 
4th Content Providers Community Call
4th Content Providers Community Call4th Content Providers Community Call
4th Content Providers Community Call
 
3rd Content Providers Community Call
3rd Content Providers Community Call3rd Content Providers Community Call
3rd Content Providers Community Call
 

Recently uploaded

Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professormuralinath2
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and ClassificationsAreesha Ahmad
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIADr. TATHAGAT KHOBRAGADE
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxDiariAli
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfSumit Kumar yadav
 
Exploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfExploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfrohankumarsinghrore1
 
An introduction on sequence tagged site mapping
An introduction on sequence tagged site mappingAn introduction on sequence tagged site mapping
An introduction on sequence tagged site mappingadibshanto115
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Silpa
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY1301aanya
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....muralinath2
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxseri bangash
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learninglevieagacer
 
Velocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.pptVelocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.pptRakeshMohan42
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspectsmuralinath2
 

Recently uploaded (20)

Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdf
 
Exploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfExploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdf
 
An introduction on sequence tagged site mapping
An introduction on sequence tagged site mappingAn introduction on sequence tagged site mapping
An introduction on sequence tagged site mapping
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Velocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.pptVelocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.ppt
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 

Introduction to OpenAIRE services and the OpenAIRE Research Graph

  • 1. @openaire_eu OpenAIRE Services Paolo Manghi Istituto di Scienza e Tecnologie dell’Informazione, CNR
  • 2. Research communities Researchers (All) Content providers Innovators Research managers Funders Building the graph and Dashboards OpenAIRE Dashboards Validation Cleaning De-duplication Inference Research Graph Services Project communiity FunderFunding Product Publicatio n Data Software Organizatio n TERMS OF USE Harvesting Uploading Brokering Source ORP Publications repositories Data repositories Hybrid repositories Registries OA Journals Software repositories Content Providers Research Infras GUIDE LINES
  • 3. Metadata records files cleaned records Full-text cache Transform Clean Identify equivelent products and organisation s Aggregation subsystem De-duplication subsystem Information Inference subsystem Data Sources Populate Merge equivalent objects Data provision subsystem Collect Native graph “slices” Publishing subsystem Data Monitoring Action Sets (similarity rels) Front-end Native graph Deduped graph Extract full-text Copy of deduped graph Enrich graphs with links Action Set (inferred links) Enriched graph Propagation Text-mining of the full-texts and the graph to derive new semantic links Architecture and technologies: today
  • 4. Round-table of Open Source Technologies
  • 8. Materializing the Open Science Graph Project communit y FunderFunding Product Publicatio n Researc h Data Software Organizatio n Source Other res. products Mining Deduplication End-user feedback Harvesting GUIDE LINES Research Infrastructures Publishing IT OpenAIREAdvance1stReview|Luxembourg|10Oct2019
  • 9. Providing an open metadata research graph of interlinked scientific products, with Open Access information, linked to funding information and research communities The OpenAIRE research graph Open Complete De-duplicated Transparent Participatory Decentralized Trusted
  • 10. Complete: community-trusted sources Academic Graph … and more … and more … and more … and more … and more … and more
  • 11. Harvesting/transformation workflows Source A Collect Transform Source B Native XML Cleaned XML Collect Transform Native XML Cleaned XML Data Collection Workflow Sub-Workflow Sub-Workflow Monitoring Data Quality/Expectations across sources, within sources, etc. • Workflow templates and workflow executions (scheduled) • Provenance • Types of products • Etc. Transformation • Moving from XML to JSON frameworks: XSLT to JSON, XML to JSON GUIDE LINES GUIDE LINES
  • 12. Fine-grained classification of Research Products Publications • Article • Preprint • Report • … Datasets • Dataset • Collection • Clinical Trials • … Software • Research Software • … Other Research Products • Service • Workflow • Interactive Resource • … Institutional/ publication repositories Journals/ publishers Data repositories Other Products repositories Software repositories OpenAIRE-Advance Review, January 2019
  • 13. Pre-processed sources Article-datasetlinks 480Milinks CrossRefenriched 85Mipublicationrecords DOIBoost Academic Graph Published every 6 months (new versions to be published next week) Generating and maintaing dumps overtime • Versions • Incremental
  • 14. • MapReduce on HDFS/Spark • 13 Millions full-texts • Java/Python framework Mining Find new metadata and links • Identification of links to entities (URLs, PIDs) • Semantics for documents, datasets, software • Semantics of links • Links to web docs • Ecc Collect Open Access PDFs • Pro-actively collect pre-prints • Identify Open Access versions
  • 15. Context Propagation Product Source Country Project Organization communit y Product Project Source Product Project Product supplementedBy fundedBy hostedBy (institutional repository) located Funder funds (National Funder) fundedBy jurisdiction located ofInterestofInterest fundedBy hostedBy Product supplementedBy 157K 8Mi 10K OpenAIREAdvance1stReview|Luxembourg|10Oct2019
  • 16. De-duplication (BETA Content) More information about the de-duplication framework used by OpenAIRE can be found searching on Zenodo for : • “De-duplicating the OpenAIRE Scholarly Communication Big Graph” (poster) • “GDup: De-Duplication of Scholarly Communication Big Graphs” Deduplication techniques (MapReduce based, Java) • Improving results by adding context
  • 17. Production: Open Access CAPs BETA: Open Science CAPs 0 10000000 20000000 30000000 40000000 50000000 60000000 70000000 80000000 90000000 100000000 Old CAP New CAP literature 0 2000000 4000000 6000000 8000000 10000000 12000000 Old CAP New CAP research data 0 20000 40000 60000 80000 100000 120000 140000 Old CAP New CAP software 0 500000 1000000 1500000 2000000 2500000 3000000 3500000 4000000 4500000 Old CAP New CAP other 110Mi 30Mi 1Mi 10Mi 100K 180K 3Mi 7Mi Harvested content • Data sources 12K + • Records 450Mi • Publication full-texts 11,6Mi (Springer N. coming) • Links (also text-mined) 680Mi PROD BETA PROD BETA PROD BETAPROD BETA OpenAIREAdvance1stReview|Luxembourg|10Oct2019
  • 18. How to access the services
  • 19. API and access Bulk OAI-PMH Dumps in Zenodo for large datasets HTTP Search Search REST APIs Linked Open Data SparQL LOD dumps Workshop Técnico OpenAIRE / LA Referencia | 29-30 October, 2019 | Costa Rica http://develop.openaire.eu Average unique visitors per month 25,000 Average hits per month 2,2Mi
  • 20. DOIBoost Result DOI Preprint 10.5281/zenodo.1492766 Software toolkit 10.5281/zenodo.1492210 Dataset dump 10.5281/zenodo.1438356
  • 22. • October-November 2019: OpenAIRE Research Graph open for consultation Collecting feedback via Trello (operational end of September) • December 2019: OpenAIRE Research Graph in production BETA Graph Open Consultation http://beta.explore.openaire.eu • Identify errors/inconsistencies (semi-)automatically • Crowd-sourcing
  • 24. Access use-cases: APIs and web portal Harvesting of article- dataset and dataset- dataset scholarly links API WebUI: link discovery/navigation API: link search/resolution Other sources 17,5Mi literature objects, 50,7Mi datasets, 481,3Mi Scholix links; Workshop Técnico OpenAIRE / LA Referencia | 29-30 October, 2019 | Costa Rica 40Mi hits/month (~1Bi hits since Jan 2018)
  • 25. • Numbers 17,5Mi literature objects, 50,7Mi datasets, 481,3Mi Scholix links; • API Adoption 40Mi hits per month Scholexplorer
  • 26. Access use-cases: APIs and web portal Other sources Harvesting of links API API: link search/resolutio n WebUI: link discovery/navigation 40Mi hits/month (~1Bi hits since Jan 2018) OpenAIREAdvance1stReview|Luxembourg|10Oct2019
  • 27. OpenAIREAdvance1stReview|Luxembourg|10Oct2019 • Data: 141TB • Files: 3.5M • Records: 1,389,303 • Largest File: 516GB • Largest Dataset: 2.5TB • Visitors: 2M / year Zenodo: Content & Usage 27 Growth
  • 28. Is it a questionnaire management system? Definite no! • Articulated handling of a DMP Publishing, discovery, reuse, statistics onDMPs • Actionable DMPs Validation ofstatements viaexternal services • Collaborative DMP composition Researchers intheloop ArgOS Machine-actionable data management planning Powered by OpenDMP Workshop Técnico OpenAIRE / LA Referencia | 29-30 October, 2019 | Costa Rica
  • 29. • Amnesia is a data anonymization tool available at https://amnesia.openaire.eu Amnesiacanbeusedlocallyoron-line On-line is for demos and training, not safe • Offers true anonymity and not pseudo-anonymity k-anonymityandkm-anonymity • Numbers in 2019 till now: 33Khits 7Kusesoftheon-lineservice 470installations Amnesia Workshop Técnico OpenAIRE / LA Referencia | 29-30 October, 2019 | Costa Rica
  • 32. • Repository registration and validation • Repository Usage Statistics • Repository Broker Service Services for Content Providers http://provide.openaire.eu Screenshot
  • 33. • 24 repositories defined at least one subscription • Integrate with repositories (Zenodo) and aggregators (LA Referencia) • Towards PlanS implementation (PDF brokering) Broker Service
  • 34. Example of record enrichments: From LaReferencia to OpenAIRE
  • 35. • Topics have data sources as targets • Events regard an object in a given data source • Data sources: Publication repositories from OpenDOAR Data Archives from re3data.org Topics Event (potential notification): • Message • Topic • TargetRepository • Trust
  • 36. Events Properties or links that are not available in the records Merge Inference Claims Enrichments Records that should be in the repository but are NOT in the repository Deduction from authors Deduction from affiliation Additions Wrong links End-user feedbacks Alerts
  • 38. Usage statistics service for Content Providers
  • 39. ● Join OpenAIRE Usage Statistics ○ enable “usage metrics” for your data source ○ download & configure tracking plugin in your data source ○ confirmation by OpenAIRE once usage events are tracked in PIWIK ● or enter SUSHI endpoint to let OpenAIRE collect COUNTER reports Metrics Download tracker Configure Deploy & Test Validation & Confirmation
  • 40. Enable Metrics for content providers
  • 41. Summarized Usage Statistics on the content provider level
  • 42. Research Community Dashboard and Gateways Research Community Dashboard Researcher Search-Navigate-Monitor Research Products Community Gateway Community Gateway Community Manager Configure criteria of inclusion into Gateway as-a-Service IT
  • 43. • Subjects of pertinence • Provenance (data source) + critieria • Zenodo communities • Projects • Propagation via relationships Publication «supplementedBy» Data/Software Project «funds» Publication/Data/Software Criteria for inclusion New criteria • Via ORCID • Others?
  • 44. Monitoring trends and impact MONITOR Funding impact Funding attraction Open Science impact Open Access impact Research Impact 28 Funders in BETA
  • 45. Monitoring trends and impact MONITOR Funding impact Funding attraction Open Science impact Open Access impact Research Impact 28 Funders in BETA Funders • Trends in research fields: new (multidisciplinary) disciplines Institutions • OA/OS behavior, ability to attract cross-funder grants Projects • Success, interconnections, possible liaisons Funders • Recent and past EC and other funders’ activities (representing various funding levels) • Checking compliance to funder mandates Institutions • Collaboration network (by institution) via projects and products • Ability to attract funds from different funders Projects • Check if projects are eligible for Post-Grant APC funding • Compare project portfolio against that of other similar institutions (anonymized)
  • 46. Search and discovery portal http://explore.openaire.euhttp://beta.explore.openaire.eu