SlideShare une entreprise Scribd logo
1  sur  94
www.cineca.it
~
Integrate external services in
DSpace submission process
How to make self-deposit easy and improve
metadata quality and presence of full-text
Andrea Bollini – Susanna Mornati
Topics
⁄ Some context:
⁄ CINECA a brief overview
⁄ DSpace as part of a CRIS solution
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄Make the repository an active actor:
⁄Discovering missing content
⁄Improve Fulltext presence
⁄Integration of external services:
⁄Bibliographic database: Scopus, PubMed, CrossRef, ArXiv, etc.
⁄Publishers policy: Sherpa/Romeo
⁄ Owned companies: Kion, SCS.
⁄ Employees: 400 (+150 Kion)
⁄ Total turnover:  70M€
The Company
⁄ Interuniversity Consortium
⁄ No-Profit
⁄ Founded in 1969
⁄ Headquarter in Bologna
⁄ 57 Members
⁄ 54 Universities
⁄ 2 Research institutes
⁄ MIUR
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄ Owned companies: Kion, SCS.
⁄ Employees: 400 (+150 Kion)
⁄ Total turnover:  70M€
The Company
⁄ Interuniversity Consortium
⁄ No-Profit
⁄ Founded in 1969
⁄ Headquarter in Bologna
⁄ 57 Members
⁄ 54 Universities
⁄ 2 Research institutes
⁄ MIUR
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄ Owned companies: Kion, SCS.
⁄ Employees: 400 (+150 Kion)
⁄ Total turnover:  70M€
The Company
⁄ Interuniversity Consortium
⁄ No-Profit
⁄ Founded in 1969
⁄ Headquarter in Bologna
⁄ 57 Members
⁄ 54 Universities
⁄ 2 Research institutes
⁄ MIUR
as last week!
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄ The “merging process” of the three Italian Consortia started in September 2012
⁄ It was concluded in July 1st 2013 (last week!)
The Merge
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄ The “merging process” of the three Italian Consortia started in September 2012
⁄ It was concluded in July 1st 2013 (last week!)
The Merge
2.0
⁄ 67 Members
⁄ More than 700 employees (+ 150 Kion)
⁄ The only Italian Interuniversity Consortium
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Higher Education
• Solutions & Services for the University Administration
• Services for the Ministry of Education, University and
Research (MIUR)
Scientific Research
• High Performance Computing – FERMI: 2° in EU / 7° WW)
• Scientific Visualization & Interactive Virtual Environments
Technological Innovation
• Data Center
• Information and Knowledge Management Services
• Health Care Systems
What CINECA does
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
•Cineca Board of Directors
Product
Managers
Board
U-GOV &
SURplus
Restricted Board
Customer
Service
Board
Technical &
Delivery Board
Apps
Road Map
Tech
Road Map
•University Customers
•Focus Groups
•University Customers
•Cineca Technical Board
How we work with Universities
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Solutions for HE
= ERP = Best of Breed
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Solutions for HE
= ERP = Best of Breed
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
SURplus: CINECA’ CRIS System
⁄ An interoperable infrastructure made of different components
⁄ Ingestion of data from any legacy systems adopted by an institution
⁄ Maintenance of specific functional requirements, data model and preferred technologies at
the level of applications
⁄ Data warehouse and Business Intelligence tools to facilitate aggregations of data and the
application of measurement parameters and algorithms
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
SURplus: Dimension
⁄ Beginning of activities: 2004
⁄ 9 institutions
⁄ 22 institutional repositories
⁄ Total modules: 77
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Topics
⁄ Integration of external services:
⁄ Bibliographic database: Scopus, PubMed, CrossRef, ArXiv, etc.
⁄ Publishers policy: Sherpa/Romeo
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄ Make the repository an active actor:
⁄ Discovering missing content
⁄ Improve Fulltext presence
⁄ Some context:
⁄ CINECA a brief overview
⁄ DSpace as part of a CRIS solution
CINECA is a registered service provider at DuraSpace
Long-term collaboration with DSpace community, since 2003
DSpace: SURplus’ Open Archive Module
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
CINECA is a registered service provider at DuraSpace
Long-term collaboration with DSpace community, since 2003
Upgrades are periodically released to the open source community
DSpace: SURplus’ Open Archive Module
⁄ Manages collection and dissemination of research results
⁄ Simplifies data collection’s processes
⁄ Service Integration
The OA Module, developed on DSpace:
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
“dissemination of
entities’ descriptions in
the research
environment which go
beyond publications”
DSpace-CRIS: SURplus’ Expertise & Skills
DSpace-CRIS: designed together with the Hong Kong University & released as open-source
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
IR as part of a CRIS system: what
change?
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄Benefits:
⁄Strong deposit mandate
⁄More funding
⁄Issues to mitigate:
⁄IR become a critical application
⁄Author have a “requirements” perception
 Wasting time
 Late submission
IR as part of a CRIS system: what
change?
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄Benefits:
⁄Strong deposit mandate
⁄More funding
⁄Issues to mitigate:
⁄IR become a critical application
⁄Author have a “requirements” perception
 Wasting time
 Late submission
Professional support
HA infrastructure
Dedicated team
IR as part of a CRIS system: what
change?
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄Benefits:
⁄Strong deposit mandate
⁄More funding
⁄Issues to mitigate:
⁄IR become a critical application
⁄Author have a “requirements” perception
 Wasting time
 Late submission
Professional support
HA infrastructure
Dedicated team
advocacy
IR as part of a CRIS system: what
change?
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄Benefits:
⁄Strong deposit mandate
⁄More funding
⁄Issues to mitigate:
⁄IR become a critical application
⁄Author have a “requirements” perception
 Wasting time
 Late submission
Professional support
HA infrastructure
Dedicated team
advocacy
Make the submission
process easy
The information already exists in
other database!
Topics
⁄ Integration of external services:
⁄ Bibliographic database: Scopus, PubMed, CrossRef, ArXiv, etc.
⁄ Publishers policy: Sherpa/Romeo
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄ Make the repository an active actor:
⁄ Discovering missing content
⁄ Improve Fulltext presence
⁄ Some context:
⁄ CINECA a brief overview
⁄ DSpace as part of a CRIS solution
New first submission step
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
New first submission step
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Free search form
New first submission step
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Available providers: each provider
is a spring service
Free search form
New first submission step
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Available providers: each provider
is a spring service
Free search form
Main metadata common to all publication
types (article, book, etc.)
Title of the contribution
Year
Authors/Editors
New first submission step
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Available providers: each provider
is a spring service
Free search form
Main metadata common to all publication
types (article, book, etc.)
Title of the contribution
Year
Authors/Editors
New first submission step
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
New first submission step
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Lookup by unique identifier
New first submission step
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Lookup by unique identifier
Each provider declares which identifiers is able to manage
New first submission step
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Lookup by unique identifier
Each provider declares which identifiers is able to manage
New first submission step
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Lookup by unique identifier
Each provider declares which identifiers is able to manage
New first submission step
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
New first submission step
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
For each result providers are
shown that match the record.
Grouping is done via DOI
Modal box publication details
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Modal box publication details
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Modal box publication details
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Records from different providers
are merged to get richer metadata
The system guesses a collection
for the submission but the user
can change it if required
Modal box publication details
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Records from different providers
are merged to get richer metadata
The system guesses a collection
for the submission but the user
can change it if required
Manual submission
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Manual submission
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
When lookup fails the user can
always proceed manually
Batch import from external source
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Batch import from external source
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Import data (identifiers or
structured text) can be inputed
manually or uploaded as a file
Batch import from external source
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Import data (identifiers or
structured text) can be inputed
manually or uploaded as a file
Format/provider must be
specified by the user
Batch import from external source
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄Request are processed:
⁄Inline for specific providers and/or within configured data
limits
 Submitter can immediately complete the pre-filled
submissions


Batch import from external source
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄Request are processed:
⁄Inline for specific providers and/or within configured data
limits
 Submitter can immediately complete the pre-filled
submissions
⁄In a background process
 Submitter will receive a summary email with import
result
 Pre-filled submissions are available as in-progress
submission in the MyDSpace
Batch import from external source
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄Request are processed:
⁄Inline for specific providers and/or within configured data
limits
 Submitter can immediately complete the pre-filled
submissions
⁄In a background process
 Submitter will receive a summary email with import
result
 Pre-filled submissions are available as in-progress
submission in the MyDSpace
The legacy batch import feature for JSPUI has been already shared as pull request
on GitHub, see DS-1252
Enhanced Describe step: showing
metadata source
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Enhanced Describe step: showing
metadata source
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Technical details
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
PubMed Lookup Provider
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
arXiv Lookup Provider
Scopus Lookup Provider
…
PubMed Lookup Provider
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
arXiv Lookup Provider
Scopus Lookup Provider
…
<bean name="pubmedLookupProvider"
class=“...lookup.PubmedLookupProvider">
<property name="pubmedService"
ref="pubmedService"/>
</bean>
implements SubmissionLookupProvider
PubMed Lookup Provider
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
PubMed
record
arXiv Lookup Provider
Scopus Lookup Provider
…
<bean name="pubmedService"
class=“...service.PubmedService"/>
<bean name="pubmedLookupProvider"
class=“...lookup.PubmedLookupProvider">
<property name="pubmedService"
ref="pubmedService"/>
</bean>
implements SubmissionLookupProvider
PubMed Lookup Provider
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
PubMed
record
arXiv Lookup Provider
Scopus Lookup Provider
…
<bean name="pubmedService"
class=“...service.PubmedService"/>
<bean name="pubmedLookupProvider"
class=“...lookup.PubmedLookupProvider">
<property name="pubmedService"
ref="pubmedService"/>
</bean>
implements SubmissionLookupProvider
WGET http://eutils.ncbi.nlm.nih.gov/entrez/eutils/efetch.fcgi
?db=pubmed&id=23297105&retmode=xml&rettype=full
PubMed Lookup Provider
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
PubMed
record
arXiv Lookup Provider
Scopus Lookup Provider
…
<bean name="pubmedService"
class=“...service.PubmedService"/>
<bean name="pubmedLookupProvider"
class=“...lookup.PubmedLookupProvider">
<property name="pubmedService"
ref="pubmedService"/>
</bean>
implements SubmissionLookupProvider
PubMed Lookup Provider
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
PubMed
record
JAVA
Bean
Mapping
file
arXiv Lookup Provider
Scopus Lookup Provider
…
<bean name="pubmedService"
class=“...service.PubmedService"/>
<bean name="pubmedLookupProvider"
class=“...lookup.PubmedLookupProvider">
<property name="pubmedService"
ref="pubmedService"/>
</bean>
implements SubmissionLookupProvider
public class PubmedLookupProvider extends
ConfigurableLookupProvider
public abstract class ConfigurableLookupProvider
PubMed Lookup Provider
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
PubMed
record
JAVA
Bean
Mapping
file
arXiv Lookup Provider
Scopus Lookup Provider
…
<bean name="pubmedService"
class=“...service.PubmedService"/>
<bean name="pubmedLookupProvider"
class=“...lookup.PubmedLookupProvider">
<property name="pubmedService"
ref="pubmedService"/>
</bean>
implements SubmissionLookupProvider
public class PubmedLookupProvider extends
ConfigurableLookupProvider
public abstract class ConfigurableLookupProvider
public class PubmedItem
{
private String pubmedID;
private String doi;
private String issn;
private String eissn;
private String journalTitle;
private String title;
private String pubblicationModel;
private String year;
private String volume;
private String issue;
private String language;
private List<String> type;
private List<String> primaryKeywords;
private List<String> secondaryKeywords;
…
PubMed Lookup Provider
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
PubMed
record
JAVA
Bean
Mapping
file
arXiv Lookup Provider
Scopus Lookup Provider
…
<bean name="pubmedService"
class=“...service.PubmedService"/>
<bean name="pubmedLookupProvider"
class=“...lookup.PubmedLookupProvider">
<property name="pubmedService"
ref="pubmedService"/>
</bean>
implements SubmissionLookupProvider
public class PubmedLookupProvider extends
ConfigurableLookupProvider
public abstract class ConfigurableLookupProvider
PubMed Lookup Provider
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
PubMed
record
JAVA
Bean
Mapping
file
arXiv Lookup Provider
arXiv
record
JAVA
Bean
Mapping
file
Scopus Lookup Provider
Scopus
record
JAVA
Bean
Mapping
file
…
<bean name="pubmedService"
class=“...service.PubmedService"/>
<bean name="pubmedLookupProvider"
class=“...lookup.PubmedLookupProvider">
<property name="pubmedService"
ref="pubmedService"/>
</bean>
implements SubmissionLookupProvider
public class PubmedLookupProvider extends
ConfigurableLookupProvider
public abstract class ConfigurableLookupProvider
Translationlogic
originalnormalized
PubMed Lookup Provider
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
PubMed
record
JAVA
Bean
Mapping
file
Normalized
record
arXiv Lookup Provider
arXiv
record
JAVA
Bean
Mapping
file
Scopus Lookup Provider
Scopus
record
JAVA
Bean
Mapping
file
…
Translationlogic
originalnormalized
PubMed Lookup Provider
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
PubMed
record
JAVA
Bean
Mapping
file
Normalized
record
Enhancer
plugins
Split, aggregate fields
Derive data
ISSN  Journal title
…
arXiv Lookup Provider
arXiv
record
JAVA
Bean
Mapping
file
Scopus Lookup Provider
Scopus
record
JAVA
Bean
Mapping
file
…
Translationlogic
originalnormalized
PubMed Lookup Provider
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
PubMed
record
JAVA
Bean
Mapping
file
DSpace
Item
Normalized
record
Enhancer
plugins
Split, aggregate fields
Derive data
ISSN  Journal title
…
arXiv Lookup Provider
arXiv
record
JAVA
Bean
Mapping
file
Scopus Lookup Provider
Scopus
record
JAVA
Bean
Mapping
file
…
Translationlogic
NormalizedRepository
Mapping
file
Topics
⁄ Integration of external services:
⁄ Bibliographic database: Scopus, PubMed, CrossRef, ArXiv, etc.
⁄ Publishers policy: Sherpa/Romeo
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄ Make the repository an active actor:
⁄ Discovering missing content
⁄ Improve Fulltext presence
⁄ Some context:
⁄ CINECA a brief overview
⁄ DSpace as part of a CRIS solution
Enhanced upload step
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Using the ISSN or EISSN provided in the describe step
the upload form is improved showing on the right side the
publisher policy from the Sherpa/Romeo database
Enhanced upload step
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Enhanced upload step
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Enhanced upload step
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Enhanced upload step
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Access policy for the bitstream:
Open access, embargo, intranet,
etc.
Enhanced upload step
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Access policy for the bitstream:
Open access, embargo, intranet,
etc.
Deposit of fulltext to the national
database for individual CVs
Topics
⁄ Integration of external services:
⁄ Bibliographic database: Scopus, PubMed, CrossRef, ArXiv, etc.
⁄ Publishers policy: Sherpa/Romeo
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄ Make the repository an active actor:
⁄ Discovering missing content
⁄ Improve Fulltext presence
⁄ Some context:
⁄ CINECA a brief overview
⁄ DSpace as part of a CRIS solution
What is the problem?
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
(very) late submissions produce some issues for the repository
both at technical and organization level:
/ The system is subjected to periods of intense input activities.
DSpace, but in general IR software, scales well for read
operations less well for write operations
/ IR staff involved in workflow get lot of task to perform in
small period
What is the problem?
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
(very) late submissions produce some issues for the repository
both at technical and organization level:
/ The system is subjected to periods of intense input activities.
DSpace, but in general IR software, scales well for read
operations less well for write operations
/ IR staff involved in workflow get lot of task to perform in
small period
Get researcher aware
Remind researcher
about IR presence
Intercept early
new content
How we plan to mitigate the
problem?
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Citation databases provide APIs to perform search (we already
use them for the lookup) and in some cases they provide
additional APIs or search filters/indexes to make more
raffinated search and allow scanning of the database.
The interesting filters/indexes are:
/ Time based (much better if related to insertion in the
citation database)
/ Author ID (better if related to a «standard/common»
identifier as ORCID)
/ Affiliation
/ Subject category
Implementation idea
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Allow the researcher to store personal preferences about
scanning:
/ Enabled providers (e.g disable arXiv if you are not a
physicist)
/ Frequencies
/ Subject categories filters
AuthorIDs will be stored/retrieved from the Researcher profile.
Subject categories could be proposed from previous items or
researcher profile.
DSpace-CRIS: Researcher profile
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
DSpace-CRIS: Researcher profile
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
DSpace-CRIS: Researcher profile
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Who are the potential targets?
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄ ORCID
⁄ Scopus
⁄ Web of Science
⁄ arXiv
⁄ PubMed Central
⁄ DBLP
⁄ REPEC
Who are the potential targets?
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄ ORCID
⁄ Scopus
⁄ Web of Science
⁄ arXiv
⁄ PubMed Central
⁄ DBLP
⁄ REPEC
The Repository itself!
The repository as source of
missing content?
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄ The submitter has to match authors of publication
with the University staff to higthlight internal
authors
⁄ Sometimes matches are missing
⁄ Othertimes matches are wrong (homonymous)
⁄ External authors could become «internal» at some
point in the future
The repository as source of
missing content?
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄ Send email to internal «co-authors» when a
submission is done  prevent wrong attribution
(and reduce duplication)
⁄ Allow researcher to unclaim publications from her
profile  last chance to fix wrong attribution
⁄ Allow researcher to claim publications  fix missing
attribution and/or engagement of new researcher
The repository as source of
missing content?
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄ Send email to internal «co-authors» when a
submission is done  prevent wrong attribution
(and reduce duplication)
⁄ Allow researcher to unclaim publications from her
profile  last chance to fix wrong attribution
⁄ Allow researcher to claim publications  fix missing
attribution and/or engagement of new researcher
The last two features are included in the
DSpace-CRIS addon
Current implementation: claim/unclaim
publications in the repository
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Current implementation: claim/unclaim
publications in the repository
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
This is the current status of the
publication
U  Unlinked
Current implementation: claim/unclaim
publications in the repository
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
This is the current status of the
publication
U  Unlinked
You can claim it
A  Active, simple claim
S  Make it a selected publication
H  Claim it but hide from you public
profile
Current implementation: claim/unclaim
publications in the repository
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Current implementation: claim/unclaim
publications in the repository
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
You can unclaim a publication
U  Unlink
Current implementation: claim/unclaim
publications in the repository
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Current implementation: claim/unclaim
publications in the repository
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
Topics
⁄ Integration of external services:
⁄ Bibliographic database: Scopus, PubMed, CrossRef, ArXiv, etc.
⁄ Publishers policy: Sherpa/Romeo
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄ Make the repository an active actor:
⁄ Discovering missing content
⁄ Improve Fulltext presence
⁄ Some context:
⁄ CINECA a brief overview
⁄ DSpace as part of a CRIS solution
Improve fulltext presence
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
⁄ Use the Sherpa/Romeo policy database to analyze
repository content
⁄ Use external database API to find an actual fulltext
(arXiv, pubmed, ...why not the publisher version via
library subscription?)
⁄ Send email to researcher to validate found PDFs or
ask for an «author» versions
⁄ Use statistics to encourage upload
127.000+ items
65.000+ items
9,4% 17,2%
Sherpa/Romeo Statistics (Example)
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
51%
ISSN
36%
Not in Sherpa
24.000 items
32% green
21.000 items
127.000+ items
65.000+ items
9,4% 17,2%
Sherpa/Romeo Statistics (Example)
www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
51%
ISSN
36%
Not in Sherpa
24.000 items
7,3% have a
fulltext…
5,3% open access
32% green
21.000 items
www.cineca.it | Innovative Open Source Technologies for a CRIS: SURplus | euroCRIS | May 2013
SURplus: prevision 2014
⁄ 50+ institutional repositories (DSpace)
⁄ 10 research portals (DSpace-CRIS)
www.cineca.it
~
Thank you!
Andrea Bollini
a.bollini@cineca.it
SURplus - http://www.cineca.it/en/content/surplus
DSpace-CRIS - http://cilea.github.com/dspace-cris

Contenu connexe

Dernier

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 

Dernier (20)

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 

En vedette

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by HubspotMarius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 

En vedette (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Integrate external bibliographic services in DSpace submission process to make self-deposit easy and improve metadata quality and presence of full-text

  • 1. www.cineca.it ~ Integrate external services in DSpace submission process How to make self-deposit easy and improve metadata quality and presence of full-text Andrea Bollini – Susanna Mornati
  • 2. Topics ⁄ Some context: ⁄ CINECA a brief overview ⁄ DSpace as part of a CRIS solution www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 ⁄Make the repository an active actor: ⁄Discovering missing content ⁄Improve Fulltext presence ⁄Integration of external services: ⁄Bibliographic database: Scopus, PubMed, CrossRef, ArXiv, etc. ⁄Publishers policy: Sherpa/Romeo
  • 3. ⁄ Owned companies: Kion, SCS. ⁄ Employees: 400 (+150 Kion) ⁄ Total turnover:  70M€ The Company ⁄ Interuniversity Consortium ⁄ No-Profit ⁄ Founded in 1969 ⁄ Headquarter in Bologna ⁄ 57 Members ⁄ 54 Universities ⁄ 2 Research institutes ⁄ MIUR www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 4. ⁄ Owned companies: Kion, SCS. ⁄ Employees: 400 (+150 Kion) ⁄ Total turnover:  70M€ The Company ⁄ Interuniversity Consortium ⁄ No-Profit ⁄ Founded in 1969 ⁄ Headquarter in Bologna ⁄ 57 Members ⁄ 54 Universities ⁄ 2 Research institutes ⁄ MIUR www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 5. ⁄ Owned companies: Kion, SCS. ⁄ Employees: 400 (+150 Kion) ⁄ Total turnover:  70M€ The Company ⁄ Interuniversity Consortium ⁄ No-Profit ⁄ Founded in 1969 ⁄ Headquarter in Bologna ⁄ 57 Members ⁄ 54 Universities ⁄ 2 Research institutes ⁄ MIUR as last week! www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 6. ⁄ The “merging process” of the three Italian Consortia started in September 2012 ⁄ It was concluded in July 1st 2013 (last week!) The Merge www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 7. ⁄ The “merging process” of the three Italian Consortia started in September 2012 ⁄ It was concluded in July 1st 2013 (last week!) The Merge 2.0 ⁄ 67 Members ⁄ More than 700 employees (+ 150 Kion) ⁄ The only Italian Interuniversity Consortium www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 8. Higher Education • Solutions & Services for the University Administration • Services for the Ministry of Education, University and Research (MIUR) Scientific Research • High Performance Computing – FERMI: 2° in EU / 7° WW) • Scientific Visualization & Interactive Virtual Environments Technological Innovation • Data Center • Information and Knowledge Management Services • Health Care Systems What CINECA does www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 9. •Cineca Board of Directors Product Managers Board U-GOV & SURplus Restricted Board Customer Service Board Technical & Delivery Board Apps Road Map Tech Road Map •University Customers •Focus Groups •University Customers •Cineca Technical Board How we work with Universities www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 10. Solutions for HE = ERP = Best of Breed www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 11. Solutions for HE = ERP = Best of Breed www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 12. SURplus: CINECA’ CRIS System ⁄ An interoperable infrastructure made of different components ⁄ Ingestion of data from any legacy systems adopted by an institution ⁄ Maintenance of specific functional requirements, data model and preferred technologies at the level of applications ⁄ Data warehouse and Business Intelligence tools to facilitate aggregations of data and the application of measurement parameters and algorithms www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 13. SURplus: Dimension ⁄ Beginning of activities: 2004 ⁄ 9 institutions ⁄ 22 institutional repositories ⁄ Total modules: 77 www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 14. Topics ⁄ Integration of external services: ⁄ Bibliographic database: Scopus, PubMed, CrossRef, ArXiv, etc. ⁄ Publishers policy: Sherpa/Romeo www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 ⁄ Make the repository an active actor: ⁄ Discovering missing content ⁄ Improve Fulltext presence ⁄ Some context: ⁄ CINECA a brief overview ⁄ DSpace as part of a CRIS solution
  • 15. CINECA is a registered service provider at DuraSpace Long-term collaboration with DSpace community, since 2003 DSpace: SURplus’ Open Archive Module www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 16. CINECA is a registered service provider at DuraSpace Long-term collaboration with DSpace community, since 2003 Upgrades are periodically released to the open source community DSpace: SURplus’ Open Archive Module ⁄ Manages collection and dissemination of research results ⁄ Simplifies data collection’s processes ⁄ Service Integration The OA Module, developed on DSpace: www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 17. “dissemination of entities’ descriptions in the research environment which go beyond publications” DSpace-CRIS: SURplus’ Expertise & Skills DSpace-CRIS: designed together with the Hong Kong University & released as open-source www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 18. IR as part of a CRIS system: what change? www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 ⁄Benefits: ⁄Strong deposit mandate ⁄More funding ⁄Issues to mitigate: ⁄IR become a critical application ⁄Author have a “requirements” perception  Wasting time  Late submission
  • 19. IR as part of a CRIS system: what change? www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 ⁄Benefits: ⁄Strong deposit mandate ⁄More funding ⁄Issues to mitigate: ⁄IR become a critical application ⁄Author have a “requirements” perception  Wasting time  Late submission Professional support HA infrastructure Dedicated team
  • 20. IR as part of a CRIS system: what change? www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 ⁄Benefits: ⁄Strong deposit mandate ⁄More funding ⁄Issues to mitigate: ⁄IR become a critical application ⁄Author have a “requirements” perception  Wasting time  Late submission Professional support HA infrastructure Dedicated team advocacy
  • 21. IR as part of a CRIS system: what change? www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 ⁄Benefits: ⁄Strong deposit mandate ⁄More funding ⁄Issues to mitigate: ⁄IR become a critical application ⁄Author have a “requirements” perception  Wasting time  Late submission Professional support HA infrastructure Dedicated team advocacy Make the submission process easy The information already exists in other database!
  • 22. Topics ⁄ Integration of external services: ⁄ Bibliographic database: Scopus, PubMed, CrossRef, ArXiv, etc. ⁄ Publishers policy: Sherpa/Romeo www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 ⁄ Make the repository an active actor: ⁄ Discovering missing content ⁄ Improve Fulltext presence ⁄ Some context: ⁄ CINECA a brief overview ⁄ DSpace as part of a CRIS solution
  • 23. New first submission step www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 24. New first submission step www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 Free search form
  • 25. New first submission step www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 Available providers: each provider is a spring service Free search form
  • 26. New first submission step www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 Available providers: each provider is a spring service Free search form Main metadata common to all publication types (article, book, etc.) Title of the contribution Year Authors/Editors
  • 27. New first submission step www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 Available providers: each provider is a spring service Free search form Main metadata common to all publication types (article, book, etc.) Title of the contribution Year Authors/Editors
  • 28. New first submission step www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 29. New first submission step www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 Lookup by unique identifier
  • 30. New first submission step www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 Lookup by unique identifier Each provider declares which identifiers is able to manage
  • 31. New first submission step www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 Lookup by unique identifier Each provider declares which identifiers is able to manage
  • 32. New first submission step www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 Lookup by unique identifier Each provider declares which identifiers is able to manage
  • 33. New first submission step www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 34. New first submission step www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 For each result providers are shown that match the record. Grouping is done via DOI
  • 35. Modal box publication details www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 36. Modal box publication details www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 37. Modal box publication details www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 Records from different providers are merged to get richer metadata The system guesses a collection for the submission but the user can change it if required
  • 38. Modal box publication details www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 Records from different providers are merged to get richer metadata The system guesses a collection for the submission but the user can change it if required
  • 39. Manual submission www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 40. Manual submission www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 When lookup fails the user can always proceed manually
  • 41. Batch import from external source www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 42. Batch import from external source www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 Import data (identifiers or structured text) can be inputed manually or uploaded as a file
  • 43. Batch import from external source www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 Import data (identifiers or structured text) can be inputed manually or uploaded as a file Format/provider must be specified by the user
  • 44. Batch import from external source www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 ⁄Request are processed: ⁄Inline for specific providers and/or within configured data limits  Submitter can immediately complete the pre-filled submissions  
  • 45. Batch import from external source www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 ⁄Request are processed: ⁄Inline for specific providers and/or within configured data limits  Submitter can immediately complete the pre-filled submissions ⁄In a background process  Submitter will receive a summary email with import result  Pre-filled submissions are available as in-progress submission in the MyDSpace
  • 46. Batch import from external source www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 ⁄Request are processed: ⁄Inline for specific providers and/or within configured data limits  Submitter can immediately complete the pre-filled submissions ⁄In a background process  Submitter will receive a summary email with import result  Pre-filled submissions are available as in-progress submission in the MyDSpace The legacy batch import feature for JSPUI has been already shared as pull request on GitHub, see DS-1252
  • 47. Enhanced Describe step: showing metadata source www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 48. Enhanced Describe step: showing metadata source www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 49. Technical details www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 50. PubMed Lookup Provider www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 arXiv Lookup Provider Scopus Lookup Provider …
  • 51. PubMed Lookup Provider www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 arXiv Lookup Provider Scopus Lookup Provider … <bean name="pubmedLookupProvider" class=“...lookup.PubmedLookupProvider"> <property name="pubmedService" ref="pubmedService"/> </bean> implements SubmissionLookupProvider
  • 52. PubMed Lookup Provider www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 PubMed record arXiv Lookup Provider Scopus Lookup Provider … <bean name="pubmedService" class=“...service.PubmedService"/> <bean name="pubmedLookupProvider" class=“...lookup.PubmedLookupProvider"> <property name="pubmedService" ref="pubmedService"/> </bean> implements SubmissionLookupProvider
  • 53. PubMed Lookup Provider www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 PubMed record arXiv Lookup Provider Scopus Lookup Provider … <bean name="pubmedService" class=“...service.PubmedService"/> <bean name="pubmedLookupProvider" class=“...lookup.PubmedLookupProvider"> <property name="pubmedService" ref="pubmedService"/> </bean> implements SubmissionLookupProvider WGET http://eutils.ncbi.nlm.nih.gov/entrez/eutils/efetch.fcgi ?db=pubmed&id=23297105&retmode=xml&rettype=full
  • 54. PubMed Lookup Provider www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 PubMed record arXiv Lookup Provider Scopus Lookup Provider … <bean name="pubmedService" class=“...service.PubmedService"/> <bean name="pubmedLookupProvider" class=“...lookup.PubmedLookupProvider"> <property name="pubmedService" ref="pubmedService"/> </bean> implements SubmissionLookupProvider
  • 55. PubMed Lookup Provider www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 PubMed record JAVA Bean Mapping file arXiv Lookup Provider Scopus Lookup Provider … <bean name="pubmedService" class=“...service.PubmedService"/> <bean name="pubmedLookupProvider" class=“...lookup.PubmedLookupProvider"> <property name="pubmedService" ref="pubmedService"/> </bean> implements SubmissionLookupProvider public class PubmedLookupProvider extends ConfigurableLookupProvider public abstract class ConfigurableLookupProvider
  • 56. PubMed Lookup Provider www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 PubMed record JAVA Bean Mapping file arXiv Lookup Provider Scopus Lookup Provider … <bean name="pubmedService" class=“...service.PubmedService"/> <bean name="pubmedLookupProvider" class=“...lookup.PubmedLookupProvider"> <property name="pubmedService" ref="pubmedService"/> </bean> implements SubmissionLookupProvider public class PubmedLookupProvider extends ConfigurableLookupProvider public abstract class ConfigurableLookupProvider public class PubmedItem { private String pubmedID; private String doi; private String issn; private String eissn; private String journalTitle; private String title; private String pubblicationModel; private String year; private String volume; private String issue; private String language; private List<String> type; private List<String> primaryKeywords; private List<String> secondaryKeywords; …
  • 57. PubMed Lookup Provider www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 PubMed record JAVA Bean Mapping file arXiv Lookup Provider Scopus Lookup Provider … <bean name="pubmedService" class=“...service.PubmedService"/> <bean name="pubmedLookupProvider" class=“...lookup.PubmedLookupProvider"> <property name="pubmedService" ref="pubmedService"/> </bean> implements SubmissionLookupProvider public class PubmedLookupProvider extends ConfigurableLookupProvider public abstract class ConfigurableLookupProvider
  • 58. PubMed Lookup Provider www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 PubMed record JAVA Bean Mapping file arXiv Lookup Provider arXiv record JAVA Bean Mapping file Scopus Lookup Provider Scopus record JAVA Bean Mapping file … <bean name="pubmedService" class=“...service.PubmedService"/> <bean name="pubmedLookupProvider" class=“...lookup.PubmedLookupProvider"> <property name="pubmedService" ref="pubmedService"/> </bean> implements SubmissionLookupProvider public class PubmedLookupProvider extends ConfigurableLookupProvider public abstract class ConfigurableLookupProvider
  • 59. Translationlogic originalnormalized PubMed Lookup Provider www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 PubMed record JAVA Bean Mapping file Normalized record arXiv Lookup Provider arXiv record JAVA Bean Mapping file Scopus Lookup Provider Scopus record JAVA Bean Mapping file …
  • 60. Translationlogic originalnormalized PubMed Lookup Provider www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 PubMed record JAVA Bean Mapping file Normalized record Enhancer plugins Split, aggregate fields Derive data ISSN  Journal title … arXiv Lookup Provider arXiv record JAVA Bean Mapping file Scopus Lookup Provider Scopus record JAVA Bean Mapping file …
  • 61. Translationlogic originalnormalized PubMed Lookup Provider www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 PubMed record JAVA Bean Mapping file DSpace Item Normalized record Enhancer plugins Split, aggregate fields Derive data ISSN  Journal title … arXiv Lookup Provider arXiv record JAVA Bean Mapping file Scopus Lookup Provider Scopus record JAVA Bean Mapping file … Translationlogic NormalizedRepository Mapping file
  • 62. Topics ⁄ Integration of external services: ⁄ Bibliographic database: Scopus, PubMed, CrossRef, ArXiv, etc. ⁄ Publishers policy: Sherpa/Romeo www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 ⁄ Make the repository an active actor: ⁄ Discovering missing content ⁄ Improve Fulltext presence ⁄ Some context: ⁄ CINECA a brief overview ⁄ DSpace as part of a CRIS solution
  • 63. Enhanced upload step www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 Using the ISSN or EISSN provided in the describe step the upload form is improved showing on the right side the publisher policy from the Sherpa/Romeo database
  • 64. Enhanced upload step www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 65. Enhanced upload step www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 66. Enhanced upload step www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 67. Enhanced upload step www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 Access policy for the bitstream: Open access, embargo, intranet, etc.
  • 68. Enhanced upload step www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 Access policy for the bitstream: Open access, embargo, intranet, etc. Deposit of fulltext to the national database for individual CVs
  • 69. Topics ⁄ Integration of external services: ⁄ Bibliographic database: Scopus, PubMed, CrossRef, ArXiv, etc. ⁄ Publishers policy: Sherpa/Romeo www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 ⁄ Make the repository an active actor: ⁄ Discovering missing content ⁄ Improve Fulltext presence ⁄ Some context: ⁄ CINECA a brief overview ⁄ DSpace as part of a CRIS solution
  • 70. What is the problem? www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 (very) late submissions produce some issues for the repository both at technical and organization level: / The system is subjected to periods of intense input activities. DSpace, but in general IR software, scales well for read operations less well for write operations / IR staff involved in workflow get lot of task to perform in small period
  • 71. What is the problem? www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 (very) late submissions produce some issues for the repository both at technical and organization level: / The system is subjected to periods of intense input activities. DSpace, but in general IR software, scales well for read operations less well for write operations / IR staff involved in workflow get lot of task to perform in small period Get researcher aware Remind researcher about IR presence Intercept early new content
  • 72. How we plan to mitigate the problem? www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 Citation databases provide APIs to perform search (we already use them for the lookup) and in some cases they provide additional APIs or search filters/indexes to make more raffinated search and allow scanning of the database. The interesting filters/indexes are: / Time based (much better if related to insertion in the citation database) / Author ID (better if related to a «standard/common» identifier as ORCID) / Affiliation / Subject category
  • 73. Implementation idea www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 Allow the researcher to store personal preferences about scanning: / Enabled providers (e.g disable arXiv if you are not a physicist) / Frequencies / Subject categories filters AuthorIDs will be stored/retrieved from the Researcher profile. Subject categories could be proposed from previous items or researcher profile.
  • 74. DSpace-CRIS: Researcher profile www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 75. DSpace-CRIS: Researcher profile www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 76. DSpace-CRIS: Researcher profile www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 77. Who are the potential targets? www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 ⁄ ORCID ⁄ Scopus ⁄ Web of Science ⁄ arXiv ⁄ PubMed Central ⁄ DBLP ⁄ REPEC
  • 78. Who are the potential targets? www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 ⁄ ORCID ⁄ Scopus ⁄ Web of Science ⁄ arXiv ⁄ PubMed Central ⁄ DBLP ⁄ REPEC The Repository itself!
  • 79. The repository as source of missing content? www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 ⁄ The submitter has to match authors of publication with the University staff to higthlight internal authors ⁄ Sometimes matches are missing ⁄ Othertimes matches are wrong (homonymous) ⁄ External authors could become «internal» at some point in the future
  • 80. The repository as source of missing content? www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 ⁄ Send email to internal «co-authors» when a submission is done  prevent wrong attribution (and reduce duplication) ⁄ Allow researcher to unclaim publications from her profile  last chance to fix wrong attribution ⁄ Allow researcher to claim publications  fix missing attribution and/or engagement of new researcher
  • 81. The repository as source of missing content? www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 ⁄ Send email to internal «co-authors» when a submission is done  prevent wrong attribution (and reduce duplication) ⁄ Allow researcher to unclaim publications from her profile  last chance to fix wrong attribution ⁄ Allow researcher to claim publications  fix missing attribution and/or engagement of new researcher The last two features are included in the DSpace-CRIS addon
  • 82. Current implementation: claim/unclaim publications in the repository www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 83. Current implementation: claim/unclaim publications in the repository www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 This is the current status of the publication U  Unlinked
  • 84. Current implementation: claim/unclaim publications in the repository www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 This is the current status of the publication U  Unlinked You can claim it A  Active, simple claim S  Make it a selected publication H  Claim it but hide from you public profile
  • 85. Current implementation: claim/unclaim publications in the repository www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 86. Current implementation: claim/unclaim publications in the repository www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 You can unclaim a publication U  Unlink
  • 87. Current implementation: claim/unclaim publications in the repository www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 88. Current implementation: claim/unclaim publications in the repository www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013
  • 89. Topics ⁄ Integration of external services: ⁄ Bibliographic database: Scopus, PubMed, CrossRef, ArXiv, etc. ⁄ Publishers policy: Sherpa/Romeo www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 ⁄ Make the repository an active actor: ⁄ Discovering missing content ⁄ Improve Fulltext presence ⁄ Some context: ⁄ CINECA a brief overview ⁄ DSpace as part of a CRIS solution
  • 90. Improve fulltext presence www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 ⁄ Use the Sherpa/Romeo policy database to analyze repository content ⁄ Use external database API to find an actual fulltext (arXiv, pubmed, ...why not the publisher version via library subscription?) ⁄ Send email to researcher to validate found PDFs or ask for an «author» versions ⁄ Use statistics to encourage upload
  • 91. 127.000+ items 65.000+ items 9,4% 17,2% Sherpa/Romeo Statistics (Example) www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 51% ISSN 36% Not in Sherpa 24.000 items 32% green 21.000 items
  • 92. 127.000+ items 65.000+ items 9,4% 17,2% Sherpa/Romeo Statistics (Example) www.cineca.it | Integrate external services in DSpace submission process | OR2013| July 2013 51% ISSN 36% Not in Sherpa 24.000 items 7,3% have a fulltext… 5,3% open access 32% green 21.000 items
  • 93. www.cineca.it | Innovative Open Source Technologies for a CRIS: SURplus | euroCRIS | May 2013 SURplus: prevision 2014 ⁄ 50+ institutional repositories (DSpace) ⁄ 10 research portals (DSpace-CRIS)
  • 94. www.cineca.it ~ Thank you! Andrea Bollini a.bollini@cineca.it SURplus - http://www.cineca.it/en/content/surplus DSpace-CRIS - http://cilea.github.com/dspace-cris