UKSG Conference 2015 - E-resources: ezPAARSE helps you discover who is reading what in your institution Thomas Porquet COUPERIN.ORG Consortium and Cécilia Fabry CNRS: National Centre for Scientific Research
EzPAARSE is open source software that analyses your locally gathered proxy logfiles and provides you with COUNTER-deduplicated, KBART-formatted and geolocalised reports of your users’ accesses to subscribed e-resources. Come and watch us demo it live to understand how it works and learn how to install it in your institution for producing your own enriched measures and indicators.
Levine-Clark, Michael, “E-Resources in Academic Libraries: Trends, Strategies...
Similaire à UKSG Conference 2015 - E-resources: ezPAARSE helps you discover who is reading what in your institution Thomas Porquet COUPERIN.ORG Consortium and Cécilia Fabry CNRS: National Centre for Scientific Research
Karuta: Design Your Own Portfolio ProcessJanice Smith
Similaire à UKSG Conference 2015 - E-resources: ezPAARSE helps you discover who is reading what in your institution Thomas Porquet COUPERIN.ORG Consortium and Cécilia Fabry CNRS: National Centre for Scientific Research (20)
UKSG Conference 2015 - E-resources: ezPAARSE helps you discover who is reading what in your institution Thomas Porquet COUPERIN.ORG Consortium and Cécilia Fabry CNRS: National Centre for Scientific Research
1. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
E-RESOURCES: EZPAARSE HELPS YOU
DISCOVER WHO IS READING WHAT IN
YOUR INSTITUTION.
http://ezpaarse.couperin.org
http://analogist.couperin.org
cecilia.fabry@inist.fr
thomas.porquet@couperin.org
https://github.com/ezpaarse-project/ezpaarse
ezpaarse@couperin.org
2. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
A. An Introduction to the ezPAARSE/AnalogIST project
1- A Need for Better Figures
2- Gathering and analysing Local Data
3- AnalogIST and ezPAARSE
4- Results and Visualization
B. Live demo!
1. Installation
2. Processing and analyses
Presentation outline
3. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
A. An Introduction to the ezPAARSE/AnalogIST project
⇒1- A Need for Better Figures
2- Gathering and analysing Local Data
3- AnalogIST and ezPAARSE
4- Results and Visualization
B. Live demo!
1. Installation
2. Processing and analyses
4. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
About some well-known facts
● $25 billion global revenue in 2012, + 4-5 % / year
● The 4 biggest publishers make up half the market
● For 10 years the price of most journals increases from 3% to
5% / year
● 1.5 billion articles downloaded per year and by 10M users
The Scientific and Technical Information Market
We need to assess and evaluate the use
of these e-resources
1. A Need for better figures
5. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
What we’ve currently got
… not available
… available and COUNTER-compliant
… available but not COUNTER- compliant
Publisherprovided statistics are
1. A Need for better figures
6. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
the ezPAARSE solution :
→ locally-gathered usage quantification
Vendors are the only source
These numbers just offer mere quantification
→ We need to assess these
numbers
→ We need to qualify them
Some limitations with publisher provided statistics
1. A Need for better figures
7. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
A. An Introduction to the ezPAARSE/AnalogIST project
1- A Need for Better Figures
⇒ 2- Gathering and analysing Local Data
3- AnalogIST and ezPAARSE
4- Results and Visualization
B. Live demo!
1. Installation
2. Processing and analyses
8. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
The reverse proxy...
2. Gathering and analysing usage data
1
9. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
...and where ezPAARSE comes into play
3
2. Gathering and analysing usage data
1
2
10. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
1
3
2
2. Gathering and analysing usage data
...and where ezPAARSE comes into play
11. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
http://pdn.sciencedirect.com/science?
_ob=MiamiImageURL&_cid=271664&_user=4046427&_pii=S0001
457512000747&_check=y&_origin=browse&_zone=rslt_list_item&
_coverDate=2012-07-31&wchp=dGLbVlt-
zSkWb&md5=f5d8d157ccda6d597cb466af123dbff3/1-s2.0-
S0001457512000747-main.pdf
2. Gathering and analysing usage data
Example of a URL structuration
12. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
Example of a URL structuration
http://pdn.sciencedirect.com/science?
_ob=MiamiImageURL&_cid=271664&_user=4046427&_pii=S0001
457512000747&_check=y&_origin=browse&_zone=rslt_list_item&
_coverDate=2012-07-31&wchp=dGLbVlt-
zSkWb&md5=f5d8d157ccda6d597cb466af123dbff3/1-s2.0-
S0001457512000747-main.pdf
ISSN & type of the downloaded file
2. Gathering and analysing usage data
13. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
http://www.cairn.info/load_pdf.php?ID_ARTICLE=RFG_218_0009
We know it’s a PDF but
we only get a publisher-
specific identifier.
We need a correspondence table:
the Publisher Knowledge Base
(ideally a KBART formated file)
Publisher id ISSN
RFG 0338-4551
LMS 0027-2671
...
Example of a URL structuration
2. Gathering and analysing usage data
14. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
http://pdn.sciencedirect.com/science?
_ob=MiamiImageURL&_cid=271664&_user=4046427&_pii=S000145751200074
7&_check=y&_origin=browse&_zone=rslt_list_item&_coverDate=2012-07-
31&wchp=dGLbVlt-zSkWb&md5=f5d8d157ccda6d597cb466af123dbff3/1-s2.0-
S0001457512000747-main.pdf
/_pii=S([0-9]{0,7}[0-9X])/i
How to parse the URL?
2. Gathering and analysing usage data
15. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
we need one parser for each platform
Recognized platforms
...today ezPAARSE covers about
65 platforms
2. Gathering and analysing usage data
16. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
Opaque URLs : session ids, encryption… (Ex: EbscoHOST platform)
http://web.b.ebscohost.com.bases-doc.univ-
lorraine.fr/ehost/pdfviewer/pdfviewer?vid=3&sid=af50e721-1536-4e25-
a18e-d09532b5aa01%40sessionmgr110&hid=118
Publisher IDs, needing to be linked to a knowledge base (ex: Cairn)
http://www.cairn.info/load_pdf.php?ID_ARTICLE=RFG_218_0009
- Opaque URLs (session ids, encryption…)
- Knowledge bases having to be manually edited
But some limitations apply...
2. Gathering and analysing usage data
17. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
The same principle may be applied to
Institutional repositories (IRs)
http://docnum.univ-lorraine.fr/public/UPV-M/Theses/2005/Germain.Lionel.SMZ0518.pdf
http://hal-univ-rennes1.archives-ouvertes.fr/file/index/docid/
797465/filename/These_Morgane_Gicquel_2012.pdf
HAL (Hyper Articles en Ligne), the main French National Repository :
The Université de Lorraine repository :
Examples :
2. Gathering and analysing usage data
for which we need
to filter out robots
accesses
18. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
A. An Introduction to the ezPAARSE/AnalogIST project
1- A Need for Better Figures
2- Gathering and analysing Local Data
⇒ 3- AnalogIST and ezPAARSE
4- Results and Visualization
B. Live demo!
1. Installation
2. Processing and analyses
19. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
the software
ez : easy / PAARSE : Progiciel d'Analyse des
Accès aux RessourceS Electroniques
= Software for Analysing the Accesses to
Online Resources
- as a local installation
- as an online service (SaaS)
Free (libre) software
Cross platform
Available online here :
http://ezpaarse.couperin.org
the wiki portal
Analyse des Logs de l'IST = Analysing
the logs of Scientific and Technical
Information
The place where we:
→ mutualize the participations
→ provide the users with tools to work
→ publish the platform analyses
http://analogist.couperin.org
AnalogIST and ezPAARSE
3. AnalogIST and ezPAARSE
20. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
3. AnalogIST and ezPAARSE
AnalogIST : organizing the collaborative work
21. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
http://analogist.couperin.org/platforms/analyse-helper/start
The rest is
automatically
processedThe URL is the only information
you need to enter
dokuwiki syntax
generated
3. AnalogIST and ezPAARSE
22. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
Example of an ezPAARSE output
KBART fields
geoip fields
Deduplicatedconsultationevents:
COUNTERrecommendation
Text file
(CSV or JSON format)
3. AnalogIST and ezPAARSE
23. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
A. An Introduction to the ezPAARSE/AnalogIST project
1- A Need for Better Figures
2- Gathering and analysing Local Data
3- AnalogIST and ezPAARSE
⇒ 4- Results and Visualization
B. Live demo!
1. Installation
2. Treatment and analyses
25. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
Enrichment of results
Link from external tables to ezPAARSE results to enrich data
Indicators
and
dashboards
User
Data
Pricing
information
Data on
journal
(ILS)
Scientific
disciplineLanguage
Usage events
from
ezPAARSE
4. Results and Visualization
(CNRS)
(CNRS/UL
)
(CNRS/UL)
(CNRS/UL
)
(CNRS/UL
)
26. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
Exploiting the Results with
23
4. Results and Visualization
27. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
Who (student, researcher, staff) consults what? (UL)
Repartition of consultations of paid content (books, journals, law
references…) by user type at the University of Lorraine
4. Results and Visualization
28. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
Consultations by research unit in Astronomy
Consultations of articles from Jan 2014 to
October 2014 by research units in
Astronomy at CNRS
4. Results and Visualization
29. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
Detection of an anomaly (CNRS)
The consultation peak corresponds to an abuse of an e-resource.
Detection allows to react promptly to this incident.
4. Results and Visualization
30. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
Difference between publishers and ezpaarse for the major journals in astronomy on Elsevier
4. Results and Visualization
31. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
Publishers numbers vs ezPAARSE numbers for EDP & ACS
4. Results and Visualization
32. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
Geolocation of consultations (CNRS)
4. Results and Visualization
33. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
Agile development process
SCRUM
31
34. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
The French SCRUM team
32
Paris
Nancy
Cécilia
Thomas
35. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
4. Results and Visualization
A new tool : ezvis http://couperin_ezpaarseezvis640v3_3.board.inist.fr/index.html
36. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
A new tool : ezvis http://couperin_ezpaarseezvis640v3_3.board.inist.fr/index.html
4. Results and Visualization
37. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
ezPAARSE working live : bibliomap
4. Results and Visualization
38. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
● download ezPAARSE on http://analogist.couperin.org
● install ezPAARSE locally
● start ezPAARSE
● Use the online instance : http://ezpaarse.couperin.org
● if necessary: set up your input parameters
● process some sample logs and check it’s running correctly
● open the csv results
● use the macro to get a first possible graphical layout
● present the project to your team and install ezPAARSE on
a (Unix) server for a daily / weekly / monthly processing
A live demo...
4. Results and Visualization
39. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
In conclusion
● ezPAARSE is free and open source
● Simple to install and to use
● Innovative technologies (NodeJS, AngularJS, MongoDB, etc.)
● Test ezPAARSE
● send us log samples
● give us feedback, negative or positive
● Contribute to a platform analysis
● create an account on Trello and start a platform analysis
● ask us an account on http://analogist.couperin.org
33
40. UKSG 38th Annual Conference: Glasgow - 2015/03/30 & 31
Questions?
http://ezpaarse.couperin.org
http://analogist.couperin.org
https://twitter.com/ezpaarse
nuage de tag avec termes appropriés
https://github.com/ezpaarse-project/ezpaarse