During the workshop 'Demands and requirements of scientific Text and Data Mining', organized by the Priority Initiative ' Digital Information' of the Deutsche Forschungs Gemeinschaft, I presented some of the findings and results of our Leiden University Libraries project on Text- & Data Mining (TDM) and I gave an overview of the barriers for TDM at a national level in relation to license and Intellectual Property Rights.
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
Presentation DFG Bonn 16 september 2015
1. Discover theworld at Leiden UniversityDiscover theworld at Leiden University
Text- & Data Mining at Leiden University
from a Library Perspective
Drs. Isabel Brouwer | Leiden University Libraries
DFG | Bonn - 16 September 2015
2. Discover theworld at Leiden University
New areas of expertise in research support
Leiden University Libraries (UBL)
• Virtual Research Environments
• Open Access
• Data Management & Data Curation
• IPR
• Publication Support
• GIS
• Text- & Data Mining
3. Discover theworld at Leiden University
UBL Project TDM - Limitations
• Limitation to text (collections)
• Focus on two international trends:
1) Humanities: increasing use of TDM on digital text corpora
NL: eHumanities group - Royal Netherlands Academy of Arts &
Sciences (KNAW)
2) Science: growing importance of TDM digital content for Literature
Based Discovery (Swanson linking)
4. Discover theworld at Leiden University
UBL Project TDM - Goals
Gain insight into:
• The impact of the 2 above mentioned international trends on Leiden
researchers
• To what extend would they use our collections for TDM?
• What do researchers need and wish for this purpose?
• What does the library have to do to make collections suitable for
TDM?
• What supporting services could the library offer?
5. Discover theworld at Leiden University
UBL Project TDM – Project Team
• Librarians:
• Leiden University Libraries (UBL)
• Walaeus Library-Leiden University Medical Centre (LUMC)
• Researchers:
• Faculty of Humanities:
• Leiden University Centre for Linguistics (LUCL)
• Book & Digital Media Studies (Leiden University Centre for the Arts in Society-
LUCAS)
• Biosemantics group-Leiden University Medical Centre (LUMC)
6. Discover theworld at Leiden University
UBL Project TDM – Approach
• Phase 1 : exploratory study
- Desk research
- Symposia, workshops
• Phase 2: execution
- Interviews
- Pilots
- Symposium ‘ Digital Scholarship and the Role of the Library’
- Available sources, TDM license agreements, criteria digitized collections
- TDM Website
- Selection TDM tools
• Phase 3: evaluation, recommendations
- End of Project Report
7. Discover theworld at Leiden University
UBL Project TDM – Findings (1)
• Humanities
- TDM mainly done by Computational Linguistics, History, Area Studies and Book & Digital Media
Studies
- Texts from digitization projects (libraries, archives, Google Books)
- Researchers digitizing themselves
- Databases like Factiva (newspapers) becoming popular
• Science
- TDM mainly done in Biomedical Sciences (-omics)
- Literature Based Discovery – not at the scale that we expected
• Bibliometrics
- Science
- Centre for Science and Technology Studies (CWTS), Faculty of Social Sciences
8. Discover theworld at Leiden University
UBL Project TDM – Findings (2)
• Leiden researchers that use TDM have encountered various difficulties in relation
to accessibility and usability of textual data for TDM
• Difficulties in relation to copyright law and license agreements: no access at all,
restrictions, etc.
• Different ways to deal with barriers by researchers
• And libraries
- Elsevier TDM agreement not accepted by the UKB (partnership of Dutch
University Libraries and the National Library of the Netherlands )
- Negotiations with Elsevier now part of negotiations on OA and done at a
national level by the VSNU (Association of Universities in the Netherlands)
- No agreement yet
9. Discover theworld at Leiden University
Open Access position of the Dutch Government
• 2013: State Secretary Sander Dekker announced a national transition to OA
• 2016: if necessary legislation
• 2018: 60 % OA
• 2024: 100 % OA
• Preference for gold OA
• Negotiations will all publishers involved will be done by:
- VSNU (The Association of Universities in the Netherlands)
- UKB (the partnership of University Libraries and the National Library of the
Netherlands)
- SURF market (the collaborative ICT organization for Dutch higher education and
research)
• No new contracts without agreements about OA
10. Discover theworld at Leiden University
Removing barriers
Researchers play a vital role in OA!
International level:
• European Commission
• The League of European Research Universities (LERU)
• The Association of European Research Libraries (LIBER)
• The International Federation of Library Associations and Institutions (IFLA)
National level:
• Association of Universities in the Netherlands (VSNU)
• The Partnership of University Libraries and the National Library of the
Netherlands (UKB)
11. Discover theworld at Leiden University
Thank you very much for your attention!
http://www.library.leiden.edu
http://www.library.leiden.edu/teaching-
researching-publishing/manage-your-
research/data-and-text-mining/
@bellabrouwer
@ubleiden