SlideShare une entreprise Scribd logo
1  sur  14
Data Reference Interview
Stuart Macdonald
CISER Data Services Librarian
Email: srm262@cornell.edu
Data Archive - Collection and Services
•Established over 30 years ago
•Collection of numeric datasets to support quantitative
research
c. 27,000 online files in addition to thousands of studies on CD/DVD

•Emphasis on demography (state/federal censuses),
economics, health, labor, election studies, attitudinal and
behavioral studies, family life etc.
•Consulting services to match user needs with appropriate data
•finding, accessing and using data

•Current Cornell researchers can download archive files from online
catalog (search & browse) in formats conversant with statistical software

•Data files are identified by a ‘traffic light’ icon that indicates usage level:
• Green – downloadable by anyone
• Yellow – downloadable from links in the catalog with CUWebAuth authentication
(for use within the CISER research computing environment - CISERRSCH) –
Cornell researchers can apply for a computing account
• Red – data to be used in restriction ( via conditions imposed by data provider)

• Cornell Restricted Access Data Center
•Provides Cornell social science researchers with a repository for
sharing and providing long-term preservation of their numeric/statistical
research data
•Participates in Cornell’s Research Data Management Service Group
•Assist Cornell social science researchers with Research Data
Management (RDM) plans
•Provide Cornell social science researchers with support and expertise
in obtaining and using restricted data
Believe it or not – not all data are the same …
• Data means different things to different people (informatics,
geography, art history, system biology, architecture,
archaeology etc)
• Definition of data / value of data in a commercial sense is
different to that in an academic sense
• Data requirements differ for the undergraduate, postgraduate,
teacher, researcher

• Data catalogs, data libraries, gateways, portals exist for a range
of disciplinary domains
Research data may include all of the following:
• Text or Word documents, spreadsheets
• Laboratory notebooks, field notebooks, diaries
• Questionnaires, transcripts, codebooks
• Audiotapes, videotapes
• Photographs, films
• Slides, artifacts, specimens, samples
• Database contents including video, audio, text, images
• Models, algorithms, scripts
• Contents of an application such as input, output, log files for analysis software, simulation
software, schemas
• Methodologies and workflows
• Standard operating procedures and protocols
Formats, size, volume, open, confidentiality, complexity, flat files – factors to consider as part of
the reference interview (computing capabilities, software dependencies, copyright and ethical
considerations)
Data Reference Interview - establish what the user actually needs (not
what they think they may need!) :
• Statistics or data? Summary statistics, secondary use datasets, raw or derived data
•

•
•
•
•
•
•
•

Software requirements, contingencies

What is the subject or topic? Health, unemployment, deprivation
Type of analysis? Visualization, map, statistical analysis, modelling
What is the unit of analysis? Individual, family, county-level, country-level
Geographic constraints?
Time constraints? Range of years, daily, monthly, quarterly, annual
Cross-sectional or longitudinal?
Data type? Historic, demographic, financial, administrative, geospatial
Sets the goals and structure for the data interview and helps articulate any
decisions made by the data librarian

Establishes the ‘learning stage of the user’ and helps put them at ease

Observations:
Establish time-line for research and data needs (can buy data librarian time, set
priorities, allow time for further investigation)
Fine balance between assistance and exploitation!!
Recognition that data finding, data handling etc may be the learning objective itself
(e.g. identifying variables and using a codebook)
All data queries should be viewed as new. It will soon become evident if the
request has similarities with previous enquiries.
Important not to use too much jargon and to double-check understanding of
unfamiliar terms – often we use the same word to mean something different,
conversely we can use different words but mean the same thing
Sometimes users will say they understand but often don’t. If there’s any doubt
ask and explain again.

Supply of up-to-date user guides to hand
Call Management Systems are great knowledge banks
Be familiar with available expertise (colleagues, organization, national,
international)
Google is a friend. A very good friend.
Two recent examples:
Q. Grad student wanting # of plastic surgery clinics in Seoul, South Korea from 19902009
A. the International Society of Aesthetic Plastic Surgery (ISAPS http://www.isaps.org/ ) in particular the ISAPS International Survey on
Aesthetic/Cosmetic Procedures – there’s data for 2010 and 2011
(http://www.isaps.org/isaps-global-statistics.html ).
Process:
Check NGO sources (World Bank, UN etc)
Check Google – deep searching in to results using a variety of related terms. Time
consuming but often productive. Searches often find references in literature which
can be followed up or discussion forums.
User needs statistical data about agrarian violence (originated by land disputes) variables
include: food riots, assassinations (if occurred as result of land dispute), imprisonments etc
unit of investigation is country-year; area of interest: Latin American countries; period: from
1960 until now, yearly
Process:
Not likely to available through NGO sources
Try deep searching through Google – find literature sources with summary statistics about
land disputes for individual countries – no time series
Responded:
Check Latin America Network Information Center (LANIC) at Univ. Taxas at Austin
Speak with our Cornell Colleague Sean Knowlton who has expertise in Latin American
statistical resources.
Check CEPALSTAT - gateway to statistical information of Latin America and the Caribbean
countries published by Economic Commission for Latin America and the Caribbean
11
Social Science research data resources
•Inter-University Consortium for Political and Social Research (ICPSR)
•National Archive of Criminal Justice Data
•Minority Data Resource Center
•National Archive of Computerized Data on Aging

•Roper Center for Public Opinion Archives
•International Data Archives e.g. CESSDA, UKDA, Eurostat
• CESSDA catalog (DDI) provides a multi-lingual interface to datasets from member social
science data archives across Europe
• Study description and online documentation are free

•Non-Govenmental Organizations
•National / Governmental Statistical Agencies
Social science statistical data on the internet:
CISER Internet Data Sources:
https://ciser.cornell.edu/info/datasource.shtml

MIT Data Sources:
http://libguides.mit.edu/ssds/any-subject
Columbia University Social Science Data
http://library.columbia.edu/locations/dssc/data/socsc.html
University California, San Diego – Data on the Web
http://3stages.org/idata/

Most research-driven universities have similar listings via Data Library webpages
Location & hours:
CISER Data Archive is located at 391 Pine Tree Road, Ithaca
CISER is open 8.30am – 4.30pm (Mon-Fri) – walk-in assistance
is not always available – so appointments are recommended

Contacts:
Tel.: (607) 255 4801
Email: ciser@cornell.edu

Contenu connexe

Tendances

Digital Scholar Webinar: Recruiting Research Participants Online Using Reddit
Digital Scholar Webinar: Recruiting Research Participants Online Using RedditDigital Scholar Webinar: Recruiting Research Participants Online Using Reddit
Digital Scholar Webinar: Recruiting Research Participants Online Using RedditSC CTSI at USC and CHLA
 
Privacy in Research Data Managemnt - Use Cases
Privacy in Research Data Managemnt - Use CasesPrivacy in Research Data Managemnt - Use Cases
Privacy in Research Data Managemnt - Use CasesMicah Altman
 
Doing research better: The role of meta‐data
Doing research better: The role of meta‐dataDoing research better: The role of meta‐data
Doing research better: The role of meta‐dataGarethKnight
 
Teaching research
Teaching researchTeaching research
Teaching researchsroof
 
Major issues in data mining
Major issues in data miningMajor issues in data mining
Major issues in data miningYashwant Rautela
 
Types of Information Needs
Types of Information NeedsTypes of Information Needs
Types of Information NeedsShivakumar G.T.
 
Data Citation and DOIs
Data Citation and DOIsData Citation and DOIs
Data Citation and DOIsARDC
 
Digital Scholarly Communication @Claremont Colleges
Digital Scholarly Communication @Claremont CollegesDigital Scholarly Communication @Claremont Colleges
Digital Scholarly Communication @Claremont CollegesAshley Sanders, Ph.D.
 
Informetrics final
Informetrics finalInformetrics final
Informetrics finalAamir Abbas
 
ICPSR Data Exploration Tools
ICPSR Data Exploration ToolsICPSR Data Exploration Tools
ICPSR Data Exploration ToolsICPSR
 
Borgman orcid dryadsymposiumoxford20130523
Borgman orcid dryadsymposiumoxford20130523Borgman orcid dryadsymposiumoxford20130523
Borgman orcid dryadsymposiumoxford20130523ORCID, Inc
 
Library Science Emerging Career Trends 2016
Library Science Emerging Career Trends 2016Library Science Emerging Career Trends 2016
Library Science Emerging Career Trends 2016Scott Lee
 
Upgrading the Scholarly Infrastructure
Upgrading the Scholarly InfrastructureUpgrading the Scholarly Infrastructure
Upgrading the Scholarly InfrastructureBjörn Brembs
 
Survey Research Data Archive: Current Status and Challenges
Survey Research Data Archive: Current Status and ChallengesSurvey Research Data Archive: Current Status and Challenges
Survey Research Data Archive: Current Status and ChallengesBob Chao
 
Green, gold, uncle sam, and information literacy
Green, gold, uncle sam, and information literacyGreen, gold, uncle sam, and information literacy
Green, gold, uncle sam, and information literacySeth Porter, MA, MLIS
 

Tendances (19)

Data and Research Infrastructures and Open Science
Data and Research Infrastructures and Open ScienceData and Research Infrastructures and Open Science
Data and Research Infrastructures and Open Science
 
Digital Scholar Webinar: Recruiting Research Participants Online Using Reddit
Digital Scholar Webinar: Recruiting Research Participants Online Using RedditDigital Scholar Webinar: Recruiting Research Participants Online Using Reddit
Digital Scholar Webinar: Recruiting Research Participants Online Using Reddit
 
Research4C4U
Research4C4UResearch4C4U
Research4C4U
 
Privacy in Research Data Managemnt - Use Cases
Privacy in Research Data Managemnt - Use CasesPrivacy in Research Data Managemnt - Use Cases
Privacy in Research Data Managemnt - Use Cases
 
Doing research better: The role of meta‐data
Doing research better: The role of meta‐dataDoing research better: The role of meta‐data
Doing research better: The role of meta‐data
 
Teaching research
Teaching researchTeaching research
Teaching research
 
LOD-SEM
LOD-SEMLOD-SEM
LOD-SEM
 
Major issues in data mining
Major issues in data miningMajor issues in data mining
Major issues in data mining
 
Types of Information Needs
Types of Information NeedsTypes of Information Needs
Types of Information Needs
 
Data Citation and DOIs
Data Citation and DOIsData Citation and DOIs
Data Citation and DOIs
 
Digital Scholarly Communication @Claremont Colleges
Digital Scholarly Communication @Claremont CollegesDigital Scholarly Communication @Claremont Colleges
Digital Scholarly Communication @Claremont Colleges
 
Informetrics final
Informetrics finalInformetrics final
Informetrics final
 
ICPSR Data Exploration Tools
ICPSR Data Exploration ToolsICPSR Data Exploration Tools
ICPSR Data Exploration Tools
 
Borgman orcid dryadsymposiumoxford20130523
Borgman orcid dryadsymposiumoxford20130523Borgman orcid dryadsymposiumoxford20130523
Borgman orcid dryadsymposiumoxford20130523
 
Jonathan Breeze, Symplectic
Jonathan Breeze, SymplecticJonathan Breeze, Symplectic
Jonathan Breeze, Symplectic
 
Library Science Emerging Career Trends 2016
Library Science Emerging Career Trends 2016Library Science Emerging Career Trends 2016
Library Science Emerging Career Trends 2016
 
Upgrading the Scholarly Infrastructure
Upgrading the Scholarly InfrastructureUpgrading the Scholarly Infrastructure
Upgrading the Scholarly Infrastructure
 
Survey Research Data Archive: Current Status and Challenges
Survey Research Data Archive: Current Status and ChallengesSurvey Research Data Archive: Current Status and Challenges
Survey Research Data Archive: Current Status and Challenges
 
Green, gold, uncle sam, and information literacy
Green, gold, uncle sam, and information literacyGreen, gold, uncle sam, and information literacy
Green, gold, uncle sam, and information literacy
 

Similaire à CISER & the Data Reference Interview

Ada slide presentation rsc day_feb2017_v2
Ada slide presentation rsc day_feb2017_v2Ada slide presentation rsc day_feb2017_v2
Ada slide presentation rsc day_feb2017_v2SusanMRob
 
Managing Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsManaging Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsRebecca Grant
 
The Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliThe Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliLEARN Project
 
Next generation data services at the Marriott Library
Next generation data services at the Marriott LibraryNext generation data services at the Marriott Library
Next generation data services at the Marriott LibraryRebekah Cummings
 
How metadata drives data sharing; UK Data Archive
How metadata drives data sharing; UK Data Archive How metadata drives data sharing; UK Data Archive
How metadata drives data sharing; UK Data Archive Louise Corti
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...ICPSR
 
Management of Data Collections
Management of Data CollectionsManagement of Data Collections
Management of Data Collectionsabedejesus
 
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...AKSHAY BHAGAT
 
ICPSR Workshop Template - 2012/13
ICPSR Workshop Template - 2012/13ICPSR Workshop Template - 2012/13
ICPSR Workshop Template - 2012/13ICPSR
 
2013 ICPSR Data Services
2013 ICPSR Data Services2013 ICPSR Data Services
2013 ICPSR Data ServicesICPSR
 
How to overcome obstacles to data publication: Issues, requirements, and good...
How to overcome obstacles to data publication: Issues, requirements, and good...How to overcome obstacles to data publication: Issues, requirements, and good...
How to overcome obstacles to data publication: Issues, requirements, and good...ariadnenetwork
 
ICPSR-RCMD 2012 Presentation from HACU conference
ICPSR-RCMD 2012 Presentation from HACU conferenceICPSR-RCMD 2012 Presentation from HACU conference
ICPSR-RCMD 2012 Presentation from HACU conferenceDavid457
 
NC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
NC3Rs Publication Bias workshop - Sansone - Better Data = Better ScienceNC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
NC3Rs Publication Bias workshop - Sansone - Better Data = Better ScienceSusanna-Assunta Sansone
 
From Data Sharing to Data Stewardship
From Data Sharing to Data StewardshipFrom Data Sharing to Data Stewardship
From Data Sharing to Data StewardshipICPSR
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...Sarah Anna Stewart
 
Introduction Data Science.pptx
Introduction Data Science.pptxIntroduction Data Science.pptx
Introduction Data Science.pptxAkhirulAminulloh2
 

Similaire à CISER & the Data Reference Interview (20)

Rdm slides march 2014
Rdm slides march 2014Rdm slides march 2014
Rdm slides march 2014
 
Ada slide presentation rsc day_feb2017_v2
Ada slide presentation rsc day_feb2017_v2Ada slide presentation rsc day_feb2017_v2
Ada slide presentation rsc day_feb2017_v2
 
Managing Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsManaging Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research Methods
 
The Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliThe Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina Leonelli
 
Next generation data services at the Marriott Library
Next generation data services at the Marriott LibraryNext generation data services at the Marriott Library
Next generation data services at the Marriott Library
 
How metadata drives data sharing; UK Data Archive
How metadata drives data sharing; UK Data Archive How metadata drives data sharing; UK Data Archive
How metadata drives data sharing; UK Data Archive
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
 
Management of Data Collections
Management of Data CollectionsManagement of Data Collections
Management of Data Collections
 
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
 
ICPSR Workshop Template - 2012/13
ICPSR Workshop Template - 2012/13ICPSR Workshop Template - 2012/13
ICPSR Workshop Template - 2012/13
 
2013 ICPSR Data Services
2013 ICPSR Data Services2013 ICPSR Data Services
2013 ICPSR Data Services
 
How to overcome obstacles to data publication: Issues, requirements, and good...
How to overcome obstacles to data publication: Issues, requirements, and good...How to overcome obstacles to data publication: Issues, requirements, and good...
How to overcome obstacles to data publication: Issues, requirements, and good...
 
Big Data for Library Services (2017)
Big Data for Library Services (2017)Big Data for Library Services (2017)
Big Data for Library Services (2017)
 
ICPSR-RCMD 2012 Presentation from HACU conference
ICPSR-RCMD 2012 Presentation from HACU conferenceICPSR-RCMD 2012 Presentation from HACU conference
ICPSR-RCMD 2012 Presentation from HACU conference
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
 
Qs4 group c corti
Qs4 group c cortiQs4 group c corti
Qs4 group c corti
 
NC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
NC3Rs Publication Bias workshop - Sansone - Better Data = Better ScienceNC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
NC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
 
From Data Sharing to Data Stewardship
From Data Sharing to Data StewardshipFrom Data Sharing to Data Stewardship
From Data Sharing to Data Stewardship
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
 
Introduction Data Science.pptx
Introduction Data Science.pptxIntroduction Data Science.pptx
Introduction Data Science.pptx
 

Plus de Historic Environment Scotland

Digital Archiving for Archaeological Units at Historic Environment Scotland
Digital Archiving for Archaeological Units at Historic Environment ScotlandDigital Archiving for Archaeological Units at Historic Environment Scotland
Digital Archiving for Archaeological Units at Historic Environment ScotlandHistoric Environment Scotland
 
Archives & Records Association summer seminar Edinburgh 7 June 2019
Archives & Records Association summer seminar   Edinburgh 7 June 2019Archives & Records Association summer seminar   Edinburgh 7 June 2019
Archives & Records Association summer seminar Edinburgh 7 June 2019Historic Environment Scotland
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareHistoric Environment Scotland
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Historic Environment Scotland
 
EPSRC research data expectations and research software management
EPSRC research data expectations and research software managementEPSRC research data expectations and research software management
EPSRC research data expectations and research software managementHistoric Environment Scotland
 
Introduction to data support services and resources for public policy
Introduction to data support services and resources for public policyIntroduction to data support services and resources for public policy
Introduction to data support services and resources for public policyHistoric Environment Scotland
 
Certifying CISER! A Data Seal of Approval Case Study
Certifying CISER! A Data Seal of Approval Case StudyCertifying CISER! A Data Seal of Approval Case Study
Certifying CISER! A Data Seal of Approval Case StudyHistoric Environment Scotland
 
EPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to knowEPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to knowHistoric Environment Scotland
 
Creating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationCreating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationHistoric Environment Scotland
 

Plus de Historic Environment Scotland (20)

Digital Archiving for Archaeological Units at Historic Environment Scotland
Digital Archiving for Archaeological Units at Historic Environment ScotlandDigital Archiving for Archaeological Units at Historic Environment Scotland
Digital Archiving for Archaeological Units at Historic Environment Scotland
 
Archives & Records Association summer seminar Edinburgh 7 June 2019
Archives & Records Association summer seminar   Edinburgh 7 June 2019Archives & Records Association summer seminar   Edinburgh 7 June 2019
Archives & Records Association summer seminar Edinburgh 7 June 2019
 
Bonares presentation oct2016v2
Bonares presentation oct2016v2Bonares presentation oct2016v2
Bonares presentation oct2016v2
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...
 
RDM for trainee physicians
RDM for trainee physiciansRDM for trainee physicians
RDM for trainee physicians
 
EPSRC research data expectations and research software management
EPSRC research data expectations and research software managementEPSRC research data expectations and research software management
EPSRC research data expectations and research software management
 
Introduction to RDM for trainee physicians
Introduction to RDM for trainee physiciansIntroduction to RDM for trainee physicians
Introduction to RDM for trainee physicians
 
Introduction to data support services and resources for public policy
Introduction to data support services and resources for public policyIntroduction to data support services and resources for public policy
Introduction to data support services and resources for public policy
 
Certifying CISER! A Data Seal of Approval Case Study
Certifying CISER! A Data Seal of Approval Case StudyCertifying CISER! A Data Seal of Approval Case Study
Certifying CISER! A Data Seal of Approval Case Study
 
EPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to knowEPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to know
 
Creating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationCreating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant Application
 
RDM@Edinburgh_interoperation_IDCC2015
RDM@Edinburgh_interoperation_IDCC2015RDM@Edinburgh_interoperation_IDCC2015
RDM@Edinburgh_interoperation_IDCC2015
 
RDM @ UoE
RDM @ UoERDM @ UoE
RDM @ UoE
 
RDM Programme @ Edinburgh
RDM Programme @ Edinburgh RDM Programme @ Edinburgh
RDM Programme @ Edinburgh
 
RDM @ Edinburgh - Arkivum Workshop
RDM @ Edinburgh - Arkivum WorkshopRDM @ Edinburgh - Arkivum Workshop
RDM @ Edinburgh - Arkivum Workshop
 
Good Practice in Research Data Management
Good Practice in Research Data ManagementGood Practice in Research Data Management
Good Practice in Research Data Management
 
RDM Programme@Edinburgh
RDM Programme@EdinburghRDM Programme@Edinburgh
RDM Programme@Edinburgh
 
Edinburgh DataShare - DSpace for Data
Edinburgh DataShare - DSpace for DataEdinburgh DataShare - DSpace for Data
Edinburgh DataShare - DSpace for Data
 
RDM Programme at University of Edinburgh
RDM Programme at University of EdinburghRDM Programme at University of Edinburgh
RDM Programme at University of Edinburgh
 

Dernier

mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 

Dernier (20)

mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 

CISER & the Data Reference Interview

  • 1. Data Reference Interview Stuart Macdonald CISER Data Services Librarian Email: srm262@cornell.edu
  • 2. Data Archive - Collection and Services •Established over 30 years ago •Collection of numeric datasets to support quantitative research c. 27,000 online files in addition to thousands of studies on CD/DVD •Emphasis on demography (state/federal censuses), economics, health, labor, election studies, attitudinal and behavioral studies, family life etc.
  • 3. •Consulting services to match user needs with appropriate data •finding, accessing and using data •Current Cornell researchers can download archive files from online catalog (search & browse) in formats conversant with statistical software •Data files are identified by a ‘traffic light’ icon that indicates usage level: • Green – downloadable by anyone • Yellow – downloadable from links in the catalog with CUWebAuth authentication (for use within the CISER research computing environment - CISERRSCH) – Cornell researchers can apply for a computing account • Red – data to be used in restriction ( via conditions imposed by data provider) • Cornell Restricted Access Data Center
  • 4. •Provides Cornell social science researchers with a repository for sharing and providing long-term preservation of their numeric/statistical research data •Participates in Cornell’s Research Data Management Service Group •Assist Cornell social science researchers with Research Data Management (RDM) plans •Provide Cornell social science researchers with support and expertise in obtaining and using restricted data
  • 5. Believe it or not – not all data are the same … • Data means different things to different people (informatics, geography, art history, system biology, architecture, archaeology etc) • Definition of data / value of data in a commercial sense is different to that in an academic sense • Data requirements differ for the undergraduate, postgraduate, teacher, researcher • Data catalogs, data libraries, gateways, portals exist for a range of disciplinary domains
  • 6. Research data may include all of the following: • Text or Word documents, spreadsheets • Laboratory notebooks, field notebooks, diaries • Questionnaires, transcripts, codebooks • Audiotapes, videotapes • Photographs, films • Slides, artifacts, specimens, samples • Database contents including video, audio, text, images • Models, algorithms, scripts • Contents of an application such as input, output, log files for analysis software, simulation software, schemas • Methodologies and workflows • Standard operating procedures and protocols Formats, size, volume, open, confidentiality, complexity, flat files – factors to consider as part of the reference interview (computing capabilities, software dependencies, copyright and ethical considerations)
  • 7. Data Reference Interview - establish what the user actually needs (not what they think they may need!) : • Statistics or data? Summary statistics, secondary use datasets, raw or derived data • • • • • • • • Software requirements, contingencies What is the subject or topic? Health, unemployment, deprivation Type of analysis? Visualization, map, statistical analysis, modelling What is the unit of analysis? Individual, family, county-level, country-level Geographic constraints? Time constraints? Range of years, daily, monthly, quarterly, annual Cross-sectional or longitudinal? Data type? Historic, demographic, financial, administrative, geospatial
  • 8. Sets the goals and structure for the data interview and helps articulate any decisions made by the data librarian Establishes the ‘learning stage of the user’ and helps put them at ease Observations: Establish time-line for research and data needs (can buy data librarian time, set priorities, allow time for further investigation) Fine balance between assistance and exploitation!! Recognition that data finding, data handling etc may be the learning objective itself (e.g. identifying variables and using a codebook) All data queries should be viewed as new. It will soon become evident if the request has similarities with previous enquiries.
  • 9. Important not to use too much jargon and to double-check understanding of unfamiliar terms – often we use the same word to mean something different, conversely we can use different words but mean the same thing Sometimes users will say they understand but often don’t. If there’s any doubt ask and explain again. Supply of up-to-date user guides to hand Call Management Systems are great knowledge banks Be familiar with available expertise (colleagues, organization, national, international) Google is a friend. A very good friend.
  • 10. Two recent examples: Q. Grad student wanting # of plastic surgery clinics in Seoul, South Korea from 19902009 A. the International Society of Aesthetic Plastic Surgery (ISAPS http://www.isaps.org/ ) in particular the ISAPS International Survey on Aesthetic/Cosmetic Procedures – there’s data for 2010 and 2011 (http://www.isaps.org/isaps-global-statistics.html ). Process: Check NGO sources (World Bank, UN etc) Check Google – deep searching in to results using a variety of related terms. Time consuming but often productive. Searches often find references in literature which can be followed up or discussion forums.
  • 11. User needs statistical data about agrarian violence (originated by land disputes) variables include: food riots, assassinations (if occurred as result of land dispute), imprisonments etc unit of investigation is country-year; area of interest: Latin American countries; period: from 1960 until now, yearly Process: Not likely to available through NGO sources Try deep searching through Google – find literature sources with summary statistics about land disputes for individual countries – no time series Responded: Check Latin America Network Information Center (LANIC) at Univ. Taxas at Austin Speak with our Cornell Colleague Sean Knowlton who has expertise in Latin American statistical resources. Check CEPALSTAT - gateway to statistical information of Latin America and the Caribbean countries published by Economic Commission for Latin America and the Caribbean 11
  • 12. Social Science research data resources •Inter-University Consortium for Political and Social Research (ICPSR) •National Archive of Criminal Justice Data •Minority Data Resource Center •National Archive of Computerized Data on Aging •Roper Center for Public Opinion Archives •International Data Archives e.g. CESSDA, UKDA, Eurostat • CESSDA catalog (DDI) provides a multi-lingual interface to datasets from member social science data archives across Europe • Study description and online documentation are free •Non-Govenmental Organizations •National / Governmental Statistical Agencies
  • 13. Social science statistical data on the internet: CISER Internet Data Sources: https://ciser.cornell.edu/info/datasource.shtml MIT Data Sources: http://libguides.mit.edu/ssds/any-subject Columbia University Social Science Data http://library.columbia.edu/locations/dssc/data/socsc.html University California, San Diego – Data on the Web http://3stages.org/idata/ Most research-driven universities have similar listings via Data Library webpages
  • 14. Location & hours: CISER Data Archive is located at 391 Pine Tree Road, Ithaca CISER is open 8.30am – 4.30pm (Mon-Fri) – walk-in assistance is not always available – so appointments are recommended Contacts: Tel.: (607) 255 4801 Email: ciser@cornell.edu

Notes de l'éditeur

  1. Data, documentation and associated files (e.g. SAS, SPSS, Stata) are housed on the CISER file server. Files are downloaded from the catalog in ZIP compressed format..Cross-National Time Series data
  2. UG – more general enquiries – summary statistics rather than raw data – what they ask for is often not what they really needPG – nature of enquiry more specific, more often again, summary statistics. May be raw data as PhD progresses. Often data collection may be involved, to be used in conjunction with other sources, visualized etcTeacher – teaching datasets or sample data. Or data subsets (NGO, IGO)Researcher – Have a better idea as to what data they need, usually raw data, need to identify variables, help with codebook / questionnaire. Use of statistical analysis packages, GIS
  3. As CISER is an ICPSR member, researchers can gain access to data held in those CESSDA Archives that are themselves ICPSR membersCESSDAT member organisations adhere to a Trans-border Data Access Agreement
  4. As CISER is an ICPSR member, researchers can gain access to data held in those CESSDA Archives that are themselves ICPSR membersCESSDAT member organisations adhere to a Trans-border Data Access Agreement