SlideShare a Scribd company logo
1 of 41
Advanced Statistics for Librarians How to use and evaluate statistical information in library research ,[object Object],Caltech ,[object Object],Acquisitions Librarian ,[object Object],John McDonald
Advanced Statistics ,[object Object],[object Object],[object Object]
Research Design ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Research Design Steps ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Research Question ,[object Object],[object Object],[object Object],[object Object]
Hypothesis ,[object Object],[object Object],[object Object],[object Object],[object Object]
Data collection ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Collection: Sampling ,[object Object],[object Object],[object Object],[object Object],[object Object]
Simple Stratified Assumes homogeneity Assumes heterogeneity Sampling Designs
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Sample size spreadsheet Calculating Sample Sizes
[object Object],[object Object],[object Object],[object Object],M&M Sampling
[object Object],[object Object],M&M Sampling
Data Definitions ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Scales ,[object Object],[object Object],[object Object],[object Object],[object Object]
Name that data type! ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Distributions ,[object Object],Non-normal  (skewed): extreme values with steep slopes Normal : bell shaped curve with gradual slopes
Fulltime Students at ARL Schools N=114 Mean = 22K SD = 10K
Total Salaries & Wages at ARL Libraries N=114 Mean = 10M SD = 6.5M
Variables ,[object Object],[object Object],[object Object]
Data analysis ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Correlational Statistics ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Correlational Statistics ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Correlation
Inferential Statistics ,[object Object],[object Object]
T-Test ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
T-Test
Regression ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Regression
ANOVA ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
ANOVA
Chi Square Test ,[object Object],[object Object],[object Object],[object Object],[object Object]
Chi Square Test Pepsi Challenge Observed : Pepsi 85, Coke 57, RC 78  Expected  (equal) = 73.33 Degrees of freedom = rows - 1 = 3 - 1 =  2 Critical value of χ 2  =  5.99  at alpha = 0.05 Observed value of χ 2  =  5.8 Decision:  Fail to reject H 0 5.8  χ 2  =  219.99  220  Totals  0.3  21.81  4.67  73.33  78  RC  3.64  266.67  -16.33  73.33  57  Coke  1.86  136.19  11.67  73.33  85  Pepsi  (O-E) 2 /E (O-E) 2   O-E  E  O
Inferential Statistics ,[object Object],[object Object],OLS Regression Predict value from measured variables ,[object Object],[object Object],T-test Compare sample to a hypothetical value ,[object Object],[object Object],ANOVA Compare 3+ unmatched groups ,[object Object],[object Object],Standard two-group t-test Compare 2 paired groups ,[object Object],[object Object],Unpaired t-test Compare 2 unpaired groups ,[object Object],[object Object],Pearson correlation Quantify association between variables Non-parametric Parametric Goal
Review: Research Design ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Case Studies ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
“ Changing the Face of Instruction…” Is an online tutorial as effective in teaching library instruction as a classroom setting? H3. Students will report as much or more satisfaction with online instruction as students taking traditional instruction. Research Question Hypotheses H1. Students will have higher scores in information literacy tests after library instruction. H2. Students will have the same or higher scores in info-lit tests after taking online tutorials as students taking traditional instruction.
“ Changing the Face of Instruction…” Variables: Test scores & survey results Data Collection: Pretest/Posttest & Survey Variables &  Data Collection Statistical Tests Conclusions Accept H1:  Instruction improves literacy.  Desc Stats incl. mean, standard deviation, standard error, T-tests (1 & 2 tailed) Accept H3 alternative hypothesis – Student satisfaction is equal with both methods. Accept H2 alternative hypothesis – Online has no significant difference from traditional.
“ Do Open-Access Articles…” ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
“ Do Open-Access Articles…” Do freely available articles have a greater research impact? Research impact: citation rates Open Access: freely available Research Question Hypotheses H1. Scholarly articles have a greater research impact if the articles are freely available online than if they are not. Ho: (null hypothesis): There is no difference between the mean citation rates: Ho: d1 = d0 Measures
“ Do Open-Access Articles…” Variables: Mean citation rates Data Collection: At least 50 articles from 10 leading journals in 4 disciplines.  Variables &  Data Collection Statistical Tests Conclusions Reject Ho:  Open Access articles are citation more than those that are not OA.  Desc Stats incl. mean, standard deviation, standard error, Wilcoxon sign-rank Validity?  Reliability of Measures? Generalizability? Alternate hypotheses? Discussion
My favorite statistic… Baseball is 90% mental –  the other half is physical.

More Related Content

What's hot

quantitative data analysis using spss
quantitative  data analysis using spssquantitative  data analysis using spss
quantitative data analysis using spss
Yagami7
 
Statistical Significance Testing in Information Retrieval: An Empirical Analy...
Statistical Significance Testing in Information Retrieval: An Empirical Analy...Statistical Significance Testing in Information Retrieval: An Empirical Analy...
Statistical Significance Testing in Information Retrieval: An Empirical Analy...
Julián Urbano
 

What's hot (19)

quantitative data analysis using spss
quantitative  data analysis using spssquantitative  data analysis using spss
quantitative data analysis using spss
 
Analyzing survey data
Analyzing survey dataAnalyzing survey data
Analyzing survey data
 
Parametric & non-parametric
Parametric & non-parametricParametric & non-parametric
Parametric & non-parametric
 
Non parametric test
Non parametric testNon parametric test
Non parametric test
 
Multivariate
MultivariateMultivariate
Multivariate
 
Thiyagu statistics
Thiyagu   statisticsThiyagu   statistics
Thiyagu statistics
 
Statistical test in spss
Statistical test in spssStatistical test in spss
Statistical test in spss
 
Overview of statistical tests: Data handling and data quality (Part II)
Overview of statistical tests: Data handling and data quality (Part II)Overview of statistical tests: Data handling and data quality (Part II)
Overview of statistical tests: Data handling and data quality (Part II)
 
Quantitative data analysis
Quantitative data analysisQuantitative data analysis
Quantitative data analysis
 
Research Methodology: Questionnaire, Sampling, Data Preparation
Research Methodology: Questionnaire, Sampling, Data PreparationResearch Methodology: Questionnaire, Sampling, Data Preparation
Research Methodology: Questionnaire, Sampling, Data Preparation
 
Chapter 1
Chapter 1Chapter 1
Chapter 1
 
Non parametric study; Statistical approach for med student
Non parametric study; Statistical approach for med student Non parametric study; Statistical approach for med student
Non parametric study; Statistical approach for med student
 
Review of "Survey Research Methods & Design in Psychology"
Review of "Survey Research Methods & Design in Psychology"Review of "Survey Research Methods & Design in Psychology"
Review of "Survey Research Methods & Design in Psychology"
 
Data Analysis
Data AnalysisData Analysis
Data Analysis
 
3.1 non parametric test
3.1 non parametric test3.1 non parametric test
3.1 non parametric test
 
Commonly Used Statistics in Medical Research Part I
Commonly Used Statistics in Medical Research Part ICommonly Used Statistics in Medical Research Part I
Commonly Used Statistics in Medical Research Part I
 
Analysis of Data - Dr. K. Thiyagu
Analysis of Data - Dr. K. ThiyaguAnalysis of Data - Dr. K. Thiyagu
Analysis of Data - Dr. K. Thiyagu
 
Statistical test
Statistical testStatistical test
Statistical test
 
Statistical Significance Testing in Information Retrieval: An Empirical Analy...
Statistical Significance Testing in Information Retrieval: An Empirical Analy...Statistical Significance Testing in Information Retrieval: An Empirical Analy...
Statistical Significance Testing in Information Retrieval: An Empirical Analy...
 

Viewers also liked (6)

February 2014 Library Statistics
February 2014 Library StatisticsFebruary 2014 Library Statistics
February 2014 Library Statistics
 
Statistics for Librarians: How to Use and Evaluate Statistical Evidence
Statistics for Librarians: How to Use and Evaluate Statistical EvidenceStatistics for Librarians: How to Use and Evaluate Statistical Evidence
Statistics for Librarians: How to Use and Evaluate Statistical Evidence
 
Changes in library standards : Statistics and evaluation as mirror of library...
Changes in library standards : Statistics and evaluation as mirror of library...Changes in library standards : Statistics and evaluation as mirror of library...
Changes in library standards : Statistics and evaluation as mirror of library...
 
Staff manual,lib.survey,statistics,standards.
Staff manual,lib.survey,statistics,standards.Staff manual,lib.survey,statistics,standards.
Staff manual,lib.survey,statistics,standards.
 
Annual Reports
Annual ReportsAnnual Reports
Annual Reports
 
Library management system
Library management systemLibrary management system
Library management system
 

Similar to Advanced statistics for librarians

Introduction To Statistics
Introduction To StatisticsIntroduction To Statistics
Introduction To Statistics
albertlaporte
 
Week 6 DQ1. What is your research questionIs there a differen.docx
Week 6 DQ1. What is your research questionIs there a differen.docxWeek 6 DQ1. What is your research questionIs there a differen.docx
Week 6 DQ1. What is your research questionIs there a differen.docx
cockekeshia
 
Survey and correlational research (1)
Survey and correlational research (1)Survey and correlational research (1)
Survey and correlational research (1)
zuraiberahim
 
Need a nonplagiarised paper and a form completed by 1006015 before.docx
Need a nonplagiarised paper and a form completed by 1006015 before.docxNeed a nonplagiarised paper and a form completed by 1006015 before.docx
Need a nonplagiarised paper and a form completed by 1006015 before.docx
lea6nklmattu
 

Similar to Advanced statistics for librarians (20)

Research Procedure
Research ProcedureResearch Procedure
Research Procedure
 
April Heyward Research Methods Class Session - 7-29-2021
April Heyward Research Methods Class Session - 7-29-2021April Heyward Research Methods Class Session - 7-29-2021
April Heyward Research Methods Class Session - 7-29-2021
 
April Heyward Research Methods Class Session - 8-5-2021
April Heyward Research Methods Class Session - 8-5-2021April Heyward Research Methods Class Session - 8-5-2021
April Heyward Research Methods Class Session - 8-5-2021
 
Introduction To Statistics
Introduction To StatisticsIntroduction To Statistics
Introduction To Statistics
 
Day 11 t test for independent samples
Day 11 t test for independent samplesDay 11 t test for independent samples
Day 11 t test for independent samples
 
BASIC STATISTICAL TREATMENT IN RESEARCH.pptx
BASIC STATISTICAL TREATMENT IN RESEARCH.pptxBASIC STATISTICAL TREATMENT IN RESEARCH.pptx
BASIC STATISTICAL TREATMENT IN RESEARCH.pptx
 
Quantitative research
Quantitative researchQuantitative research
Quantitative research
 
Meta analysis with R
Meta analysis with RMeta analysis with R
Meta analysis with R
 
Week 6 DQ1. What is your research questionIs there a differen.docx
Week 6 DQ1. What is your research questionIs there a differen.docxWeek 6 DQ1. What is your research questionIs there a differen.docx
Week 6 DQ1. What is your research questionIs there a differen.docx
 
Quantitative Research Design.pptx
Quantitative Research Design.pptxQuantitative Research Design.pptx
Quantitative Research Design.pptx
 
SPSS statistics - get help using SPSS
SPSS statistics - get help using SPSSSPSS statistics - get help using SPSS
SPSS statistics - get help using SPSS
 
Week 7 a statistics
Week 7 a statisticsWeek 7 a statistics
Week 7 a statistics
 
statistical analysis.pptx
statistical analysis.pptxstatistical analysis.pptx
statistical analysis.pptx
 
Methodology
MethodologyMethodology
Methodology
 
statistical analysis gr12.pptx lesson in research
statistical analysis gr12.pptx lesson in researchstatistical analysis gr12.pptx lesson in research
statistical analysis gr12.pptx lesson in research
 
Experimental
ExperimentalExperimental
Experimental
 
MELJUN CORTES research designing_research_methodology
MELJUN CORTES research designing_research_methodologyMELJUN CORTES research designing_research_methodology
MELJUN CORTES research designing_research_methodology
 
Survey and correlational research (1)
Survey and correlational research (1)Survey and correlational research (1)
Survey and correlational research (1)
 
Lec1_Methods-for-Dummies-T-tests-anovas-and-regression.pptx
Lec1_Methods-for-Dummies-T-tests-anovas-and-regression.pptxLec1_Methods-for-Dummies-T-tests-anovas-and-regression.pptx
Lec1_Methods-for-Dummies-T-tests-anovas-and-regression.pptx
 
Need a nonplagiarised paper and a form completed by 1006015 before.docx
Need a nonplagiarised paper and a form completed by 1006015 before.docxNeed a nonplagiarised paper and a form completed by 1006015 before.docx
Need a nonplagiarised paper and a form completed by 1006015 before.docx
 

More from John McDonald

Making the Data Work: Telling your story with Usage Statistics
Making the Data Work: Telling your story with Usage StatisticsMaking the Data Work: Telling your story with Usage Statistics
Making the Data Work: Telling your story with Usage Statistics
John McDonald
 
SerialsSolutions Visit
SerialsSolutions VisitSerialsSolutions Visit
SerialsSolutions Visit
John McDonald
 
Niso usage data forum 2007
Niso usage data forum 2007Niso usage data forum 2007
Niso usage data forum 2007
John McDonald
 

More from John McDonald (20)

Discovery or Displacement?: A Large Scale Longitudinal Study of the Effect of...
Discovery or Displacement?: A Large Scale Longitudinal Study of the Effect of...Discovery or Displacement?: A Large Scale Longitudinal Study of the Effect of...
Discovery or Displacement?: A Large Scale Longitudinal Study of the Effect of...
 
Springer Symposium on Scholarly Communications
Springer Symposium on Scholarly CommunicationsSpringer Symposium on Scholarly Communications
Springer Symposium on Scholarly Communications
 
Making the Data Work: Telling your story with Usage Statistics
Making the Data Work: Telling your story with Usage StatisticsMaking the Data Work: Telling your story with Usage Statistics
Making the Data Work: Telling your story with Usage Statistics
 
Transforming the Library
Transforming the LibraryTransforming the Library
Transforming the Library
 
Collaboration in Information Technology Services
Collaboration in Information Technology ServicesCollaboration in Information Technology Services
Collaboration in Information Technology Services
 
Ebook Availability Revisited: A Quantitative Analysis of the 2012 Ebook Aggre...
Ebook Availability Revisited: A Quantitative Analysis of the 2012 Ebook Aggre...Ebook Availability Revisited: A Quantitative Analysis of the 2012 Ebook Aggre...
Ebook Availability Revisited: A Quantitative Analysis of the 2012 Ebook Aggre...
 
What OCLC Data Analysis Reveals About SCELC Libraries
What OCLC Data Analysis Reveals About SCELC LibrariesWhat OCLC Data Analysis Reveals About SCELC Libraries
What OCLC Data Analysis Reveals About SCELC Libraries
 
SerialsSolutions Visit
SerialsSolutions VisitSerialsSolutions Visit
SerialsSolutions Visit
 
Communication Strategies for Pushing the Boundaries of Collaboration
Communication Strategies for Pushing the Boundaries of CollaborationCommunication Strategies for Pushing the Boundaries of Collaboration
Communication Strategies for Pushing the Boundaries of Collaboration
 
Fear Factor, Amazing Race, or Survivor: Threats & Opportunities for Libraries...
Fear Factor, Amazing Race, or Survivor: Threats & Opportunities for Libraries...Fear Factor, Amazing Race, or Survivor: Threats & Opportunities for Libraries...
Fear Factor, Amazing Race, or Survivor: Threats & Opportunities for Libraries...
 
Tipping the Cow: Reorganizing Staff to Support Electronic Resources
Tipping the Cow: Reorganizing Staff to Support Electronic ResourcesTipping the Cow: Reorganizing Staff to Support Electronic Resources
Tipping the Cow: Reorganizing Staff to Support Electronic Resources
 
Niso usage data forum 2007
Niso usage data forum 2007Niso usage data forum 2007
Niso usage data forum 2007
 
Size Matters: Engaging Your Users Where They Are @
Size Matters: Engaging Your Users Where They Are @Size Matters: Engaging Your Users Where They Are @
Size Matters: Engaging Your Users Where They Are @
 
Oberlin Group Library Statistics
Oberlin Group Library StatisticsOberlin Group Library Statistics
Oberlin Group Library Statistics
 
bX at Claremont
bX at ClaremontbX at Claremont
bX at Claremont
 
Sherlock: The Summon Experience at Claremont
Sherlock: The Summon Experience at ClaremontSherlock: The Summon Experience at Claremont
Sherlock: The Summon Experience at Claremont
 
Copyright 2.0: Issues for Digital Natives
Copyright 2.0: Issues for Digital NativesCopyright 2.0: Issues for Digital Natives
Copyright 2.0: Issues for Digital Natives
 
NISO Webinar on Usage Data: An Overview of Recent Usage Data Research
NISO Webinar on Usage Data: An Overview of Recent Usage Data ResearchNISO Webinar on Usage Data: An Overview of Recent Usage Data Research
NISO Webinar on Usage Data: An Overview of Recent Usage Data Research
 
Usage Factor: Final Report & Next Steps
Usage Factor: Final Report & Next StepsUsage Factor: Final Report & Next Steps
Usage Factor: Final Report & Next Steps
 
Changing the Structure of Scholarly Publishing: Open Access, Open Archives, a...
Changing the Structure of Scholarly Publishing: Open Access, Open Archives, a...Changing the Structure of Scholarly Publishing: Open Access, Open Archives, a...
Changing the Structure of Scholarly Publishing: Open Access, Open Archives, a...
 

Recently uploaded

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 

Recently uploaded (20)

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Role Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxRole Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptx
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 

Advanced statistics for librarians

  • 1.
  • 2.
  • 3.
  • 4.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9. Simple Stratified Assumes homogeneity Assumes heterogeneity Sampling Designs
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17. Fulltime Students at ARL Schools N=114 Mean = 22K SD = 10K
  • 18. Total Salaries & Wages at ARL Libraries N=114 Mean = 10M SD = 6.5M
  • 19.
  • 20.
  • 21.
  • 22.
  • 24.
  • 25.
  • 27.
  • 29.
  • 30. ANOVA
  • 31.
  • 32. Chi Square Test Pepsi Challenge Observed : Pepsi 85, Coke 57, RC 78 Expected (equal) = 73.33 Degrees of freedom = rows - 1 = 3 - 1 = 2 Critical value of χ 2 = 5.99 at alpha = 0.05 Observed value of χ 2 = 5.8 Decision: Fail to reject H 0 5.8 χ 2 = 219.99 220 Totals 0.3 21.81 4.67 73.33 78 RC 3.64 266.67 -16.33 73.33 57 Coke 1.86 136.19 11.67 73.33 85 Pepsi (O-E) 2 /E (O-E) 2 O-E E O
  • 33.
  • 34.
  • 35.
  • 36. “ Changing the Face of Instruction…” Is an online tutorial as effective in teaching library instruction as a classroom setting? H3. Students will report as much or more satisfaction with online instruction as students taking traditional instruction. Research Question Hypotheses H1. Students will have higher scores in information literacy tests after library instruction. H2. Students will have the same or higher scores in info-lit tests after taking online tutorials as students taking traditional instruction.
  • 37. “ Changing the Face of Instruction…” Variables: Test scores & survey results Data Collection: Pretest/Posttest & Survey Variables & Data Collection Statistical Tests Conclusions Accept H1: Instruction improves literacy. Desc Stats incl. mean, standard deviation, standard error, T-tests (1 & 2 tailed) Accept H3 alternative hypothesis – Student satisfaction is equal with both methods. Accept H2 alternative hypothesis – Online has no significant difference from traditional.
  • 38.
  • 39. “ Do Open-Access Articles…” Do freely available articles have a greater research impact? Research impact: citation rates Open Access: freely available Research Question Hypotheses H1. Scholarly articles have a greater research impact if the articles are freely available online than if they are not. Ho: (null hypothesis): There is no difference between the mean citation rates: Ho: d1 = d0 Measures
  • 40. “ Do Open-Access Articles…” Variables: Mean citation rates Data Collection: At least 50 articles from 10 leading journals in 4 disciplines. Variables & Data Collection Statistical Tests Conclusions Reject Ho: Open Access articles are citation more than those that are not OA. Desc Stats incl. mean, standard deviation, standard error, Wilcoxon sign-rank Validity? Reliability of Measures? Generalizability? Alternate hypotheses? Discussion
  • 41. My favorite statistic… Baseball is 90% mental – the other half is physical.

Editor's Notes

  1. Science & Electronic Resources Librarian Libraries of the Claremont Colleges
  2. Part I will be an overview of developing a research project with the aim of using statistics as a methodology in the analysis. Part II will be an overview of statistical concepts and language. Part III will be an exercise in evaluating library statistics.
  3. Three key concepts to remember when designing a research project of any kind, but especially statistical projects are Validitiy, Reliability and Generalizability. Validity is how well a variable measures a particular concept; For example – if we are measuring use, is it valid to count reshelving figures as use? Fulltext downloads? Reliability, the consistency of the variable, measurement, or test; One basic of scientific and statistical analysis is that the results can be confirmed by others repeating the experiment or data analysis. Without reliability, we would have Generalizability, means can the results be applied to other situations. For example: if you observed students using group study rooms in this library, could you generalize that use across all hours, days, buildings, user groups, or other institutions? If you can, your research can become a general model, a universal law, or immutable truth. But if you can’t, that just means that the results are applicable in common situations.
  4. Designing research includes formulating initial hypotheses, or statements about what the researcher thinks the data will show, data collection (and manipulation) through a variety of techniques, and statistical analysis that is suited to the hypotheses and data. Here are the key stages for doing research. Statistics are only a tool to help us understand the outcome of the research. Much research can be done not employing statistical techniques – most ethnographic research relies on direct observation and not on analysis of statistics. Take medicine for example: drug works, drug is safe, prescribe drug. Observational data or microscopic data may suffice. But most research relies on statistical analysis of research data, no matter how it’s collected.
  5. There are two basic designs to sample: a simple random sample and a stratified random sample, and they are pretty similar. Draw a single circle and a circle composed of other circles inside to provide visual aid. A simple random sample is what it sounds. A group of subjects is chosen from the whole population and each subject has an equal chance of being samples. If you took the campus directory and randomly selected a 100 names, that would be a simple random sample. Draw examples comparing simple and stratified. A stratified random sample is a bit more complex. It assumes that your population is composed of different types of individuals and that you want some knowledge about each group. For example, libraries often want to know how well they serve their communities and want to know something about students, faculty and staff. Are they meeting each of their needs? The solution to this problem is to divide up the population into each group and then randomly sample each group. Samples from each group are generally proportional to the size of each population.
  6. Now comes to a really fun and interactive part of the workshop. In this study, we are going to sample M&Ms and try to figure out the frequency of colors. Not only that, but we’re going to test our results against what the Mars Candy Company says should be the frequency. Lets think about our M&M packs. At the plant, the company loads millions of these little candies into a big hopper and tries to mix them so that they are randomly distributed. When they get packaged, the company wants you to get some of each color, but does not regulate the number of colored candies going into your package – some may have more blues, some may have more yellows. Each of these packages, you can consider a random sample of the large hopper or bin of M&M candies. And if we sample enough of these packages, we should start getting close to the distribution of colors at the company. Remember, we are doing samples because we don’t have enough money to count all M&Ms sold in every store.
  7. How is accuracy affected by size of sample? What would explain a difference between our observed results and M&M’s reported figures? Was our sample a good representation of the population? Is our methodology valid? Are our results generalizable?
  8. A review of the Basic Statistics for Librarians workshop. The five components were statistical concepts, evaluation of literature, sampling, an introduction to usage statistics, and designing a research study. Concepts included frequency distributions including flat (no change), normal ( a bell curve shape), and skewed (very many sloping to very few or vice versa). Mean is the average of a group and median is the middle value of a set of ordered values. A standard deviation is the measure of the dispersion or variation in a sample. For a normal distribution, 68% of the data is found within +/- 1 SD, 95% is +/- 2, and over 99% is +/- 3. Three key concepts to remember when evaluating literature are Validity, how well a variable measures the concept being studied; Reliability, the consistency of the variable, measurement, or test; and Generalizability, can the results be applied to other situations. Sampling is the act of drawing a portion of subjects to measure from a larger population. A random sample is the strongest type of sample since it is assumed to be a fair representation of the population. More complex sampling includes stratified random sampling, where a portion from each representation group of the population is taken, or convenience sampling where the sample includes a non-random sample. Sample size is important and for small populations, more subjects are needed. Usage statistics are very important in applied librarianship today but researchers need to remember to ask questions such as what is being measured, who did what is measured, why they did it, and how many of them did it. Most datasets will include outliers and missing data that can impact the statistical tests but there are many techniques for dealing with these problem data.
  9. Quiz – scale these types of data? Take a few minutes to write down what type of data these are, then we’ll go over them: Salary: ratio Author: nominal Hours: ratio Patron: nominal Publication: interval Ranked: Ordinal Tests: interval Articles: interval FTE: ratio
  10. This is a histogram of fulltime enrollments at ARL schools Fulltime students average about 22 thousand The standard deviation is about 10 thousand. QUIZ How many schools fall between 12 and 32 thousand students? Answer: 68%
  11. Lets now look at some real data from libraries to apply the concepts of mean and standard deviation This is histogram I generated from data I collected from American Research Libraries on total salaries and wages. There are 114 libraries included in this histogram Mean salary and wages at ARL libraries is about 10 million SD is about 6 and a half million
  12. How do you get a Law named after you? The key stages of statistical research, for collection & analysis, I’ve just listed a few examples and will briefly go over them.
  13. I’ll explain this in depth – how to get DF, how to do Chi-Square, etc.
  14. I’ll explain this in depth – how to get DF, how to do Chi-Square, etc.
  15. Designing research includes formulating initial hypotheses, or statements about what the researcher thinks the data will show, data collection (and manipulation) through a variety of techniques, and statistical analysis that is suited to the hypotheses and data. Here are the key stages for doing research. Statistics are only a tool to help us understand the outcome of the research. Much research can be done not employing statistical techniques – most ethnographic research relies on direct observation and not on analysis of statistics. Take medicine for example: drug works, drug is safe, prescribe drug. Observational data or microscopic data may suffice. But most research relies on statistical analysis of research data, no matter how it’s collected.
  16. I’ll state some general introduction about each types of analysis. And then introduce Nichols’ et. al study as the first example.
  17. Everyone will read the article and then we’ll go through these together, with each item coming out after someone states it.
  18. After going through this, we’ll discuss what the study did right (pretest, posttest, survey), and did wrong, including assumptions (Not stating the null hypotheses, accepting the alternate hypothesis when should have been rejected.
  19. As a group, the participants will read through this study and come up with the answers to the 5 questions, with discussion centering around the reliabilibity, validity, and generalizability, with a focus on finding out if the methods, variables, and tests fit the question.
  20. Everyone will read the article and then we’ll go through these together, with each item coming out after someone states it.
  21. After going through this, we’ll discuss what the study did right (pretest, posttest, survey), and did wrong, including assumptions (Not stating the null hypotheses, accepting the alternate hypothesis when should have been rejected.
  22. Just for fun….