Advanced statistics for librarians

Advanced Statistics for Librarians How to use and evaluate statistical information in library research ,[object Object],Caltech ,[object Object],Acquisitions Librarian ,[object Object],John McDonald

Advanced Statistics ,[object Object],[object Object],[object Object]

Research Design ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Research Design Steps ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Research Question ,[object Object],[object Object],[object Object],[object Object]

Hypothesis ,[object Object],[object Object],[object Object],[object Object],[object Object]

Data collection ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Data Collection: Sampling ,[object Object],[object Object],[object Object],[object Object],[object Object]

Simple Stratified Assumes homogeneity Assumes heterogeneity Sampling Designs

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Sample size spreadsheet Calculating Sample Sizes

[object Object],[object Object],[object Object],[object Object],M&M Sampling

[object Object],[object Object],M&M Sampling

Data Definitions ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Data Scales ,[object Object],[object Object],[object Object],[object Object],[object Object]

Name that data type! ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Data Distributions ,[object Object],Non-normal (skewed): extreme values with steep slopes Normal : bell shaped curve with gradual slopes

Fulltime Students at ARL Schools N=114 Mean = 22K SD = 10K

Total Salaries & Wages at ARL Libraries N=114 Mean = 10M SD = 6.5M

Variables ,[object Object],[object Object],[object Object]

Data analysis ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Correlational Statistics ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Correlational Statistics ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Inferential Statistics ,[object Object],[object Object]

T-Test ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Regression ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

ANOVA ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Chi Square Test ,[object Object],[object Object],[object Object],[object Object],[object Object]

Chi Square Test Pepsi Challenge Observed : Pepsi 85, Coke 57, RC 78 Expected (equal) = 73.33 Degrees of freedom = rows - 1 = 3 - 1 = 2 Critical value of χ 2 = 5.99 at alpha = 0.05 Observed value of χ 2 = 5.8 Decision: Fail to reject H 0 5.8 χ 2 = 219.99 220 Totals 0.3 21.81 4.67 73.33 78 RC 3.64 266.67 -16.33 73.33 57 Coke 1.86 136.19 11.67 73.33 85 Pepsi (O-E) 2 /E (O-E) 2 O-E E O

Inferential Statistics ,[object Object],[object Object],OLS Regression Predict value from measured variables ,[object Object],[object Object],T-test Compare sample to a hypothetical value ,[object Object],[object Object],ANOVA Compare 3+ unmatched groups ,[object Object],[object Object],Standard two-group t-test Compare 2 paired groups ,[object Object],[object Object],Unpaired t-test Compare 2 unpaired groups ,[object Object],[object Object],Pearson correlation Quantify association between variables Non-parametric Parametric Goal

Review: Research Design ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Case Studies ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

“ Changing the Face of Instruction…” Is an online tutorial as effective in teaching library instruction as a classroom setting? H3. Students will report as much or more satisfaction with online instruction as students taking traditional instruction. Research Question Hypotheses H1. Students will have higher scores in information literacy tests after library instruction. H2. Students will have the same or higher scores in info-lit tests after taking online tutorials as students taking traditional instruction.

“ Changing the Face of Instruction…” Variables: Test scores & survey results Data Collection: Pretest/Posttest & Survey Variables & Data Collection Statistical Tests Conclusions Accept H1: Instruction improves literacy. Desc Stats incl. mean, standard deviation, standard error, T-tests (1 & 2 tailed) Accept H3 alternative hypothesis – Student satisfaction is equal with both methods. Accept H2 alternative hypothesis – Online has no significant difference from traditional.

“ Do Open-Access Articles…” ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

“ Do Open-Access Articles…” Do freely available articles have a greater research impact? Research impact: citation rates Open Access: freely available Research Question Hypotheses H1. Scholarly articles have a greater research impact if the articles are freely available online than if they are not. Ho: (null hypothesis): There is no difference between the mean citation rates: Ho: d1 = d0 Measures

“ Do Open-Access Articles…” Variables: Mean citation rates Data Collection: At least 50 articles from 10 leading journals in 4 disciplines. Variables & Data Collection Statistical Tests Conclusions Reject Ho: Open Access articles are citation more than those that are not OA. Desc Stats incl. mean, standard deviation, standard error, Wilcoxon sign-rank Validity? Reliability of Measures? Generalizability? Alternate hypotheses? Discussion

My favorite statistic… Baseball is 90% mental – the other half is physical.

Advanced statistics for librarians

Recommended

Recommended

More Related Content

What's hot

What's hot (19)

Viewers also liked

Viewers also liked (6)

Similar to Advanced statistics for librarians

Similar to Advanced statistics for librarians (20)

More from John McDonald

More from John McDonald (20)

Recently uploaded

Recently uploaded (20)

Advanced statistics for librarians

Editor's Notes