SlideShare une entreprise Scribd logo
1  sur  19
Analyzing Data
 There are three kinds of lies -
    lies, damned lies and
       statistics.  
      ~Benjamin Disraeli

                            Advanced
                             Biology
                          Mrs. Morgan
Using Data
Statistics: The only science that enables
 different experts using the same figures
     to draw different conclusions.
               - Evan Esar



   After collecting data during lab
 investigations there are many ways to
        organize and analyze it.
Presenting Data
• Always present data in charts       Subject
                                                     HR                HR

  and graphs as well as in
                                                Before Exercise   After Exercise

                                        1             60               84

  words                                 2             76               80

                                        3             62               90


• Example:                              4             78               110

                                        5             70               92
  – Table 1 shows the heart rate of     6             66               92
    subjects before and after           7             70               88
    exercise. The average of
                                        8             74               80
    subjects’ heart rates shows a
    rise of 10.2 beats per minute       9             78               100


    after exercise.                     10            68               88

                                       Avg           70.2             80.4
Simple Data Analysis
Mean (average): sum of all
measurements divided by the                   Example
total # of measurements (duh…)       Data set: 2 4 5    7 10


Median: the middle number in a                 Mean
series of measurements.                (2+4+5+7+10)/5 = 5.6
                                              Median
                                          middle number = 5
Range: the difference between
                                               Range
the highest and lowest values in a
                                             10 – 2 = 8
series of measurements
More
 Analysis
The Q-Test
 – Used to determine if a data point should be left out
   of analysis calculations
 – Example: data set includes
    45, 48, 52, 43, 89, 56, 48, 47, 44, 51, 50
            (One of these things is not like the others…)


    A Q-test decides if the analysis of the data set
             should include the 89 or not
Q-Test
 Q = gap         Gap: distance between the
                 outlier and nearest data point
    range

   45, 48, 52, 43, 89, 56, 48, 47, 44, 51, 50

   Q = (89-56) = 33
                    = .717                          It helps to put
                                                  the data points in
       (89-43) = 46                                numerical order


                         So what do we do
                         with this number?
Q-
                                   Test
      Use a Q-table for the expected Q value
N-1     Q-value   N = number of data points
3        .94
                  N-1 = 10
4        .76
5        .64             If calculated Q value is greater
6        .56                 than expected Q value -
7        .51                   discard the data point
8        .47                    Qcalc = .717 > Qexp = .41
9        .44
                                  Discard point 89
10       .41
The last and most useful type of
            analysis

            The T-Test
• Determines if the averages of two sets of results
  are statistically different from each other, thus
  allowing for a confident conclusion to be made

• The chance that the results are due to
  coincidence must be below 5%
Say what?
Statistically different: t-test result is less than 0.05

What this means: if results are statistically different, there
                     is less than a 5% chance the results
                     are coincidence - therefore your
                     hypothesis is more likely to be
                     supported
         Calculate a t-test value for 2
        sets of data and compare it to .
                       05
Types of Data in a T-Test
• Tails:
  – One-tailed: experimenter has expected results (one
    group being higher/lower than another)

  – Two-tailed: experimenter only assumes a difference in
    results

• Paired/Two-Sample
  – Paired: same group used in each experiment;
    dependent (before and after)

  – Two-Sample: two separate groups; independent (men
    v.women)
T-Test Formula
In words: the mean of the first set minus the mean of the
 second set over the square root of the variance of each
  group divided by the number of results in each group.




                                                That’s a crap
                                                load of math
                                                 – we’ll use
                                                 PowerPoint
Using Microsoft Excel

Open the program and
create a new workbook.

Under “View” choose to
see the “Formula Builder”
T-Test using Microsoft Excel


  Type your data in,
  using one column
  for each group of
       results:
T-Test using Microsoft Excel
• Find the average for each set of data:
   – Select the group of data
   – Click on the equal (=) sign at the top of the
     screen

   – A window unfolds that looks like this:
T-Test using Microsoft Excel
• Select “average” from the pull-down menu,
  and a screen appears:
T-Test using Microsoft Excel
 • To take a t-test, choose an
   empty cell and enter a “=“
   which will bring up the
   formula builder.

 • If “TTEST” isn’t on the list
   of functions, search for it at
   the top of the builder.

 • Double click on “TTEST”
T-Test using Microsoft Excel
      Fill in the required data:
• Each of the categories are described

• Array = group of data
  (highlight the column to select group –
  don’t include any headings)


• Tails = one or two tailed (1 or 2)

• Type = paired or two-sample (1 or 2)
                                            And the answer just
                                                appears…
Tips for a Better T-Test
• The more results you have, the better and more
  accurate the results.

• If you have several sets of results, perform
  t-tests for all of them versus each other.

• The columns of data can also be used to
  generate graphs if the lab calls for it.
Works Cited
• http://trochim.human.cornell.edu/kb/stat_t.htm
• http://davidmlane.com/hyperstat/A29337.html

Contenu connexe

Tendances

www1.cs.columbia.edu
www1.cs.columbia.eduwww1.cs.columbia.edu
www1.cs.columbia.edu
butest
 

Tendances (20)

Measures of Variation
Measures of Variation Measures of Variation
Measures of Variation
 
www1.cs.columbia.edu
www1.cs.columbia.eduwww1.cs.columbia.edu
www1.cs.columbia.edu
 
Week 6 lecture_math_221_apr_2012
Week 6 lecture_math_221_apr_2012Week 6 lecture_math_221_apr_2012
Week 6 lecture_math_221_apr_2012
 
Two-sample Hypothesis Tests
Two-sample Hypothesis Tests Two-sample Hypothesis Tests
Two-sample Hypothesis Tests
 
Measures of Relative Standing and Boxplots
Measures of Relative Standing and BoxplotsMeasures of Relative Standing and Boxplots
Measures of Relative Standing and Boxplots
 
PG STAT 531 Lecture 4 Exploratory Data Analysis
PG STAT 531 Lecture 4 Exploratory Data AnalysisPG STAT 531 Lecture 4 Exploratory Data Analysis
PG STAT 531 Lecture 4 Exploratory Data Analysis
 
Back to the basics-Part2: Data exploration: representing and testing data pro...
Back to the basics-Part2: Data exploration: representing and testing data pro...Back to the basics-Part2: Data exploration: representing and testing data pro...
Back to the basics-Part2: Data exploration: representing and testing data pro...
 
Conistency of random forests
Conistency of random forestsConistency of random forests
Conistency of random forests
 
Lesson 6 measures of central tendency
Lesson 6 measures of central tendencyLesson 6 measures of central tendency
Lesson 6 measures of central tendency
 
CVPR2015 reading "Global refinement of random forest"
CVPR2015 reading "Global refinement of random forest"CVPR2015 reading "Global refinement of random forest"
CVPR2015 reading "Global refinement of random forest"
 
Measure of central tendency
Measure of central tendencyMeasure of central tendency
Measure of central tendency
 
Exploratory data analysis
Exploratory data analysisExploratory data analysis
Exploratory data analysis
 
Classification techniques in data mining
Classification techniques in data miningClassification techniques in data mining
Classification techniques in data mining
 
2.2 decision tree
2.2 decision tree2.2 decision tree
2.2 decision tree
 
Eda sri
Eda sriEda sri
Eda sri
 
Krupa rm
Krupa rmKrupa rm
Krupa rm
 
evaluation and credibility-Part 2
evaluation and credibility-Part 2evaluation and credibility-Part 2
evaluation and credibility-Part 2
 
2.4 rule based classification
2.4 rule based classification2.4 rule based classification
2.4 rule based classification
 
Stat 3203 -multphase sampling
Stat 3203 -multphase samplingStat 3203 -multphase sampling
Stat 3203 -multphase sampling
 
Chapter 4 Classification
Chapter 4 ClassificationChapter 4 Classification
Chapter 4 Classification
 

Similaire à Statistics Notes

Topic 8a Basic Statistics
Topic 8a Basic StatisticsTopic 8a Basic Statistics
Topic 8a Basic Statistics
Yee Bee Choo
 
jhghgjhgjhgjhfhcgjfjhvjhjgjkggjhgjhgjhfjgjgfgfhgfhg
jhghgjhgjhgjhfhcgjfjhvjhjgjkggjhgjhgjhfjgjgfgfhgfhgjhghgjhgjhgjhfhcgjfjhvjhjgjkggjhgjhgjhfjgjgfgfhgfhg
jhghgjhgjhgjhfhcgjfjhvjhjgjkggjhgjhgjhfjgjgfgfhgfhg
UMAIRASHFAQ20
 

Similaire à Statistics Notes (20)

Factorial Experiments
Factorial ExperimentsFactorial Experiments
Factorial Experiments
 
T test
T testT test
T test
 
Inferential statistics nominal data
Inferential statistics   nominal dataInferential statistics   nominal data
Inferential statistics nominal data
 
Spss basic Dr Marwa Zalat
Spss basic Dr Marwa ZalatSpss basic Dr Marwa Zalat
Spss basic Dr Marwa Zalat
 
T test statistics
T test statisticsT test statistics
T test statistics
 
Test of significance
Test of significanceTest of significance
Test of significance
 
MEASURE OF CENTRAL TENDENCY
MEASURE OF CENTRAL TENDENCY  MEASURE OF CENTRAL TENDENCY
MEASURE OF CENTRAL TENDENCY
 
Designing Test Collections for Comparing Many Systems
Designing Test Collections for Comparing Many SystemsDesigning Test Collections for Comparing Many Systems
Designing Test Collections for Comparing Many Systems
 
univariate and bivariate analysis in spss
univariate and bivariate analysis in spss univariate and bivariate analysis in spss
univariate and bivariate analysis in spss
 
Day 12 t test for dependent samples and single samples pdf
Day 12 t test for dependent samples and single samples pdfDay 12 t test for dependent samples and single samples pdf
Day 12 t test for dependent samples and single samples pdf
 
Unit 2 - Statistics
Unit 2 - StatisticsUnit 2 - Statistics
Unit 2 - Statistics
 
Non parametric-tests
Non parametric-testsNon parametric-tests
Non parametric-tests
 
Topic 8a Basic Statistics
Topic 8a Basic StatisticsTopic 8a Basic Statistics
Topic 8a Basic Statistics
 
Week 11 Model Evalaution Model Evaluation
Week 11 Model Evalaution Model EvaluationWeek 11 Model Evalaution Model Evaluation
Week 11 Model Evalaution Model Evaluation
 
analytical representation of data
 analytical representation of data analytical representation of data
analytical representation of data
 
evaluation and credibility-Part 1
evaluation and credibility-Part 1evaluation and credibility-Part 1
evaluation and credibility-Part 1
 
data analysis in research.pptx
data analysis in research.pptxdata analysis in research.pptx
data analysis in research.pptx
 
jhghgjhgjhgjhfhcgjfjhvjhjgjkggjhgjhgjhfjgjgfgfhgfhg
jhghgjhgjhgjhfhcgjfjhvjhjgjkggjhgjhgjhfjgjgfgfhgfhgjhghgjhgjhgjhfhcgjfjhvjhjgjkggjhgjhgjhfjgjgfgfhgfhg
jhghgjhgjhgjhfhcgjfjhvjhjgjkggjhgjhgjhfjgjgfgfhgfhg
 
SP and R.pptx
SP and R.pptxSP and R.pptx
SP and R.pptx
 
central tendency
central tendency central tendency
central tendency
 

Plus de Leah Morgan (20)

Unit 1 jeopardy game
Unit 1 jeopardy gameUnit 1 jeopardy game
Unit 1 jeopardy game
 
Imaging powerpoint
Imaging powerpointImaging powerpoint
Imaging powerpoint
 
Anatomical positioning notes
Anatomical positioning notesAnatomical positioning notes
Anatomical positioning notes
 
History of forensics 2012
History of forensics 2012History of forensics 2012
History of forensics 2012
 
Biology Exam Student Preparation Booklet
Biology Exam Student Preparation BookletBiology Exam Student Preparation Booklet
Biology Exam Student Preparation Booklet
 
Ch 16 lab info
Ch 16 lab infoCh 16 lab info
Ch 16 lab info
 
Ch 16 due dates
Ch 16 due datesCh 16 due dates
Ch 16 due dates
 
Fs ch 16 documents
Fs ch 16 documentsFs ch 16 documents
Fs ch 16 documents
 
Karyotype notes
Karyotype notesKaryotype notes
Karyotype notes
 
Genetics review sheet
Genetics review sheetGenetics review sheet
Genetics review sheet
 
CHapter 11 Notes - Blood Analysis
CHapter 11 Notes - Blood AnalysisCHapter 11 Notes - Blood Analysis
CHapter 11 Notes - Blood Analysis
 
Chapter 13 - Process of Death
Chapter 13 - Process of DeathChapter 13 - Process of Death
Chapter 13 - Process of Death
 
Cell bio review sheet
Cell bio review sheetCell bio review sheet
Cell bio review sheet
 
forensics ch 5 & 6 notes
forensics ch 5 & 6 notesforensics ch 5 & 6 notes
forensics ch 5 & 6 notes
 
Forensics ch 4 notes
Forensics ch 4 notesForensics ch 4 notes
Forensics ch 4 notes
 
sketching/photo lab sheet
sketching/photo lab sheetsketching/photo lab sheet
sketching/photo lab sheet
 
Forensics ch 3 notes
Forensics ch 3 notesForensics ch 3 notes
Forensics ch 3 notes
 
Morgan History part 2
Morgan History part 2Morgan History part 2
Morgan History part 2
 
Morgan History part 1
Morgan History part 1Morgan History part 1
Morgan History part 1
 
Forensics Ch 2 notes
Forensics Ch 2 notesForensics Ch 2 notes
Forensics Ch 2 notes
 

Dernier

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Dernier (20)

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 

Statistics Notes

  • 1. Analyzing Data There are three kinds of lies - lies, damned lies and statistics.   ~Benjamin Disraeli Advanced Biology Mrs. Morgan
  • 2. Using Data Statistics: The only science that enables different experts using the same figures to draw different conclusions. - Evan Esar After collecting data during lab investigations there are many ways to organize and analyze it.
  • 3. Presenting Data • Always present data in charts Subject HR HR and graphs as well as in Before Exercise After Exercise 1 60 84 words 2 76 80 3 62 90 • Example: 4 78 110 5 70 92 – Table 1 shows the heart rate of 6 66 92 subjects before and after 7 70 88 exercise. The average of 8 74 80 subjects’ heart rates shows a rise of 10.2 beats per minute 9 78 100 after exercise. 10 68 88 Avg 70.2 80.4
  • 4. Simple Data Analysis Mean (average): sum of all measurements divided by the Example total # of measurements (duh…) Data set: 2 4 5 7 10 Median: the middle number in a Mean series of measurements. (2+4+5+7+10)/5 = 5.6 Median middle number = 5 Range: the difference between Range the highest and lowest values in a 10 – 2 = 8 series of measurements
  • 5. More Analysis The Q-Test – Used to determine if a data point should be left out of analysis calculations – Example: data set includes 45, 48, 52, 43, 89, 56, 48, 47, 44, 51, 50 (One of these things is not like the others…) A Q-test decides if the analysis of the data set should include the 89 or not
  • 6. Q-Test Q = gap Gap: distance between the outlier and nearest data point range 45, 48, 52, 43, 89, 56, 48, 47, 44, 51, 50 Q = (89-56) = 33 = .717 It helps to put the data points in (89-43) = 46 numerical order So what do we do with this number?
  • 7. Q- Test Use a Q-table for the expected Q value N-1 Q-value N = number of data points 3 .94 N-1 = 10 4 .76 5 .64 If calculated Q value is greater 6 .56 than expected Q value - 7 .51 discard the data point 8 .47 Qcalc = .717 > Qexp = .41 9 .44 Discard point 89 10 .41
  • 8. The last and most useful type of analysis The T-Test • Determines if the averages of two sets of results are statistically different from each other, thus allowing for a confident conclusion to be made • The chance that the results are due to coincidence must be below 5%
  • 9. Say what? Statistically different: t-test result is less than 0.05 What this means: if results are statistically different, there is less than a 5% chance the results are coincidence - therefore your hypothesis is more likely to be supported Calculate a t-test value for 2 sets of data and compare it to . 05
  • 10. Types of Data in a T-Test • Tails: – One-tailed: experimenter has expected results (one group being higher/lower than another) – Two-tailed: experimenter only assumes a difference in results • Paired/Two-Sample – Paired: same group used in each experiment; dependent (before and after) – Two-Sample: two separate groups; independent (men v.women)
  • 11. T-Test Formula In words: the mean of the first set minus the mean of the second set over the square root of the variance of each group divided by the number of results in each group. That’s a crap load of math – we’ll use PowerPoint
  • 12. Using Microsoft Excel Open the program and create a new workbook. Under “View” choose to see the “Formula Builder”
  • 13. T-Test using Microsoft Excel Type your data in, using one column for each group of results:
  • 14. T-Test using Microsoft Excel • Find the average for each set of data: – Select the group of data – Click on the equal (=) sign at the top of the screen – A window unfolds that looks like this:
  • 15. T-Test using Microsoft Excel • Select “average” from the pull-down menu, and a screen appears:
  • 16. T-Test using Microsoft Excel • To take a t-test, choose an empty cell and enter a “=“ which will bring up the formula builder. • If “TTEST” isn’t on the list of functions, search for it at the top of the builder. • Double click on “TTEST”
  • 17. T-Test using Microsoft Excel Fill in the required data: • Each of the categories are described • Array = group of data (highlight the column to select group – don’t include any headings) • Tails = one or two tailed (1 or 2) • Type = paired or two-sample (1 or 2) And the answer just appears…
  • 18. Tips for a Better T-Test • The more results you have, the better and more accurate the results. • If you have several sets of results, perform t-tests for all of them versus each other. • The columns of data can also be used to generate graphs if the lab calls for it.
  • 19. Works Cited • http://trochim.human.cornell.edu/kb/stat_t.htm • http://davidmlane.com/hyperstat/A29337.html