SlideShare une entreprise Scribd logo
1  sur  36
@kaimrc_edu




       1
DATA ENTRY USING IBM SPSS


        Yusuf O.B.
      Biostatistician
       KAIMRC-WR
Lecture Outline
•   What is SPSS
•   Uses of SPSS
•   Preparing to enter data
•   Preparing a data dictionary
•   Data Structures
•   Errors in data
•   Data Cleaning
What is SPSS

Statistical Package for the Social Sciences
Uses of SPSS

•   Data entry
•   Data cleaning and editing
•   Data analysis
•   Data presentation
•   Data Importing and Exporting
•   SPSS Data Library
Preparing a Data Dictionary
• What is a data dictionary?
  – A book/document containing all variables and
    the codes/categories assigned to them
  – Also contains how the variables will be
    entered and other remarks necessary
  – Specifies width/ length of variables
  – Specifies how missing values will be
    assigned
Data coding
• Translation of responses on the
  questionnaires or data collection sheets to
  specific categories for the purpose of
  analysis.
• Assignment of numbers to the various
  levels of the variables.
• Load of work light for pre-coded
  questionnaires
• Important and tedious for open ended
  questions.
• Need to assign numerical codes to
  categorical data before entering
• For example, you may choose to assign
  codes of 1, 2, 3 and 4 to categories of “no
  pain”, „mild pain”, “moderate pain” and
  “severe pain” respectively
• These codes can be put in the
  questionnaire when collecting the data.
• For binary data e.g. yes/no answers, it is
  often convenient to assign codes 1 (e.g.
  for yes) and 0 or 2 (for no).
NEED FOR CODING GUIDE/Data
         Dictionary
• Prepare data in format to allow use of
  computers for statistical analysis.
• Prepare code book or data dictionary for
  the questionnaire.
• Specify range of values expected.
• Unit of measurement should be consistent
  for all observations on a variable. E.g.
  weight should be recorded in kg or in
  pounds , but not both interchangeably
• Time: days? Hours?
  – For example length of hospital stay
Example of data dictionary
Variable    Variable label Value labels   width   Remark
Name
1. Age      AGE            ---------      2       Missing=99
2. Sex      SEX            1=male         1       -----
                           2=female


3. Do you   SMOKE          1=YES          1       Missing=9
smoke?                     2=NO
Example
• Topic: Smoking among medical students
• 200 questionnaires/records
• 6 questions/variables
Variables in questionnaire
• Serial Number
• Age
• Place of residence
   1=on campus, 2=off campus
• Sex
   1=male, 2= female
• Do you smoke?
   1=yes 2= No
• At what age did you start to
  smoke? -------
Exercise

– Complete the data dictionary above
SPSS windows
• Data editor
  – For data entry
  – For statistical analysis
• Viewer
  – Results are displayed
Data Editor
• Two views
  – Data view: for data entry
  – Variable view: to define variable
    characteristics
Preparing Data Structures in
            SPSS
• Variable views
  – Variable names
  – Variable types
  – Value labels
  – Variable width
  – Column
  – Measure
Data Entry
• Use of computer packages such as SPSS
  – Improves the accuracy and speed of data analysis
     • Makes it easy to check for errors, produces graphical
       summaries and generates new variables
  - Log in data as it arrives
• Frequent backing-up
• Problems with dates and times:
  – dates and times should be entered in a
    consistent manner, e.g. as day/month/year or
    month/day/year but not interchangeably.
  – It is important to find out what format the
    statistical package can read
Handling missing data

• Consider what to do with missing values before data is
  entered.
• In most cases, need to use some symbol to represent a
  missing value
• Statistical packages deal with missing values in different
  ways
• Some use special characters (e.g. a full stop or asterisk)
  to indicate missing values, whereas others require you to
  define your own code for a missing value (commonly
  used values are 9, 99 or 999)
• The value that is chosen should be one that is not
  possible for that variable
• For example when entering a categorical
  variable with four categories (coded 1,2,3,
  and 4), you may choose the value 9 to
  represent missing values.
• However, if the variable is age of a child,
  then a different code should be chosen.
• If a large proportion of data is missing, then the results
  are likely to be unreliable

• Reasons why data are missing should always be
  investigated: how much is missing and why?

• If missing data tend to cluster around a particular
  variable, or in a particular sub group of individuals, then
  it ,may indicate that the variable is not applicable or has
  not been measured for that group of individuals
• Then the group of individuals should be
  excluded from any analysis on that
  variable
• Or it may be that the data is simply sitting
  on a piece of paper in someone‟s drawer
  and are yet to be entered!
Errors in data
• In any study, there is always the potential
  for errors to occur in a data set, either at
  the outset when taking the measurements,
  or when collecting, and entering data onto
  a computer

• It is hard to eliminate all of these errors

• But one can reduce the number of typing
  errors by checking the data carefully once
  they have been entered.
Common sources of error
• „not applicable‟ or „blank‟ coded as “0”
• typing errors on data entry- 18 INSTEAD
  of 81
• column shift- data for one variable column
  was entered under the adjacent column
• coding errors
• Loss of concentration
Data cleaning
Two-step process-
• detection and
• correction of errors
.
Detecting errors
- Check for completeness and
  correctness of records.
     - Indicate admissible values during
       data entry
     - Range checks-permissible responses.

    - Statistical editing
How to Detect Errors via
        Statistical editing
• Produce descriptive statistics for all
  variables.
• Check frequency distribution of each
  variable 1=male, 2=female, 3?
• Standard deviation higher than mean;
  check for outlying observation
Quality Control

- Record verification (double entry)
  - Does not rule out the possibility that the same error
    has been incorrectly entered on the two occasions
  - Disadvantage of this approach is that it takes twice as
    long to enter the data, which may have major cost or
    time implications
    - Creating check files
- Random checking: selection at random but
  should represent all forms being entered
Error checking
• Categorical Data: relatively easy, values not allowable
  must be errors
• Check frequency distribution of each variable 1=male,
  2=female, 3?

• Numerical data: Produce descriptive statistics for all
  variables.
• Standard deviation higher than mean; check for outlying
  observation

• range checks, upper and lower limits can be specified for
  each variable
• Dates: not easy to check accuracy of dates, for
  example 30th feb. must be incorrect, any day of
  the month greater than 31, any month greater
  than 12
• Apply logical checks:
  – date of birth should correspond to patient‟s age
  – subjects should usually have been born before
    entering the study( at least in most studies)
  – patients who have died should not appear on
    subsequent follow up visits
  – there should be no pregnant men
• With all error checks, a value should only
  be corrected if there is evidence that a
  mistake has been made
• Do not change values simply because
  they look unusual; investigate
Summary
• Experience comes with practice
• Input influences output

Contenu connexe

Tendances

Creating a Histogram in SPSS
Creating a Histogram in SPSSCreating a Histogram in SPSS
Creating a Histogram in SPSS
florentinoz
 
SPSS an intro...
SPSS an intro...SPSS an intro...
SPSS an intro...
Jithin Zcs
 
Introduction To SPSS
Introduction To SPSSIntroduction To SPSS
Introduction To SPSS
Phi Jack
 
Final spss hands on training (descriptive analysis) may 24th 2013
Final spss  hands on training (descriptive analysis) may 24th 2013Final spss  hands on training (descriptive analysis) may 24th 2013
Final spss hands on training (descriptive analysis) may 24th 2013
Tin Myo Han
 
Evaluation Spss
Evaluation SpssEvaluation Spss
Evaluation Spss
jackng
 

Tendances (20)

Data processing & Analysis: SPSS an overview
Data processing & Analysis: SPSS an overviewData processing & Analysis: SPSS an overview
Data processing & Analysis: SPSS an overview
 
"A basic guide to SPSS"
"A basic guide to SPSS""A basic guide to SPSS"
"A basic guide to SPSS"
 
Statistical Procedures using SPSSi
Statistical Procedures using SPSSiStatistical Procedures using SPSSi
Statistical Procedures using SPSSi
 
Spss
SpssSpss
Spss
 
An Introduction to SPSS
An Introduction to SPSSAn Introduction to SPSS
An Introduction to SPSS
 
Creating a Histogram in SPSS
Creating a Histogram in SPSSCreating a Histogram in SPSS
Creating a Histogram in SPSS
 
Spss
SpssSpss
Spss
 
Applied Statistical Methods - Question & Answer on SPSS
Applied Statistical Methods - Question & Answer on SPSSApplied Statistical Methods - Question & Answer on SPSS
Applied Statistical Methods - Question & Answer on SPSS
 
SPSS an intro...
SPSS an intro...SPSS an intro...
SPSS an intro...
 
Application of spss usha (1)
Application of spss usha (1)Application of spss usha (1)
Application of spss usha (1)
 
Workshop on SPSS: Basic to Intermediate Level
Workshop on SPSS: Basic to Intermediate LevelWorkshop on SPSS: Basic to Intermediate Level
Workshop on SPSS: Basic to Intermediate Level
 
Introduction To SPSS
Introduction To SPSSIntroduction To SPSS
Introduction To SPSS
 
Final spss hands on training (descriptive analysis) may 24th 2013
Final spss  hands on training (descriptive analysis) may 24th 2013Final spss  hands on training (descriptive analysis) may 24th 2013
Final spss hands on training (descriptive analysis) may 24th 2013
 
Data management through spss
Data management through spssData management through spss
Data management through spss
 
Evaluation Spss
Evaluation SpssEvaluation Spss
Evaluation Spss
 
SPSS
SPSSSPSS
SPSS
 
(Manual spss)
(Manual spss)(Manual spss)
(Manual spss)
 
Spss tutorial 1
Spss tutorial 1Spss tutorial 1
Spss tutorial 1
 
Introduction to spss 2
Introduction to spss 2Introduction to spss 2
Introduction to spss 2
 
Spss
SpssSpss
Spss
 

En vedette

Data and Statistics library research at UCSD
Data and Statistics library research at UCSDData and Statistics library research at UCSD
Data and Statistics library research at UCSD
Annelise Sklar
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statistics
madan kumar
 
Introduction To Statistics
Introduction To StatisticsIntroduction To Statistics
Introduction To Statistics
albertlaporte
 

En vedette (7)

PrePARe: Store It Safely
PrePARe: Store It SafelyPrePARe: Store It Safely
PrePARe: Store It Safely
 
Data and Statistics library research at UCSD
Data and Statistics library research at UCSDData and Statistics library research at UCSD
Data and Statistics library research at UCSD
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statistics
 
We Are Social's Digital Statshot 002
We Are Social's Digital Statshot 002We Are Social's Digital Statshot 002
We Are Social's Digital Statshot 002
 
Introduction To Statistics
Introduction To StatisticsIntroduction To Statistics
Introduction To Statistics
 
Visual Design with Data
Visual Design with DataVisual Design with Data
Visual Design with Data
 
3 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 20173 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 2017
 

Similaire à RSS 2012 Data Entry SPSS

Basics of Data Analysis
Basics of Data AnalysisBasics of Data Analysis
Basics of Data Analysis
ankurjain1909
 

Similaire à RSS 2012 Data Entry SPSS (20)

Data Collection Preparation
Data Collection PreparationData Collection Preparation
Data Collection Preparation
 
APSY3206 Lecture 1.pptx
APSY3206 Lecture 1.pptxAPSY3206 Lecture 1.pptx
APSY3206 Lecture 1.pptx
 
Data Preparation and Processing
Data Preparation and ProcessingData Preparation and Processing
Data Preparation and Processing
 
Data analysis using spss
Data analysis using spssData analysis using spss
Data analysis using spss
 
Research 101: Quantitative Data Preparation
Research 101: Quantitative Data PreparationResearch 101: Quantitative Data Preparation
Research 101: Quantitative Data Preparation
 
Introduction to Data Analysis for Nurse Researchers
Introduction to Data Analysis for Nurse ResearchersIntroduction to Data Analysis for Nurse Researchers
Introduction to Data Analysis for Nurse Researchers
 
Spss basic Dr Marwa Zalat
Spss basic Dr Marwa ZalatSpss basic Dr Marwa Zalat
Spss basic Dr Marwa Zalat
 
Principles of data collection.pptx
Principles of data collection.pptxPrinciples of data collection.pptx
Principles of data collection.pptx
 
Basics of Data Analysis
Basics of Data AnalysisBasics of Data Analysis
Basics of Data Analysis
 
1. chapter i(pasw)
1. chapter i(pasw)1. chapter i(pasw)
1. chapter i(pasw)
 
5.Measurement and scaling technique.pptx
5.Measurement and scaling technique.pptx5.Measurement and scaling technique.pptx
5.Measurement and scaling technique.pptx
 
4 Statistical Software.pptx
4 Statistical Software.pptx4 Statistical Software.pptx
4 Statistical Software.pptx
 
Data exploration validation and sanitization
Data exploration validation and sanitizationData exploration validation and sanitization
Data exploration validation and sanitization
 
Mba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation aMba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation a
 
Brief Introduction to the 12 Steps of Evaluation Data Cleaning
Brief Introduction to the 12 Steps of Evaluation Data CleaningBrief Introduction to the 12 Steps of Evaluation Data Cleaning
Brief Introduction to the 12 Steps of Evaluation Data Cleaning
 
analysis of data.pptx
analysis of data.pptxanalysis of data.pptx
analysis of data.pptx
 
trs-3.ppt
trs-3.ppttrs-3.ppt
trs-3.ppt
 
trs-3.ppt
trs-3.ppttrs-3.ppt
trs-3.ppt
 
trs-3.ppt
trs-3.ppttrs-3.ppt
trs-3.ppt
 
trs-3.ppt
trs-3.ppttrs-3.ppt
trs-3.ppt
 

Plus de Wesam Abuznadah

Plus de Wesam Abuznadah (14)

RSS 2012 Preparing & Submitting the Manuscript
RSS 2012 Preparing & Submitting the ManuscriptRSS 2012 Preparing & Submitting the Manuscript
RSS 2012 Preparing & Submitting the Manuscript
 
RSS 2012 Interview Techniques
RSS 2012 Interview TechniquesRSS 2012 Interview Techniques
RSS 2012 Interview Techniques
 
RSS 2012 Sampling Techniques
RSS 2012 Sampling TechniquesRSS 2012 Sampling Techniques
RSS 2012 Sampling Techniques
 
RSS 2012 How to Write a Health Survey
RSS 2012 How to Write a Health SurveyRSS 2012 How to Write a Health Survey
RSS 2012 How to Write a Health Survey
 
Referencing Guide APA Style --Curtin-Handout
Referencing Guide APA Style --Curtin-HandoutReferencing Guide APA Style --Curtin-Handout
Referencing Guide APA Style --Curtin-Handout
 
Referencing Guide Vancouver Style
Referencing Guide Vancouver StyleReferencing Guide Vancouver Style
Referencing Guide Vancouver Style
 
RSS 2012 Introduction to Referencing
RSS 2012 Introduction to ReferencingRSS 2012 Introduction to Referencing
RSS 2012 Introduction to Referencing
 
RSS 2012 Study designs
RSS 2012 Study designsRSS 2012 Study designs
RSS 2012 Study designs
 
RSS 2012 Email Etiquette
RSS 2012 Email EtiquetteRSS 2012 Email Etiquette
RSS 2012 Email Etiquette
 
RSS 2012 Literature Reviews
RSS 2012 Literature ReviewsRSS 2012 Literature Reviews
RSS 2012 Literature Reviews
 
RSS 2012 Literature Searches
RSS 2012 Literature SearchesRSS 2012 Literature Searches
RSS 2012 Literature Searches
 
RSS 2012 Developing Research Idea and Question
RSS 2012 Developing Research Idea and QuestionRSS 2012 Developing Research Idea and Question
RSS 2012 Developing Research Idea and Question
 
RSS 2012 Introduction: A Student Perspective
RSS 2012 Introduction: A Student PerspectiveRSS 2012 Introduction: A Student Perspective
RSS 2012 Introduction: A Student Perspective
 
Introduction to Research Research Summer School RSS 2012
Introduction to Research Research Summer School RSS 2012Introduction to Research Research Summer School RSS 2012
Introduction to Research Research Summer School RSS 2012
 

Dernier

Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
chetankumar9855
 
🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
 
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
mahaiklolahd
 
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
adilkhan87451
 

Dernier (20)

All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
 
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
 
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
 
🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
 
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
 
Top Rated Bangalore Call Girls Majestic ⟟ 9332606886 ⟟ Call Me For Genuine S...
Top Rated Bangalore Call Girls Majestic ⟟  9332606886 ⟟ Call Me For Genuine S...Top Rated Bangalore Call Girls Majestic ⟟  9332606886 ⟟ Call Me For Genuine S...
Top Rated Bangalore Call Girls Majestic ⟟ 9332606886 ⟟ Call Me For Genuine S...
 
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
 
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...
 
Call Girls Tirupati Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Tirupati Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Tirupati Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Tirupati Just Call 8250077686 Top Class Call Girl Service Available
 
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
 
Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...
Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...
Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...
 
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
 
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
 
Call Girls Visakhapatnam Just Call 8250077686 Top Class Call Girl Service Ava...
Call Girls Visakhapatnam Just Call 8250077686 Top Class Call Girl Service Ava...Call Girls Visakhapatnam Just Call 8250077686 Top Class Call Girl Service Ava...
Call Girls Visakhapatnam Just Call 8250077686 Top Class Call Girl Service Ava...
 
Call Girls Vadodara Just Call 8617370543 Top Class Call Girl Service Available
Call Girls Vadodara Just Call 8617370543 Top Class Call Girl Service AvailableCall Girls Vadodara Just Call 8617370543 Top Class Call Girl Service Available
Call Girls Vadodara Just Call 8617370543 Top Class Call Girl Service Available
 
Call Girls Shimla Just Call 8617370543 Top Class Call Girl Service Available
Call Girls Shimla Just Call 8617370543 Top Class Call Girl Service AvailableCall Girls Shimla Just Call 8617370543 Top Class Call Girl Service Available
Call Girls Shimla Just Call 8617370543 Top Class Call Girl Service Available
 
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
 
Call Girls Kakinada Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Kakinada Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Kakinada Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Kakinada Just Call 9907093804 Top Class Call Girl Service Available
 
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
 

RSS 2012 Data Entry SPSS

  • 2. DATA ENTRY USING IBM SPSS Yusuf O.B. Biostatistician KAIMRC-WR
  • 3. Lecture Outline • What is SPSS • Uses of SPSS • Preparing to enter data • Preparing a data dictionary • Data Structures • Errors in data • Data Cleaning
  • 4. What is SPSS Statistical Package for the Social Sciences
  • 5. Uses of SPSS • Data entry • Data cleaning and editing • Data analysis • Data presentation • Data Importing and Exporting • SPSS Data Library
  • 6. Preparing a Data Dictionary • What is a data dictionary? – A book/document containing all variables and the codes/categories assigned to them – Also contains how the variables will be entered and other remarks necessary – Specifies width/ length of variables – Specifies how missing values will be assigned
  • 7. Data coding • Translation of responses on the questionnaires or data collection sheets to specific categories for the purpose of analysis. • Assignment of numbers to the various levels of the variables. • Load of work light for pre-coded questionnaires
  • 8. • Important and tedious for open ended questions.
  • 9. • Need to assign numerical codes to categorical data before entering • For example, you may choose to assign codes of 1, 2, 3 and 4 to categories of “no pain”, „mild pain”, “moderate pain” and “severe pain” respectively
  • 10. • These codes can be put in the questionnaire when collecting the data. • For binary data e.g. yes/no answers, it is often convenient to assign codes 1 (e.g. for yes) and 0 or 2 (for no).
  • 11. NEED FOR CODING GUIDE/Data Dictionary • Prepare data in format to allow use of computers for statistical analysis. • Prepare code book or data dictionary for the questionnaire. • Specify range of values expected.
  • 12. • Unit of measurement should be consistent for all observations on a variable. E.g. weight should be recorded in kg or in pounds , but not both interchangeably • Time: days? Hours? – For example length of hospital stay
  • 13. Example of data dictionary Variable Variable label Value labels width Remark Name 1. Age AGE --------- 2 Missing=99 2. Sex SEX 1=male 1 ----- 2=female 3. Do you SMOKE 1=YES 1 Missing=9 smoke? 2=NO
  • 14. Example • Topic: Smoking among medical students • 200 questionnaires/records • 6 questions/variables
  • 15. Variables in questionnaire • Serial Number • Age • Place of residence 1=on campus, 2=off campus • Sex 1=male, 2= female • Do you smoke? 1=yes 2= No
  • 16. • At what age did you start to smoke? -------
  • 17. Exercise – Complete the data dictionary above
  • 18. SPSS windows • Data editor – For data entry – For statistical analysis • Viewer – Results are displayed
  • 19. Data Editor • Two views – Data view: for data entry – Variable view: to define variable characteristics
  • 20. Preparing Data Structures in SPSS • Variable views – Variable names – Variable types – Value labels – Variable width – Column – Measure
  • 21. Data Entry • Use of computer packages such as SPSS – Improves the accuracy and speed of data analysis • Makes it easy to check for errors, produces graphical summaries and generates new variables - Log in data as it arrives • Frequent backing-up
  • 22. • Problems with dates and times: – dates and times should be entered in a consistent manner, e.g. as day/month/year or month/day/year but not interchangeably. – It is important to find out what format the statistical package can read
  • 23. Handling missing data • Consider what to do with missing values before data is entered. • In most cases, need to use some symbol to represent a missing value • Statistical packages deal with missing values in different ways • Some use special characters (e.g. a full stop or asterisk) to indicate missing values, whereas others require you to define your own code for a missing value (commonly used values are 9, 99 or 999) • The value that is chosen should be one that is not possible for that variable
  • 24. • For example when entering a categorical variable with four categories (coded 1,2,3, and 4), you may choose the value 9 to represent missing values. • However, if the variable is age of a child, then a different code should be chosen.
  • 25. • If a large proportion of data is missing, then the results are likely to be unreliable • Reasons why data are missing should always be investigated: how much is missing and why? • If missing data tend to cluster around a particular variable, or in a particular sub group of individuals, then it ,may indicate that the variable is not applicable or has not been measured for that group of individuals
  • 26. • Then the group of individuals should be excluded from any analysis on that variable • Or it may be that the data is simply sitting on a piece of paper in someone‟s drawer and are yet to be entered!
  • 27. Errors in data • In any study, there is always the potential for errors to occur in a data set, either at the outset when taking the measurements, or when collecting, and entering data onto a computer • It is hard to eliminate all of these errors • But one can reduce the number of typing errors by checking the data carefully once they have been entered.
  • 28. Common sources of error • „not applicable‟ or „blank‟ coded as “0” • typing errors on data entry- 18 INSTEAD of 81 • column shift- data for one variable column was entered under the adjacent column • coding errors • Loss of concentration
  • 29. Data cleaning Two-step process- • detection and • correction of errors .
  • 30. Detecting errors - Check for completeness and correctness of records. - Indicate admissible values during data entry - Range checks-permissible responses. - Statistical editing
  • 31. How to Detect Errors via Statistical editing • Produce descriptive statistics for all variables. • Check frequency distribution of each variable 1=male, 2=female, 3? • Standard deviation higher than mean; check for outlying observation
  • 32. Quality Control - Record verification (double entry) - Does not rule out the possibility that the same error has been incorrectly entered on the two occasions - Disadvantage of this approach is that it takes twice as long to enter the data, which may have major cost or time implications - Creating check files - Random checking: selection at random but should represent all forms being entered
  • 33. Error checking • Categorical Data: relatively easy, values not allowable must be errors • Check frequency distribution of each variable 1=male, 2=female, 3? • Numerical data: Produce descriptive statistics for all variables. • Standard deviation higher than mean; check for outlying observation • range checks, upper and lower limits can be specified for each variable
  • 34. • Dates: not easy to check accuracy of dates, for example 30th feb. must be incorrect, any day of the month greater than 31, any month greater than 12 • Apply logical checks: – date of birth should correspond to patient‟s age – subjects should usually have been born before entering the study( at least in most studies) – patients who have died should not appear on subsequent follow up visits – there should be no pregnant men
  • 35. • With all error checks, a value should only be corrected if there is evidence that a mistake has been made • Do not change values simply because they look unusual; investigate
  • 36. Summary • Experience comes with practice • Input influences output