EXERCISES AND REVIEW QUESTIONS
PART ONE: GETTING STARTED
Before attempting these questions read through Chapters 1, 2 and 3 of the SPSS Survival Manual.
Designing a study
1.1 When choosing a scale for use in research, what are the two main characteristics you need to
be aware of?
1.2 What are the two main types of reliability of a scale?
1.3 What measure is often used to indicate the internal consistency of a scale?
1.4 If you read that a scale had a Cronbach alpha value of .4 what would you think?
1.5 There are many different types of validity of a scale. Describe three.
1.6 If you were designing a questionnaire and wanted to measure respondents’ ages, which of the
following formats, (a) or (b), would be better? Explain your choice.
(a) Please tick one of the following categories to indicate
your age:
18-30 ___
31-45 ___
46-60 ___
61-80 ___
81+ ___
(b) Please indicate your age in years: _______
Preparing a codebook
1.7 There are a number of rules that must be obeyed when choosing variable names to use in
SPSS (see Chapter 2). Use the following questions to review some of these rules.
(a) Can a variable name start with a number?
(b) What is the maximum number of characters that a variable name can have?
(c) Can variable names contain spaces?
Copyright © Julie Pallant 2001
-1-
1.8 For each of the following, indicate which is a suitable variable name. If not suitable, explain
why.
(a) *q1
(b) and
(c) religion
(d) martialstatus
(e) a110q
(f) incom.hous
(g) 5optim
(h) optim5
Getting to know SPSS
1.9 In Chapter 3 of the SPSS Survival Manual you are taken on a guided tour of the basics of
SPSS. The best way to learn this program is by using it. Open the data file survey.sav. To get you
familiar with the program, try some of the activities below.
(a) Using the Data Editor window, go to the bottom of the file (using the scroll bar) and find out
the ID number of the last case in the file.
(b) Explore the different menus available in SPSS.
Click on Graphs and find out what types of graphs are available.
Click on Analyze and discover the wide range of statistics available.
(c) Practise using the dialogue and sub-dialogue boxes by clicking on Analyze and then on
Frequencies. Next, highlight the following variables and move them into the Variables box:
sex, age, marital, educ, op1, op2, op3, op4, op5, op6. Then, move these variables out of the
Variables box and use Cancel to escape from the dialogue box.
Copyright © Julie Pallant 2001
-2-
PART TWO: PREPARING THE DATA FILE
Before attempting these questions read through Chapters 4 and 5 of the SPSS Survival Manual.
Creating a data file and entering data
2.1 Before you start using SPSS to prepare a data file and run analyses, it is important to check
the SPSS Options. This is covered at the start of Chapter 4 of the SPSS Survival Manual. Use the
following questions to review this material.
(a) How do you change the order in which your variables are displayed (e.g. alphabetical order
instead of file order)?
(b) How do you change the number of decimal places that are used for a particular variable?
(c) If you wished to change the format used to display the output tables, where in the Options
would you do this?
2.2 The best way to learn how to set up an SPSS data file is to actually work through each of the
steps yourself. At the back of the SPSS Survival Manual you will find a codebook for suver.sav
(the file that was used to generate some of the output throughout the book) and the questionnaire
that this came from. Use this codebook to set up a data file from scratch. Follow the procedures
described in Chapter 4 to define each of the variables listed in the codebook. When you have
finished, enter some pretend data—you can generate this data yourself by completing the survey
presented in the appendix at the back of the Manual. Save the file as mydata.sav.
Screening and cleaning the data
It is very important that you check your data file for errors before beginning statistical analyses.
These exercises give you practice with the process of screening your data and correcting errors.
2.3 Using the instructions provided in Chapter 5 of the SPSS Survival Manual, check the
following categorical variables for out-of-range cases. You will need to open the survey.sav data
file. Use the codebook provided in the appendix at the back of the Manual to guide you.
(a) sex
(b) marital status (marital)
(c) children (child)
(d) major source of stress (source)
(e) do you smoke? (smoke)
Copyright © Julie Pallant 2001
-3-
2.4 Using the instructions provided in Chapter 5 of the SPSS Survival Manual, check the
following continuous variables for out-of-range cases. You will need to open the survey.sav data
file. Use the codebook provided in the appendix at the back of the Manual to guide you.
(a) each of the items in the Optimism scale (op1 to op6)
(b) each of the items in the Life Satisfaction scale (lifsat1 to lifsat5)
(c) each of the items in the Perceived Stress scale (pss1 to pss10)
Copyright © Julie Pallant 2001
-4-
PART THREE: PRELIMINARY ANALYSES
Before attempting these questions read through Chapters 6, 7, 8, 9 and 10 of the SPSS Survival
Manual.
Descriptive statistics
The first step in the analysis of any data file is to obtain descriptive statistics on each of your
variables. These can be used to check for out-of-range cases, to explore the distribution of the
scores, and to describe your sample in the Method section of a report.
3.1 Use the instructions in Chapter 6 and Chapter 7 of the SPSS Survival Manual to answer the
following questions concerning the variables included in the survey.sav data file.
(a) What is the mean age of the sample? What is the age range of the sample (minimum and
maximum values)?
(b) What is the percentage of males and females in the sample? Did any of the sample fail to
indicate their gender?
(c) What percentage of the sample were smokers?
(d) Assess the distribution of scores on the age variable. How normal is the distribution?
(e) Inspect the distribution of scores on the Total Negative Affect scale. How normal is the
distribution? Are there any cases that you would consider outliers?
(f) Following the instructions in Chapter 7 generate a histogram to explore the distribution of
scores on the Optimism scale (Total Optimism). How closely does it resemble the normal curve?
(g) Generate a scatterplot to explore the relationship between scores on the tpcoiss and tposaff.
Describe and interpret what this scatterplot tells you about the relationship between these two
variables.
Copyright © Julie Pallant 2001
-5-
Using graphs to describe and explore the data
3.2 Using the data file survey.sav, follow the instructions in Chapter 7 of the SPSS Survival
Manual to obtain the following graphs.
(a) histogram of scores on the Total Self-esteem scale (tslfest)
(b) bar graph of scores on the Total Self-esteem scale (tslfest) for males and females (sex), across
the three age groups (agegp3)
(c) scatterplot of scores on age and total scores on the Optimism scale (toptim)
(d) boxplot of scores on the Total Negative Affect scale (tnegaff) for males and females
(e) line graph of the Total Self-esteem scale (tslfest) for males and females (sex), across the three
age groups (agegp3)
Manipulating the data
This section includes a number of activities to help you review, and to apply, the material
covered in Chapter 8 of the SPSS Survival Manual. You should read through this chapter before
attempting these questions.
3.3 One of the things that many students initially find difficult is being able to identify when
items in a scale need to be ‘reversed’ before being added to give a total score. It is essential that
this is done properly, otherwise the values obtained for the total scale do not mean anything. To
give you some practice at this we will use the Perceived Control of Internal States scale (an 18-
item scale on p. 276 of the Manual). The aim of this exercise is to identify which items to reverse
(not to actually carry out the reversals on the items in the survey.sav data file as these have
already been correctly reversed).
(a) Identify which items in the scale would need to be reversed so that high total scores would
indicate high levels of perceived control.
As discussed in Chapter 8 of the SPSS Survival Manual the next step is to calculate total scores
by adding together the items that make up each scale. The following two exercises give you some
practice with this process.
3.4 Use the procedures covered in Chapter 8 to create (compute) the following new total scale
scores. Create new total subscale scores for the Perceived Control of Internal States scale (this
scale is shown on p. 276).
(a) To calculate the Emotion subscale, add items pc1 to pc6. Call this new variable pcemot.
(b) To calculate the Thoughts subscale, add items pc7 to pc12. Call this new variable pcthou.
(c) To calculate the Physical subscale, add items pc13 to pc18. Call this new variable pcphys.
Copyright © Julie Pallant 2001
-6-
3.5 In this exercise a variable with eight different responses will be recoded into another variable
which has only two possible values. You will need to use information from the questionnaire and
codebook provided in the appendix of the Manual concerning the marital status question. You
will also need to open the survey.sav data file.
(a) Run Frequencies on the variable marital status (marital) to find out how many people fall
into each of the categories.
(b) Follow the instructions in Chapter 8 to create a new variable (relship) from the variable in the
data file (marital). The new variable will only have two values, indicating whether a person is or
is not in a relationship.
In the first group include people who are not in a relationship (single, separated, divorced,
widowed). These will be coded 1.
In the second group include people who are in a relationship (steady relationship, living
with partner, married for the first time, remarried). These will be coded 2.
(c) Run Frequencies on the new variable (relship) and compare this with the results of the
Frequencies on the original variable (marital). Are there the correct number of cases in each of
the new groups?
Checking the reliability of a scale
If you use scales or standardised measures in your research (this is common in psychological
research) it is important to assess the reliability (internal consistency) of the scores on the scale in
your sample. The following exercise gives you some practice in this process.
3.6 Follow the procedure in Chapter 9 of the SPSS Survival Manual to assess the reliability of the
following scales. You will need to refer to the codebook in the appendix to identify the items that
make up each of the scales.
(a) Optimism scale (op1 to op6)
(b) Perceived Stress scale (pc1 to pc18)
(c) Self-esteem scale (sest1 to sest10)
Copyright © Julie Pallant 2001
-7-
Choosing the right statistic
Many students find it difficult to identify which statistical technique to use to address their
research questions. Chapter 10 of the SPSS Survival Manual will help you with this process.
3.7 For each of the following research situations identify which statistical technique could be
used.
(a) Ann is interested in exploring the possibility of gender differences in levels of perceived
stress.
(b) Ann would also like to explore the relationship between optimism and perceived stress. She
suspects that higher levels of optimism would be associated with lower levels of perceived stress.
(c) Bill is interested in exploring the effect of both sex and age group on self-esteem scores. He is
interested in the effect of each variable individually, and any interaction that may exist.
(d) Celia would like to know which is a better predictor of negative affect: optimism or self-
esteem.
(e) If Celia were also concerned that age may be a confounding variable, how would she go about
controlling for this variable in the analyses?
(f) David is interested in the question: Are younger people (18-29yrs) more likely to be smokers
than older people (30-44yrs or 45+yrs)?
(g) Ellie conducts a study to find out if there is a significant change in depression levels across
three time periods (prior to an intervention, after the intervention and at a three-month follow-
up).
(h) Frank, a lecturer, wants to know who performed better on a statistics exam, males or females?
(i) Frank is also interested to find out if there was a relationship between age and exam score.
3.8 Review each of the situations listed in Exercise 3.7 and consider what non-parametric
technique you would use if it was not appropriate to use a parametric test. (Hint: Not all will have
a non-parametric alternative.)
Copyright © Julie Pallant 2001
-8-
PART FOUR: STATISTICAL TECHNIQUES TO EXPLORE RELATIONSHIPS
AMONG VARIABLES
You should review the material in the introduction to Part Four and in Chapters 11, 12, 13 and 14
of the SPSS Survival Manual before attempting these exercises.
Correlation
4.1 Using the data file survey.sav follow the instructions in Chapter 11 to explore the relationship
between age and life satisfaction (tlifesat). Present the results in a brief report.
4.2 Use the instructions in Chapter 11 to generate a full correlation matrix to check the
intercorrelations among the following variables.
(a) age
(b) perceived stress (tpstress)
(c) positive affect (tposaff)
(d) negative affect (tnegaff)
(e) life satisfaction (tlifesat)
4.3 Gill, a researcher, is interested in exploring the impact of a number of variables on the
experience of positive affect (tposaff), negative affect (tnegaff) and perceived stress (tpstress). In
particular, he is interested in the impact of age, self-esteem (tslfest), optimism (toptim), control
of internal states (tpcoiss) and mastery (tmast).
(a) Follow the instructions in Chapter 11 of the SPSS Survival Manual to generate a condensed
correlation matrix which presents the correlations between positive affect, negative affect and
perceived stress with each of the other variables listed.
(b) Repeat the analysis in (a), but first split the sample by sex. Compare the pattern of
correlations for males and females. Remember to turn off the Split File option after you have
finished this analysis.
Partial correlation
4.4 Follow the procedures detailed in Chapter 12 of the SPSS Survival Manual to calculate the
partial correlation between optimism (toptim) and perceived stress (tpstress) while controlling for
the effects of age. Compare the zero order correlations with the partial correlation coefficients to
see if controlling for age had any effect.
Copyright © Julie Pallant 2001
-9-
Multiple regression
4.5 There are three main types of multiple regression analyses. What are they? When would you
use each approach?
4.6 You wish to run a standard multiple regression to explore the impact of three independent
variables on one dependent variable. What would be the minimum sample size that you should
have? (Hint: Use the formula presented in Chapter 13 of the SPSS Survival Manual.)
4.7 As part of the preliminary screening process it is recommended that you inspect the
Mahalanobis distances produced by SPSS. What do these tell you?
4.8 The example used in the SPSS Survival Manual to demonstrate the use of standard multiple
regression compares two control measures (PCOISS and Mastery) in terms of their ability to
predict perceived stress. Repeat this analysis, this time using negative affect (tnegaff) as your
dependent variable. Use the output to answer the following questions.
(a) Overall, how much of the variance in negative affect is explained by these two variables?
(b) Which of the independent variables (tpcoiss, tmast) is the best predictor of negative affect?
(c) Do both variables make a statistically significant contribution to the prediction of negative
affect?
4.9 Follow the instructions on p. 146 of the SPSS Survival Manual to perform an hierarchical
multiple regression, this time using negative affect as the dependent variable.
Factor analysis
4.10 There is some controversy in the literature concerning the underlying factor structure of one
of the scales included in the questionnaire presented in the appendix of the SPSS Survival
Manual. The Optimism scale was originally designed as a one-dimension (factor) scale which
included some positively worded items and some negatively worded items. Recent studies
suggest that it may in fact consist of two factors representing optimism and pessimism. Conduct
a factor analysis using the instructions presented in Chapter 14 to explore the factor structure of
the optimism scale (op1 to op6).
Copyright © Julie Pallant 2001
-10-
PART FIVE: STATISTICAL TECHNIQUES TO COMPARE GROUPS
Before attempting these questions read through the introduction to Part Five and Chapters 15-21
of the SPSS Survival Manual.
T-tests
5.1 Using the data file survey.sav follow the instructions in Chapter 15 of the SPSS Survival
Manual to find out if there is a statistically significant difference in the mean score for males and
females on the Total Life Satisfaction Scale (tlifesat). Present this information in a brief report.
5.2 Using the data file experim.sav apply whichever of the t-test procedures covered in Chapter
15 of the SPSS Survival Manual that you think are appropriate to answer the following questions.
(a) Who has the greatest fear of statistics at time 1, males or females?
(b) Was the intervention effective in increasing students’ confidence in their ability to cope with
statistics? You will need to use the variables, confidence time1 (conf1) and confidence time2
(conf2). Write your results up in a report.
(c) What impact did the intervention have on students’ levels of depression?
One-way analysis of variance
For exercises 5.3, 5.4 and 5.5, you will need to open the data file survey.sav.
5.3 Perform a one-way between-groups ANOVA to compare the levels of perceived stress
(tpstress) for the five different age groups (agegp5), 18-24yrs, 25-32yrs, 33-40yrs, 41-49yrs and
50+yrs.
5.4 Perform post-hoc tests to identify where the significant differences between the groups occur.
5.5 Perform planned comparisons to compare the Perceived Stress scores for people aged 18-
29yrs with those 45+yrs. Use the agegp3 variable, groups 1 and 3.
For the following exercise you will need to open the data file experim.sav.
5.6 Use one-way repeated measures ANOVA to compare the Fear of Statistics scores for the
three time periods (time1, time2 and time3). Inspect the means plots and describe the impact of
the intervention and the subsequent follow-up three months later.
Copyright © Julie Pallant 2001
-11-
Two-way between-groups ANOVA
5.7 For this exercise you will need to open the data file survey.sav. Follow the instructions in
Chapter 17 of the SPSS Survival Manual to conduct a two-way ANOVA to explore the impact of
sex and age group on levels of perceived stress. The three variables you will need are sex, agegp5
and tpstress.
(a) Interpret the results. Is there a significant interaction effect? Are the two main effects
significant?
(b) Write up this analysis and the results in a report. (Don’t forget to report the means and
standard deviations for each group.)
Mixed between-within subjects analysis of variance
5.8 In Chapter 18 of the SPSS Survival Manual we explored the impact of two different
intervention programs (maths skills/confidence building) on participants’ fear of statistics. We
found that both interventions were equally effective in reducing participants’ fear—that is, we
found no differences between groups—but a significant difference across the three time periods.
Repeat these analyses, but this time use confidence scores as the dependent variable. You will
need to use the following variables: group, conf1, conf2 and conf3.
(a) Is there a significant interaction effect between type of intervention (group) and time?
(b) Is there a significant main effect for the within-subjects independent variable, time?
(c) Is there a significant main effect for the between-subjects independent variable, group (maths
skills/confidence building)?
Multivariate analysis of variance
5.9 How does MANOVA differ from ANOVA?
5.10 In Chapter 19 of the SPSS Survival Manual it is recommended that you check the
Mahalonobis distances before proceeding with MANOVA. What does this allow you to check
for?
5.11 Multicollinearity can be a problem when you are intending to use MANOVA. What does
this word mean and how can you check for it?
5.12 Which assumption is Box’s Test used to assess?
Copyright © Julie Pallant 2001
-12-
5.13 Follow the procedure detailed in Chapter 19 of the SPSS Survival Manual to perform a
MANOVA to explore positive and negative affect scores for the three age groups (18-29yrs, 30-
44yrs, 45+yrs). The three variables you will need are tposaff, tnegaff, agegp3. Remember to
check your assumptions first.
5.14 Write a brief report of your results of the procedure you conducted in Exercise 5.13.
Analysis of covariance
5.15 Under what circumstances would you want to consider using analysis of covariance?
5.16 What issues do you need to consider when you are selecting possible covariates?
5.17 Using the experim.sav data file, perform the appropriate analyses (including assumption
testing) to compare the confidence scores for the two groups (maths skills, confidence building)
at time 2, while controlling for confidence scores at time 1. The variables you will need are
group, conf1, conf2.
5.18 Perform a two-way analysis of covariance to explore the question: Does gender influence
the effectiveness of the two intervention programs designed to increase participants’ confidence
in being able to cope with statistics training? You will need to assess the impact of sex and type
of intervention (group) on confidence at time 2, controlling for confidence scores at time 1.
Non-parametric statistics
5.19 What is the difference between parametric techniques and non-parametric techniques?
5.20 What factors would you consider when choosing whether to use a parametric or a non-
parametric technique?
5.21 For each of the following parametric techniques indicate the non-parametric alternative (if
one exists).
(a) one-way between-groups ANOVA
(b) Pearson’s product-moment correlation
(c) independent samples t-test
(d) multivariate analysis of variance
(e) one-way repeated measures ANOVA
(f) paired samples t-test
(g) partial correlation
(h) one-way repeated measures ANOVA
Copyright © Julie Pallant 2001
-13-
5.22 Choose and perform the appropriate non-parametric test to address each of the following
research questions.
(a) Using the survey.sav data file find out whether smokers are significantly more stressed than
non-smokers. The variables you will need are smoke and total perceived stress (tpstress).
(b) Using the survey.sav data file compare the self-esteem scores across the three different age
groups (18-29yrs, 30-44yrs, 45+yrs). The variables you will need are tslfest and agegp3.
(c) Using the survey.sav data file explore the relationship between optimism and negative affect.
The variables you will need are toptim and tnegaff.
(d) Using the survey.sav data file explore the association between education level and smoking.
The variables you will need are educ2 and smoke. Check the codebook and the questionnaire in
the appendix of the SPSS Survival Manual for details on these two variables.
(e) Using the experim.sav data file compare the depression scores at time 1 and the depression
scores at time 2. Did the intervention result in a significant change in depression scores? The
variables you will need are depress1 and depress2.
(f) Using the experim.sav data file compare the depression scores for the three time periods
involved in the study (before the intervention, after the intervention and at the three-month
follow up). The variables you will need are depress1, depress2 and depress3.
Copyright © Julie Pallant 2001
-14-
CONSOLIDATION EXERCISES
The remaining questions give you the opportunity to not only practise using SPSS, but also to
choose which analyses can be used to address different research questions. Run whatever
analyses or procedures you think appropriate to answer the following questions. Wherever
appropriate, support your conclusions with graphs. Consider how you would present your
answers to these questions in a report.
Part I: For this set of questions you will need to open the survey.sav data file.
(a) Obtain the information necessary to describe, in a Methods section of a report, the
demographic characteristics of the sample used in the data file survey.sav. In particular describe:
the number (and percentages) of males and females in the sample
the age range of the sample and the mean and standard deviation for age
the number (and percentages) of married, single and divorced participants
(b) Are scores on the Perceived Control of Internal States scale (tpcoiss) normally distributed?
(c) What is the ID number of the respondent(s) with the lowest score on the self-esteem scale
(tslfest)?
(d) How many people in the sample smoke?
(e) Who is most stressed: males or females (use the Total Perceived Stress scale scores, tpstress)?
(f) Do older people have higher levels of self-esteem (use the variable tslfest)? (Hint: there are
two different ways that this question can be addressed.)
(g) Are optimists happier? Or, rephrased: Is there a relationship between a person’s level of
optimism (toptim) and their scores on the Positive Affect scale (tposaff)?
(h) Do smokers and non-smokers differ in terms of their perceived stress (tpstress) or their self-
esteem (tslfest)?
(i) How strong is the relationship between perceived stress (tpstress) and overall life satisfaction
(tlifesat)?
(j) Are younger people more likely to smoke than older people?
(k) In this sample, how many people fell into the age category 50+ (use agegp5)? Did these
people have higher or lower mean scores on the Optimism scale (toptim) than people in the 18-
24 yr age group?
(l) Is there a linear (straight-line) relationship between control of internal states (tpcoiss) and
perceived stress (tpstress)? Is this relationship positive or negative? How strongly are these two
variables related?
Copyright © Julie Pallant 2001
-15-
Part II: For the following questions you will need to open the experim.sav data file.
(a) Using the experim.sav data file find out whether there is a significant change in depression
levels across the three time periods, and whether this pattern was the same for males and females.
(b) Who performed better on the statistics exam—males or females?
(c) Is there a relationship between age and exam score? Describe the nature of this relationship.
(d) Review each of the questions listed in both Part A and Part B above and consider what non-
parametric technique you would use if it was not appropriate to use a parametric test. (Hint: Not
all will have a non-parametric alternative.) Follow the instructions provided in Chapter 21 to
carry out any non-parametric analyses that are appropriate. Compare the conclusions you obtain
using the parametric and non-parametric alternatives.
Copyright © Julie Pallant 2001
-16-