Data & statistics
for writers
(some of)
what you’ll need to know
Walter W. Giesbrecht AS/WRIT 2300
Data Librarian, York University 2008.02.05
numeric panic!
• related conditions are numerophobia,
arithmophobia, statistophobia
• in writers, a condition brought on by a
need for a statistical fact, figure, table
or data
• symptoms include
– a blank mind
– feeling of a clenched fist in your stomach
– urge to change the subject of your article
to avoid the need for numbers
how to deal with numeric panic?
• understand what you’re looking for
(i.e., become more statistically literate)
• ask the right questions
• search the right sources
what are data?
• facts or figures from which conclusions
can be drawn
• numeric files created and organized
– for analysis, or to create a new table
• includes geographic data
– (to make maps)
Without data, all you are is just another
person with an opinion.
–Unknown
Data is a lot like humans: It is born. Matures.
Gets married to other data, divorced. Gets old.
One thing that it doesn’t do is die. It has to be
killed.
–Arthur Miller
Errors using inadequate data are much less
than those using no data at all.
- Charles Babbage
what data are not
"The plural of anecdote is not data."
-- Roger Brinner
“… cursed the alchemy by which anecdote
is transformed into data …”
what are statistics?
• type of information obtained through
mathematical operations on numerical
data
• statistics are processed data, or data
that have been analyzed in some way
• generally used to support an argument
or position in a study or report
There are two kinds of statistics, the kind you
look up, and the kind you make up.
- Rex Stout
If you want to inspire confidence, give plenty of
statistics. It does not matter that they should
be accurate, or even intelligible, as long as
there is enough of them.
- Lewis Carroll
The statistics on sanity are that one out of
every four Americans is suffering from some
form of mental illness. Think of your three best
friends. If they are okay, then it’s you.
- Rita Mae Brown
statistics
• in print, typically found in statistical
abstracts, census and other government
publications (monograph or serial)
• in digital form, found in online databases
data vs. statistics
• difference between looking at a
photograph and taking the photograph
yourself
• statistics are like a photograph or
postcard
– a captured image of the data chosen by
someone else
• data are like the view through a camera
– you choose the view you want
aggregate data
• data that have been grouped or
summarized in some way
– e.g., by geography or age group
• boundary between aggregate data and
statistics sometimes blurry
aggregate data structure
• time
– e.g., time series data from CANSIM,
Labour Force Historical Review,
multiple Census years
• geography
– e.g., Census data
– neighbourhood --> national
• social content
– e.g., injury data from Health Indicators
Database
example of aggregate data table
microdata
• unsummarized data
– often samples of actual responses to
surveys
• two types of microdata files
– master file -- raw data, usually of
restricted availability
– PUMF (public-use microdata file)
-- anonymized version of master file
-- available for purchase
National Population Health Survey
microdata file excerpt
column 42-44
-- # visits to eye specialist
column 13 – pets?
column 8
-- sex of respondent
aggregate data vs. microdata
• aggregate data is what you’ll be
working with (most of the time)
• microdata usually requires referral to
data librarian or Statistics Canada
– requires use of statistical software
essential factors in
data research
• geography
• time
• level of observation
• format
• intended use
• cost (and ability to pay!)
Go to the source ...
Canada United States International
Federal Federal Departments United Nations
• National Accounts • Commerce OECD
• Census • Labor IMF
• Trade • Justice World Bank
• Agriculture Eurostats
Provincial
• Health
• Education
Canadian data
• Statistics Canada is generally the first
stop for Canadian data
• search tools:
– the Daily
– Online catalogue
– E-STAT
– Canadian Statistics
– Community Profiles
E-STAT
• intended for use by education
community, and DSP libraries
• provides “free” access to CANSIM
– CANSIM on E-STAT only updated once a
year
• census data from 1986-2006, and
selected censuses from 1665-1871
• data can be mapped/exported
maps generated in E-STAT
professional occupations people earning > $60K / year
in arts and culture
Sources of international data
• United States
• United Nations
• International agencies
United States
• Bureau of the Census
• fedstats.gov
• Statistical Abstracts of the United States
United Nations
• United Nations Statistics Division
– many useful print publications as well,
e.g., Statistical Yearbook
• most specialized agencies have their
own statistical publications and/or
websites
Other sources of
international statistics
• OECD
• World Bank
• National statistical agencies
• OFFSTATS
sources of help for writers
• Statistics every writer should know
• Statistics: Power from Data!
• STATS (“checking out the facts &
figures behind the news”)
• your local librarian!
Be prepared for
disappointment!
• Some of what you want
– Won’t exist
– Won’t be readily available to you
– Will cost too much
One last thing ...
• Don’t be afraid to ask for help!
• All the stupid questions have
already been asked … by “experts”
Library home page
> Course-related research guides
> WRIT 2300
walterg@yorku.ca
Walter W. Giesbrecht AS/WRIT 2300
Data Librarian, York University 2008.02.05