The author(s) shown below used Federal funds provided by the U.S. Department of Justice and prepared the following final report: Document Title: Analysis of the United Nations Data Set on Crime Trends and the Operations of Criminal Justice Systems: Part I Author(s): R.W. Burnham Document No.: 183461 Date Received: July 5, 2000 Award Number: 96-MU-CX-0002 This report has not been published by the U.S. Department of Justice. To provide better customer service, NCJRS has made this Federallyfunnde grant final report available electronically in addition to traditional paper copies. Opinions or points of view expressed are those of the author(s) and do not necessarily reflect the official position or policies of the U.S. Department of Justice.L FINAL REPORT OF NIJBJS VISITING FELLOWSHIP R. W.BURNHAM November 1998 PART I: THE ANALYSIS OF THE UNITED NATIONS DATA SET ON CRIME TRENDS AND THE OPERATIONS OF CRIMINAL JUSTICE SYSTEMS Section I: Introduction and Background. The United Nations has collected data on crime and criminal justice since the mid 1 9 7 0 ~ ~ but little systematic use has been made of the data, with some exceptions. The data were collected in a series of five sweeps, called by the UN “surveys”. Thus the first survey covered 1970-74, the second 1975-80, the thlrd 1980-86, the fourth 1986-90 and the fifth 1990-94. In this report the word “sweep” is used, so that the term “survey” can be retained for the activity as a whole. The sixth sweep, covering 1995-97 is being administered in early 1999. The exceptions known to the author are: (i) by Dr. Freda Adler,in 1982, with an analysis which eventually formed the basis of her book “Countries not Obsessed with Crime”. While that was an original and interesting attempt at a new approach to the analysis of crime data in the context of other socioeconnomi data, it suffered from the fact that the data available from the second sweep of the UN surveys were even less complete and reliable than Adler allowed for, thus making most of the conclusions at best tentative; (ii) a series of analyses of the European regional data made bv the Helsinlu European Institute for Crime and Justice affiliated with the United Nations (HEUNI) for the thrd and each of the subsequent sweeps. Most of the output is descriptive, giving tabular and graphical representations of the data with various summary statistics. These reports are thorough and careful. As most of the countries, but not all, which supplied the best data in the UN surveys are West European, to follow the HELJNI model in this work would have been largely to replicate it. Readers are therefore referred to those reports as being one good way of using the UN data. (iii) some individual articles by criminologsts in professional journals on specific aspects within the data set. (iv) most recently the Global Report on Crime and Justice, edited by Dr. Graeme Newman. This is in the final stages of publication as ths report is written, and I believe that it makes extensive use of the data collected in the fifth sweep. To the best of my knowledge no research worker had tried to survey the whole of the data 1 U.S. Department of Justice. of the author(s) and do not necessarily reflect the official position or policies of the has not been published by the Department. Opinions or points of view expressed are those This document is a research report submitted to the U.S. Department of Justice. This reportset, in all sweeps, and to consider the main patterns which may be seen, and, above all, the uses to which the whole data set could be put. I therefore made a proposal for a Visiting Fellowshlp to the National Institute of Justice and the Bureau of Justice Statistics in July 1996 to undertake that task. Ln the proposal as accepted by NUBJS, the original terms of reference proposed by the Fellow were to carry out a comprehensive analysis of the complete UN data set. As described in the next section, that turned out to be not feasible, and also of limited interest or value. In August 1997, therefore, I requested authorization for reformulating the objectives and outputs, and that request was approved. The new objectives and outputs are described on page 3, in Section II. SECTION Il. THE NATURE OF THE DATA SET The data set was provided by the Crime Prevention and Criminal Justice Division (as it then was) of the United Nations Office in Vienna. It was also available in the same format on the Internet. The different sweeps were all in SPSS+, the generation of the software package for use on MS DOS based machines. These were easily convertible into SPSS for Windows. One of the first problems encountered was the general structure of the data set. The first and most critical factor to emerge was that analysis across sweeps was very difficult, because of the file structure which had been used. The second sweep had been entered using the SAS application on a main frame computer, as that was all that was available. ‘The third sweep had been entered into SPSS-t, which was the leading application at the time. One problem with the applications of that day, itself deriving from the limitations of space and power on the early micro computers, was that the size of files was limited. Therefore each file could contain only a relatively small number of variables by the standards of today. The file structure was built around the agencies of the system, and basically there was one file for each agency. Each variable within one agency file was described by coded letters. “nointh80 “represented recorded non intentional homicides in When the data from the fourth and fifth sweeps were added, the same file structure was retained, along with variable names constructed in the same way, although the improving technology through the early and mid 1990s would have made a more flexible file structure feasible. The outcome was that it was extremely difficult to create a mode of analysis which crossed the five or six years of each sweep, and that in effect there were four different sweeps whch could be related to each other only in a clumsy, painstalung and limited manner. The data set in its previous form was described by one methodologist as not so much userunheendl as user-repellant. With the advice and assistance of Professor Helmut Anheier, of Rutgers and Johns Hopkins Universities, a specialist in the methodology of social surveys, the data set was reconfigured into its present form, which is a standard format for such survey data bases. In its present confi_wation, there is a separate file for each year, and each file contains all 2 U.S. Department of Justice. of the author(s) and do not necessarily reflect the official position or policies of the has not been published by the Department. Opinions or points of view expressed are those This document is a research report submitted to the U.S. Department of Justice. This reportthe variables and cases for that year. The variables are numbered so that they are the same for each year, but distinguishable by the year suffix, for instance total recorded non intentional homicide in 1980 now becomes VO38-80. This makes it much easier to compare any variable for any year with that of any other year. This is not to say that any such comparison would be meaningful; it is a reasonable prediction that the majority would be meaningless however good the quality of the data; but such comparisons are feasible. The data set currently is in SPSS for windows, and can be manipulated and translated in any way.that application allows. The re-configuration took some considerable time, so that the nine months which the original proposal had foreseen was quite inadequate to carry out any extensive analysis. It did provide me with an opportunity, however, to thinli at some length what are and are not appropriate uses of the data set, and which are the types of questions which it can properly be used to answer, and which types of questions are inappropriate, as requiring other kinds of data. The conclusion of that meta analysis, that is the conceptual analysis of what type of data analysis would provide the most useful outcome of the fellowship led to a reformulation of the objectives of the fellowship, with the outputs being as follows: 1. the data set in a configuration which is easy to manipulate in any way, and as a seamless whole, not a series of unconnected sweeps; 2. an examination of the quality of the data, and a methodological account of the rationale and techques by whch the quality is assessed; 3 a review of the types of questions which can sensibly and appropriately be asked of such data; 4. a series of exemplary analyses, that is examples of the type of analysis needed to address and answer the questions identified in output 3 above; 5. (if possible) a set of purpose written small programs which will enable other users of the data set to carry out the analyses described, but on whichever variables and cases they wish, That would require further technical assistance from, or through, NIJ. 6. a set of recommendations to NLTBJS whch they may refer to the United Nations if they consider it appropriate as to how to improve the quality and utility of data gathered in the future. The emphasis in the work of the Fellow therefore changed over time, to the point where his task is seen now as preparing the material for others to use, and providing basic tools for those who are not themselves comfortable with statistical applications, and are more concerned with the results of the analysis rather than the methodology. The rationale for the change from the orig~nal objective of undertalung some form of comprehensive analysis was based on the growing appreciation of the number of possible analyses to be made, combined with the fact that any one of them would be of interest to only a certain proportion of those interested in the data sei. Therefore the most desirable end state would be if as many users as possible could be encouraged to undertake their own analysis on whatever variables and cases interested them the most. 3 U.S. Department of Justice. of the author(s) and do not necessarily reflect the official position or policies of the has not been published by the Department. Opinions or points of view expressed are those This document is a research report submitted to the U.S. Department of Justice. This reportSECTION III: THE QUALITY OF THE DATA One major concern in the analysis of any data set must be with the quality of the data, traditionally evaluated by validity (the accuacy with which the data relate to what is being measured in the external world), and reliability (the accuracy of the apparent relationship of one datum to the other data internally to the data set). The data in this set had been collected by the UN in five different sweeps, as described n the first section. At the expert group meeting in 1986 convened by OJP in Washington DC to plan the thlrd sweep, one of the participants proposed that the questionnaire for that and subsequent sweeps should include two new features. These were, administratively, a place at the beginning of each section intended for completion by the different agencies for the name and contact address of the individual who completed the return, and substantively the inclusion of the last year of the time period covered by the previous sweep. The purpose of the first was to enable a dialogue between the UN and the returning office to question possible miss entries and clan@any uncertainty; the purpose of the second was to obtain information on how consistent countries were or are in submitting data. This instrument was included to make some form of validity check feasible. It seems that the first proposal was implemented up to a point, namely that a space was incorporated, with the appropriate questions, for the provision of identifying specifics by the returning officials, and that these details were often provided; but it seems also that the UN never made use of the contact points so created. It seems also the explanation was that the workload and time pressures of the CPCJB were such that data entry had to be done by volunteers or short term staff, and that it was therefore done some time after the returns had been received, and m
merge files > add variables open Huncj90(5) use vOO1 as key variable] With this new file, it is then possible to carry out the compute statements. The following is an example of the compute statement in syntax for finding matches in overlapping years, where dim4 is the new variable showing the difference between the same variable in the two sweeps: COMPUTE dm4 = ~034-90 -~034-904 . EXECUTE. FREQUENCIES VARIABLES=diff34 This gives the cases which are exact matches (Le. the number is the same for both sweeps L giving a result of 0); a closer look is required to see whether the matches are within 10%. For example, in the variable given above, the number of O’s, indicating that the two numbers are exactly the same is 48.7%. In the table1 on p.5, the percentage is given as 66. 30 U.S. Department of Justice. of the author(s) and do not necessarily reflect the official position or policies of the has not been published by the Department. Opinions or points of view expressed are those This document is a research report submitted to the U.S. Department of Justice. This reportThe figures for the comparison across years were generated by the follohing procedure, For a given country and variable, the availability of data was checked, and a separate file created. For the adjacent years, the smaller was subtracted from the larger figure, the product of that was divided by the larger figure, and the product of that multiplied by 100. As the purpose of the procedure was to establish to size of the change from one year to the next, but was not concerned with the direction of the change, it did not matter whether the first or second of the two figures is the larger, i.e. whether the change is an increase or decrease in the incidence. For the lagged years, the base year was established as 1975 when the data were available in the second sweep and 1980 when they were available only in the third sweep. Alternatively, for the analysis within a given sweep, the first year of that sweep became the base year. The base year was then subtracted from the year each of the years in turn, or they from it if they were the smaller number, and the product again divided by the larger number and multiplied by 100. These procedures gave the value for the rate of change of the incidence of the variable in question as a percentage relative to either the preceding or the base year, and were plotted on a h e chart as exemplified by Charts 1-6. An example of the syntax of the compute statement in SPSS is as follows, where ‘d42d7576’ is a new variable representing the difference between 1975 & 1976 ‘ADJACENT YEARS V042 (RECORDED ROBBERY) IF (~042-76 >= ~042-75) d42d7576 = (~042-76 -~042-75) /~042-76 * 100 . EXECUTE. IF (~042-75 >= ~042-76) d42d7576 = (~042-75 -~042-76) /~042-75 100 . IF (~042-77 >= ~042-76) d42d7677 = (~042-77 -~042-76) 1~042-77 * 100 . IF (~042-76 >= ~042-77) d42d7677 = (~042-76 -~042-77) 1~042-76 100 . IF (~042-78.>= ~042-77) d42d7778 = (~042-78 -~042-77) /VO42-78 100 . IF (~042-77 >= ~042-78) d42d7778 = (~042-77 -~042-78) 1~042-77 * 100 . IF (~042-79 >= ~042-78) d42d7879 = (~042-79 -~042-78) 1~042-79 * 100 . EXECUTE. EXECUTE EXECUTE. EXECUTE EXECUTE. EXECUTE. IF (~042-78 >= ~042-79) d42d7879 = (~042-78 -~042-79) /~042-78 100 . EXECUTE IF (~042-80 >= ~042-79) d42d7980 = (~042-80 -~042-79) Iv042-80 * 100 . EXECUTE. IF (~042-79 >= vO42-80) d42d7980 = (~042-79 -~042-80) /~042-79 100 . EXECUTE. IF (~042-81 >= ~042-80) d42d8081 = (~042-81 -~042-80) 1~042-81 100 EXECUTE IF (~042-80 >= ~042-81) d42d8081 = (~042-80 -~042-81) Iv042-80 100 EXECUTE IF (~042-82 >= ~042-81) d42d8182 = (~042-82 -~042-81) 1~042-62 100 EXECUTE IF (~042-81 >= ~042-82) d42d8182 = (~042-81 -~042-82) /~042-81 100 EXECUTE IF (~042-83 >= ~042-82) d42d8283 = (~042-83 -~042-82) /~042-83 100 EXECUTE 31 U.S. Department of Justice. of the author(s) and do not necessarily reflect the official position or policies of the has not been published by the Department. Opinions or points of view expressed are those This document is a research report submitted to the U.S. Department of Justice. This reportIt ( v U ~ L -~ L >= Vb4i-8 j) d4LObL 83 = (vu4L-O~ -VULIL-OJ) I vW~L-OL I uv EXECUTE . IF (vO42-84 >= ~042-83) d42d8384 = (~042-84 -~042-83) I ~042-84 100 . EXECUTE IF (~042 83 >= ~042 84) d42d8384 = (~042-83 -~042-84) I ~042-83 100 . -. EXECUTE. IF (~042-85 >= ~042-84) d42d8485 = (~042-85 -~042-84) 1~042-85 * 100 . EXECUTE. IF (~042-84 >= ~042-85) d42d8485 = (~042-84 -~042-85) /~042-84 100 , EXECUTE. EXECUTE. EXECUTE. IF (~042-86 >= ~042-85) d42d8586 = (~042-86 -~042-85) /~042-86 * 100.. IF (~042-85 >= ~042-86) d42d8586 = (~042-85 -~042-86) I ~042-85 100 . IF (~042-87 >= ~042-86) d42d8687 = (~042-87 -~042-86) I V042-87 100 . EXECUTE. IF (~042 86 >= ~042-87) d42d8687 = (~042-86 -VO42-87) 1~042-86 100 . EXECUTE. -IF (~042-88 >= ~042-87) d42d8788 = (~042-88 -~042-87) 1~042-88 100 . IF (~042-87 >= ~042-88) d42d8788 = (~042-87 -~042-88) I V042-87 100 . IF (~042-89 >= ~042-88) d42d8889 = (~042-89 -~042-88) IV042-89 100 . IF (~042-88 >= ~042-89) d42d8889 = (V042-88 -~042-89) I ~042-88 * 100 . IF (~042-90 >= ~042-89) d42d8990 = (~042-90 -~042-89) I V042-90 100. IF (~042-89 >= ~042-90) d42d8990 = (~042-89 -~042-90) Iv042-89 * 100 . IF (~042-91 >= ~042-90) d42d9091 = (~042-91 -~042-90) IV042-91 100. IF (~042-90 >= ~042-91) d42d9091 = (~042-90 -~042-91) I V042-90 100 . EXECUTE. EXECUTE. EXECUTE. EXECUTE. EXECUTE. EXECUTE. EXECUTE. EXECUTE. IF (~042-92 >= ~042-91) d42d9192 = (~042-92 -~042-91) IvO42-92 100 . EXECUTE. IF (~042-91 >= ~042-92) d42d9192 = (~042-91 -~042-92) IV042-91 100. EXECUTE. IF (~042-93 >= ~042-92) d42d9293 = (~042-93 -~042-92) I ~042-93 * 100 . EXECUTE. IF (V042-92 >= ~042-93) d42d9293 = (~042-92 -~042-93) I V042-92 * 100 . IF (~042-94 >= ~042-93) d42d9394 = (~042-94 -~042-93) I V042-94 * 100 . IF (~042-93 >= ~042-94) d42d9394 = (~042-93 -~042-94) 1~042-93 * 100 . EXECUTE. EXECUTE. EXECUTE. ' 32 U.S. Department of Justice. of the author(s) and do not necessarily reflect the official position or policies of the has not been published by the Department. Opinions or points of view expressed are those This document is a research report submitted to the U.S. Department of Justice. This report
|