Links to resources and background information used
during the CERIF Revision activities
CERIF Recommendation as published in 1991:
Terms of Reference for the CERIF Revision :
Code of Good Practice for Current Research Information Systems
Development of a European Service for Information on Research and Education
DESIRE project (The project is looking at Web technology and the implementation of
pilot information services on behalf of European researchers.)
DESIRE is a large project funded by the Telematics for Research Sector of the Fourth
Framework programme of the European Union. The project is looking at Web
technology and the implementation of pilot information services on behalf of
UKOLN, together with the other ROADS partners - the Institute of Learning and
Research Technology (ILRT) at the University of Bristol and the Department of
Computer Studies at Loughborough University - are involved in Work Package 3
(WP3) on Resource discovery and indexing. The other partners involved in WP3 are
NetLab at the University of Lund, Sweden and the Koninklijke Bibliotheek - the
National Library of the Netherlands.
See also the project's Web pages at URL
The Nordic metadata project - Final report
Dublin Core Metadata Template
Category/Genre (Resource Type) The Dublin Core proposal for metadata contains a
Resource Type element. This element is to describe the genre of the item being
described. Attached is an example of how we use it. This option seeks to identify
resource types down to approximately three layers of specificity.
Format the data representation of the resource, such as text/html, ASCII, Postscript
file, executable application, or JPEG image. The intent of specifying this element is to
provide information necessary to allow people or machines to make decisions about
the usability of the encoded data (what hardware and software might be required to
display or execute it, for example). As with the RESOURCE TYPE elements,
FORMAT will be assigned from enumerated lists such as registered Internet Media
Types - IMT (MIME types). In principal, formats can include physical media such as
books, serials, or other non-electronic media. The MIME types are defined according
to the RFC2046 standard. http://sunsite.auc.dk/RFC/rfc/rfc2046.html
General www technology
INTERNET & NETWORKING: Standards and Organizations
A beginners guide to URLs
About HTML, SGML, XML, see htmlsgml.doc
EUROsearch is a Language Engineering Programme project
Multilingual European Federated Search Service, see EUROsearch.htm
http://www.w3.org/Metadata/ overviews everything important that's going on
(including RDF, PICS, DSIG, Dublin Core.....)
Some interesting papers on the development of metadata at the WWW-conference in
Brisbane, available at:
http://www7.conf.au/, see in particular:
Another important source by Anders Ardo and Sigfrid Lundberg: "A regional
distributed WWW search and indexing service - the DESIRE way"
DESIRE and metadata
"A review of metadata : a survey of current resource description formats".
(Part 1 of the DESIRE project)
Source : http://www.ukoln.ac.uk/metadata/desire/overview
About WWW Technology (WWW Consortium)
The World Wide Web Consortium Issues HTML 4.0 as a W3C Recommendation
Metadata: Microsoft and others propose XML-Data
Microsoft are trying to get agreement for XML-Data which in some ways is against
the work on RDF and other metadata formats
HTML4 accepted as standard: http://www.w3.org/MarkUp/
XML Information: http://www.w3.org/XML/
The W3C 'standard' on metadata - RDF - is ratified and published. URL is:
Classification schemes and Indexing technologies
Data Mining - An Introduction
See also datamine.doc
"Old" CERIF scheme
Swedish pilot project using CERIF classification and Meta data.
List of 34 main areas in the "old" CERIF Research Classification Scheme.
Second example where the old CERIF is used :
University of Helsinki :
See in particular:
UDC, DDC, LCC
The DESIRE Report compared UDC, DDC, LCC, national general schemes and
international subject specific schemes.
About the differences between UDC and DDC ...
"European nomenclature for research and development activities (NABS)",
You can have a look to the NABS codes at the URL :
See also NABS.DOC extracted for the CERIF-revision experts from the above
mentioned URL (EN and FR).
International Patent Classification (IPC)
All information about the International Patent Classification can be found at the
following URL :
For details on the structure of the classification :
SIGLE is the System for Information on Grey Literature in Europe.
- SIGLE Classification : see SIGLE1.DOC (56 pages)
- Short decription of SIGLE: see SIGLE2.DOC)
About Ortelius : ORTELIUS.DOC
Alphabetical list of terma : ORTELIUSA.DOC
Hierarchical list of terms : ORTELIUSH.DOC
List of "top level" : ORTELIUSTOP.DOC
For the CPA we do not yet have an electronic version in English.
A French version can be consulted at the Internet :
The Grants Keyword Thesaurus™
Candidate thesaurus for building and maintaining a strong R&D - enhanced Ortelius ?
Thesaurus in use by the Community of Science
Similarly, check http://expertise.cos.com/cgi-bin/genkeys
for the. You will note that the two lists are very similar.
Thesaurus used by InfoEd International
URLs related to B. Lutes' presentation on indexing activities in the ProCORDIS
Links to the various chapters of the ProCORDIS final report:
Improved Retrieval Engine:
Multilingual Classification Methods:
Thesaurus-aided Free-text Search / Query Translation:
DESIRE and Classification
DESIRE PROJECT - part 3: "The role of classification schemes in Internet resource
description and discovery"
Source : http://www.ukoln.ac.uk/metadata/desire
DESIRE project: Comparison of classification schemes. Conclusions:
Executive summary of the DESIRE study:
ISCED 97 : Classification scheme for education, suitable for R&D
ISCED is the International Standard Classification of Education (UNESCO), which is
also recommended in the "Frascati manual" for R&D classification (OECD).
The full documents contains about 50 pages. The attached extract (6 pages) of the
ISCED covers the ISDEC classification scheme that we might consider to compare
with the other existing schemes for the research area codes.
From the Office for Official Publications of the European Communities.
The Eurovoc thesaurus is a multilanguage and multidisciplinary thesaurus of the
European Parliament and the Office for Official Publications of the EC, which is
available in nine languages of the European Union countries and used for indexing in
the EPOQUE database (all external and internal documentation of the European
Parliament); in the Celex database (the complete legislation of the European Union -
as from next year records are to be provided with subject descriptions); in the
CATEL system (enquiry and ordering system for EU publications); in the Library of
the European Parliament; in numerous national parliamentary and other libraries.
Third edition FX-07-92-000-EN-C ECU 120
Volume 1 - Permuted alphabetical version (1995 438 pp. FX-07-92-001-EN-C
Volume 2 - Subject Oriented Version (1995 232 pp. FX-07-92-002-EN-C ECU
Volume 3 - Multilingual version (1995 245 pp. FX-07-92-003-EN-C ECU 38.00)
CD-ROM ! not on-line
Can be ordered : Mr. BRACK Tel. 301.42567
Multilingual; Language: English, Danish, German, Greek, Spanish, French, Italian,
EUROPEAN COMMISSION TRANSLATION SERVICE - Directorate for General
and Language Matters
Compendium of classifications/thesauri
Link to Barbara Lutes' collection of online classifications and thesauri in many
domains, some of which may be of interest as potential candidates for CERIF:
Grants Keywords, OECD Macrothesaurus, CPV
In the ProCORDIS final report there is a chapter comparing various candidate
indexing vocabularies, including Grants Keyword Thesaurus (used by US Gov't).
Agencies for indexing research grants), the OECD Macrothesaurus, and the Common
Procurement Vocabulary of the EU. There is also some discussion and short
descriptions of the UDC and various other candidates
Here you will find the ProCORDIS implementation of the OECD Macrothesaurus,
which, in my opinion, is a strong candidate for recommended use in CERIF, if it can
be complemented by thesauri/classifications covering the domains which it doesn't
adequately contain. The OECD Macrothesaurus is available in English, French,
German, Spanish, covering social and economic sciences, humanities, agriculture,
medicine; weak on natural sciences and technology. It is both thesaurus and
LCC: Library of Congress Classification
Description and evaluation as reported from the DESIRE Project :