Digital Library in Mathematics First Problems Related to Contents and Access Bernd Wegner Institute of Mathematics, TU Berlin DLM-Workshop, NSF, Washington, July 2002 Part 1 Problems and questions related to contents General Question 1 Do we really want to cover all mathematical publications world-wide by the DLM? If not so, determine what is “important, useful, good“ or whatever. Probably the items 1 and 5 in the subsequent list are the easiest to be handled systematically. DLM - document types What are the documents to be covered by the DLM? 1. Articles in refereed journals (in the narrower sense) 2. Articles in Newsletters and other non-refereed journals 3. Articles in series of collections of publications 4. Articles in conference proceedings and other non-periodic collections of publications 5. Series of advanced level monographs DLM - document types, ctd. 6. Single advanced level monographs 7. Series of textbooks 8. Single textbooks 9. Collected works, handbooks, encyclopaedias, bibliographies and similar publications 10. Publications related to education in mathematics (including curricula), popularisation 11. Dissertations 12. Reports, preprints and other grey literature What else? Legacies, Letters General Question 2 How can we determine all documents belonging to one of these items? Presumably all the methods listed subsequently will have to be applied to be really comprehensive. DLM - determining the content 1. Relying on the archival information provided by publishing houses and academic publishers 2. Consulting catalogues provided by national reference libraries 3. Consulting catalogues provided by comprehensive reference libraries for mathematics DLM - determining the content -ctd. 4. Consulting the databases and printed versions of comprehensive reviewing services like Jahrbuch über die Fortschritte der Mathematik, Mathematical Reviews, Referativnyi Zhurnal Matematika, and Zentralblatt MATH 5. Checking special bibliographies and historical surveys in mathematics for older material which is not covered by these reviewing services What else? General Question 3 Which publications may be considered as a part of mathematics according to subject area? DLM - subject domain 1. Publications and serials dealing with pure and applied mathematics. 2. Serials with mixed content, but containing a relevant amount of mathematical publications 3. Serials publishing articles on the borderline of mathematics with applications to other sciences DLM - subject domain -ctd. 4. Statistics 5. Logics 6. Theoretical computer science 7. Theoretical physics 8. Theoretical mechanics 8 + n A lot of other areas in applications More? General Question 4 Who should take part in arranging a list of contents and who should administrate such a list? Should we not care about such a list at all and digitise what will be just in our mind or easily available? DLM dimensions - T T - Time: When should it start and how far back should should it go? Current digitally produced publications Moving walls for non-profit distribution Classical material (after Gutenberg) Ancient material (Chinese, Arabic, Indian, Maya, Greek, Egypt etc.) DLM dimensions - M M - Side to side: Where is the borderline between mathematics and ”non-mathematics”? How much mathematics is supposed to be in the document suitable for DLM? Decide on the formal handling of journals with mixed content according to scope. It will be impossible to make a distinction in most documents before 1800. DLM dimensions - L L - Top to bottom: Different kinds of levels Impact on research Potential user interest in having the document available, depending also on different user communities (research, education, applications, history etc.) Quality Availability DLM dimensions - G G - Back to front: Initial activities are and will be mainly located in Europe and North-America In several cases material outside of this part of the world is difficult to access and not well-represented by information sources. How should the DLM project spread out from current initial activities covering content from all over the world ? Part 2 Problems and questions related to access Access - assumptions and requirements DLM will have a distributed architecture? Integrated access will be desirable. Provider oriented access will be desirable as well. Common interface will be an essential user requirement In a comprehensive system different business models for different providers will have to be allowed. Access options Integrated access through links from reviewing services like JFM, MR, RZ and ZBL Integrated access through a special DLM-oriented system based on finer meta-data specifications Provider-oriented access according to the special requirements from the provider Integrated access through portals Integrated access within wider bundles of electronic offers (not only mathematics) First questions What are the metadata structures used by the current retrodigitization projects? What are the metadata structures used by the current digital offers? Is there a core group where these structures are more or less compatible already? Are there current providers of a repository of interest to the DLM which do not want to be integrated into the access structure? Do they reveal their metadata structure, and if so, is this compatible with that of the others?