"Study report on SMM process"
ISO/IEC JTC1/SC32 WG2 Interim Meeting Seoul, Korea Study report on SMM process 2007. 12. 6 Tae-Hoon Lim and Tae-Sul Seo email@example.com firstname.lastname@example.org Background According to the resolution of SC32 New York meeting (SC32N1604a), the study on Semantic Harmonization of Metadata was performed. Reference: SC32N1658 2007-12-06 SC32 WG2 Interim Meeting, 2 Seoul, Korea Summary Title was changed. The procedures were modified. Name of each step was changed The 2nd and 3rd steps can be replaced by each other. A description system for mapping was established. 2007-12-06 SC32 WG2 Interim Meeting, 3 Seoul, Korea Title change From “semantic harmonization of metadata” To “semantic metadata mapping (SMM) process” The later is more specific expression than the former. 2007-12-06 SC32 WG2 Interim Meeting, 4 Seoul, Korea Procedure modification 1st Surveying metadata sets 1st Collecting metadata schema 2nd Constructing common DECs 2nd Grouping attributes based on 11179 3rd Grouping data elements 3rd Finding common DECs by the DECs 4th Completing crosswalks 4th Mapping into a table 2007-12-06 SC32 WG2 Interim Meeting, 5 Seoul, Korea Semantic Metadata Mapping Process Overall Process 1st Collecting metadata schema 2nd Grouping attributes 3rd Finding common DECs 4th Mapping into a table The 2nd and 3rd steps can be replaced by each other. 2007-12-06 SC32 WG2 Interim Meeting, 6 Seoul, Korea Semantic Metadata Mapping Process 1st. Collecting metadata schema Survey and identify candidate metadata schema in a domain. Surveying form includes: Domain name, Service DB name, or an other equivalent name. Number of fields Sample data Value domains 2007-12-06 SC32 WG2 Interim Meeting, 7 Seoul, Korea Semantic Metadata Mapping Process 2nd. Grouping attributes Selecting a metadata set as a primary metadata set. The simplest or the highest level metadata set is desirable to be the primary one. For all available metadata schema, attributes should be aggregated by the attributes of the primary metadata set. There may exist attributes which aren’t fitted to any of them. Some attributes, which are not important, may be removed. The remaining are grouped separately. Metadata experts should perform the work along with domain experts. 2007-12-06 SC32 WG2 Interim Meeting, 8 Seoul, Korea Semantic Metadata Mapping Process 3rd. Finding common DECs Analyzing each attribute of the primary metadata set and find out an object class and a property hidden in and related to the attribute. Constructing common DECs based on ISO/IEC 11179 standard using the object classes and the properties. If there exists an attribute which isn’t fitted to any of the DECs, a new DEC may be constructed for them. 2007-12-06 SC32 WG2 Interim Meeting, 9 Seoul, Korea Semantic Metadata Mapping Process 4th. Mapping into a table Finally, arranging all attributes into a table by the common DECs. Comments on the types of mapping can be included in the table as bellow. Same, no difference: no description Level difference: upper/lower terms Domain difference: generic/specific (book, technical report, article, …) Term difference: synonym, antonym or preferred term Naming rule difference: Order or representation rules A recommended set of metadata can be provided for guiding future standardization. 2007-12-06 SC32 WG2 Interim Meeting, 10 Seoul, Korea Application to e-Book Domain: e-Book (1st) Available metadata sets: OpenEBPS, MODS and TEI primary metadata set: OpenEBPS OpenEBPS MODS TEI header Domain name Description of Description of Library Encoding methods Electronic Book resources for machine-readable texts Number of fields 15 About 60 (top level: 20) Over 20 Sample data yes no yes 2007-12-06 SC32 WG2 Interim Meeting, 11 Seoul, Korea Application to e-Book (2nd) Grouping attributes OpenEBPS MODS TEI title titleInfor:title fileDesc:titleStmt:title titleInfor:subTitle fileDesc:seriesStmt:title titleInfor:partNumber fileDesc:seriesStmt:idno titleInfor:partName titleInfor:nonSort creator(role) name:role creator(file-as) name:namePart fileDesc:titleStmt:author name:displayForm name:affiliation name:discription subject subject:topic profileDesc:textClass:keyword classification profileDesc:textClass:classCode subject:catographics profileDesc:textClass:catRef subject:occupation 2007-12-06 SC32 WG2 Interim Meeting, 12 Seoul, Korea Application to e-Book (3rd) Constructing common DECs based on 11179: Object class: e-Book Properties: title, author, subject, abstract, publisher, distributor, authority, contributor, publication-date, genre, format, extent, identifier, language, coverage-geographic, coverage-temporal, right, location, edition DECs: ebookTitle, ebookAuthor, ebookSubject, ebookAbstract, ebookPublisher, ebookDistributor, ebookAuthority, ebookContributor, ebookPublication-date, ebookGenre, ebookFormat, ebookExtent, ebookIdentifier, ebookLanguage, ebookCoverage-geographic, ebookCoverage-temporal, ebookRight, ebookLocation, ebookEdition 2007-12-06 SC32 WG2 Interim Meeting, 13 Seoul, Korea Application to e-Book (4th) Mapping into a table DEC OpenEBPS MODS TEI Recommaned DE ebookTitle title titleInfo:title titleStmt:title ebookTitle titleInfo:subTitle seriesStmt:title T:pre ebookSubtitle ebookAuthor creator(role) name:role creator(file-as) T:pre name:namePart D:gen titleStmt:author N:rep ebookAuthorName ebookSubject subject N:rep subject:topic T:pre textClass:keyword N:rep ebookSubjectWord classification N:rep textClass:classCode N:rep ebookSubject-classCode textClass:catRef T:pre L - up: upper term/lo: lower term D - generic: gen/… T - syn: synonym/ant: antonym/pre: preferred term N - ord: order/rep: representation 2007-12-06 SC32 WG2 Interim Meeting, 14 Seoul, Korea Future plan The SMM process will be elaborated more in order to be proposed as a new work item in ISO/IEC JTC1/SC32 next year. 2007-12-06 SC32 WG2 Interim Meeting, 15 Seoul, Korea Thank you! 2007-12-06 SC32 WG2 Interim Meeting, 16 Seoul, Korea