Archival description and linked data: a preliminary study of opportunities and implementation challenges
详细信息    查看全文
  • 作者:Karen F. Gracy
  • 关键词:Archival description and access ; Linked data ; Semantic interoperability ; Encoded Archival Description (EAD) ; Machine ; readable cataloging (MARC)
  • 刊名:Archival Science
  • 出版年:2015
  • 出版时间:September 2015
  • 年:2015
  • 卷:15
  • 期:3
  • 页码:239-294
  • 全文大小:2,416 KB
  • 参考文献:Bermes E (2011) Convergence and interoperability: a Linked Data perspective. IFLA 2011, 13鈥?8 August, 2011, San Juan, Puerto Rico pp. 1鈥?2 http://鈥媍onference.鈥媔fla.鈥媜rg/鈥媝ast/鈥?011/鈥?49-bermes-en.鈥媝df . Accessed 30 Dec 2013
    Berner RC (1971) Manuscript catalogs and other finding aids: what are their relationships? Amer Arch 34:367鈥?72
    Berners-Lee T, Hendler J, Lassila O (2001) The Semantic Web. Sci Am (May):29鈥?7
    Blanke T, Bryant M, Speck R, Kristel C (2012) Information extraction on noisy texts for historical research. Digital Humanities 2012, 16鈥?0 July 2012, Hamburg, Germany. http://鈥媤ww.鈥媎h2012.鈥媢ni-hamburg.鈥媎e/鈥媍onference/鈥媝rogramme/鈥媋bstracts/鈥媔nformation-extraction-on-noisy-texts-for-historical-research/鈥?/span> . Accessed 30 Dec 2013
    Brickley D, Miller L (2010) FOAF vocabulary specification 0.98. http://鈥媥mlns.鈥媍om/鈥媐oaf/鈥媠pec/鈥?/span> . Accessed 30 Dec 2013
    Catone J (2008) Australian museum uses Open Calais to tag collection. http://鈥媟eadwrite.鈥媍om/鈥?008/鈥?4/鈥?1/鈥媋ustralian_鈥媘useum_鈥媢ses_鈥媜pen_鈥媍alais . Accessed 30 Dec 2013
    Civil War Data 150 (2010) Civil War Data 150: linking Civil War data across state and federal archives and libraries. http://鈥媤ww.鈥媍ivilwardata150.鈥媙et . Accessed 30 Dec 2013
    Clough P, Tang J, Hall M, Warner A (2011) Linking archival data to location: a case study at the UK National Archives. Aslib Proc 63(2鈥?):127鈥?47
    Coats L (2004) Users of EAD finding aids: who are they and are they satisfied? J Arch Organ 2(3):25鈥?9View Article
    Cox RJ (2007) Revisiting the archival finding aid. J Arch Organ 5(4):5鈥?2View Article
    Cox E, Czechowski L (2007) Subject access points in the MARC record and archival finding aid: enough or too many? J Arch Organ 5(4):51鈥?9View Article
    Coyle K (2012) Linked data tools: connecting on the Web. ALA Lib Tech Reports 48(4):10鈥?4
    Cyganiak R (2011) The linking open data cloud. http://鈥媗od-cloud.鈥媙et . Accessed 30 Dec 2013
    Davies T (2011) Elements of a linked open data stack. http://鈥媤ww.鈥媡imdavies.鈥媜rg.鈥媢k/鈥媤p-content/鈥媢ploads/鈥婭KMLI-LOD-Stack-Draft-Diagram.鈥媝ng . Accessed 30 Dec 2013
    Digital Collections and Archives, Tufts University (2013) LiAM: Linked Archival Metadata. http://鈥媠ites.鈥媡ufts.鈥媏du/鈥媗iam/鈥媎eliverables/鈥媝rospectus-for-linked-archival-metadata-a-guidebook/鈥?/span> . Accessed 30 Dec 2013
    Dooley J (1992) Subject indexing in context. Am Arch 55:344鈥?54
    Dooley J, Luce K (2010) Taking our pulse: the OCLC research survey of special collections and archives. OCLC Research, Dublin
    Dooley J, Beckett R, Cullingford A, Sambrook K, Sheppard C, Worrall S (2013) Survey of special collections and archives in the United Kingdom and Ireland. OCLC Research and RLUK, Dublin
    Duff W (2001) Evaluating metadata on a metalevel. Arch Sci 1:285鈥?94View Article
    Duff W, Johnson C (2003) Where is the list with all the names? Information-seeking behavior of genealogists. Am Arch 66:79鈥?5
    Duff W, Stoyanova P (1998) Transforming the crazy quilt: archival displays from a user鈥檚 point of view. Archivaria 45:44鈥?9
    Eidson MY (2002) Describing anything that walks: the problem behind the problem with EAD. J Arch Organ 1(4):5鈥?8View Article
    Erp M, Oomen J, Segers R et al. (2011) Automatic heritage metadata enrichment with historic events. Museums and the Web 2011, 6鈥? April 2011, Philadelphia, PA. http://鈥媤ww.鈥媘useumsandtheweb鈥?鈥媍om/鈥媘w2011/鈥媝apers/鈥媋utomatic_鈥媓eritage_鈥媘etadata_鈥媏nrichment_鈥媤ith_鈥媓i . Accessed 30 Dec 2013
    Feeney K (1999) Retrieval of archival finding aids using world-wide-web search engines. Am Arch 62:206鈥?28
    Fons T, Penka J, Wallis R (2012) OCLC鈥檚 Linked Data initiative: using Schema.org to make library data relevant on the web. Inf Stand Q 24(2鈥?):29鈥?3
    Gabriel C (2002) Subject access to archives and manuscript collections: an historical overview. J Arch Organ 1(4):53鈥?3View Article
    Hamburger S (2004) How researchers search for manuscript and archival collections. J Arch Organ 2(1鈥?):79鈥?02View Article
    Heath T, Bizer C (2011) Linked data: evolving the Web into a global data space. Morgan & Claypool, San Rafael
    Hienert D, Luciano F (2012) Extraction of historical events from Wikipedia. Proceedings of knowledge discovery and data mining meets linked open data (Know@LOD) workshop at ESWC 2012, 27-31 May 2012, Heraklion, Crete. http://鈥媋rxiv.鈥媜rg/鈥媝df/鈥?205.鈥?138.鈥媝df . Accessed 30 Dec 2013
    Hyv枚nen E, Lindquist T, T枚rnroos J, M盲kel盲 E (2012) History on the semantic web as linked data: an event gazetteer and timeline for the World War I. Proceedings of CIDOC 2012, Enriching Cultural Heritage, 10鈥?4 June 2012, Helsinki, Finland. http://鈥媤ww.鈥媍idoc2012.鈥媐i/鈥媏n/鈥婩ile/鈥?609/鈥媓yvonen.鈥媝df . Accessed 30 Dec 2013
    Isaac A, Clayphan R, Haslhofer B (2012) Europeana: moving to linked open data. Inf Stand Q 24(2鈥?):34鈥?0View Article
    Larson R, Janakiraman K (2011) Connecting archival collections: the social networks and archival context project. In Gradmann S, et al. (eds) Proceedings of the International Conference on Theory and Practice of Digital Libraries (TPDL 2011), 26鈥?8 September 2011, Berlin, Germany. Lecture Notes in Computer Science 6966. Springer, Berlin, pp. 3鈥?4
    Library of Congress (2006) Encoded Archival Description tag library, version 2002, appendix A: EAD crosswalks. http://鈥媤ww.鈥媗oc.鈥媑ov/鈥媏ad/鈥媡glib/鈥媋ppendix_鈥媋.鈥媓tml . Accessed 30 Dec 2013
    Light M, Hyry T (2002) Colophons and annotations: new directions for the finding aid. Am Arch 65:216鈥?30
    Lynch C (2002) Digital collections, digital libraries, and the digitization of cultural heritage information. First Monday 7(5). http://鈥媤ww.鈥媐irstmonday.鈥媜rg/鈥媜js/鈥媔ndex.鈥媝hp/鈥媐m/鈥媋rticle/鈥媣iew/鈥?49/鈥?70 . Accessed 30 Dec 2013
    Lytle R (1980) Intellectual access to archives: I. Provenance and content indexing methods of subject retrieval. Am Arch 43 (Winter 1980): 64鈥?5
    MacNeil H (2012) What finding aids do: archival description as rhetorical genre in traditional and web-based environments? Arch Sci 12:485鈥?00. doi:10.鈥?007/鈥媠10502-012-9175-4 View Article
    Mascaro M (2011) Controlled access headings in EAD finding aids: current practices in number of and types of headings assigned. J Arch Organ 9:208鈥?25. doi:10.鈥?080/鈥?5332748.鈥?011.鈥?43690 View Article
    Mazzini S, Ricci F (2011) EAC-CPF Ontology and linked archival data. Proceedings of the 1st international workshop on semantic digital archives, 29 Sept 2011, Berlin, Germany. CEUR Workshop Proceedings, vol. 801, pp. 72鈥?1. http://鈥媍eur-ws.鈥媜rg/鈥媀ol-801/鈥媝aper6.鈥媝df . Accessed 30 Dec 2013
    Michelson A (1987) Description and reference in the age of automation. Am Arch 50:192鈥?08
    Nesmith T (2005) Reopening archives: bringing new contextualities into archival theory and practice. Archivaria 60:259鈥?74
    Nimer C (2011). Applying inheritance: single-level displays and repurposeable metadata. Society of American Archivists, Chicago. http://鈥媤ww2.鈥媋rchivists.鈥媜rg/鈥媠ites/鈥媋ll/鈥媐iles/鈥婥NFinal.鈥媝df . Accessed 30 Dec 2013
    OpenCalais (2013). How does Calais work? http://鈥媤ww.鈥媜pencalais.鈥媍om/鈥媋bout . Accessed 30 Dec 2013
    Pattuelli MC (2012) Personal name vocabularies as Linked Open Data: a case study of jazz artist names. J Inf Sci 38(6):558鈥?65View Article
    Perkins J, Yoose B (2011) Case study: mining oral history for enhanced access. Poster presentation, Society of American Archivists annual conference, 22鈥?7 August 2011, Chicago, IL. http://鈥媜hda.鈥媘atrix.鈥媘su.鈥媏du/鈥?012/鈥?6/鈥媘ining-oral-history-for-enhanced-access/鈥?/span> . Accessed 30 Dec 2013
    Prom C (2004) User interactions with electronic finding aids in a controlled setting. Am Arch 67:234鈥?68
    Pugh MJ (1982) The illusion of omniscience: subject access and the reference archivist. Am Arch 45(1):33鈥?4
    Raimond Y, Abdallah S (2007) The Event Ontology. http://鈥媘otools.鈥媠ourceforge.鈥媙et/鈥媏vent/鈥媏vent.鈥媓tml . Accessed 30 Dec 2013
    Redding C (2002) Reengineering finding aids revisited: current archival descriptive practice and its effect on EAD implementation. J Arch Organ 1(3):35鈥?9View Article
    Rizzo G, Troncy R (2011) NERD: evaluating named entity recognition tools in the web of data. Workshop on Web Scale Knowledge Extraction, ISWC 2011, 23鈥?7 October 2011, Bonn, Germany. http://鈥媝orto.鈥媝olito.鈥媔t/鈥?440793/鈥?/鈥媤ekex2011_鈥媠ubmission_鈥?.鈥媝df . Accessed 30 Dec 2013
    Ruddock B, Stevenson J (2011) Creating linked open data for library and archive descriptions. Multimed Inf Technol 37(4):19鈥?0
    Schaffner J (2009) The metadata is the interface: better description for better discovery of archives and special collections, synthesized from user studies. OCLC Research, Dublin
    Scheir W (2006) First entry: report on a qualitative exploratory study of novice user experience with online finding aids. J Arch Org 3(4):49鈥?5
    Schema.org (2011) FAQ [Frequently Asked Questions]. http://鈥媠chema.鈥媜rg/鈥媎ocs/鈥媐aq.鈥媓tml . Accessed 30 Dec 2013
    Shaw E (2001) Rethinking EAD: balancing flexibility and interoperability. New Rev Info Netw 7:117鈥?32View Article
    Shaw R (2010) LODE: An ontology for Linking Open Descriptions of Events. http://鈥媗inkedevents.鈥媜rg/鈥媜ntology/鈥?/span> . Accessed 30 Dec 2013
    Singhal A (2012) Introducing the Knowledge Graph: things not strings. http://鈥媑oogleblog.鈥媌logspot.鈥媍om/鈥?012/鈥?5/鈥媔ntroducing-knowledge-graph-things-not.鈥媓tml . Accessed 30 Dec 2013
    Society of American Archivists, Technical Subcommittee on Encoded Archival Description (2011) EAD: Technical considerations. http://鈥媤ww2.鈥媋rchivists.鈥媜rg/鈥媠ites/鈥媋ll/鈥媐iles/鈥婨ADRevisionTechn鈥媔calConsideratio鈥媙s_鈥?.鈥媝df . Accessed 30 Dec 2013
    Spindler R, Pearce-Moses R (1993) Does AMC mean archives made complicated? Am Arch 56:330鈥?41
    Stevenson J (2012) Linking data: linking lives: the creation and display of Linked Open Data for archives. International Council on Archives, 20鈥?4 August 2012, Brisbane, Australia. http://鈥媔ca2012.鈥媔ca.鈥媜rg/鈥媐iles/鈥媝df/鈥婩ull%20鈥媝apers%20鈥媢pload/鈥媔ca12Final00029.鈥媝df . Accessed 30 Dec 2013
    Tibbo H (2003) Primarily history: historians and the search for primary source materials. Am Arch 66:9鈥?0
    Trace CB, Dillon A (2012) The evolution of the finding aid in the United States: from physical to digital document genre. Arch Sci 12:501鈥?19View Article
    University of California, Berkeley. Library (1994) Berkeley Finding Aid Project. https://鈥媤eb.鈥媋rchive.鈥媜rg/鈥媤eb/鈥?0130427232556 . Accessed 30 Dec 2013 http://鈥媠unsite.鈥媌erkeley.鈥媏du/鈥婩indingAids/鈥婨AD/鈥媌fap.鈥媓tml . Accessed 30 Dec 2013
    Vatant B (2012) GeoNames Ontology. http://鈥媤ww.鈥媑eonames.鈥媜rg/鈥媜ntology/鈥媎ocumentation.鈥媓tml . Accessed 30 Dec 2013
    Yakel E (2003) Archival representation. Arch Sci 3:1鈥?5View Article
    Yakel E (2004) Encoded Archival Description: are finding aids boundary spanners or barriers for users? J Arch Organ 2(1鈥?):63鈥?7View Article
  • 作者单位:Karen F. Gracy (1)

    1. School of Library and Information Science, Kent State University, P.O. Box 5190, Kent, OH, 44242, USA
  • 刊物类别:Humanities, Social Sciences and Law
  • 刊物主题:Humanities / Arts
    Cultural Heritage
    History
    Library Science
    Information Storage and Retrieval
    Organization and Planning
  • 出版者:Springer Netherlands
  • ISSN:1573-7519
文摘
This paper presents the results of a study to investigate how archives can connect their collections to related data sources through the use of Semantic Web technologies, specifically Linked Data. Questions explored included (a) What types of data currently available in archival surrogates such as Encoded Archival Description (EAD) finding aids and Machine-Readable Cataloging (MARC) records may be useful if converted to Linked Data? (b) For those potentially useful data points identified in archival surrogates, how might one align data structures found in those surrogates to the data structures of other relevant internal or external information sources? (c) What features of current standards and data structures present impediments or challenges that must be overcome in order to achieve interoperability among disparate data sources? To answer these questions, the researcher identified metadata elements of potential use as Linked Data in archival surrogates, as well as metadata element sets and vocabularies of data sets that could serve as pathways to relevant external data sources. Data sets chosen for the study included DBpedia and schema.org; metadata element sets examined included Friend of a Friend (FOAF), GeoNames, and Linking Open Description of Events (LODE). The researcher then aligned tags found in the EAD encoding standard to related classes and properties found in these Linked Data sources and metadata element sets. To investigate the third question about impediments to incorporating Linked Data in archival descriptions, the researcher analyzed the locations and frequencies at which controlled and uncontrolled access points (personal and family name, corporate name, geographic name, and genre/form entities) appeared in a sample of MARC and EAD archival descriptive records by using a combination of hand counts and the natural language processing (NLP) tool, OpenCalais. The results of the location and frequency analysis, combined with the results of the alignment process, helped the researcher identify several critical challenges currently impeding interoperability among archival information systems and relevant Linked Data sources, including differences in granularity between archival and other data source vocabularies, and inadequacies of current encoding standards to support semantic tagging of potential access points embedded in free text areas of archival surrogates.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700