Workshops and Tutorials
Contents
- Workshops
- Innovation Measurement for Scientific Communication (IMSC) in the Era of Big Data
- Joint Workshop of the 4th Extraction and Evaluation of Knowledge Entities from Scientific Documents (EEKE2023) and the 3rd AI + Informetrics (AII2023)
- 2nd International Workshop on Digital Language Archives
- Persistent Identifiers & US Federal Agency Policies
- FAIR Data for Large Research Facilities
- Web Archiving and Digital Libraries (WADL) 2023
- Tutorials
Workshops
Innovation Measurement for Scientific Communication (IMSC) in the Era of Big Data

Zhongyi Wang
Central China Normal University

Haihua Chen
University of North Texas

Chengzhi Zhang
Nanjing University of Science and Technology

Wei Lu
Wuhan University

Jian Wu
Old Dominion University
Duration: Half-Day
Website: https://jcdl2023workshop-imsc.github.io/IMSC2023-workshop/
Submission page: https://easychair.org/conferences/?conf=imsc20230
Delivery method: Hybrid
Abstract: The increasingly mature artificial intelligence technologies, such as big data, deep learning, and natural language processing, provide technical support for research on automatic text understanding and bring development opportunities for innovative measurement of scientific communication. Innovation measurement in scientific communication is a challenging and cutting-edge direction in Informetrics. It is interdisciplinary, requiring considering the characteristics of different disciplines and different types of scientific outcomes to establish a comprehensive evaluation metrics system. On the other hand, metadata and content features should be considered to reflect the innovation of scientific works objectively and comprehensively. This workshop focuses on the discussion and produces enlightening outcomes. We will engage broad audiences to share their ideas and pre-productions, enabling an interdisciplinary approach to exploring frontier areas. This workshop consists of keynotes, oral presentations, and poster sessions and would attract interest from academic researchers, librarians, and decision-makers from governments and practical sectors.
Joint Workshop of the 4th Extraction and Evaluation of Knowledge Entities from Scientific Documents (EEKE2023) and the 3rd AI + Informetrics (AII2023)

Chengzhi Zhang
Nanjing University of Science and Technology

Yi Zhang
University of Technology Sydney

Philipp Mayr
GESIS - Leibniz-Institute for the Social Sciences

Wei Lu
Wuhan University

Arho Suominen
Tampere University

Haihua Chen
University of North Texas

Ying Ding
University of Texas at Austin
Duration: Full Day
Website: https://eeke-workshop.github.io/2023/
Submission page: https://easychair.org/cfp/EEKE-AII2023
Delivery method: Hybrid
Abstract: The 4th Workshop on Extraction and Evaluation of Knowledge Entities from Scientific Documents (EEKE 2023) and the 3rd AI + Informetrics (AII2023) will be held in New Mexico, USA and online at the ACM/IEEE Joint Conference on Digital Libraries (JCDL) 2023. The goal of these two workshop series (https://eeke-workshop.github.io/ and https://ai-informetrics.github.io/) are to engage the related communities in open problems in the extraction and evaluation of knowledge entities from scientific documents and the modeling and applications of AI + Informetrics for broad interests in science of science, science, technology, & innovation (ST&I), etc. This joint workshop consists of keynotes, oral presentations, and posters, and will attract not only academic researchers and librarians but also decision makers from governments and practical sectors.
2nd International Workshop on Digital Language Archives

Oksana L. Zavalina
University of North Texas

Shobhana L. Chelliah
Indiana University Bloomington
Duration: Full Day
Website: https://jcdl2023dla.ci.unt.edu
Submission page: https://easychair.org/cfp/LangArc2023
Delivery method: Online Only
Abstract: 2nd International Workshop on Digital Language Archives (LangArc 2023) virtual workshop on digital language archives – digital libraries that preserve, curate, and provide online access to language data – continues (after the initial LangArc 2021 workshop) addressing the growing need. LangArc 2023 will explore a broad scope of issues related to digital language archives. This includes challenges and opportunities, strategies and solutions for: facilitating depositing and improving access; information organization, architecture, and retrieval; quality assurance; usability; ethical issues; ways of encouraging reuse of deposited data in research and education; and coursework and other training for information professionals that will develop and maintain digital language archives. This workshop is expected to support interdisciplinary collaboration among information professionals, linguists, educators, representatives of language communities (including indigenous and other underrepresented), students, and other interested audiences.
Persistent Identifiers & US Federal Agency Policies

Brian Minihan
ORCID

Sara Bowman
Crossref

Carly Robinson
Department of Energy, Office of Science

Amanda French
Research Organization Registry (ROR)
Duration: Half-Day
Delivery method: Hybrid
Abstract: In recent years the US Government has issued two policy guidance memos that will likely have a large impact on academic research. This workshop seeks to walk through the 2021 National Security Presidential Memo 33 (NSPM-33) ,the 2022 OSTP Public Access Memo and share how use of persistent identifiers (PIDs) are key to ensuring compliance. We’ll share PID aspects of the implementation guidance provided for NSPM-33, timelines and next steps for the Public Access Memo, and implementation options for both memos. Moving to the technical side, we’ll discuss how the various persistent identifiers function in ensuring trustworthy metadata by from other systems via the API, specifically affiliation metadata (ROR), publication and grant metadata (Crossref) and author and contributing author metadata (ORCID). Finally, we’ll ensure time for interaction and discussion to ensure that research institutions, research administrators, repository managers, are familiar with how their landscape as well as that of funders and publishers will change and how to ensure their community is well-prepared.
FAIR Data for Large Research Facilities

David Butcher
National High Magnetic Field Laboratory

Don Brower
University of Notre Dame
Duration: Half-Day
Delivery method: In-Person
Abstract: Major Research Facilities and large research institutions have complex cyberinfrastructure and workflow needs. There are increasing calls for data openness and sharing, such as with the Nelson memo from last fall, and this is balanced by the complexity to store and manage the this amount of research data. This workshop will facilitate discussion on the problems faced in the research data management of these organizations, and approaches to dealing with these issues through processes, technical tools, or policy.
Web Archiving and Digital Libraries (WADL) 2023

Mat Kelly
Drexel University

Brenda Reyes Ayala
University of Alberta

Zhiwu Xie
Virginia Tech Libraries

Edward Fox
Virginia Tech
Duration: Full Day
Website: https://fox.cs.vt.edu/wadl2023.html
Delivery method: Hybrid
Abstract: The 2023 edition of the Workshop on Web Archiving and Digital Libraries (WADL) will explore the integration of web archiving and digital libraries. The workshop aims at addressing aspects covering the entire life cycle of digital resources, including such as social media, scholarly and government datasets, 3D objects, and digital online art.creation/authoring, uploading/publishing, crawling, indexing, exploration, and archiving. It will also explore areas such as archiving processes and tools for "non-traditional" resources such as social media, scholarly and government datasets, 3D objects, and digital online art.
Tutorials
The Solid Protocol and Its Potential for Open Science

Patrick Hochstenbach
Ghent University Library IT
& IMEC-IDLab
Duration: Half-Day
Delivery method: In-Person, Hybrid Possible
Abstract: This tutorial provides a high-level overview of the rationale of Solid, an introduction to the protocol suite, and the data description approach using linked data. The participants will get a general introduction to the need for decentralization and the rationale of Solid. They will get a hands-on introduction to the capabilities of the Solid protocol. They will learn about the differences with other Linked Data Platform solutions such as Fedora Commons. Demonstrations will be provided on how to protect online resources and a discussion about Solid's potential for open science.
Up and running with ARK persistable identifiers

Donny Winston
Polyneme LLC

John Kunze
Duration: Half-Day
Website: https://arks.org/about/ark-training/
Delivery method: Hybrid
Abstract: In this 3-hour workshop we will introduce you to ARKs (Archival Resource Keys), which can serve as persistent identifiers, or stable, trusted references for information objects (eg, web addresses that don’t return 404 Page Not Found errors). For more than two decades, 8.2 billion ARKs have been created by over 1000 organizations — libraries, data centers, archives, museums, publishers, government agencies, and vendors. We will cover:
- Why ARKs — non-paywalled, decentralized, flexible
- Use cases — Smithsonian, French National Library, Internet Archive Metadata for early and ongoing object development
- How to get started — one form to fill out
- Minting and assigning ARK identifiers
- Resolvers, resolution, redirection
- Object types — digital, physical, conceptual
- Persistence considerations
- Available tools
From Digital Records to Digital Cultural Landscapes: Beyond Digital Library boundaries with DSpace

Andrea Bollini
4Science

Claudio Cortese
4Science

Immacolata Scancarello
4Science

Federico Verlicchi
4Science
Duration: Half-Day
Delivery method: Hybrid
Abstract: In a Digital Library environment, we can define Digital cultural landscapes as “virtual ecosystems” in which digital cultural heritage subsets are related with entities such as people, places, events, fonds, etc., according to different visions and interpretations, in a pluralism generating new knowledge and opening up new perspectives. These virtual ecosystems today can be easily structured by cultural institutions, using a popular application such as DSpace, the world's most widely used open-source Digital Asset Management System.
The proposed 3 hours online tutorial will show how to manage digital cultural contents in DSpace to model digital cultural landscapes either as Networks, as Paths, or as Aggregations.
However, even if the illustrated workflow, the examples and the use cases will be related to DSpace, the proposed methodological approach can be implemented in any system that has sufficiently granular and consistent metadata and an extensible data model.