Category: linked data

July 13, 2017

Powerful tool to quickly create items for publications in Wikidata

This is a great example of why I love the WikiCite community. At WikiCite 2017 a group of people decided to write a zotero translator for the Wikidata community.

Last week I had the opportunity to learn about a data archive at the Institution for Social and Policy Studies at Yale University. The archive is well-curated, and has a lot of metadata about data files that they house, such as supporting data sets or replication materials related to published papers and books that ISPS-affiliated scholars have created.

This week I read about the zotero translator and I wanted to try it out. Thank you very much, zotkat! This translator meets the needs of people who want a semi-automated way to quickly create items of publications of many varieties.

If you run this query on the Wikidata Query Service then you will be able to explore the items for publications and explore the supporting data files by following the links to where the data is stored at ISPS.

March 13, 2017

Windham-Campbell Prizes

The Windham-Capbell Prizes are awarded in Fiction, Nonfiction, Drama and Poetry. This year 8 awards were made.

Erna Brodber, Andre Alexis, Marina Carr, Carolyn Forche, Ali Cobby Eckermann, Ike Holter, Maya Jasanoff, Ashleigh Young are the 2017 recipients of Windham-Campbell Prizes.

For each recipient who already had an item in Wikidata I added a statement using property P166 “award received” to connect them to the prize.

I was then able to write queries about the set of prize recipients.

Who are all the winners of the Windham-Campbell Prizes in history? Try the query here on the Wikidata Query Service.
How many recipients do we have images for in Wikimedia Commons? Try the query here on the Wikidata Query Service.
Return a list of all winners of the Windham-Campbell Prize along with the geocoordinates of their birthplaces. Try the query here on the Wikidata Query Service.
Return a list of winners with their DOBs and plot it on a timeline. Try the query here on the Wikidata Query Service.
Winners of the Windham-Campbell Prize listed with all other awards received. Try the query here on the Wikidata Query Service.

I think this is a helpful example of how library bibliographic metadata could further enhance Wikidata. I would like to be able to see the metadata for each of the works created by these authors, but this data is not yet in Wikidata. Imagine what we could build for library users- or what library users could build themselves- if we could also provide bibliographic metadata from Wikidata!

February 27, 2017

Library of Congress Digital Preservation using Wikidata URIs!

The Library of Congress Digital Preservation team recently updated their inventory of Format Description Documents to include Wikidata URIs. The Library of Congress has detailed descriptions of more than 400 file formats on their website.

Wikidata QID on the TIFF format description document — This is an excerpt from the format description document for TIFF, Revision 6.0 showing the Wikidata ID.

The purposes of these format descriptions are listed on their website:

To support strategic planning regarding digital content formats, in order to ensure the long-term preservation of digital content by the Library of Congress, and
To provide an inventory of information about current and emerging formats, including the identification of tools and detailed documentation that are needed to ensure that the Library of Congress can manage content created or received in these formats through the content life cycle, and
To identify and describe the formats that are promising for long-term sustainability, and develop strategies for sustaining these formats including recommendations pertaining to the tools and documentation needed for their management.
To identify and describe the formats that are not promising for long-term sustainability, and develop strategies for sustaining the content they contain.
The overall analysis is part of the execution of the Library of Congress Digital strategic plannning goal pertaining to the management and sustenance of digital content.

I’m looking forward to seeing many additional cultural heritage institutions and organizations using Wikidata URIs in the future.

Wikidata is already serving as a crosswalk between identifiers. Here is a SPARQL query for the Wikidata endpoint showing all of the items in Wikidata for which we have IDs from the Library of Congress, PRONOM, and the Just Solve Wiki.

UPDATE: I updated this post on March 15, 2017 with new links to the Library of Congress websites.

February 10, 2017

#loveyourdata week 2017

Logo for Love Your Data Week

#LYD17

Similar to Open Access Week, the purpose of the Love Your Data (LYD) campaign is to raise awareness and build a community to engage on topics related to research data management, sharing, preservation, reuse, and library-based research data services. We will share practical tips, resources, and stories to help researchers at any stage in their career use good data practices.

I created a series of 5 SPARQL queries that highlight Wikidata, collections at Yale University Library, and are expressions of how I relate to #loveyourWIKIdata.

January 3, 2017

Oral history at the Computer History Museum

The mission statement of the Computer History Museum is “to preserve and present for posterity the artifacts and stories of the Information Age.”

The CHM has conducted hundreds of oral history interviews, transcribed them, and made them available from their website. This set of oral histories is very rich with information and I imagine that many people interested in the history of computing might like to read the transcripts of these oral histories.

I was curious to see what data about the people who have oral histories at the CHM might be in Wikidata. You might recognize this bubble chart from my post on 11/11/2016. Well there is a new bubble on the chart now!

This bubble chart is a visualization of the number of archival materials held by each institution. The CHM is now represented by the second largest bubble!

I found many of the people who contributed oral histories to the CHM in Wikidata. For those who already had items in Wikidata, I added a link to the transcript of their oral history. Now we can ask questions about these people as a group.

Using the Wikidata Query Service, I wrote a few SPARQL queries to find out more about these pioneers of computing history.

Map with the birthplaces of those who contributed oral histories to CHM.

The ability to ask questions about this group of people demonstrates the benefits of linked open data. With a few queries, we unearth all of the data that editors have been contributing about these people.

Future work:

Create items for all of the people who have contributed an oral history who are not yet in Wikidata.
Create statements for all of these people to make their items more complete. Sourcing statements to these oral histories themselves will help us enrich the data.
Add links in Wikipedia to content from CHM since many humans read Wikipedia and fewer humans read Wikidata.

December 14, 2016

OCLC Linked Data Survey

I just had the opportunity to watch the video of Karen Smith-Yoshimura’s presentation of “Linked Data Implementations-Who, What and Why”. Video available here.

Karen Smith-Yoshimura’s profile at OCLC Research
http://www.oclc.org/research/people/smith-yoshimura.html

The video is a presentation of survey results from 2 surveys, one in 2014 and one in 2015. OCLC Research has shared the data here.

Wikidata is mentioned a few times as a source of linked open data that is consumed by some of the projects discussed in the presentation.

I found this video to be a great overview of linked data projects across libraries and museums.