pymarc PEP-8 cleanup
pymarc v2.0 was released yesterday afternoon. I’m mentioning it here to give a big tip of the hat to Gabriel Farrell (gsf on #code4lib) who spent a significant amount of time cleaning up the code to be...
View Articlejson vs pickle
in python JSON is faster, smaller and more portable than pickle … At work, I’m working on a project where we’re modeling newspaper content in a relational database. We’ve got newspaper titles, issues,...
View ArticleLibraryThing Ubuntu Screen Saver
I read about the LibraryThing Mac Screensaver and of course wanted the same thing for my Ubuntu workstation at $work. Naturally, I’m supposed to be working on some high-priority tickets on a tight...
View Articleflickr, digital curation and the web
The Library of Congress has started to put selected content from Chronicling America into Flickr as part of the Illustrated Newspaper Supplements set. More details on the rationale and process involved...
View ArticleNew York Times Topics as SKOS
Serves 23,376 SKOS Concepts INGREDIENTS Text editor: Vim, Emacs, TextMate, etc Python BeautifulSoup rdflib Internet connection DIRECTIONS Open a new file using your favorite text editor. Instantiate an...
View Articledata.gov.uk and rdfa
The recent public release of the UK Government’s data.gov.uk site got picked up by the press last week in articles at The Guardian, Prospect Magazine and elswhere. These have been supplemented by some...
View Articleversion control and digital curation
For some time now I have been meaning to write about some of the issues related to version control in repositories as they relate to some projects going on at $work. Most repository systems have a...
View Articlebad xml smells
I’m used to refactoring code smells, but sometimes you can catch a bad whiff in XML too. Before: < ?xml version="1.0" encoding="UTF-8"?> <mets...
View Articletriadomany
I fully admit that there is not uncommon craze for trichotomies. I do not know but the psychiatrists have provided a name for it. If not, they should … it might be called triadomany. I am not so...
View Articlediving into VIAF
Last week saw a big (well big for library data nerds) announcement from OCLC that they are making the data for the Virtual International Authority File (VIAF) available for download under the terms of...
View Articleviaf ntriples
I had a few requests for the Virtual International Authority File ntriples file I wrote about earlier. Having the various flavors of VIAF data available is great, but if an RDF dump is going to be made...
View Articlearchiving wikitweets
Earlier this year I created a little toy webapp called wikitweets that uses the Twitter streaming API to identify tweets that reference Wikipedia, which it then displays realtime in your browser. It...
View Articlelevel 0 linked archival data
Depósito del Archivo de la FundaciónSierra-Pambley TLDR; lets see if we can share structured archival data better by adding HTML <link> elements that point at our EAD XML files. A few weeks ago I...
View Articlepython heal thyself
.@adriarichards is currently getting doxed & threatened w/ violence. Search Twitter for her name & report abuse: bit.ly/Y82Ntx — Gina Trapani (@ginatrapani) March 21, 2013 After seeing Gina’s...
View ArticleGendered Archivist
Over the past few years I’ve been trying to deepen my understanding of the literature of and about archives. My own MLIS education was heavy on libraries and light on archives; so I was really quite...
View Article
More Pages to Explore .....