<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Jerome</title>
	<atom:link href="http://jerome.blogs.lincoln.ac.uk/feed/" rel="self" type="application/rss+xml" />
	<link>http://jerome.blogs.lincoln.ac.uk</link>
	<description>Reimagining Your Library</description>
	<lastBuildDate>Mon, 28 Nov 2011 11:29:26 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>It’s the end of Jerome as we know it (but I feel fine)</title>
		<link>http://jerome.blogs.lincoln.ac.uk/2011/11/28/its-the-end-of-jerome-as-we-know-it-but-i-feel-fine/</link>
		<comments>http://jerome.blogs.lincoln.ac.uk/2011/11/28/its-the-end-of-jerome-as-we-know-it-but-i-feel-fine/#comments</comments>
		<pubDate>Mon, 28 Nov 2011 11:29:26 +0000</pubDate>
		<dc:creator>Paul Stainthorp</dc:creator>
				<category><![CDATA[#ukdiscovery]]></category>
		<category><![CDATA[Andy McGregor]]></category>
		<category><![CDATA[Cambridge University]]></category>
		<category><![CDATA[Cambridge University Library]]></category>
		<category><![CDATA[cataloguing]]></category>
		<category><![CDATA[CLOCK]]></category>
		<category><![CDATA[CUL]]></category>
		<category><![CDATA[discovery]]></category>
		<category><![CDATA[Ed Chamberlain]]></category>
		<category><![CDATA[follow-up]]></category>
		<category><![CDATA[funding]]></category>
		<category><![CDATA[Jerome]]></category>
		<category><![CDATA[Jerome blog]]></category>
		<category><![CDATA[JISC]]></category>
		<category><![CDATA[Owen Stephens]]></category>
		<category><![CDATA[phone]]></category>
		<category><![CDATA[projects]]></category>
		<category><![CDATA[#CULComet]]></category>
		<category><![CDATA[COMET]]></category>

		<guid isPermaLink="false">http://pstainthorp.blogs.lincoln.ac.uk/?p=2139</guid>
		<description><![CDATA[The University of Lincoln&#8217;s&#160;Jerome project finished in August with the successful release of more than 240,000 openly-licensed bibliographic records, available over developer APIs, and a joint hack day with Cambridge University Library&#8216;s COMET project. Now, encouraged by positive JISC feedback, both institutions&#8212;Cambridge and Lincoln jointly&#8212;have applied for follow-up project funding under the project title CLOCK. [...]]]></description>
				<content:encoded><![CDATA[<p>The University of Lincoln&#8217;s <strong><a href="http://jerome.blogs.lincoln.ac.uk/">Jerome</a></strong> project finished in August with the successful release of more than 240,000 openly-licensed bibliographic records, available over developer APIs, and a <a href="http://lncn.eu/q34">joint hack day</a> with <a href="http://www.lib.cam.ac.uk/">Cambridge University Library</a>&#8216;s <strong><a href="http://cul-comet.blogspot.com/">COMET</a></strong> project.</p>
<p>Now, encouraged by positive JISC feedback, both institutions—Cambridge and Lincoln jointly—have applied for follow-up project funding under the project title <strong>CLOCK</strong>. If our bid is successful, the new project will run between December 2011–July 2012, employing a web developer based at the University of Lincoln, and distilling the work of both institutions into the development of new innovative library metadata discovery services for the scholarly community.</p>
<p>You can read the project proposal for CLOCK at <a id="internal-source-marker_0.28438019496388733" href="http://lncn.eu/ijt4">http://lncn.eu/ijt4</a> – the introductory section is below.</p>
<blockquote><p>The University of Lincoln and Cambridge University Library both delivered successful projects (<a href="http://jerome.blogs.lincoln.ac.uk/">Jerome</a> and <a href="http://cul-comet.blogspot.com/">COMET</a>) for the JISC Infrastructure for Resource Discovery Programme in 2011. This is a proposal for the continuation of and elaboration upon the work of both projects, via a programme of development work shared between the two institutions.</p></blockquote>
<blockquote><p>Throughout both projects (COMET-Jerome), parallel approaches in technology and data structure were noted and commented upon. A ‘mash day’ workshop event <a href="http://lncn.eu/q34">held in Cambridge in August</a> aimed to explore these differences as well as areas of potential synergy. Here project members identified several points of interest to take forward.</p></blockquote>
<blockquote><p>Both projects produced outputs of interest to researchers, students, librarians, developers, and designers of bibliographic discovery environments. The CLOCK project will harness the success of these two complementary initiatives and investigate new approaches to data creation and discovery in the library domain. In particular, it will investigate, propose, and develop new, web-based bibliographic tools/APIs which will make it easier for developers, academic libraries and library end-users (esp. researchers) to find Open Bibliographic Data and incorporate that data into systems and workflows.</p></blockquote>
<blockquote><p>This project is an opportunity to [1] exploit through real-world applications the significant amount of data released openly by Cambridge University Library; [2] apply the Jerome database architecture, iterative development methodology, and API framework to a bibliographic dataset an order of magnitude greater than the University of Lincoln’s; and [3] to build and enable a new set of tools and demonstrator services which will enable the future development of public Open Bib Data web applications of practical utility to libraries and end-users.</p></blockquote>
<blockquote><p>The project will be supported by library consultant <a href="http://www.ostephens.com/">Owen Stephens</a>, who will help to put the work into a national context, relating CLOCK to the wider movement toward <a href="http://obd.jisc.ac.uk/">Open Bib Data</a> and the work of the <a href="http://discovery.ac.uk/">JISC Discovery initiative</a>. It will take place in an environment (Lincoln/Cambridge) where a culture of developer inquiry and experimentation is encouraged and nurtured. It is also endorsed by senior library management at both universities.</p></blockquote>
<blockquote><p>Both universities are involved in complementary development work which will  both inform and be informed by CLOCK: at Cambridge, Ed Chamberlain is guiding the development of the JISC <a href="http://openbiblio.net/p/jiscopenbib2/">Open Bibliography 2</a> project; in Lincoln, Paul Stainthorp is lead researcher on the #jiscmrd <a href="http://orbital.blogs.lincoln.ac.uk/about/">Orbital</a> project, which is investigating the management of research data, with some areas of overlap.</p></blockquote>
<blockquote><p>CLOCK will operate as part of the wider JISC Digital Infrastructure: Information and library infrastructure: Resource discovery, and support the recent concerted effort to move toward openly licensed library discovery in UK Higher Education and beyond.</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://jerome.blogs.lincoln.ac.uk/2011/11/28/its-the-end-of-jerome-as-we-know-it-but-i-feel-fine/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Jerome/COMET hack day: Fun in the Fens</title>
		<link>http://jerome.blogs.lincoln.ac.uk/2011/08/10/jeromecomet-hack-day-fun-in-the-fens/</link>
		<comments>http://jerome.blogs.lincoln.ac.uk/2011/08/10/jeromecomet-hack-day-fun-in-the-fens/#comments</comments>
		<pubDate>Wed, 10 Aug 2011 16:46:14 +0000</pubDate>
		<dc:creator>Paul Stainthorp</dc:creator>
				<category><![CDATA[#ukdiscovery]]></category>
		<category><![CDATA[Alex Bilbie]]></category>
		<category><![CDATA[Cambridge University]]></category>
		<category><![CDATA[CARET]]></category>
		<category><![CDATA[Chris Keene]]></category>
		<category><![CDATA[Chris Leach]]></category>
		<category><![CDATA[Dan Sheppard]]></category>
		<category><![CDATA[discovery]]></category>
		<category><![CDATA[Ed Chamberlain]]></category>
		<category><![CDATA[ideas]]></category>
		<category><![CDATA[Jerome]]></category>
		<category><![CDATA[Jerome blog]]></category>
		<category><![CDATA[Nick Jackson]]></category>
		<category><![CDATA[Open University]]></category>
		<category><![CDATA[OU]]></category>
		<category><![CDATA[Phillip Heels]]></category>
		<category><![CDATA[SALDA]]></category>
		<category><![CDATA[synthesis]]></category>
		<category><![CDATA[The Fens]]></category>
		<category><![CDATA[Tony Hirst]]></category>
		<category><![CDATA[University of Sussex]]></category>
		<category><![CDATA[#CULComet]]></category>
		<category><![CDATA[Cambridge]]></category>
		<category><![CDATA[COMET]]></category>

		<guid isPermaLink="false">http://pstainthorp.blogs.lincoln.ac.uk/?p=2113</guid>
		<description><![CDATA[Here&#8217;s a photo of the CARET (Centre for Applied Research in Educational Technologies) offices at the University of Cambridge, where we held our log-awaited joint&#160;Jerome/COMET hack day, on Monday 8 August. Actually, in the end, it&#160;turned out to be a kind of&#160;Jerome/COMET/SALDA/synthesis/OUseful mashup-AH! In attendance (for the record): Alex Bilbie (University of Lincoln) Ed Chamberlain [...]]]></description>
				<content:encoded><![CDATA[<p>Here&#8217;s a photo of the <a href="http://www.caret.cam.ac.uk/">CARET</a> (Centre for Applied Research in Educational Technologies) offices at the University of Cambridge, where we held our log-awaited joint <strong><a href="http://jerome.library.lincoln.ac.uk/">Jerome</a>/<a href="http://cul-comet.blogspot.com/">COMET</a> </strong>hack day, on Monday 8 August. Actually, in the end, it turned out to be a kind of Jerome/COMET/<a href="http://blogs.sussex.ac.uk/salda/">SALDA</a>/synthesis/OUseful mashup-<a href="http://en.wikipedia.org/wiki/Mark_E._Smith">AH</a>!</p>
<p><a title="Jerome/COMET by [Uploading archive snaps] Paul Stainthorp, on Flickr" href="http://www.flickr.com/photos/pstainthorp/6025581031/"><img src="http://farm7.static.flickr.com/6143/6025581031_165f79c023_m.jpg" alt="Jerome/COMET" width="240" height="160" /></a></p>
<p>In attendance (for the record):</p>
<ul>
<li><a href="http://alexbilbie.com/">Alex Bilbie</a> (University of Lincoln)</li>
<li><a href="http://arcadiaproject.lib.cam.ac.uk/people/chamberlain.html">Ed Chamberlain</a> (University of Cambridge)</li>
<li><a href="http://lincoln.nickjackson.me/">Nick Jackson</a> (University of Lincoln)</li>
<li><a href="http://www.nostuff.org/words/">Chris Keene</a> (University of Sussex)</li>
<li>Phillip Heels (University of Lincoln)</li>
<li><a href="http://blog.ouseful.info/">Tony Hirst</a> (The Open University)</li>
<li><a href="http://arcadiaproject.lib.cam.ac.uk/people/jones.html">Huw Jones</a> (University of Cambridge)</li>
<li>Chris Leach (University of Lincoln)</li>
<li><a href="http://arcadiaproject.lib.cam.ac.uk/people/sheppard.html">Dan Sheppard</a> (University of Cambridge)</li>
<li><a href="http://paulstainthorp.com/">Paul Stainthorp</a> (University of Lincoln)</li>
<li><a href="http://www.meanboyfriend.com/overdue_ideas/">Owen Stephens</a> (Owen Stephens Consulting)</li>
<li>Laura Waldoch (University of Cambridge)</li>
<li><a href="http://arcadiaproject.lib.cam.ac.uk/people/lihuazhu.html">Lihua Zhu</a> (University of Cambridge)</li>
</ul>
<p>Train mayhem aside (in the end the Lincoln contingent didn&#8217;t arrive until nearly midday), it was a really useful day and well worth doing. Particular thanks to Ed Chamberlain and his colleagues for hosting the event and for arranging the food and refreshments. Thanks also to everyone who travelled from afar for no other reason than they love a good mashup.</p>
<p>Typically, the ever-prolific <a href="http://blog.ouseful.info/">Tony Hirst</a> has <em>already</em> managed to write up not one, but <strong>two</strong> blog posts about ideas that came out of the day:</p>
<ul>
<li><a href="http://blog.ouseful.info/2011/08/09/getting-library-catalogue-searches-out-there/">Getting Library Catalogue Searches Out There…</a></li>
<li><a href="http://blog.ouseful.info/2011/08/09/open-data-processes-the-open-metadata-laundry/">Open Data Processes: the Open Metadata Laundry</a> (N.B. this one relates specifically to Jerome – in particular, our notion of &#8216;scrubbing&#8217; dodgy MARC records by taking only the identifiers plus the bare citation-only fields, and using that minimal set to grab additional free and Open data from the web, automatically creating new full versions of records that are inherently Open. &#8216;Metadata laundry&#8217;, me like.)</li>
</ul>
<p>Here are three more <a href="http://www.flickr.com/photos/pstainthorp/6026137896/">ideas</a>/conversations we had in Cambridge that I thought were going somewhere interesting. Yeah, we might get around to actually doing these, sometime…</p>
<p><strong>1. Using COMET data to enhance Jerome</strong></p>
<p><a title="The idea by [Uploading archive snaps] Paul Stainthorp, on Flickr" href="http://www.flickr.com/photos/pstainthorp/6026137896/"><img class="alignright" style="margin-left: 10px; margin-right: 10px;" src="http://farm7.static.flickr.com/6148/6026137896_e235c9a15a_m.jpg" alt="The idea" width="160" height="240" /></a>Similar to the &#8216;metadata laundry&#8217;, above, and to the way Jerome already uses data from the <a href="http://openlibrary.org/">Open Library</a>, <a href="http://www.journaltocs.hw.ac.uk/">JournalTOCs</a>, <a href="http://www.librarything.com/">LibraryThing</a>, <em>etc.</em>, to enhance its book records with additional metadata. Jerome constructs a URL in the form <em>http://data.lib.cam.ac.uk/isbn/_______</em>, with the ISBN from the Jerome record dropped in at the end. COMET responds with a link to an open record in RDF and/or JSON, which Jerome gladly sucks in, adding any additional fields to its original source record. Enrichment ensues.</p>
<p><strong>2. Using Jerome search to &#8216;skin&#8217; COMET</strong></p>
<p>I called this one &#8221;Jerome Scholar&#8221; <img src='http://paulstainthorp.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' />  …we make use of the search aspects of Jerome (in particular, the speed of <a href="http://sphinxsearch.com/">Sphinx</a>, the &#8216;<a href="http://jerome.blogs.lincoln.ac.uk/2011/03/27/mixing-it-up/">mixing desk</a>&#8216; idea, the neat record presentation, to provide a really smooth way of interacting with the <strong>much more well-structured</strong> (hence &#8220;Scholar&#8221;) data that resides in COMET.</p>
<p><strong>3. Using the <span style="text-decoration: underline;">differences</span> between the two datasets to tell us something interesting</strong></p>
<p>I have a notion that there&#8217;s something inherently useful about being able to compare two versions of a record for the &#8216;same&#8217; object. If we could use Jerome+COMET to generate a web application/data feed – one that <em>other discovery services could themselves consume</em>, we&#8217;d have ways of &#8216;sparking off&#8217; whole new avenues of discovery: from misspelled names, variant titles, different subject terms assigned by different cataloguing practices, <em>etc.</em> Like <a href="http://www.worldcat.org/affiliate/webservices/xisbn/app.jsp">xISBN</a>, but for non-standardised data(?). All right, that&#8217;s the fuzziest of the three ideas. And as the eminiently sensible Owen Stephens kept asking me, &#8220;…what&#8217;s the use case?&#8221;.</p>
<p>And then we went to the pub.</p>
<p><a title="And then we went to the pub. by [Uploading archive snaps] Paul Stainthorp, on Flickr" href="http://www.flickr.com/photos/pstainthorp/6025581629/"><img src="http://farm7.static.flickr.com/6205/6025581629_9e955f26f3_m.jpg" alt="And then we went to the pub." width="240" height="160" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://jerome.blogs.lincoln.ac.uk/2011/08/10/jeromecomet-hack-day-fun-in-the-fens/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>It&#8217;s the &#8216;Final&#8217; Blog-Post</title>
		<link>http://jerome.blogs.lincoln.ac.uk/2011/08/01/its-the-final-blog-post/</link>
		<comments>http://jerome.blogs.lincoln.ac.uk/2011/08/01/its-the-final-blog-post/#comments</comments>
		<pubDate>Mon, 01 Aug 2011 09:49:33 +0000</pubDate>
		<dc:creator>Paul Stainthorp</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[CC-BY]]></category>
		<category><![CDATA[CC0]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[deliverables]]></category>
		<category><![CDATA[final]]></category>
		<category><![CDATA[Jerome]]></category>
		<category><![CDATA[JournalTOCs]]></category>
		<category><![CDATA[licensing]]></category>
		<category><![CDATA[recommendation]]></category>
		<category><![CDATA[records]]></category>
		<category><![CDATA[report]]></category>
		<category><![CDATA[un-project]]></category>

		<guid isPermaLink="false">http://jerome.blogs.lincoln.ac.uk/?p=377</guid>
		<description><![CDATA[I&#8217;ve put &#8216;final&#8217; in inverted commas in the title of this blog post (which should be sung—of course—to the tune of this song) – because while the JISC-funded Jerome project has indeed come to an end, Jerome itself is going nowhere. We&#8217;ll continue to tweak and develop it as an &#8220;un-project&#8221; (from whence it came), and—we sincerely hope—Jerome [...]]]></description>
				<content:encoded><![CDATA[<p><a title="sunset cowboy by mtx, on Flickr" href="http://www.flickr.com/photos/matttinsley/247503725/"><img class="alignright" style="margin-left: 10px; margin-right: 10px;" src="http://farm1.static.flickr.com/87/247503725_8b73c4e6e0_m.jpg" alt="sunset cowboy" width="240" height="179" /></a></p>
<p>I&#8217;ve put &#8216;final&#8217; in inverted commas in the title of this blog post (which should be sung—of course—to the tune of <a href="http://www.youtube.com/watch?v=EG7wB3G-xp0">this</a> song) – because while the <a href="http://www.jisc.ac.uk/whatwedo/programmes/inf11/infrastructureforresourcediscovery/jerome.aspx">JISC-funded Jerome project</a> has indeed come to an end, <a href="http://jerome.library.lincoln.ac.uk/">Jerome itself</a> is going nowhere. We&#8217;ll continue to tweak and develop it as an &#8220;<a href="http://paulstainthorp.com/tag/un-projects/">un-project</a>&#8221; (from whence it came), and—we sincerely hope—Jerome will lead in time, in whole or in part, to a real, live university library service of awesome.</p>
<p>Before we get started, though, thanks are due to the whole of the Jerome project team: Chris Leach, Dave Raines, Tim Simmonds, <a href="http://elif.blogs.lincoln.ac.uk/">Elif Varol</a>, <a href="http://joss.blogs.lincoln.ac.uk/">Joss Winn</a>, <strong>developers <a href="http://lincoln.nickjackson.me/">Nick Jackson</a> and <a href="http://alexbilbie.com/">Alex Bilbie</a></strong> times a million, and also to people outside the University of Lincoln who have offered support and advice, including <a href="http://arcadiaproject.lib.cam.ac.uk/people/chamberlain.html">Ed Chamberlain</a>, <a href="http://www.meanboyfriend.com/overdue_ideas/">Owen Stephens</a>, and our JISC programme manager, <a href="http://www.jisc.ac.uk/contactus/staff/andrewmcgregor.aspx"><strong>Andy McGregor</strong></a>.</p>
<p>Ssssooooo…</p>
<h3>Just what exactly have we produced?</h3>
<ol>
<li>A public-facing search portal service available at: <strong><a href="http://jerome.library.lincoln.ac.uk/">http://jerome.library.lincoln.ac.uk/</a></strong>
<ul>
<li>Featuring <a href="http://jerome.library.lincoln.ac.uk/search?q=libraries">search</a>, <a href="http://jerome.library.lincoln.ac.uk/browse/subject/libraries">browse</a>, and <a href="http://jerome.library.lincoln.ac.uk/catalogue/8854">bibliographic record</a> views.</li>
<li>Search is provided by <a href="http://sphinxsearch.com/docs/1.10/extended-syntax.html">Sphinx</a>.</li>
<li>A &#8216;<a href="http://jerome.blogs.lincoln.ac.uk/2011/03/27/mixing-it-up/">mixing desk</a>&#8216; allows user control over advanced search parameters.</li>
<li>Each record is augmented by data from <a href="http://openlibrary.org/">OpenLibrary</a> (licenced under <a href="http://creativecommons.org/about/cc0">CC0</a>) to help boost the depth and accuracy of our own catalogue. Where possible, OpenLibrary also provides our book cover images.</li>
<li>Bibliographic work pages sport <a href="http://jerome.blogs.lincoln.ac.uk/2011/04/15/jerome-coins-and-zotero-all-i-really-want-is-data-in-my-pocket/">COinS metadata</a> and links to previews from <a href="http://books.google.com/">Google Books</a>.</li>
<li>Item data is harvested from the Library Management System.</li>
<li>Social tools allow sharing of works on Facebook, Twitter, <em>etc.</em></li>
</ul>
</li>
<li>Openly licensed bibliographic data, available at <strong><a href="http://data.lincoln.ac.uk/documentation.html#bib">http://data.lincoln.ac.uk/documentation.html#bib</a></strong>, and including:
<ul>
<li>170,000 <a href="http://jerome.library.lincoln.ac.uk/catalogue/">library catalogue</a> records, released under a <a href="http://creativecommons.org/choose/zero/">CC0</a> licence</li>
<li>3,100 <a href="http://jerome.library.lincoln.ac.uk/repository/">repository</a> records (<a href="http://creativecommons.org/choose/zero/">CC0</a>)</li>
<li>92,000 <a href="http://jerome.library.lincoln.ac.uk/journal/">e-journal</a> records (<a href="http://creativecommons.org/choose/zero/">CC0</a>)</li>
<li>XXX,XXX (<span style="color: #ff0000;">we still don&#8217;t know yet! It hasn&#8217;t finished counting…</span>) journal tables of contents derived from <a href="http://www.journaltocs.hw.ac.uk/">JournalTOCs</a>, available under <a href="http://creativecommons.org/licenses/by/2.0/">CC-BY</a></li>
<li>See our <a href="http://data.lincoln.ac.uk/licence.html">licensing</a> page for more information.</li>
</ul>
</li>
<li><a href="http://data.lincoln.ac.uk/documentation.html#bib">Attractive, documented, supported APIs for all data</a>, with a timeline of <a href="http://jerome.library.lincoln.ac.uk/about">data refresh cycles</a>. The APIs will provide data in the following formats:
<ol>
<li><a href="http://en.wikipedia.org/wiki/Resource_Description_Framework">RDF</a>/XML</li>
<li><a href="http://en.wikipedia.org/wiki/JSON">JSON</a></li>
<li><a href="http://en.wikipedia.org/wiki/RIS_(file_format)">RIS</a></li>
<li><span style="color: #ff0000;">The <em>potential</em> for MARC</span></li>
</ol>
</li>
<li>Source code for Jerome will be made Open and publicly available (after a shakedown) on <a href="https://github.com/">GitHub</a>.</li>
<li>While the user interface, technical infrastructure, analytics and machine learning/personalisation aspects of Jerome have been <a href="http://jerome.blogs.lincoln.ac.uk/">discussed fairly heavily on the project blog</a>, you&#8217;ll have to wait a little while for formal case studies.</li>
<li>Contributions to community events. We presented/discussed Jerome at:
<ul>
<li><a href="http://jerome.blogs.lincoln.ac.uk/2011/03/01/jisc-rdtf-meeting-birmingham/">JISC Infrastructure for Resource Discovery start-up meeting</a>, Birmingham, March</li>
<li><a href="http://jerome.blogs.lincoln.ac.uk/2011/04/15/jerome-at-cpd25mash/">M25 Consortium CPD event on mashups and open data</a>, London, April</li>
<li><a href="http://jerome.blogs.lincoln.ac.uk/2011/04/20/notes-from-my-%E2%80%98personal-pitch%E2%80%99-rdtf-in-manchester/">JISC/RLUK &#8216;Opening Data – Opening Doors&#8217; event</a>, Manchester, April (<a href="http://jerome.blogs.lincoln.ac.uk/2011/04/20/an-elastic-bucket-down-the-data-well-rdtf-in-manchester/">report</a>)</li>
<li><a href="http://jerome.blogs.lincoln.ac.uk/2011/05/17/how-commercial-next-generation-library-discovery-tools-have-nearly-got-it-right/">CILIP UC&amp;R Yorkshire &amp; Humberside training event</a>, Huddersfield, May</li>
<li><a href="http://jerome.blogs.lincoln.ac.uk/2011/05/26/in-the-background-at-discovery-event/">JISC/RLUK Discovery launch event</a>, London, May (<a href="http://jerome.blogs.lincoln.ac.uk/2011/06/08/launch-event-for-discovery/">report</a>)</li>
<li>Post-project <a href="http://jerome.blogs.lincoln.ac.uk/2011/03/28/linking-arms-across-the-fens-jeromecomet/">Jerome/COMET</a> open-data mashup day, Cambridge, August!</li>
<li>We are also submitting a magazine article to <em>SCONUL Focus</em> about Jerome in the context of next-generation library resource discovery (&#8220;OPAC2.0&#8243;) services, to be published later in 2011.</li>
</ul>
</li>
</ol>
<p><span style="font-size: 15px; font-weight: bold;">What ought to be done next?</span></p>
<ol>
<li>There&#8217;s a lot more interesting work to be done around the use of activity/recommendation data and Jerome. We&#8217;re using the historical library loan data both to provide user recommendations (&#8220;<em>People who borrowed X…</em>&#8220;), and to inform the search and ranking algorithms of Jerome itself. However, there are lots of other measures of implicit and explicit activity (<em>e.g.</em> use of the social sharing tools) that <em>could</em> be used to provide even more accurate recommendations.</li>
<li>Jerome has concentrated on data at the <em>bibliographic</em>/<em>work</em> level. But there is potentially even more value to be had out of aggregating and querying library <em>item</em> data (<em>i.e.</em> information about a library&#8217;s physical and electronic holdings of individual copies of a bibliographic work) – <em>e.g.</em> using geo-lookup services to highlight the nearest available copies of a work. This is perhaps the next great untapped sphere for the open data/Discovery movement.</li>
<li>Demonstrate use of the APIs to do cool stuff! Mashing up library data with other sets of institutional data (user profiles, mapping, calendaring data) to provide a really useful &#8216;portal&#8217; experience for users. Also: tapping into Jerome for reporting/administrative purposes; for example identifying and sanitising bad data!</li>
</ol>
<h3>Has Jerome&#8217;s data actually been <em>used</em>?</h3>
<p>Probably not yet. We were delighted to be able to offer something up (in the form of an early, <a href="http://jerome.blogs.lincoln.ac.uk/2011/06/28/is-that-a-jerome-open-data-api-i-spy/">bare-bones</a> Jerome bibliographic API) to the <strong><a href="http://jerome.blogs.lincoln.ac.uk/2011/07/05/discodev-worldwide-software-development-competition-using-open-library-data/">#discodev</a></strong> Developer Competition, where we still hope to see it used. Also, we are holding a post-project hack day (on 8 August 2011) with the <a href="http://cul-comet.blogspot.com/">COMET</a> project in Cambridge to share data, code, and best practices around handling Open Data. We certainly intend to make use of the APIs internally to enhance the University of Lincoln&#8217;s own library services. If you&#8217;re interested in making use of the Jerome open data, please <a href="mailto:pstainthorp@lincoln.ac.uk">email me</a> or leave a comment here.</p>
<h3>What skills did we need?</h3>
<p>At the University of Lincoln we have been experimenting with a new (for us) way of managing development projects: the <a href="http://en.wikipedia.org/wiki/Agile_software_development">Agile</a> method, using shared tools (<a href="http://www.pivotaltracker.com/">Pivotal Tracker</a>, <a href="https://github.com/">GitHub</a>) to allow a distributed team of developers and interested parties to work together. On a practical level, we&#8217;ve had to come to terms with matching a schemaless database architecture with traditional formats for describing resources… Nick and Alex have learned more about library standards and cataloguing practice (*cough*<a href="http://jerome.blogs.lincoln.ac.uk/2011/07/31/the-jerome-resource-model/">MARC</a>*cough*) than they may have wished! . There are also now plans to extend <a href="http://www.mongodb.org/">MongoDB</a> training to more staff within the ICT Services department.</p>
<h3>What did we learn along the way?</h3>
<p>Three things to take away from Jerome:</p>
<ol>
<li><a href="http://jerome.blogs.lincoln.ac.uk/2011/04/21/three-quarks-for-muster-marc/">MARC is evil</a>. But still, perhaps, a necessary evil. Until there&#8217;s a critical mass of libraries and library applications using newer, more sane languages to describe their collections, developers will just have to bite down hard and learn to parse MARC records. Librarians, in turn, need to accept the limitations of MARC and actively engage in developing the alternative&lt;/lecture over&gt;.</li>
<li>Don&#8217;t battle: use technology to find a way around licensing issues. Rather than spending time negotiating with third parties to release their data openly, Jerome took a different approach, which was to release openly those (sometimes minimal) bits of data which we know are free from third-party interest, then to use existing open data sources to enhance and extend those records.</li>
<li>Don&#8217;t waste time trying to handle every nuance of a record. Whilst it&#8217;s important from a catalogue standpoint, people really don&#8217;t care if it&#8217;s a main title, subtitle, spine title or any other form of title when they&#8217;re searching. Perfection is a goal, but not a restriction. Releasing 40% of data and working on the other 60% later is better than aiming for 100% and never releasing anything.</li>
</ol>
<p>Thanks! It&#8217;s been fun…</p>
<p><a href="http://paulstainthorp.com/">Paul Stainthorp</a><br />
July, 2011</p>
]]></content:encoded>
			<wfw:commentRss>http://jerome.blogs.lincoln.ac.uk/2011/08/01/its-the-final-blog-post/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>The Jerome Resource Model</title>
		<link>http://jerome.blogs.lincoln.ac.uk/2011/07/31/the-jerome-resource-model/</link>
		<comments>http://jerome.blogs.lincoln.ac.uk/2011/07/31/the-jerome-resource-model/#comments</comments>
		<pubDate>Sun, 31 Jul 2011 11:08:20 +0000</pubDate>
		<dc:creator>Nick Jackson</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[format]]></category>

		<guid isPermaLink="false">http://jerome.blogs.lincoln.ac.uk/?p=420</guid>
		<description><![CDATA[One of the things which I&#8217;ve come to realise whilst working on Jerome is that library storage formats generally suck. They&#8217;re either so complex you need a manual to work out what they mean (such as my good friend MARC) or lack sufficient depth to elegantly handle what you&#8217;re trying to do without resorting to [...]]]></description>
				<content:encoded><![CDATA[<p>One of the things which I&#8217;ve come to realise whilst working on Jerome is that library storage formats generally suck. They&#8217;re either so complex you need a manual to work out what they mean (such as my good friend <a href="http://marc-must-die.info/">MARC</a>) or lack sufficient depth to elegantly handle what you&#8217;re trying to do without resorting to epic kludges. This is mostly down to the fact that people have assumed a storage format must also function as an exchange format which can encompass the entirety of knowledge about a resource, a foolish notion which has led to all kinds of mayhem in the past. We have catalogues to know things like the date at which the special edition bound with human skin was printed, who edited it and the colour of his eyes. A discovery service doesn&#8217;t care, and neither (I strongly suspect) do people who just want access to our data.</p>
<p>As a result Jerome uses its own internal format for storing data which is built around what Jerome needs to know to provide the necessary search and discovery services. Since we&#8217;re storing objects in MongoDB I&#8217;ve opted for a nice clean JSON-based object model with very clear field names and structure. It throws away all kinds of &#8216;useful&#8217; data from a detailed metadata standpoint purely because it&#8217;s irrelevant to discovery. The vast majority of fields we store are completely optional (since not all resources have everything we care to know about). In a nutshell, we are capturing and storing the following data:</p>
<ul>
<li>A resource&#8217;s unique Jerome ID.</li>
<li>Information on the collection the resource belongs to, including its unique ID within that collection.</li>
<li>Title, secondary title and edition.</li>
<li>An array of author names and secondary author names.</li>
<li>A date of publishing split into year, month and day.</li>
<li>An abstract or synopsis.</li>
<li>Keywords and subject categories.</li>
<li>A periodical name, volume number and issue number.</li>
<li>Availability dates, including start and stop of availability.</li>
<li>Publisher name and location.</li>
<li>ISBN or ISSN.</li>
<li>A URL to access the resource, and optionally a label for that URL.</li>
<li>A URL for direct access to the full text of the resource.</li>
<li>Name and URL for the licence information of the resource metadata.</li>
</ul>
<p>As part of the magic of this approach we completely disassociate the individual type of resource from its representation within Jerome. We can represent pretty much any collection of resources within the same framework with no having to muck about writing custom code to turn one format into another for representation.</p>
<p><code></code></p>
]]></content:encoded>
			<wfw:commentRss>http://jerome.blogs.lincoln.ac.uk/2011/07/31/the-jerome-resource-model/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Modular Discovery</title>
		<link>http://jerome.blogs.lincoln.ac.uk/2011/07/30/modular-discovery/</link>
		<comments>http://jerome.blogs.lincoln.ac.uk/2011/07/30/modular-discovery/#comments</comments>
		<pubDate>Sat, 30 Jul 2011 15:00:27 +0000</pubDate>
		<dc:creator>Nick Jackson</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[APIs]]></category>
		<category><![CDATA[discovery]]></category>
		<category><![CDATA[resources]]></category>

		<guid isPermaLink="false">http://jerome.blogs.lincoln.ac.uk/?p=423</guid>
		<description><![CDATA[Jerome is a system which is modular by design. It comprises of a variety of distinct modules which handle data collection, formatting, output, search, indexing, recommendation and more. It&#8217;s also fairly unique (as far as I can tell) in that different types of resource also occupy a modular &#8216;slot&#8217; rather than being interwoven with Jerome [...]]]></description>
				<content:encoded><![CDATA[<p>Jerome is a system which is modular by design. It comprises of a variety of distinct modules which handle data collection, formatting, output, search, indexing, recommendation and more. It&#8217;s also fairly unique (as far as I can tell) in that different types of resource also occupy a modular &#8216;slot&#8217; rather than being interwoven with Jerome itself – it has no differentiation at the code level between books, ebooks, dissertations, papers, journals, journal entries, websites or any other &#8216;resource&#8217; which people may want to feed it.</p>
<p>As a result of this approach we can use Jerome as a true multi-channel resource discovery tool. All that&#8217;s required for anybody to add resources to Jerome and immediately make them searchable and recommendable is for a &#8216;collection&#8217; to be created and for them to write a bit of code which can make the following API calls as necessary:</p>
<ul>
<li><strong>Create</strong> a new resource as part of the collection, telling us as much about it as they can.</li>
<li><strong>Update</strong> an existing resource when it changes.</li>
<li><strong>Delete</strong> a resource which is no longer available.</li>
<li>Optionally <strong>record a use</strong> of a resource against a user&#8217;s account to help build our recommendations dataset.</li>
</ul>
<p>That&#8217;s it. Got a collection of awesome lecture slides you want to feed into Jerome and instantly make known as a resource? You can do that.</p>
<p>We&#8217;ll have your API documentation up soon.</p>
]]></content:encoded>
			<wfw:commentRss>http://jerome.blogs.lincoln.ac.uk/2011/07/30/modular-discovery/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What did it cost and who benefits?</title>
		<link>http://jerome.blogs.lincoln.ac.uk/2011/07/27/what-did-it-cost-and-who-benefits/</link>
		<comments>http://jerome.blogs.lincoln.ac.uk/2011/07/27/what-did-it-cost-and-who-benefits/#comments</comments>
		<pubDate>Wed, 27 Jul 2011 14:59:49 +0000</pubDate>
		<dc:creator>Paul Stainthorp</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[APIs]]></category>
		<category><![CDATA[benefits]]></category>
		<category><![CDATA[checking]]></category>
		<category><![CDATA[code]]></category>
		<category><![CDATA[COMET]]></category>
		<category><![CDATA[costs]]></category>
		<category><![CDATA[CWD]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[development]]></category>
		<category><![CDATA[enhancement]]></category>
		<category><![CDATA[events]]></category>
		<category><![CDATA[innovation]]></category>
		<category><![CDATA[library]]></category>
		<category><![CDATA[mobile]]></category>
		<category><![CDATA[open data]]></category>
		<category><![CDATA[operational]]></category>
		<category><![CDATA[platform]]></category>
		<category><![CDATA[resource discovery]]></category>
		<category><![CDATA[sanitising]]></category>
		<category><![CDATA[staff]]></category>
		<category><![CDATA[time]]></category>
		<category><![CDATA[travel]]></category>

		<guid isPermaLink="false">http://jerome.blogs.lincoln.ac.uk/?p=374</guid>
		<description><![CDATA[This is going to be one of the hardest project blog posts to write… The costs of getting Jerome to this stage are relatively easy to work out. Under the Infrastructure for Resource Discovery programme, JISC awarded us the sum of £36,585 which (institutional overheads aside) we used to pay for the following: Developer staff [...]]]></description>
				<content:encoded><![CDATA[<p>This is going to be one of the hardest project blog posts to write…</p>
<p>The <strong>costs</strong> of getting Jerome to <a href="http://jerome.library.lincoln.ac.uk/">this stage</a> are relatively easy to work out. Under the <a href="http://www.jisc.ac.uk/whatwedo/programmes/inf11/infrastructureforresourcediscovery/jerome.aspx">Infrastructure for Resource Discovery</a> programme, JISC awarded us the sum of £36,585 which (institutional overheads aside) we used to pay for the following:</p>
<ul>
<li>Developer staff time: 825 hours over six months.</li>
<li>Library and project staff time: 250 hours over six months.</li>
<li>The cost of travel to a number of programme events and relevant conferences at which we presented Jerome, including <a href="http://jerome.blogs.lincoln.ac.uk/2011/03/01/jisc-rdtf-meeting-birmingham/">this one</a>, <a href="http://jerome.blogs.lincoln.ac.uk/2011/04/15/jerome-at-cpd25mash/">this one</a>, <a href="http://jerome.blogs.lincoln.ac.uk/2011/04/20/notes-from-my-%E2%80%98personal-pitch%E2%80%99-rdtf-in-manchester/">this one</a>, <a href="http://jerome.blogs.lincoln.ac.uk/2011/05/17/how-commercial-next-generation-library-discovery-tools-have-nearly-got-it-right/">this one</a> and <a href="http://jerome.blogs.lincoln.ac.uk/2011/05/26/in-the-background-at-discovery-event/">this one</a>.</li>
</ul>
<p>As all the other aspects of Jerome—hardware, software <em>etc.—</em>either <a href="http://jerome.blogs.lincoln.ac.uk/2010/07/20/dont-call-it-a-monkey/">already existed</a> or were free to use, that figure represents the <em>total</em> cost of getting Jerome to its current state.</p>
<p>The <strong>benefits</strong> (<a href="https://docs.google.com/Doc?docid=0AXDuUd71vXr4ZGM0YzlycmNfMjUyanZ4OWdoZnM&amp;authkey=CI2KhZkB">see also section 2.4 of the original bid</a>) of Jerome are less easily quantified financially, but we ought to consider these operational benefits:</p>
<p>1. The potential for using Jerome as a &#8216;production&#8217; <strong>resource discovery system</strong> by the University of Lincoln. As such it could replace our <a href="http://www.library.lincoln.ac.uk/">current OPAC web catalogue</a> as the Library&#8217;s primary public tool of discovery. The Library ought also to consider Jerome as a viable alternative to the purchase of a commercial, hosted next-generation resource discovery service (<a href="http://paulstainthorp.com/?s=%22notes+on%22">which it is currently reviewing</a>), with the potential for replacing the investment it would make in such a system with investment in developer time to maintain and extend Jerome. In addition, the <a href="http://thecwd.blogs.lincoln.ac.uk/">Common Web Design</a> (on which the Jerome search portal is based) is inherently mobile-friendly.</p>
<p>2. Related: even if the Jerome search portal is not adopted <em>in toto</em>, there&#8217;s real potential for using <a href="http://data.lincoln.ac.uk/documentation.html#bib">Jerome&#8217;s APIs</a> and code (open sourced) to <strong>enhance our existing user interfaces</strong> (catalogues, student portals, <em>etc.</em>) by &#8216;hacking in&#8217; additional useful data and services via Jerome (similar to the Talis <a href="http://blogs.talis.com/panlibus/archives/2009/04/juice-up-your-opac.php">Juice</a> service). This could lead to cost savings: a modern OPAC would not have to be developed in isolation or tools bought in. And these enhancements are as available to other institutions and libraries as much as to Lincoln.</p>
<p>3. The use of Jerome as an <strong>operational tool</strong> for checking and sanitising bibliographic data. Jerome can already be used to generate lists of &#8216;bad&#8217; data (<em>e.g.</em> invalid ISBNs in MARC records); this intelligence could be fed back into the Library to make the work of cataloguers, e-resources admin staff, <em>etc.</em>, easier and faster (efficiency savings) and again to improve the user experience.</p>
<p>4. Benefits of <strong>Open Data</strong>: in releasing our bibliographic collections openly Jerome is adding to the UK&#8217;s academic resource discovery &#8216;<a href="http://discovery.ac.uk/">ecosystem</a>&#8216;, with benefits to scholarly activity both in Lincoln and elsewhere. We are <a href="http://jerome.blogs.lincoln.ac.uk/2011/03/28/linking-arms-across-the-fens-jeromecomet/">already working with the COMET team</a> at <a href="http://www.lib.cam.ac.uk/">Cambridge University Library</a> on a cross-Fens spin-off miniproject(!) to share data, code, and best practices around handling Open Data. Related to this are the &#8216;fuzzier&#8217; benefits of associating the University of Lincoln&#8217;s name with <a href="http://lncd.org/">innovation in technology for education</a> (which is a stated aim in the University&#8217;s draft institutional strategy).</p>
<p>5. Finally, there is the potential for the university to use Jerome as a <strong>platform for future development</strong>: Jerome already sits in a &#8216;suite&#8217; of interconnecting innovative institutional web services (excuse the unintentional alliteration!) which include the <a href="http://thecwd.blogs.lincoln.ac.uk/">Common Web Design</a> presentation framework, <a href="http://blog.totalrecal.org/">Total ReCal</a> space/time data, <a href="http://lncn.eu/">lncn.eu</a> URL shortener and link proxy, a university-wide <a href="http://data.lincoln.ac.uk">open data platform</a>, and the <a href="http://nucleus.blogs.lincoln.ac.uk/">Nucleus</a> data storage layer. Just as each of these (notionally separate) services has facilitated the development of all the others, so it&#8217;s likely that Jerome will itself act as a catalyst for further innovation.</p>
]]></content:encoded>
			<wfw:commentRss>http://jerome.blogs.lincoln.ac.uk/2011/07/27/what-did-it-cost-and-who-benefits/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Following in Jerome&#8217;s footsteps</title>
		<link>http://jerome.blogs.lincoln.ac.uk/2011/07/27/following-in-jeromes-footsteps/</link>
		<comments>http://jerome.blogs.lincoln.ac.uk/2011/07/27/following-in-jeromes-footsteps/#comments</comments>
		<pubDate>Wed, 27 Jul 2011 11:36:00 +0000</pubDate>
		<dc:creator>Paul Stainthorp</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[abilities]]></category>
		<category><![CDATA[approaches]]></category>
		<category><![CDATA[bibliographic data]]></category>
		<category><![CDATA[copyright]]></category>
		<category><![CDATA[developers]]></category>
		<category><![CDATA[footsteps]]></category>
		<category><![CDATA[IPR]]></category>
		<category><![CDATA[iteration]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Mac Mini]]></category>
		<category><![CDATA[MARC]]></category>
		<category><![CDATA[MARCXML]]></category>
		<category><![CDATA[mashup]]></category>
		<category><![CDATA[methodology]]></category>
		<category><![CDATA[OAI-PMH]]></category>
		<category><![CDATA[open data]]></category>
		<category><![CDATA[orangutan]]></category>
		<category><![CDATA[Pivotal Tracker]]></category>
		<category><![CDATA[rapid innovation]]></category>
		<category><![CDATA[RIS]]></category>
		<category><![CDATA[third-party]]></category>
		<category><![CDATA[tools]]></category>

		<guid isPermaLink="false">http://jerome.blogs.lincoln.ac.uk/?p=376</guid>
		<description><![CDATA[As the formal end of the Jerome project looms, this blog post is aimed at other people who might want to take a similar approach to breaking data out of their library systems. From the start, Jerome took an approach that can be summed up as: Use the tools and abilities that you have to [...]]]></description>
				<content:encoded><![CDATA[<p>As the formal end of the Jerome project looms, this blog post is aimed at other people who might want to take a similar approach to breaking data out of their library systems.</p>
<p>From the start, Jerome took an approach that can be summed up as:</p>
<ul>
<li>Use the tools and abilities that you have to hand. Go for low-hanging fruit and demonstrate value early on.</li>
<li>Follow the path of least resistance. If it works, it&#8217;s fine. Route around problems rather than fighting against them.</li>
<li>Consider the benefits of a &#8216;minimally invasive&#8217; approach to library systems. Use the web to passively gather copies of your data without having to make changes to your exisiting root systems (such as <a href="http://en.wikipedia.org/wiki/Integrated_library_system">LMS</a>es).</li>
</ul>
<p><em>What tools would people need to take this approach?</em></p>
<p>1. Technical ability. Jerome would have got nowhere without the coding abilities of University of Lincoln developers <a href="http://lincoln.nickjackson.me/">Nick Jackson</a> and <a href="http://alexbilbie.com/">Alex Bilbie</a>, and the approach they take to development (see below). It&#8217;s fair to say that their involvement has been something of a culture-shift for the Library: they have brought a fresh approach to dealing with our systems and data.</p>
<p>2. An agile, <a href="http://www.archive.org/details/WhatIsArapidInnovationProgrammeMethodology">rapid</a>, <a href="http://en.wikipedia.org/wiki/Iterative_and_incremental_development">iterative</a> project methodology and a suite of (often free; always web-based) <a href="http://joss.blogs.lincoln.ac.uk/2011/05/16/technology-for-education-a-new-group/">software tools to support that way of working</a>. This probably can&#8217;t be overstated: a &#8216;traditional&#8217; project management methodology just wouldn&#8217;t have worked for Jerome.</p>
<p>3. An understanding of where data resides in your current systems. We&#8217;ve had to become uncomfortably <em>au fait</em> with the structure of <a href="http://www.sirsidynix.com/products/horizon">SirsiDynix</a>&#8216;s internal data tables, <a href="http://jerome.blogs.lincoln.ac.uk/2011/04/21/three-quarks-for-muster-marc/">MARCXML</a>, OAI-PMH, RIS and all sorts of other unpleasantness. Also an awareness of the rich ecosystem of third-party <a href="http://jerome.blogs.lincoln.ac.uk/2010/07/21/linking-university/">library/bibliographic/other</a> useful data that exists on the web.</p>
<p>4. Related: a willingness to try different approaches to getting hold of this data (the absolutely-anything-goes, mashup-heavy approach): APIs: great. Screen-scraping: yeah, if we must. SQL querying, .csv dumps, proprietary admin interfaces: all fine. Don&#8217;t be precious about finding a way in. By far the most important thing is to provide the open data service in the first place. Things can always be tidied up, rationalised, at a later date.</p>
<p>5. A box to run things on. This doesn&#8217;t have to be a large institutional server: we&#8217;ve <a href="http://jerome.blogs.lincoln.ac.uk/2010/07/20/dont-call-it-a-monkey/">successfully run Jerome on a Mac Mini</a>.</p>
<p>6. Finally, the use of blogs—such as this one—and social media to engage a (self-selecting, admittedly) community of potential users and fellow-travellers.</p>
<p><em>Priorities:</em></p>
<ul>
<li>Use tools such as <a href="http://www.pivotaltracker.com/">Pivotal Tracker</a> and <a href="http://jerome.blogs.lincoln.ac.uk/2011/04/20/the-jerome-mind-map/">mind maps</a> to capture requirements and turn them into a plan for development.</li>
<li>Meet regularly to review the plan and push the ideas on to the next stage.</li>
<li>Decide what&#8217;s &#8216;out of scope&#8217; early on so you can concentrate on maximum value.</li>
<li>Realise that there&#8217;s value in releasing even <em>part</em> of your data—for example, a list of ISSNs—which you can exploit immediately without having to worry about issues (<em>e.g.</em> third-party copyright in records) that might affect your complete dataset.</li>
<li><a href="http://jerome.blogs.lincoln.ac.uk/">Blog about</a> what you&#8217;re doing. Little and often is the way to go.</li>
</ul>
<p><em>What to avoid:</em></p>
<p>While it&#8217;s important to be aware of the bigger picture, don&#8217;t get <em>too</em> distracted by the way things are being done elsewhere.</p>
<p>A lot of the open data movement seems to be closely tied up with providing access to large quantities of <a href="http://linkeddata.org/">Linked Data</a> <em>(the dreaded RDF triple store!)</em>, and initially I was worried that because we were <em>not</em> taking that approach we were somehow out of step. (In fact, I think that our not concentrating on Linked Data has allowed Jerome to explore the open data landscape in a different and valuable way, <em>i.e.</em> the provision of open bibliographic data services via API – see Paul Walk&#8217;s <a href="http://www.slideshare.net/paulwalk/technical-standards-the-rdtf-vision-some-considerations">slides on providing data <em>vs.</em> providing a data service</a>. I know which side my bread&#8217;s buttered: Open Data ≠ merely open; Open Data = open <strong>and usable</strong>.)</p>
<p>Similarly, while a lot of the discussion around third-party intellectual property rights in data has been phrased in terms of negotiating with those third parties to release the data openly (or taking a risk-assessment approach to releasing it anyway!), Jerome took a different approach, which has been first to release openly those (sometimes minimal) bits of data which we know are free from third-party interest, then to use existing open data sources to enhance the minimal set: what we end up with are data that may differ from the original bibliographic record, but which are inherently open. It&#8217;s not a <em>better</em> approach, just a different one.</p>
<p><em>Who you will need to have &#8216;on side&#8217;:</em></p>
<ul>
<li>Your systems librarian, or at least someone who has a copy of the manual to your library catalogue!</li>
<li>A cataloguer, to <del datetime="2011-07-27T11:05:15+00:00">excuse</del> explain the intricacies of MARC.</li>
<li>A university librarian / head of service who is willing to take risks and sanction the release of open data.</li>
<li>A small-but-thriving community of developers, mashup-enthusiasts and <a href="http://www.shambrarian.org/">shambrarians</a> (see above), including people <a href="http://twitter.com/search/%23ukdiscovery">on Twitter</a> who can act as a sounding board for ideas.</li>
<li>Ideally, your project team should include as many people as possible who do not have a vested interest in the way libraries have historically got things done. Jerome has been valuable in arising from an approach that&#8217;s different from the library norm.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://jerome.blogs.lincoln.ac.uk/2011/07/27/following-in-jeromes-footsteps/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>The Re-Architecting of Jerome</title>
		<link>http://jerome.blogs.lincoln.ac.uk/2011/07/12/the-re-architecting-of-jerome/</link>
		<comments>http://jerome.blogs.lincoln.ac.uk/2011/07/12/the-re-architecting-of-jerome/#comments</comments>
		<pubDate>Tue, 12 Jul 2011 10:26:12 +0000</pubDate>
		<dc:creator>Nick Jackson</dc:creator>
				<category><![CDATA[database]]></category>
		<category><![CDATA[Jerome]]></category>
		<category><![CDATA[APIs]]></category>
		<category><![CDATA[collections]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[recommendation]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://me2inict.blogs.lincoln.ac.uk/?p=586</guid>
		<description><![CDATA[Over the past few days I&#8217;ve been doing some serious brain work about Jerome and how we best build our API layer to make it simultaneously awesomely cool and insanely fast whilst maintaining flexibility and clarity. Here&#8217;s the outcome. To start with, we&#8217;re merging a wide variety of individual tables1 &#8211; one for each type [...]]]></description>
				<content:encoded><![CDATA[<p>Over the past few days I&#8217;ve been doing some serious brain work about Jerome and how we best build our API layer to make it simultaneously awesomely cool and insanely fast whilst maintaining flexibility and clarity. Here&#8217;s the outcome.</p>
<p>To start with, we&#8217;re merging a wide variety of individual tables<sup><a href="http://lincoln.nickjackson.me/the-re-architecting-of-jerome/#footnote_0_586" id="identifier_0_586" class="footnote-link footnote-identifier-link" title="Strictly speaking Mongo calls them Collections, but I&rsquo;ll stick with tables for clarity">1</a></sup> – one for each type of resource offered – into a single table which handles multiple resource types. We&#8217;ve opted to use all the fields in the RIS format as our &#8216;basic information&#8217; fields, although obviously each individual resource type can extend this with their own data if necessary. This has a few benefits; first of all we can interface with our data easier than before without needing to write type-specific code which translates things back to our standardised search set. As a byproduct of this we can optimise our search algorithms even further, making it far more accurate and following generally accepted algorithms for this sort of thing. Of course, you&#8217;ll still be able to fine-tune how we search in the Mixing Deck.</p>
<p>To make this even easier to interface with from an admin side, we&#8217;ll be strapping some APIs (hooray!) on to this which support the addition, modification and removal of resources programmatically. What this means is that potentially anybody who has a resource collection they want to expose through Jerome can do, they just need to make sure their collection is registered to prevent people flooding it with nonsense that isn&#8217;t &#8216;approved&#8217; as a resource. Things like the <a href="http://diverse.blogs.lincoln.ac.uk/">DIVERSE</a> research project can now not only pull Jerome resource data into their interface, but also push into our discovery tool and harness Jerome&#8217;s recommendation tools. Which brings me neatly on to the next point.</p>
<p>Recommendation is something we want to get absolutely right in Jerome. The amount of information out there is simply staggering. Jerome already handles nearly 300,000 individual items and we want to expand that to way more by using data from more sources such as journal table of contents. Finding what you&#8217;re actually after in this can be like the proverbial needle in a haystack, and straight search can only find so much. To explore a subject further we need some form of recommendation and &#8216;similar item engine. What we&#8217;re using is an approach with a variety of angles.</p>
<p>At a basic level Jerome runs term extraction on any available textual content to gather a set of terms which describe the content, very similar to what you&#8217;ll know as tags. These are generated automatically from titles, synopses, abstracts and any available full text. We can then use the intersection of terms across multiple works to find and rank similar items based on how many of these terms are shared. This gives us a very simple &#8220;items like this&#8221; set of results for any item, with the advantage that it&#8217;ll work across all our collections. In other words, we can find useful journal articles based on a book, or suggest a paper in the repository which is on a similar subject to an article you&#8217;re looking for.</p>
<p>We then also have a second layer very similar to Amazon&#8217;s &#8220;people who bought this also bought&#8230;&#8221;, where we look over the history of users who used a specific resource to find common resources. These are then added to the mix and the rankings are tweaked accordingly, providing a human twist to the similar items by suppressing results which initially seem similar but which in actuality don&#8217;t have much in common at a content level, and pushing results which <em>are</em> related but which don&#8217;t have enough terms extracted for Jerome to infer this (for example books which only have a title and for which we can&#8217;t get a summary) up to where a user will find them easier.</p>
<p>Third of all in recommendation there&#8217;s the &#8220;people on your course also used&#8221; element, which is an attempt to make a third pass at fine-tuning the recommendation using data we have available on which course you&#8217;re studying or which department you&#8217;re in. This is very similar to the &#8220;used this also used&#8221; recommendation, but operating at a higher level. We analyse the borrowing patterns of an entire department or course to extract both titles and semantic terms which prove popular, and then boost these titles and terms in any recommendation results set. By only using this as a &#8216;booster&#8217; in most cases it prevents recommendation sets from being populated with every book ever borrowed whilst at the same time providing a more relevant response.</p>
<p>So, that&#8217;s how we recommend items. APIs for this will abound, allowing external resource providers to register &#8216;uses&#8217; of a resource with us for purposes of recommendation. We&#8217;re not done yet though, recommendation has another use!</p>
<p>As we have historical usage data for both individuals and courses, we can throw this into the mix for searching by using semantic terms to actively move results up or down (but never remove them) based on the tags which both the current user and similar users have actually found useful in the past. This means that (as an example) a computing student searching for the author name &#8220;J Bloggs&#8221; would have &#8220;Software Design by Joe Bloggs&#8221; boosted above &#8220;18th Century Needlework by Jessie Bloggs&#8221;, despite there being nothing else in the search term to make this distinction. As a final bit of epic coolness, Jerome will sport a &#8220;Recommended for You&#8221; section where we use all the recommendation systems at our disposal to find items which other similar users have found useful, as well as which share themes with items borrowed by the individual user.</p>
<ol class="footnotes"><li id="footnote_0_586" class="footnote">Strictly speaking Mongo calls them Collections, but I&#8217;ll stick with tables for clarity</li></ol>]]></content:encoded>
			<wfw:commentRss>http://jerome.blogs.lincoln.ac.uk/2011/07/12/the-re-architecting-of-jerome/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>#discodev: worldwide software development competition using open library data</title>
		<link>http://jerome.blogs.lincoln.ac.uk/2011/07/05/discodev-worldwide-software-development-competition-using-open-library-data/</link>
		<comments>http://jerome.blogs.lincoln.ac.uk/2011/07/05/discodev-worldwide-software-development-competition-using-open-library-data/#comments</comments>
		<pubDate>Tue, 05 Jul 2011 14:37:55 +0000</pubDate>
		<dc:creator>Paul Stainthorp</dc:creator>
				<category><![CDATA[#discodev]]></category>
		<category><![CDATA[#ukdiscovery]]></category>
		<category><![CDATA[competitions]]></category>
		<category><![CDATA[discovery]]></category>
		<category><![CDATA[Jerome]]></category>
		<category><![CDATA[Jerome blog]]></category>
		<category><![CDATA[prizes]]></category>
		<category><![CDATA[web development]]></category>
		<category><![CDATA[open data]]></category>

		<guid isPermaLink="false">http://pstainthorp.blogs.lincoln.ac.uk/?p=1941</guid>
		<description><![CDATA[Copied verbatim (and under licence!)&#160;from the UK Discovery website: UK Discovery&#160;(http://discovery.ac.uk/)&#160;and the&#160;Developer Community Supporting Innovation (DevCSI)&#160;project based at UKOLN are running a global Developer Competition throughout July 2011 to build open source software applications / tools, using at least one of our 10&#160;open data sources&#160;collected from libraries, museums and archives. &#8230;and one of the 10 [...]]]></description>
				<content:encoded><![CDATA[<p>Copied verbatim (and under <a href="http://discovery.ac.uk/legal/">licence</a>!) from the <a href="http://discovery.ac.uk/developers/competition/">UK Discovery</a> website:</p>
<blockquote><p><strong><a href="http://paulstainthorp.com/files/2011/07/logo.gif"><img class="alignnone size-medium wp-image-1943" title="logo" src="http://paulstainthorp.com/files/2011/07/logo-300x141.gif" alt="Discovery logo" width="300" height="141" /></a></strong></p>
<p><strong>UK Discovery</strong> (<a title="http://discovery.ac.uk/" href="http://discovery.ac.uk/">http://discovery.ac.uk/</a>) and the <a title="http://devcsi.ukoln.ac.uk/" href="http://devcsi.ukoln.ac.uk/">Developer Community Supporting Innovation (DevCSI)</a> project based at UKOLN are running a global Developer Competition throughout July 2011 to build open source software applications / tools, using at least one of our 10 <a title="http://discovery.ac.uk/packages/api/demos/paclist.php" href="http://discovery.ac.uk/packages/api/demos/paclist.php">open data sources</a> collected from libraries, museums and archives.</p></blockquote>
<p>…and one of the 10 open data sources is the <a href="http://jerome.blogs.lincoln.ac.uk/2011/06/28/is-that-a-jerome-open-data-api-i-spy/">Jerome API we announced last week</a>!</p>
<blockquote><p>Enter simply by blogging about your application and emailing the blog post URI to <strong><a title="mailto:joy.palmer@manchester.ac.uk" href="mailto:joy.palmer@manchester.ac.uk">joy.palmer@manchester.ac.uk</a></strong> by the deadline of 2359 (your local time) on Monday <strong>1 August 2011.</strong></p>
<p>Full details of the competition, the data sets and how to enter are at <strong><a title="http://discovery.ac.uk/developers/competition/" href="http://discovery.ac.uk/developers/competition/">http://discovery.ac.uk/developers/competition/</a></strong></p></blockquote>
<p>Follow <a href="http://twitter.com/search/%23discodev">#discodev</a> on Twitter to see what people are up to.</p>
]]></content:encoded>
			<wfw:commentRss>http://jerome.blogs.lincoln.ac.uk/2011/07/05/discodev-worldwide-software-development-competition-using-open-library-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Is that a Jerome open data API I spy?</title>
		<link>http://jerome.blogs.lincoln.ac.uk/2011/06/28/is-that-a-jerome-open-data-api-i-spy/</link>
		<comments>http://jerome.blogs.lincoln.ac.uk/2011/06/28/is-that-a-jerome-open-data-api-i-spy/#comments</comments>
		<pubDate>Tue, 28 Jun 2011 17:44:27 +0000</pubDate>
		<dc:creator>Paul Stainthorp</dc:creator>
				<category><![CDATA[bibliographic data]]></category>
		<category><![CDATA[discovery]]></category>
		<category><![CDATA[Jerome]]></category>
		<category><![CDATA[Jerome blog]]></category>
		<category><![CDATA[JSON]]></category>
		<category><![CDATA[API]]></category>
		<category><![CDATA[open data]]></category>

		<guid isPermaLink="false">http://pstainthorp.blogs.lincoln.ac.uk/?p=1901</guid>
		<description><![CDATA[Yes. Yes, it is. http://data.online.lincoln.ac.uk/documentation.html#bib This is only the initial, bare-bones JSON-only service. A complete (and fully-documented)&#160;API will be released in stages over the next month, providing data in a range of output formats. We&#8217;re keeping all API and open institutional data documentation in the one place, on our open data site.]]></description>
				<content:encoded><![CDATA[<p>Yes. Yes, it is.</p>
<p><a href="http://data.online.lincoln.ac.uk/documentation.html#bib"><strong>http://data.online.lincoln.ac.uk/documentation.html#bib</strong></a></p>
<p>This is only the initial, bare-bones JSON-only service. A complete (and fully-documented) API will be released in stages over the next month, providing data in a range of output formats. We&#8217;re keeping all API and <a href="http://data-ac-uk.ecs.soton.ac.uk/">open institutional data</a> documentation in the one place, on our open data site.</p>
]]></content:encoded>
			<wfw:commentRss>http://jerome.blogs.lincoln.ac.uk/2011/06/28/is-that-a-jerome-open-data-api-i-spy/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
