<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Seeing the picture &#187; PicsYes</title>
	<atom:link href="http://blog.lib.uiowa.edu/hardinmd/category/picsyes/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.lib.uiowa.edu/hardinmd</link>
	<description>Thoughts while working on Hardin MD on digitization &#38; libraries</description>
	<lastBuildDate>Wed, 18 Nov 2009 22:16:00 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.5.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Google Books Integrated into Google Search Results</title>
		<link>http://blog.lib.uiowa.edu/hardinmd/2009/10/30/google-books-integrated-into-google-search-results/</link>
		<comments>http://blog.lib.uiowa.edu/hardinmd/2009/10/30/google-books-integrated-into-google-search-results/#comments</comments>
		<pubDate>Fri, 30 Oct 2009 19:50:39 +0000</pubDate>
		<dc:creator>Eric Rumsey</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Google Book Search]]></category>
		<category><![CDATA[PicsYes]]></category>
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.lib.uiowa.edu/hardinmd/?p=4548</guid>
		<description><![CDATA[
The recent controversy about the Google Book Search Settlement seems to have taken up peoples&#8217; Google-watching attention so much that advances in the way GBS actually works have been getting overlooked. Several notable improvements were made during the summer, for example, that got very little recognition. Another change that seems to have gotten little recognition [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.google.com/search?q=ted+wheeler+iowa+track+1972"><img class="alignnone size-full wp-image-4604" style="padding-right: 16px; padding-top: 3px; padding-bottom: 3px;" title="gsearch14_701" src="http://blog.lib.uiowa.edu/hardinmd/files/2009/11/gsearch14_701.jpg" border="0" alt="" width="478" height="381" align="left" /></a></p>
<p>The recent controversy about the Google Book Search Settlement seems to have taken up peoples&#8217; Google-watching attention so much that advances in the way GBS actually works have been getting overlooked. <a href="http://booksearch.blogspot.com/2009/06/new-features-on-google-books.html">Several</a> <a href="http://booksearch.blogspot.com/2009/06/explore-book-in-10-seconds.html">notable</a> <a href="http://booksearch.blogspot.com/2009/07/new-ways-to-search-within-book.html">improvements</a> were made during the summer, for example, that got very little recognition. Another change that seems to have gotten little recognition is that Google web searches have begun to include links to books in GBS in the last 1-2 years (as in the example at left). Particularly in searching for historical topics, I&#8217;ve been seeing searches recently in which the majority of the first 10 hits are from GBS &#8212; A great advance, I think, for historical research. Up to now, my experience has been that history has been a fairly weak subject on the Web &#8212; Locked away in books, not on Web pages.</p>
<p><a href="http://books.google.com/books?id=9004n0ljGfYC&amp;pg=PA80&amp;lpg=PA80&amp;dq=ted+wheeler+iowa+track+1972&amp;source=bl&amp;ots=e8QDyeZHdh&amp;sig=cGGW4qFkRhgMrVIv_4nfBr6k8WU&amp;hl=en&amp;ei=O_niSqm4BY2d8AbZ7_H3AQ&amp;sa=X&amp;oi=book_result&amp;ct=result&amp;resnum=5&amp;ved=0CBwQ6AEwBA#v=onepage&amp;q=ted%20wheeler%20iowa%20track%201972&amp;f=false"><img class="alignnone size-full wp-image-4549" style="padding-right: 16px; padding-top: 3px; padding-bottom: 3px;" title="wheeler1plus2_80" src="http://blog.lib.uiowa.edu/hardinmd/files/2009/10/wheeler1plus2_80.jpg" border="0" alt="" width="366" height="867" align="left" /></a></p>
<p>I had occasion to take advantage of the newly accessible books from GBS recently, when I was least expecting it, while having a discussion with my son David, who&#8217;s a long-distance runner, about track runners of the past at the University of Iowa. I remembered that one particular runner on the team, Ted Wheeler, ran on the US Olympic team in the 1950&#8217;s, and that he later went on to become the coach for the UI track team (I especially knew about him because while he was the coach he married Sheila Creth, the University Librarian at the University of Iowa Libraries, where I work). David knew that Wheeler had been in the Olympics, and thought that he had been an assistant coach at Iowa, rather than the head coach. So &#8230; of course I turned to Google to settle the &#8220;discussion.&#8221; It turned out to be a surprisingly difficult search. I assumed that it would be fairly easy to find records of recent track coaches at a large, Big Ten program like Iowa. But it wasn&#8217;t &#8212; I tried several search terms without success before &#8212; <a href="http://www.google.com/search?q=ted+wheeler+iowa+track+1972">Bingo!</a> &#8212; I finally hit upon the combination that turned up the <a href="http://books.google.com/books?id=9004n0ljGfYC&amp;pg=PA80&amp;lpg=PA80&amp;dq=ted+wheeler+iowa+track+1972&amp;source=bl&amp;ots=e8QDyeZHdh&amp;sig=cGGW4qFkRhgMrVIv_4nfBr6k8WU&amp;hl=en&amp;ei=O_niSqm4BY2d8AbZ7_H3AQ&amp;sa=X&amp;oi=book_result&amp;ct=result&amp;resnum=5&amp;ved=0CBwQ6AEwBA#v=onepage&amp;q=ted%20wheeler%20iowa%20track%201972&amp;f=false">page shown here</a>, establishing that Wheeler was, indeed, the UI track coach from 1978 to 1996 &#8212; with the added benefit of a great picture!</p>
<p>The point of this little story: I think integrating GBS links into Google web search is a great advance, and deserves more attention. As I said above, there&#8217;s been so much negative press for Google in recent discussions of the Settlement that everything they do is interpreted negatively &#8212; I saw a link in the last couple of weeks, that I unfortunately didn&#8217;t keep track of, decrying Google&#8217;s putting GBS links in Web search results because someone thought Google was trying to unfairly boost their own content. Really?? I think there&#8217;s such a treasure in old books that the world will benefit from Google&#8217;s making them more accessible. There are questions, certainly, about the algorithm used by Google to determine which books are included in Web search results, and I hope Google will say more about that. But it&#8217;s not only Google that&#8217;s saying little on the subject &#8212; I haven&#8217;t seen much discussion at all by anybody on the integration of GBS books in Google web search results &#8211;  If anyone can find it, please add a comment or contact me by Twitter or Email.</p>
<p>Eric Rumsey is at: eric-rumsey AttSign uiowa dott edu and on Twitter <a href="http://twitter.com/ericrumsey">@ericrumsey</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.lib.uiowa.edu/hardinmd/2009/10/30/google-books-integrated-into-google-search-results/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Book Search a Hot Topic? NOT!</title>
		<link>http://blog.lib.uiowa.edu/hardinmd/2009/10/28/google-book-search-a-hot-topic-not/</link>
		<comments>http://blog.lib.uiowa.edu/hardinmd/2009/10/28/google-book-search-a-hot-topic-not/#comments</comments>
		<pubDate>Wed, 28 Oct 2009 22:06:32 +0000</pubDate>
		<dc:creator>Eric Rumsey</dc:creator>
				<category><![CDATA[Google Book Search]]></category>
		<category><![CDATA[PicsYes]]></category>
		<category><![CDATA[Twitter]]></category>
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.lib.uiowa.edu/hardinmd/?p=4527</guid>
		<description><![CDATA[Searching for talk on Google Books and the Settlement since Judge Denny Chin delayed the decision on October 7, I&#8217;ve been finding very little &#8212; What had been a stream of chatter in Twitter searches has turned into a trickle. I found a little example reflecting this today that I think is worth recording &#8212; [...]]]></description>
			<content:encoded><![CDATA[<p>Searching for talk on Google Books and the Settlement since Judge Denny Chin delayed the decision on October 7, I&#8217;ve been finding very little &#8212; What had been a stream of chatter in Twitter searches has turned into a trickle. I found a little example reflecting this today that I think is worth recording &#8212; The first seven hits in a Twitter search for <a href="http://search.twitter.com/search?q=%23gbs">#GBS</a>, going back a day, are in German. &#8230; You can pretty much tell when NO ONE in the US is talking about a subject when you search in Twitter and find that the last day&#8217;s tweets are NOT IN ENGLISH! &#8230; I&#8217;d predict that <a href="http://mediadecoder.blogs.nytimes.com/2009/10/07/judge-sets-nov-9-deadline-for-revised-google-book-settlement/?hp">in a couple of weeks</a> there WILL be a bit of discussion in English!</p>
<p style="text-align: center;"><a href="http://search.twitter.com/search?q=%23gbs"><img class="size-full wp-image-4528 aligncenter" title="twittersearch1" src="http://blog.lib.uiowa.edu/hardinmd/files/2009/10/twittersearch1.jpg" border="0" alt="" width="633" height="713" /></a></p>
<p>Eric Rumsey is at: eric-rumsey AttSign uiowa dott edu and on Twitter <a href="http://twitter.com/ericrumsey">@ericrumsey</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.lib.uiowa.edu/hardinmd/2009/10/28/google-book-search-a-hot-topic-not/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Why Apple &amp; Google Win &#8211; And Libraries Don&#8217;t</title>
		<link>http://blog.lib.uiowa.edu/hardinmd/2009/10/22/why-apple-google-win-and-libraries-dont/</link>
		<comments>http://blog.lib.uiowa.edu/hardinmd/2009/10/22/why-apple-google-win-and-libraries-dont/#comments</comments>
		<pubDate>Thu, 22 Oct 2009 19:46:31 +0000</pubDate>
		<dc:creator>Eric Rumsey</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Libraries]]></category>
		<category><![CDATA[Library Catalog]]></category>
		<category><![CDATA[PicsYes]]></category>
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.lib.uiowa.edu/hardinmd/?p=4397</guid>
		<description><![CDATA[
Many possible takes on this picture. What comes to my mind first is the idea of the Attention Economy &#8211;The idea that in the days of the traditional library, before the Internet, information was a limited resource. Libraries could afford to work under the assumption that &#8220;we&#8217;ve got the good stuff, and our users have [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.scottmonty.com/2009/10/why-apple-google-win-and-your-company.html"><img class="alignnone size-full wp-image-4418" title="whyapple3_601" src="http://blog.lib.uiowa.edu/hardinmd/files/2009/10/whyapple3_601.jpg" border="0" alt="" width="668" height="150" /></a></p>
<p>Many possible takes on <a href="http://www.scottmonty.com/2009/10/why-apple-google-win-and-your-company.html">this picture</a>. What comes to my mind first is the idea of the <a href="http://www.webpages.uidaho.edu/~mbolin/bridges3.htm">Attention</a> <a href="http://74.125.95.132/search?q=cache:4zp44BurMqAJ:https://www.sla.org/Documents/conf/toronto/Makani.doc+%22attention+economy%22+librarians&amp;cd=4&amp;hl=en&amp;ct=clnk&amp;gl=us">Economy </a>&#8211;The idea that in the days of the traditional library, before the Internet, information was a limited resource. Libraries could afford to work under the assumption that &#8220;we&#8217;ve got the good stuff, and our users have to to come to us to get it.&#8221; There was little motivation to improve overly-complicated search interfaces like the picture on the right above, because users had no choice. In the new environment of the Internet, however, the limiting factor is not information, but attention. The problem of users now is not finding information, but being flooded by too much information. In this environment, users naturally gravitate to the easiest information to find, which, of course, Apple, Google et al are glad to provide.</p>
<p>Another take on this is the high cost of Simplicity &#8211;The simple interfaces of Apple and Google are just the tip of the iceberg, built upon the costly labor of armies of engineers. Libraries just can&#8217;t afford to compete with this sort of juggernaut. Personally, I consider myself lucky, as a librarian, to be working in a medical library &#8212; Medical libraries have a long history of generous federal support, in the interests of the country&#8217;s health, which has enabled the creation of tools to streamline access to medical information, from Index Medicus to PubMed. For libraries generally, however, it&#8217;s still hard to compete with the resources of dotcom information providers. To end on a hopeful note &#8212; It&#8217;s encouraging to see that libraries are <a href="http://chronicle.com/article/After-Losing-Users-in/48588/">increasingly realizing</a> the importance of providing Google-like interfaces for their catalogs, to gain back the attention from users that they&#8217;ve lost in recent few years.</p>
<p>The picture above, and the title of this post, are adapted from <a href="http://www.scottmonty.com/2009/10/why-apple-google-win-and-your-company.html">an article</a> by <a href="http://twitter.com/ScottMonty">Scott Monty</a> &#8212; Thanks!</p>
<p>Eric Rumsey is at: eric-rumsey AttSign uiowa dott edu and on Twitter <a href="http://twitter.com/ericrumsey">@ericrumsey</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.lib.uiowa.edu/hardinmd/2009/10/22/why-apple-google-win-and-libraries-dont/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>MedlinePlus Needs a New Name</title>
		<link>http://blog.lib.uiowa.edu/hardinmd/2009/10/21/medlineplus-needs-a-new-name/</link>
		<comments>http://blog.lib.uiowa.edu/hardinmd/2009/10/21/medlineplus-needs-a-new-name/#comments</comments>
		<pubDate>Wed, 21 Oct 2009 14:36:27 +0000</pubDate>
		<dc:creator>Eric Rumsey</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[MedlinePlus]]></category>
		<category><![CDATA[PicsYes]]></category>
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.lib.uiowa.edu/hardinmd/?p=4336</guid>
		<description><![CDATA[Google Health OneBox is a boost for NLM&#8217;s MedlinePlus &#8212; As discussed previously, though, a few tweaks could make it an even bigger boost. A problem not discussed in the previous article is the &#8220;MedlinePlus&#8221; name &#8212; It has little user recognition, and therefore gets considerably less traffic than it might with a better name. [...]]]></description>
			<content:encoded><![CDATA[<p>Google Health OneBox is a boost for NLM&#8217;s MedlinePlus &#8212; As discussed <a href="http://blog.lib.uiowa.edu/hardinmd/2009/10/15/medlineplus-google-health-onebox/">previously</a>, though, a few tweaks could make it an even bigger boost. A problem not discussed in the previous article is the &#8220;MedlinePlus&#8221; name &#8212; It has little user recognition, and therefore gets considerably less traffic than it might with a better name. In the NLM Update at the recent Midwest Chapter/Medical Library Association meeting, NLM staffer Paula Kitendaugh said some people at NLM are aware of this, and that a different name would likely do better in Google OneBox, but that so far bureaucratic inertia has prevented a name-change.</p>
<p><a href="http://www.google.com/#hl=en&amp;q=asthma"><img class="alignnone size-full wp-image-4358" style="padding-right: 16px; padding-top: 3px; padding-bottom: 3px;" title="onebox5_90" src="http://blog.lib.uiowa.edu/hardinmd/files/2009/10/onebox5_90.jpg" border="0" alt="" width="468" height="223" align="left" /></a></p>
<p>Realizing how slowly the wheels turn in a large organization like NLM, then, a better name for MedlinPlus is probably unlikely to happen soon. But how about a quick fix for the name of the link in Google OneBox, to take advantage of the fire-hose of potential traffic from Google? My idea for a simple change, that I think would draw more traffic, as shown in the enhanced<span style="color: #e51932;"><strong> </strong></span>screen shot here, is to change the link name from &#8220;<a href="http://blog.lib.uiowa.edu/hardinmd/2009/10/15/medlineplus-google-health-onebox/">Medline Plus</a>&#8221; to &#8220;<strong><span style="color: #e00000;"><span style="color: #e51932;">Natl Lib Med</span></span>.</strong>&#8221; I think this simple abbreviation would be recognized and respected by users, and boost clicks to NLM.</p>
<p>As far as a new name for MedlinePlus, I don&#8217;t have any ideas so far. If anyone else does, please make a comment, or send to me via email or Twitter.</p>
<p>Accompanying article: <a href="http://blog.lib.uiowa.edu/hardinmd/2009/10/15/medlineplus-google-health-onebox/">MedlinePlus &amp; Google Health OneBox</a></p>
<p>Eric Rumsey is at: eric-rumsey AttSign uiowa dott edu and on Twitter <a href="http://twitter.com/ericrumsey">@ericrumsey</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.lib.uiowa.edu/hardinmd/2009/10/21/medlineplus-needs-a-new-name/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>MedlinePlus &amp; Google Health OneBox</title>
		<link>http://blog.lib.uiowa.edu/hardinmd/2009/10/15/medlineplus-google-health-onebox/</link>
		<comments>http://blog.lib.uiowa.edu/hardinmd/2009/10/15/medlineplus-google-health-onebox/#comments</comments>
		<pubDate>Thu, 15 Oct 2009 18:15:58 +0000</pubDate>
		<dc:creator>Eric Rumsey</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[MedlinePlus]]></category>
		<category><![CDATA[PicsYes]]></category>
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.lib.uiowa.edu/hardinmd/?p=4233</guid>
		<description><![CDATA[
In August, Google launched Google Health OneBox (left). This puts the National Library of Medicine&#8217;s Medline Plus right at the top of the search results, and is potentially a valuable new source of traffic for NLM.
There are factors, however, that work against MLP &#8212; The three prominent links on the left, which are likely to [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.google.com/#hl=en&amp;q=asthma"><img class="alignnone size-full wp-image-4235" style="padding-right: 16px; padding-top: 3px; padding-bottom: 3px;" title="onebox2_80" src="http://blog.lib.uiowa.edu/hardinmd/files/2009/10/onebox2_80.jpg" border="0" alt="" width="408" height="242" align="left" /></a></p>
<p>In August, Google launched Google Health OneBox (left). This puts the National Library of Medicine&#8217;s <strong>Medline Plus</strong> right at the top of the search results, and is potentially a valuable new source of traffic for NLM.</p>
<p>There are factors, however, that work against MLP &#8212; The three prominent links on the left, which are likely to get the bulk of OneBox clicks (<strong>Asthma</strong>, <strong>Google Health</strong>, &amp; thumbnail) go to the Google Health Topics page (below). This has the same text and pictures as the MLP Encyclopedia/ADAM page that&#8217;s linked from the OneBox <strong>Medline Plus</strong> link. But there&#8217;s an important difference &#8212; The Google Health version of ADAM has <strong>Symptoms</strong> as the first section after <strong>Overview</strong>. The MLP version of ADAM, on the other hand (see further down on this page) has <strong>Causes</strong> as the first section. &#8230;</p>
<p><a href="https://www.google.com/health/ref/Asthma"><img class="alignnone size-full wp-image-4238" style="padding-top: 3px; padding-bottom: 3px;" title="googlehealth21_70" src="http://blog.lib.uiowa.edu/hardinmd/files/2009/10/googlehealth21_70.jpg" border="0" alt="" width="588" height="316" /></a></p>
<p>This may seem to be a minor difference. But I&#8217;ve learned &#8212; through long experience with Hardin MD and brief experience with the short-lived Medical Library Association-Google Health Coop project &#8212; that symptoms are a very popular, heavily searched topic for users (which Google certainly knows!). So I suspect that users who try out the Google Health and Medline Plus OneBox links will quickly learn to prefer Google Health because it features the symptoms information they&#8217;re looking for. It IS a positive for NLM that the Google Health page has a prominent link to MLP. But it&#8217;s rather surprising that there&#8217;s no clear credit given to ADAM as the original provider of the information &#8212; ADAM is credited only at the bottom of the page, where few users will see it (and I suspect many will consider it copyright-free, since they&#8217;ll presume that it&#8217;s from a government site.)</p>
<p><a href="http://www.nlm.nih.gov/medlineplus/ency/article/000141.htm"><img class="alignnone size-full wp-image-4256" title="mlp2_70" src="http://blog.lib.uiowa.edu/hardinmd/files/2009/10/mlp2_70.jpg" border="0" alt="" width="598" height="402" /></a></p>
<p><strong>MedlinePlus &amp; Google Health OneBox &#8212; How NLM can boost traffic</strong></p>
<p>Change the order of sections on ADAM Encyclopedia pages, to put Symptoms at the top, as Google does. This would make the pages more interesting to most users.</p>
<p>Surprisingly, MLP Encyclopedia pages, which is what Google OneBox links to, have no links to equivalent MLP Health Topic pages (Example: there is no link between the Asthma pages in the <a href="http://www.nlm.nih.gov/medlineplus/ency/article/000141.htm">Encyclopedia</a> and in <a href="http://www.nlm.nih.gov/medlineplus/asthma.html">Health Topics</a>) &#8212; After all, it&#8217;s the MLP Health Topic pages that NLM staff creates and maintains, so how about making links to them from Encyclopedia pages, so the surging clickers from Google OneBox can find them!</p>
<p>See follow-up article: <a href="http://blog.lib.uiowa.edu/hardinmd/2009/10/21/medlineplus-needs-a-new-name/">MedlinePlus Needs a New Name</a></p>
<p>Eric Rumsey is at: eric-rumsey AttSign uiowa dott edu and on Twitter <a href="http://twitter.com/ericrumsey">@ericrumsey</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.lib.uiowa.edu/hardinmd/2009/10/15/medlineplus-google-health-onebox/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>More Metadata Problems in Google Books?: Word Clouds</title>
		<link>http://blog.lib.uiowa.edu/hardinmd/2009/09/30/more-metadata-problems-in-google-books-word-clouds/</link>
		<comments>http://blog.lib.uiowa.edu/hardinmd/2009/09/30/more-metadata-problems-in-google-books-word-clouds/#comments</comments>
		<pubDate>Wed, 30 Sep 2009 15:22:00 +0000</pubDate>
		<dc:creator>Eric Rumsey</dc:creator>
				<category><![CDATA[Google Book Search]]></category>
		<category><![CDATA[Metadata]]></category>
		<category><![CDATA[PicsYes]]></category>
		<category><![CDATA[Train Wreck]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[eBooks]]></category>

		<guid isPermaLink="false">http://blog.lib.uiowa.edu/hardinmd/?p=4134</guid>
		<description><![CDATA[A month ago Geoff Nunberg wrote two articles that got much attention on Google Book Search&#8217;s &#8220;metadata trainwreck,&#8221; relating to incorrect dating of books. I discovered another metadata-ish sort of problem, as I read Lorcan Dempsey&#8217;s recent article on GBS word clouds, and the value of their &#8220;glancability&#8221; for getting a quick overview of the [...]]]></description>
			<content:encoded><![CDATA[<p>A month ago Geoff Nunberg wrote <a href="http://languagelog.ldc.upenn.edu/nll/?p=1701">two</a> <a href="http://chronicle.com/article/Googles-Book-Search-A/48245/">articles</a> that got much attention on Google Book Search&#8217;s &#8220;metadata trainwreck,&#8221; relating to incorrect dating of books. I discovered another metadata-ish sort of problem, as I read Lorcan Dempsey&#8217;s recent <a href="http://orweblog.oclc.org/archives/002010.html">article</a> on GBS word clouds, and the value of their &#8220;glancability&#8221; for getting a quick overview of the contents of a book.</p>
<p>I was actually thinking of taking Dempsey&#8217;s thought a step further, and proposing the idea of including Google&#8217;s word clouds in library catalogs. But when I started looking more closely at GBS word clouds I found problems &#8212; The first thing I noticed in the cloud for <em><strong>Origin of Species</strong></em> (below and <a href="http://books.google.com/books?id=TCwLAAAAIAAJ&amp;dq=%22origin+of+species%22&amp;ei=yFjCSo3ZE4vUNKzJzeQD">here</a> at GBS [scroll down to <strong>Common terms and phrases</strong>]) is that it has the plant-related words &#8220;seeds,&#8221; &#8220;pistil,&#8221; and &#8220;pollen,&#8221; but does not have the word &#8220;plant(s).&#8221; Hmm, that&#8217;s odd &#8212; So I searched for &#8220;plants&#8221; and found that there are in fact 100 occurrences of it in the book. Then I clicked some of the terms in the cloud shown below, and found that the number of results often does not correlate well with the font size of the word (which is what&#8217;s supposed to happen in a word cloud) &#8230;</p>
<p style="text-align: center;"><a href="http://books.google.com/books?id=TCwLAAAAIAAJ&amp;dq=%22origin+of+species%22&amp;ei=yFjCSo3ZE4vUNKzJzeQD"><img class="size-full wp-image-4136 aligncenter" title="originspecies1_62" src="http://blog.lib.uiowa.edu/hardinmd/files/2009/09/originspecies1_62.jpg" border="0" alt="" width="599" height="379" /></a></p>
<p>Note that the words &#8220;admit,&#8221; &#8220;cause,&#8221; and &#8220;male,&#8221; which are in the smallest font, have more occurrences than other terms with larger fonts &#8212; &#8220;Asa Gray&#8221; and &#8220;pistil&#8221; in particular.</p>
<p>I tried several books, and found similar results in all of them  &#8212; The font size of terms in the word cloud does not show much correlation with the number of occurrences of words in the books. In Snippet view books (as at least <a href="http://books.google.com/books?id=orbgAAAAMAAJ&amp;dq=libraries+networks+osi&amp;ei=kmLCStLqHYSyNPXQneQD">one of the books</a> in Dempsey&#8217;s article is) the problem is not apparent because the number of search results is limited to three links in the book, making it impossible to determine how many occurrences of the term there are.</p>
<p>I suspect that the GBS word cloud problem has not been noticed more because the word clouds are rather “buried” — Not on the default <a href="http://books.google.com/books?id=TCwLAAAAIAAJ&amp;printsec=frontcover">Read (Front cover)</a> page, but inconspicuously down in the middle of the <a href="http://books.google.com/books?id=TCwLAAAAIAAJ&amp;dq=%22origin+of+species%22&amp;ei=yFjCSo3ZE4vUNKzJzeQD">Overview</a> page, probably not seen by the vast majority of users.</p>
<p>We need more documentation about word clouds in GBS — How are they derived? What exactly are they intended to mean? Google has said about other metadata problems that they are working on them, and that they’ll slowly get fixed. Hopefully, that will apply to word clouds also. Maybe Google thinks of word clouds as still being “in beta” — they were, after all, <a href="http://booksearch.blogspot.com/2009/06/explore-book-in-10-seconds.html">only launched</a> in July — and that’s why they’re giving them a low profile.</p>
<p>Eric Rumsey is at: eric-rumsey AttSign uiowa dott edu and on Twitter <a href="http://twitter.com/ericrumsey">@ericrumsey</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.lib.uiowa.edu/hardinmd/2009/09/30/more-metadata-problems-in-google-books-word-clouds/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>Tagging in Hardin MD</title>
		<link>http://blog.lib.uiowa.edu/hardinmd/2009/09/25/tagging-in-hardin-md/</link>
		<comments>http://blog.lib.uiowa.edu/hardinmd/2009/09/25/tagging-in-hardin-md/#comments</comments>
		<pubDate>Fri, 25 Sep 2009 14:05:11 +0000</pubDate>
		<dc:creator>Eric Rumsey</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Hardin MD]]></category>
		<category><![CDATA[Library Catalog]]></category>
		<category><![CDATA[Metadata]]></category>
		<category><![CDATA[PicsYes]]></category>
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.lib.uiowa.edu/hardinmd/?p=4032</guid>
		<description><![CDATA[
All Hardin MD (HMD) pages  have tags at the bottom, to make them more visible for search engines i.e. Google. We have been doing tagging in HMD since 2000, and it works very well. As shown in the example to the left, the tags are for variant spellings (measels), variant terms (rubeola), and words [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.lib.uiowa.edu/hardin/md/measlespictures.html"><img class="alignnone size-full wp-image-4037" style="padding-right: 16px;padding-top: 3px;padding-bottom: 3px" title="measles1_35_4" src="http://blog.lib.uiowa.edu/hardinmd/files/2009/09/measles1_35_4.jpg" border="0" alt="" width="291" height="426" align="left" /></a></p>
<p>All <a href="http://www.lib.uiowa.edu/hardin/md/index.html">Hardin MD</a> (HMD) pages  have tags at the bottom, to make them more visible for search engines i.e. Google. We have been doing tagging in HMD since 2000, and it works very well. As shown in the example to the left, the tags are for variant spellings (measels), variant terms (rubeola), and words and word combinations relating to pictures (we have found that the word &#8220;pictures&#8221; is especially favored by Google).</p>
<p>One of the things that has made HMD fun has been applying longstanding practices of librarianship to a web-based system. Having been a cataloger for a brief time early in my library career, it seemed natural to put tags at the bottom of the web page, just like subject headings are at the bottom of cards in the card catalog. Including mis-spellings in the tags to help users find the page seemed natural, too &#8212; As a cataloger, I had been taught to put x-ref cards in the catalog for variant ways that patrons might look for a book, and following the same principle on web pages, it became possible to apply it on a much larger scale.</p>
<p>It continues to surprise me that this simple idea &#8212; Putting tags on web pages &#8212; has not been more widely applied. I have seen very few cases of it at other sites. I suspect part of the reason for this is that people have tended to think the hidden meta keyword field was the place to put tags, rather than &#8220;cluttering up&#8221; their pages by putting them on the page. Google&#8217;s announcement  a few days ago that they ignore meta keywords finally puts an end to that idea. But many SEO people have thought meta keywords were ineffective for a long time, and it was certainly our experience &#8212; Around the time we began putting tags on pages in 2000, we compared meta field tagging and on-page tagging, and found that meta field tagging seemed to be ignored by Google.</p>
<p>Another factor that may have discouraged people from putting tags inconspicuously at the bottom of the page is that SEO people generally say that words need to be in a prominent place on the page, preferably near the top, to be found by Google. That&#8217;s no doubt true for common words that have a lot of competition, but for relatively uncommon words, like variant spellings of medical diseases, placement at the bottom of the page works well. (One proviso: Our pages with HMD are relatively small, usually no more than two screens. Putting tags at the bottom of larger pages may not work as well.)</p>
<p>I suspect a reason that people don&#8217;t think more of experimenting with tagging and Google visibility is that it is a lengthy process. Google&#8217;s not going to see new words on your page right away. It may take several weeks or even months. So it requires careful record-keeping, to note when words are added, and having a regular schedule of Google checking to see if your pages are starting to appear in search results.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.lib.uiowa.edu/hardinmd/2009/09/25/tagging-in-hardin-md/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Metadata About Metadata: Library Catalog Fail</title>
		<link>http://blog.lib.uiowa.edu/hardinmd/2009/09/03/metadata-about-metadata-library-catalog-fail/</link>
		<comments>http://blog.lib.uiowa.edu/hardinmd/2009/09/03/metadata-about-metadata-library-catalog-fail/#comments</comments>
		<pubDate>Thu, 03 Sep 2009 22:06:41 +0000</pubDate>
		<dc:creator>Eric Rumsey</dc:creator>
				<category><![CDATA[Google Book Search]]></category>
		<category><![CDATA[Library Catalog]]></category>
		<category><![CDATA[Metadata]]></category>
		<category><![CDATA[PicsYes]]></category>
		<category><![CDATA[Train Wreck]]></category>
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.lib.uiowa.edu/hardinmd/?p=3861</guid>
		<description><![CDATA[David Weinberger&#8217;s book Everything is Miscellaneous: The Power of the New Digital Disorder is fascinating &#8212; I&#8217;m especially enjoying his many original comments on metadata. So, trying out Weinberger&#8217;s ideas, I search in local library catalogs for david weinberger metadata &#8212; I get: NO ENTRIES FOUND &#8230; Hmmm &#8230; How does Google Book Search compare? [...]]]></description>
			<content:encoded><![CDATA[<p>David Weinberger&#8217;s book <a href="http://www.worldcat.org/oclc/122291427">Everything is Miscellaneous: The Power of the New Digital Disorder</a> is fascinating &#8212; I&#8217;m especially enjoying his many original comments on metadata. So, trying out Weinberger&#8217;s ideas, I search in local library catalogs for <em><strong>david weinberger metadata</strong></em> &#8212; I get: NO ENTRIES FOUND &#8230; Hmmm &#8230; How does Google Book Search compare? I do the same search in GBS, and Bingo &#8211;There it is, at the top of the list &#8230;</p>
<p><a href="http://books.google.com/books?q=david+weinberger+metadata"><img class="alignnone size-full wp-image-3864" title="gbsweinberger1_67" src="http://blog.lib.uiowa.edu/hardinmd/files/2009/09/gbsweinberger1_67.jpg" border="0" alt="" width="596" height="250" /></a></p>
<p>Google, of course, puts the book at the top of the list because its deep metadata indicates that <strong>metadata</strong> is an important topic, and PageRank likely indicates that other people also value Weinberger&#8217;s discussion of the topic.</p>
<p>So, why don&#8217;t library catalogs find the book? &#8212; The problem is the subject headings assigned by the Library of Congress, and used in most all library catalogs:</p>
<p style="padding-left: 90px;">Knowledge management.<br />
Information technology &#8212; Management.<br />
Information technology &#8212; Social aspects.<br />
Personal information management.<br />
Information resources management.<br />
Order.</p>
<p>Even though the book discusses metadata at length and on many pages, it&#8217;s not deemed important enough to be a heading &#8212; The problem is that the traditional catalog is what Weinberger calls a &#8220;second order&#8221; resource, being limited to the small number of subject headings that will fit on a card in the (bygone) catalog. Given resources to assign a larger number of subject headings, no doubt <strong>metadata</strong> would be included.</p>
<p>So &#8230; Librarians can&#8217;t afford to be smug about metadata &#8212; Google has problems (as discussed in Geoff Nunberg articles linked below). But libraries have their own problems. In many ways the traditional library catalog lacks metadata features that have become common in Google, Amazon, and other sites.</p>
<p>Hope for Libraries &#8212; WorldCat does find the book with the <a href="http://www.worldcat.org/search?qt=worldcat_org_all&amp;q=david+weinberger+metadata">david weinberger metadata</a> search (#2 in results), because it has additional tags listed in its &#8220;<a href="http://www.worldcat.org/oclc/122291427">Abstract</a>&#8221; (scroll down) which include <strong>metadata</strong> &#8212; Sooner or later, maybe libraries will add the WorldCat Abstract to their catalogs to &#8220;enrich their metadata.&#8221;</p>
<p>Context: Recent articles by Geoff Nunberg:<br />
<a href="http://languagelog.ldc.upenn.edu/nll/?p=1701">Google Books: A Metadata Train Wreck</a>, Language Log blog<br />
<a href="http://chronicle.com/article/Googles-Book-Search-A/48245/">Google&#8217;s Book Search: A Disaster for Scholars</a>, Chronicle of Higher Education</p>
<p>Eric Rumsey is on Twitter <a href="http://twitter.com/ericrumsey">@ericrumsey</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.lib.uiowa.edu/hardinmd/2009/09/03/metadata-about-metadata-library-catalog-fail/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>&#8220;Metadata Train Wreck&#8221;: Librarians Should Tread Lightly</title>
		<link>http://blog.lib.uiowa.edu/hardinmd/2009/09/03/metadata-train-wreck-librarians-should-tread-lightly/</link>
		<comments>http://blog.lib.uiowa.edu/hardinmd/2009/09/03/metadata-train-wreck-librarians-should-tread-lightly/#comments</comments>
		<pubDate>Thu, 03 Sep 2009 22:01:44 +0000</pubDate>
		<dc:creator>Eric Rumsey</dc:creator>
				<category><![CDATA[Google Book Search]]></category>
		<category><![CDATA[Library Catalog]]></category>
		<category><![CDATA[Metadata]]></category>
		<category><![CDATA[PicsYes]]></category>
		<category><![CDATA[Train Wreck]]></category>
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.lib.uiowa.edu/hardinmd/?p=3837</guid>
		<description><![CDATA[There&#8217;s been much buzz among librarians, and others, on recent articles by Geoff Nunberg (UC Berkeley School of Information) on the &#8220;Train Wreck&#8221; state of Metadata in Google Book Search (See article references below). Nunberg certainly makes some good points. But we librarians are far from perfection in the metadata realm &#8212; Look at the [...]]]></description>
			<content:encoded><![CDATA[<p>There&#8217;s been much buzz among librarians, and others, on recent articles by Geoff <span id="msgtxt3738753963" class="msgtxt en">Nunberg</span> (UC Berkeley School of Information) on the &#8220;<strong>Train Wreck</strong>&#8221; state of <strong>Metadata</strong> in Google Book Search (See article references below). <span id="msgtxt3738753963" class="msgtxt en">Nunberg</span> certainly makes some good points. But we librarians are far from perfection in the metadata realm &#8212; Look at the good ol&#8217; Card Catalog &#8212; Problems abound, as described in an <a href="http://www.youtube.com/watch?v=AIxx3haKo-I">amusing YouTube piece</a> by librarian Brian Mathews (@<a href="http://twitter.com/brianmathews">brianmathews</a>). He uses Georgia Tech as an example, but the same sorts of problems exist in many catalogs. It&#8217;s especially appropriate, in light of Nunberg&#8217;s emphasis of date problems in GBS, that one of the examples highlighted by Mathews (below) is author dates &#8230; Born by the metadata, die by the metadata &#8230;</p>
<p style="text-align: center;"><a href="http://www.youtube.com/watch?v=AIxx3haKo-I"><img class="size-full wp-image-3840 aligncenter" title="cataloganomalies2_62" src="http://blog.lib.uiowa.edu/hardinmd/files/2009/09/cataloganomalies2_62.jpg" border="0" alt="" width="576" height="213" /></a></p>
<p><a href="http://twitter.com/ericrumsey"></a></p>
<p>Context: Recent articles by Geoff Nunberg:<br />
<a href="http://languagelog.ldc.upenn.edu/nll/?p=1701">Google Books: A Metadata Train Wreck</a>, Language Log blog<br />
<a href="http://chronicle.com/article/Googles-Book-Search-A/48245/">Google’s Book Search: A Disaster for Scholars</a>, Chronicle of Higher Education</p>
<p>Eric Rumsey is on Twitter <a href="http://twitter.com/ericrumsey">@ericrumsey</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.lib.uiowa.edu/hardinmd/2009/09/03/metadata-train-wreck-librarians-should-tread-lightly/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Metaphorical Marginalia as Metadata</title>
		<link>http://blog.lib.uiowa.edu/hardinmd/2009/08/18/metaphorical-marginalia-as-metadata/</link>
		<comments>http://blog.lib.uiowa.edu/hardinmd/2009/08/18/metaphorical-marginalia-as-metadata/#comments</comments>
		<pubDate>Tue, 18 Aug 2009 15:13:57 +0000</pubDate>
		<dc:creator>Eric Rumsey</dc:creator>
				<category><![CDATA[Digitization]]></category>
		<category><![CDATA[Marginalia]]></category>
		<category><![CDATA[Metadata]]></category>
		<category><![CDATA[PicsYes]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[eBooks]]></category>

		<guid isPermaLink="false">http://blog.lib.uiowa.edu/hardinmd/?p=3581</guid>
		<description><![CDATA[Marginalia &#8212; writing notes in the margins of books &#8212; is not an exact fit for digital books. But the concept has been getting bantered about in a metaphorical sense to denote any kind of user annotation in digital texts. In my June article on Cathy Marshall&#8217;s studies of user annotations is this quote from [...]]]></description>
			<content:encoded><![CDATA[<p>Marginalia &#8212; writing notes in the margins of books &#8212; is not an exact fit for digital books. But the concept has been getting bantered about in a metaphorical sense to denote any kind of user annotation in digital texts. In my <a href="http://blog.lib.uiowa.edu/hardinmd/2009/06/08/crowdsourcing-annotations-for-books-and-ebooks/">June article</a> on Cathy Marshall&#8217;s studies of user annotations is this quote from <a href="http://blogs.zdnet.com/Ratcliffe/?p=403">Mitch Ratcliffe</a>: &#8220;Creating <strong>marginalia</strong> is an art made for the era of &#8216;crowdsourcing&#8217;.” Ratcliffe&#8217;s article is a long technical commentary that sounds very &#8220;metadata-ish,&#8221; although he doesn&#8217;t actually use the meta-word. So it wasn&#8217;t surprising to find that he has made the connection between user annotation/marginalia and metadata, in a <a href="http://booksahead.com/?p=509">July article</a> [boldface in all quotes added]:</p>
<blockquote><p>Readers will be the creators of the most important <strong>metadata</strong> describing books. Period. There is no second-guessing that conclusion, which has been proved again and again in every hypertext environment in human history. Defining the problem of book <strong>metadata</strong> without treating the reader as the fulcrum of the process is missing the point.</p></blockquote>
<p>Poking around a bit more to find connections between &#8220;metadata&#8221; and the &#8220;metaphorical marginalia&#8221; of user annotations, I found interesting commentary in 2005 around David Weinberger&#8217;s article <a href="http://www.boston.com/ae/books/articles/2005/11/13/crunching_the_metadata/">Crunching the metadata</a> (Excerpt from Weinberger, boldface added):</p>
<blockquote><p>We&#8217;re going to need massive collections of <strong>metadata</strong> about each book. Some of this <strong>metadata</strong> will come from the publishers. But much of it will come <strong>from users</strong> who write reviews, add comments and annotations to the digital text, and draw connections.</p>
<p>As the digital revolution continues, and as we generate more and more ways of organizing and linking books&#8211;integrating information from publishers, libraries and, most radically, other readers&#8211;all this <strong>metadata</strong> will not only let us find books, it will provide the context within which we read them. &#8230; The real challenge to traditional publishing today comes not from the digitizing of books, then, but from the very nature of the Web itself. Using <strong>metadata</strong> to assemble ideas and content from multiple sources, <strong>online readers become not passive recipients of bound ideas but active</strong> librarians, reviewers, anthologists, editors, commentators, even (re)publishers.</p></blockquote>
<p>Weinberger doesn&#8217;t use the word &#8220;marginalia.&#8221; But the word IS used in Ben Vershbow&#8217;s articles commenting on Weinberger. In <a href="http://www.futureofthebook.org/blog/archives/2005/11/the_book_in_the_network_masses.html">Vershbow&#8217;s first article</a> (tagged with &#8220;marginalia&#8221;) he says:</p>
<blockquote><p>The book in the network is a barnacled spirit, carrying with it the sum of its various accretions. Each book is also its own library by virtue not only of what it links to itself, but of what its readers are linking to, of what its readers are reading.</p></blockquote>
<p>And in <a href="http://www.futureofthebook.org/blog/archives/2005/11/online_retail_influencing_libr.html">another article by Vershbow</a> that continues on the same theme, Daniel Anderson, in a comment, is reminded of Billy Collins&#8217; poem, &#8220;<em><strong>Marginalia</strong></em>,&#8221; which, he says, &#8220;points to the conversations that take place as readers jot their reactions in the margins of books.&#8221; Excerpt from the poem, as quoted by Anderson:</p>
<p><img class="alignnone size-full wp-image-3670" style="padding-right: 12px;padding-top: 3px;padding-bottom: 2px" title="vershbow1" src="http://blog.lib.uiowa.edu/hardinmd/files/2009/08/vershbow2.jpg" alt="" width="285" height="478" align="left" />Sometimes the notes are ferocious,<br />
skirmishes against the author<br />
raging along the borders of every page<br />
in tiny black script.<br />
If I could just get my hands on you,<br />
Kierkegaard, or Conor Cruise O’Brien,<br />
they seem to say,<br />
I would bolt the door and beat some logic into your head. &#8230;</p>
<p>Yet the one I think of most often,<br />
the one that dangles from me like a locket,<br />
was written in the copy of Catcher in the Rye<br />
I borrowed from the local library<br />
one slow, hot summer . . .</p>
<p>A few greasy looking smears<br />
and next to them, written in soft pencil-<br />
by a beautiful girl, I could tell,<br />
whom I would never meet-<br />
&#8220;Pardon the egg salad stains, but I&#8217;m in love.&#8221;</p>
<p>****************</p>
<p>The Talmud page with marginalia at left is from Vershbow&#8217;s article <a href="http://www.futureofthebook.org/blog/2005/05/web_marginalia.html">Web marginalia</a>.</p>
<p>Eric Rumsey is on Twitter <a href="http://twitter.com/ericrumsey">@ericrumsey</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.lib.uiowa.edu/hardinmd/2009/08/18/metaphorical-marginalia-as-metadata/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
