<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Seeing the picture &#187; Pattern recognition</title>
	<atom:link href="http://blog.lib.uiowa.edu/hardinmd/category/pattern-recognition/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.lib.uiowa.edu/hardinmd</link>
	<description>Thoughts while working on Hardin MD on digitization &#38; libraries</description>
	<lastBuildDate>Wed, 18 Nov 2009 22:16:00 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.5.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Intelligent Thumbnails</title>
		<link>http://blog.lib.uiowa.edu/hardinmd/2008/08/19/intelligent-thumbnails/</link>
		<comments>http://blog.lib.uiowa.edu/hardinmd/2008/08/19/intelligent-thumbnails/#comments</comments>
		<pubDate>Tue, 19 Aug 2008 18:05:58 +0000</pubDate>
		<dc:creator>Eric Rumsey</dc:creator>
				<category><![CDATA[ContentDM]]></category>
		<category><![CDATA[Flickr]]></category>
		<category><![CDATA[Hardin MD]]></category>
		<category><![CDATA[Human input]]></category>
		<category><![CDATA[Pattern recognition]]></category>
		<category><![CDATA[PicsYes]]></category>
		<category><![CDATA[Thumbnails]]></category>
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.lib.uiowa.edu/hardinmd/?p=186</guid>
		<description><![CDATA[&#8220;Flickr takes the sun out of the sunset&#8221; &#8212;  The picture to the left  from Flickr shows the full picture and its square thumbnail, in the inset. Thumbnails like these are generated automatically by Flickr and other photo management systems. They work by taking a portion from the center to make the thumbnail. [...]]]></description>
			<content:encoded><![CDATA[<p><img style="padding-right: 15px;padding-top: 2px" src="http://blog.lib.uiowa.edu/hardinmd/files/2008/08/flickr26_85.jpg" alt="Flickr takes the sun out of the sunset" align="left" />&#8220;Flickr takes the sun out of the sunset&#8221; &#8212;  The picture to the left  from Flickr shows the full picture and its square thumbnail, in the inset. Thumbnails like these are generated automatically by Flickr and other photo management systems. They work by taking a portion from the center to make the thumbnail. This works well if the center has the most important subject in the picture. But if the picture is relatively wide or tall, and its main subject is not in the center, as in the example at left, with the sun being to one side, the thumbnail misses it. Looking at this example (<a href="http://www.flickr.com/photos/davidwhlloyd/sets/72157601516657085/">Long Beach Sunset</a>) in Flickr, note that the first thumbnail on the Flickr page (top left) is the one for the larger picture (that&#8217;s shown on our page with the thumbnail in yellow-outlined inset).</p>
<p>In large mass-production systems like Flickr, automatic thumbnails are unavoidable, and my point is not that they should never be used. Instead, my point is that, on many levels, <a href="http://blog.lib.uiowa.edu/hardinmd/2008/07/11/think-different-pictures/">pictures require more human input</a> than text to make them optimally usable. Pattern recognition &#8212; the simple observation that the thumbnail of a picture of a sunset SHOULD CONTAIN THE SUN &#8212; is something that the human brain does easily, but this does not come naturally for a computer.</p>
<p><img style="padding-right: 15px;padding-top: 12px" src="http://blog.lib.uiowa.edu/hardinmd/files/2008/08/fullsize32_70.jpg" alt="" align="left" /><br />
Another sort of problem in automatic production of thumbnails is making a thumbnail by simply reducing the size of the large picture. If the main subject of the picture is relatively small, it is not visible in a small thumbnail.</p>
<p>The picture to the left is from the Hardin Library ContentDM collection. The inset in the upper right shows the <a href="http://digital.lib.uiowa.edu/cdm4/results.php?CISOOP1=all&amp;CISOBOX1=holbein+plate+viii+12a&amp;CISOFIELD1=CISOSEARCHALL&amp;CISOROOT=%2Fjmrbr">thumbnail</a> that&#8217;s generated automatically by the system, which does a poor job of showing details of the picture. The lower inset shows a <strong><span style="color: #cc0000;">thumbnail</span></strong> made manually, which gives a much more clear view of the central image in the picture.</p>
<p>Cropping of a picture to produce a thumbnail, as done here, takes more subtle human judgement than the case with the Flickr picture in the first example, where the weakness of automatic production is obvious. With cropping, there&#8217;s inevitably a trade-off between showing the whole picture in the thumbnail or showing the most important subject of the picture. In cases such as this one from ContentDM, where most all of the detail in the picture will be lost in a small thumbnail, it seems better to focus on a central image that will show up in the thumbnail.</p>
<p>Finally, a few examples from <a href="http://blog.lib.uiowa.edu/hardinmd/2008/07/22/hardin-md-gallery-collections/">Hardin MD</a>, below, show how we have done cropping to improve the detail in our thumbnails. The thumbnails on the left in each of the three pairs are made by simply reducing the size of the full picture. On the right in each pair are the thumbnails we use, that we have made by cropping the full picture before making the thumbnail.</p>
<p><img class="alignnone size-full wp-image-198" src="http://blog.lib.uiowa.edu/hardinmd/files/2008/08/composite4.jpg" alt="" width="500" height="100" /></p>
<p>The biomedical, scientific pictures that we work with in Hardin MD are fairly easy to make thumbnails for, because they generally have a well-defined focus, that&#8217;s usually captured well by automatically-generated thumbnails. More artistic, humanities-oriented pictures, such as the ones discussed here from Flickr and ContentDM, however, often have more subtle subjects, that benefit from the human intelligent touch in the production of thumbnails.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.lib.uiowa.edu/hardinmd/2008/08/19/intelligent-thumbnails/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Think Different : Pictures</title>
		<link>http://blog.lib.uiowa.edu/hardinmd/2008/07/11/think-different-pictures/</link>
		<comments>http://blog.lib.uiowa.edu/hardinmd/2008/07/11/think-different-pictures/#comments</comments>
		<pubDate>Fri, 11 Jul 2008 19:29:40 +0000</pubDate>
		<dc:creator>Eric Rumsey</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Hardin MD]]></category>
		<category><![CDATA[Human input]]></category>
		<category><![CDATA[Image Search]]></category>
		<category><![CDATA[Pattern recognition]]></category>
		<category><![CDATA[PicsNo]]></category>
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.lib.uiowa.edu/hardinmd/2008/07/11/think-different-pictures/</guid>
		<description><![CDATA[As computers have become more powerful, many of the aspects of handling text that were formerly done by humans have been taken over by computers. Pictures, however, are much more difficult to automate &#8212; Recognizing patterns remains a task that humans do much better than computers. A human infant can easily tell the difference between [...]]]></description>
			<content:encoded><![CDATA[<p>As computers have become more powerful, many of the aspects of handling text that were formerly done by humans have been taken over by computers. Pictures, however, are much more difficult to automate &#8212; Recognizing patterns remains a task that humans do much better than computers. A human infant can easily tell the difference between a cat and a dog, but it&#8217;s difficult to train a computer to do this.</p>
<p>In pre-Google days, the task of finding good lists of web links needed the input of smart humans (and Hardin MD was on the cutting edge in doing this). Now, though, Google Web Search gives us all the lists we need.</p>
<p>Pictures are another story &#8212; on many levels, pictures require much more human input than text.</p>
<p>The basic, intractable problem with finding pictures is that they have no innate &#8220;handle&#8221; allowing them to be found. Text serves as its own handle, so it&#8217;s easy for Google Web Search to find it. But Google Image Search has a much more difficult task. It still has to rely on some sort of text handle that&#8217;s associated with a picture to find it, and is at loss to find pictures not associated with text.</p>
<p>The explosive growth of Hardin MD since 2001 (page views in 2008 are over 50 times larger) has been strongly correlated with the addition of pictures. This time period has also gone along with  the growing presence of Google, with its page-rank technology, and this has come to make old-style list-keeping, as had been featured in Hardin MD, less important.</p>
<p>Though Google has accomplished much in the retrieval of text-based pages, it&#8217;s made little progress in making pictures more accessible. Google Image Search is the second most-used Google service, but its basic approach has changed little over the years.</p>
<p>The basic problem for image search is that pictures don&#8217;t have a natural handle to search for. Because of this it takes much more computer power for the Google spider to find new pictures, and consequently it takes much longer for them to be spidered, compared to text pages (measured in months instead of days).</p>
<p>Beyond the problem of identifying pictures there are other difficult-to-automate problems for image search:<br />
• How to display search results most efficiently to help the user find the what they want &#8212; Do you rank results according to picture size, number of related pictures at a site, or some other, more subjective measure of quality?<br />
• What&#8217;s the best way to display thumbnail images in search results?<br />
• How much weight should be given to pictures that have associated text that helps interpret the picture?</p>
<p>So &#8212; <strong>Good news for picture people!</strong> &#8212; I would suggest that pictures are a growth sector of the information industry, and a human-intensive one. I would predict that text-based librarians will continue to be replaced, as computers become more prominent. But there will continue to be a need for human intelligence working in all areas relating to pictures, from indexing/tagging to designing systems to make them more accessible.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.lib.uiowa.edu/hardinmd/2008/07/11/think-different-pictures/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
