With the lack of progress in figuring out the Settlement, Google Book Search has been out of the news recently. So I was glad to see that Hugh McGuire, in his recent article about the line between the book and the Internet disappearing, in the second half of the article, gives credit to GBS as helping to lay the foundation for this (see comments below). In the first part of his paper, McGuire has a lucid description of the future world of books on the Internet that’s being foreshadowed by GBS:
A book properly hooked into the Internet is a far more valuable collection of information than a book not properly hooked into the Internet. … The false battles between ebooks and print books continue to ignore the real, though as-yet unknown, value that comes with books being truly digital; not the phony, unconnected digital of our current understanding of “ebooks.” … We still consider that books live outside of the Internet. There is massive and untapped (and unknown) value to be discovered once books are connected.
McGuire says our idea of what an eBook should be still carries a lot of baggage from our idea of what a book is — We still think of eBooks as being separate from the Web — In particular, eBooks, like print books, are not searchable or linkable, and text can’t be copy-pasted. But full view Google Books are searchable, linkable and deep–link–able, and text can be copy-pasted.
Being able to find books in a Web search is a key to removing the barrier between books and websites, as McGuire says, and Google has been bringing that capability to reality, as they steadily work with libraries to scan their books. Searching for books within Book Search has been part of GBS since the beginning. In the last three years or so, books have also been increasingly integrated into Web Search results. This is a big step, I think, in connecting books to the Web — Searching Google is how we find out about things these days, so to be able to retrieve books and web pages in the same search is a extremely valuable — A great advance that’s gotten little attention.
Not surprisingly, the people who have especially learned to value bringing books onto the Web are historical researchers — With pre-1923 books being out of copyright, and freely available in GBS full-view searches, historians have quickly taken advantage — In a recent forum of historians on GBS, it’s described with words like indispensable … enlivening.
While historians have expressed their valuing of the connected books in Google Books, there are certainly many non-academics with little voice who are also learning to use it. Google engineer Dan Clancy says that half of the out-of-copyright books in GBS have at least 10 pages viewed per month — A large portion of this use must surely be from the general non-academic public. One commenter to McGuire’s article (who apparently doesn’t know about GBS) expresses the voice of this little-heard group: “If books/entire texts were *searchable* on the internet, people searching the web could more easily find exact phrases/specific information in books. To me, this is HUGE!” Indeed — and what a fine, pithy statement of McGuire’s thesis!
To conclude, a plug for metadata as the key to connecting books to the Web, and why it’s people like the geeks at Google who are doing it before the traditional publishing industry — In the words of Mike Cane:
All of this hidden information — exploded out, made explicit — turns an ebook from a dumb object into a smart object. … Google is staffed by geeks who juggle information with an expertise that print publishers lack. … Google makes information do things. … Print publishing freezes information into a static object — An object that stands alone, disconnected, unable to do anything. … The information geeks are new publishers for a new age … Metadata has value. As new connections are formed, and new data is added, its value increases exponentially.
Bringing the world’s books to digital life on the Web, whether it’s done by Google or someone else, is a story that’s just beginning. For now though, Google Book Search gives us a start in exploring the new world of the book and the Internet.
Eric Rumsey is at: eric-rumsey AttSign uiowa dott edu and on Twitter @ericrumsey