How to forget, when you’ve remembered to do so

Posted August 26, 2009 by dbawden
Categories: Uncategorized

vanish-logoIn a previous post, I wrote about the need to ‘forget’ the cloud of digital information which we all seem obliged to create nowadays, and which may come back to haunt us forever. I mentioned the idea of having some of this information ’self-destruct’ after an appropriate time. Those clever people (Hank Levy, Tadayoshi Kohno, Roxana Geambasu and Amit Levy) in the Computer Science Department at the University of Washington in Seattle have now arranged for just this to happen, with their aptly named Vanish software.

Amit Levy and Roxana Geambasu

Amit Levy and Roxana Geambasu


This allows you to specify that a document will irreparably disappear after eight hours. At present, it works with emails, blog postings, Google Docs documents, and Facebook messages, and only with the Firefox web browser. The creators are planning to extend the range of information entities to be vanished, and to allow a wider range of life times. Of course, a recipient can always make a non-vanishing copy, and, as the creators have noted, government agencies will no doubt have ways round it. But it’s a certainly a nice start in enabling us to leave the past behind.

Remembering to forget

Posted August 14, 2009 by dbawden
Categories: Uncategorized

Tags: , , ,

memory2Most of us have problems with remembering things at time. Memory problems usually go one way – we can’t remember things. Drastic loss of memory has been a theme of many books (I can, just about, remember Asimov’s Currents of Space as being the first with this theme that I read) and movies (the Bourne series, I understand, not having seen any of them, has this theme). The lack of incentive for memorisation has been lamented over thousands of years, from the increasing use of printed books reducing the need for the ‘memory arts’ of the ancient world, which lasted well into the Renaissance, to current laments about the disappearance of ‘pub quiz style’ general knowledge in the face of Google and Wikipedia.
google gen kno

Remembering always seems to be the problem. The idea that forgetting might also be good at times has been slower to catch on. Some analogy, I suppose, to the way in which the need to filter incoming information to avoid overload has, until recently, been much less recognised than the need to find good information in the first place. Deliberate forgetting has in some cases been recognised as a good thing, and again science fiction has led the way. Spock used his Vulcan mental powers on occasion to make others forget, while the characters of Greg Bear’s Eon series had ‘good forgetings’ as birthday presents.

The idea seems now to be creeping into information management. James Harkin, in an article in the London Sunday Times, points up a number of horror stories about how old material on social networking sites came back to haunt its originators. He gives this as the most dramatic example of how digital information can remain active indefinitely, not merely cluttering up our ever-expanding storage systems and confusing our not-expanding-at-all brains, but doing real harm.

deleteA possible solution is proposed by Viktor Mayer-Schonberger in a new book Delete: the virtue of forgetting in the digital age. He argues for a mode of ‘digital forgetting’, by setting expiry dates on each and every digital document, photograph, message, etc. that we create. This will he suggests. make our digital memories more like their brainware equivalents.

Or perhaps we should take a more literary view of things, and look to a new era of memory arts, closely associated with poetry, as David Barber nicely suggests, in his essay Does Memory have a Future.

Whichever way it turns out, I think that we will see ideas of memory, and forgetting, as gaining a new importance as the digital transition engulfs us. Who knows, perhaps a resourceful professor, looking for a new course to market, will come up with a degree in forgetfulness.

Naming of parts and other things

Posted July 26, 2009 by dbawden
Categories: Uncategorized

Tags: , , , , ,

To-day we have naming of parts. Yesterday,
We had daily cleaning. And to-morrow morning,
We shall have what to do after firing. But to-day.
To-day we have naming of parts. Japonica
Glistens like coral in all of the neighbouring gardens,
And to-day we have naming of parts.
(Henry Reed, 1942)

smallbl logo_cmyk
talksciencebanner
Attending a talk at the British Library last week led me to think about how we name things; and to write a version of these thoughts as an editorial in Journal of Documentation. During 2008 and 2009, the British Library ran a series of evening discussions under the heading TalkScience; as the name implies, these were informal lecture / discussion / networking events devoted to scientific topics. As one might expect from the venue, they were all associated with scientific information In some way; publication, communication, taxonomy, bioinformatics, etc. The debate can be followed at Twitter hash-tag #BLTS and at the British Library FriendFeed.

The session on 22 July 2009 (which I attended) featured John Wilbanks from Science Commons talking about ‘Scientific findings in a digital world: what is the genuine article ?’. Its main theme was the survival, or otherwise, of the article in a scientific journal as the main way of communicating new scientific information. The talk, and discussion, touched on many issues: open access; intellectual property; what does peer review really do; can ‘common knowledge’ wikis replace introductions and literature reviews in articles; how to handle multimedia information within an ‘article’ framework; the need to decide which data to throw away; the likelihood that in a world of ‘open science’ funders will support only work which is replicable and falsifiable (as a good Popperian I warmed to the last); and so on. Much food for thought for everyone, both scientists and information specialists since the role of libraries was a recurring theme; perhaps not surprising, considering the venue. (The audience, by the way, did not seem to include many library/information people, other than the BL hosts; one tweeter described it as ‘funders, publishers and academic trouble-makers’.)

I was particularly struck by one point, which did not attract very much in the way of comment at the time, perhaps because everyone considered it self-evidently true. In his presentation of the need for annotation of digital reporting of scientific findings, Wilbanks commented simply that we need to call the same thing by the same name; this makes possible the semantic linking of information and data, the creation of ontologies, and so on, without which it will not be possible to share information across disciplinary and sub-disciplinary silos. He exemplified this by examples by simple – the various names for coffee in different languages – and complex – the variant terminology used in hundreds of datasets relating to polar climate change, and in over a thousand related to genomics.

There was another aspect to this point. What we call an information object in the digital world – DOIs and all the rest – is also fundamental; if we do not call these digital objects the same thing, we will have great difficulty in finding them.

Which all led me to think. Is this not one of the central themes of the information sciences being replayed. Given consistent names to information resources, through rules for cataloguing and resource description, and to the content of such resources, through indexing and classifying, have been a major activity of information practitioners for as long as these disciplines have existed, and – though regarded now as a little old-fashioned – have formed the focus for much information research. True, in the context in which the BL discussion was framed, they generally appear dressed in the semantic web clothes of RDF (Resource Description Framework) and OWL (Web Ontology Language – yes, I know, but what sort of acronym is WOL?), but the principles are surely the same.

Should we be pleased that ‘our’ concerns seem again to be at centre stage ? Or distraught that we still have no accepted solutions to problems that have been around for so long ? At all events, we can perhaps be quietly (or perhaps grimly) confident that these issues will continue to feature in the information sciences literature, for the foreseeable future.

Henry Reed by the Albert Bridge

Henry Reed by the Albert Bridge

By the way, the stanza which introduced this post comes from a poem which was imprinted in my mind when I had to study it in depth at school. Henry Reed is regarded as the only significant English poet of the 1939-45 war. A copy of the full poem, writing over a period of decades, together with links to far more criticism than you would ever want (unless you are also studying it in depth at school) can be found here.

ALA Chicago

Posted July 25, 2009 by dbawden
Categories: Uncategorized

Tags: , , , , ,

Lake Michigan sunrise, compensation for jet-lag

Lake Michigan sunrise, compensation for jet-lag

Put the world’s largest library/information conference – that of the American Library Association – in the city with the greatest concentration of skyscrapers, if not any longer the world’s tallest building – Chicago, of course – add a bit of jet-lag, and the result is a bit overwhelming. Never mind, ALA was very much worth the effort. My colleague, Lyn Robinson, has written more than adequately about the exhibition, the buildings, the food, and the handbags, so these are just some addenda.

Much of the conference material can be found here, and there was way to much to try to summarise. One highspot for me was a session with five science fiction and fantasy writers – Robert Charles Wilson, John Brown, Eric Flint, Margaret Weiss and Ken Scholes – all arguing for the value of this kind of, rather marginalised, literature in promoting literacy and sparking imagination. I was a little sorry that they didn’t do more on the stated theme of examining how SFF can promote ideas about the future of information but never mind; a bag of free books more than made up :)

Joan Lippincott tells us about the millenials

Joan Lippincott tells us about the millenials

Having an interest in the information needs and information-related behaviour of the various ‘generations’ – baby-boomers, Gen X, millenials, and the like – and having, indeed, recently given a workshop on the topic, I enjoyed to the session on supporting millenials in graduate school. Joan Lippincott, Barbara Dewey and Susan Gibbons analysed this group, finding them as a whole to be creative, highly social learners, always connected, and addicted to multimedia; though such generalisations are often nuanced by the academic discipline, which makes a big difference. One key to supporting them well seems to be to get the right balance between information spaces; physical and virtual, solitary and social.

Away from the conference, the Chicago Architecture Foundation does excellent tours, which not only show you the buildings of this city, but provide a kind of ‘Modern Architecture 101′ course. cun imageAnd no visitor to the city can fail to notice how much the the natural world is welcomed into what could a forbidding environment; very nicely written up in the book “Chicago’s Urban Nature” by Sally Anderson Chappell.

Codex Siniaticus; the good side of digital

Posted July 8, 2009 by dbawden
Categories: Uncategorized

Tags:

cs imageA very impressive example of the power of digitisation to support ‘book culture’ comes from the creation of the digital version of the Codex Siniaticus, the earliest version of the Christian Bible, dating from the fourth century. No complete version of the original parchment manuscript exists anywhere; the 800 pages are scattered in museums and libraries throughout the world. Now, thanks to a collaboration between between institutions in Britain, Egypt, Germany and Russia, a digital version of all the existing pages and page fragments has been created. The quality is sufficiently good that edits and changes to the manuscript over time can be followed.

A very nice example of the creation of digital information to create something which could realistically not have been realised in physical form (although it’s still reassuring to know that the physical manuscript is still there). At a time when many commentators are seeing a turning away from digital, and back to real things – see for instance a recent article in the London Evening Standard “Enough of digital fun, we want the real thing now” – this is an example of the good side of digital.

Impact factors and half-lives

Posted July 7, 2009 by dbawden
Categories: Uncategorized

Tags: ,

It’s that time of year when academic journal editors nervously look at the Web of Science data to see how they’ve done compared with their competitors, as assessed by the magic bibliometrics that are now the basis of any assessment of how ‘good’ a journal is. In my case, of course, I was looking to see how Journal of Documentation had done.

The most obvious thing that strikes on on looking at the ‘top twenty’ library / information journals in the WoS ranking is that most of them aren’t library / information in any sense that we know it. I would only recognise six of them as LIS journals; the rest are mainly management information systems and health informatics, presumably without any other home to go to.

Of the ‘real’ LIS contingent, and assessed by the ‘impact factor’ which tends to be the most quoted quality measure, Annual Reviews of Information Science and Technology is not surprisingly at the top, as befits the only review journal in the field. Of ‘normal’ journals, Journal of the American Society for Information Science and Technology leads, followed by Information Processing and Management, and then Journal of Documentation, ahead of Journal of Information Science and Journal of the Medical Library Association. Pretty much the same as last year.

Of course, any journal editor should be able to find a measure that shows their baby is the prettiest in the show, and for us that is the ‘half life’ measure, an indication of the length of time that the journal’s material retains its value. JDoc not only has the highest half-life of any of the five LIS journals quoted, but is also ahead of ARIST; notably so, in that review articles generally have the longest periods of value. A nice confirmation for me and the editorial board that we are succeeding in avoiding ephemera, and publishing material that stays relevant for a long time.

Transferred illusions

Posted July 7, 2009 by dbawden
Categories: Uncategorized

I’ve just written a review of Marilyn Deegan and Kathryn Sutherland’s Transferred lllusions: digital technology and the forms of print for the journal Alexandria. This is a modified version:
Deegan
The so-called ‘digital transition’ has been at the forefront of the minds of many library / information specialists over the past few years, as the certainties of the world of print give way to the changes and complexities of the digital age. Deegan and Sutherland do an excellent job of guiding us through some of these issues, in this scholarly but accessible book.

It comprises six essentially discrete chapters, each of which could be read alone as a primer on its topic. It is well-written, with a clear flow illustrated by numerous asides and examples. It takes a historical perspective, relating ‘new’ issues to what has gone before, and is well-referenced; it would be an excellent source book for academic courses in this area. The authors state the aim that it would contain both ‘informational’ and critical/reflective writing, and generally succeed well.

The book is, as the authors say, “about the forms and institutions of print – newspapers, books, scholarly editions, publishing, libraries – as they relate to and are changed by the emergent forms and institutions of our present digital age”. It deliberately does not deal with blogs, social networking sites, and the like. Some may consider this a weakness, given the often-stated potential of such media to complement, or even compete with, the more traditional forms of communication. However, it does give the book a clear rationale, and allows a logical argument to develop.

The first chapter, ‘After Print’ considers the survival, surprising to some, of print-on-paper, and reflects on how long it will survive, and what may replace it. The authors foresee the disappearance of printed reference works, lament the World Wide Web as a degraded from of Ted Nelson’s vision for true hypertext, and conclude that ‘far from dying, in its old age print is casting a long shadow across the digital generation’.

Chapter two deals with news sources, citing the commonly held fears of the death of news in print, and noting that during the 1991 Gulf War less than 9% of Americans kept up with the issues primarily through newspapers, compared with ten times that proportion at the end of the second world war. They suggest that using personalised newsfeeds, approximating to the long-heralded ‘Daily Me’, is perhaps not such a great deviation from reading a newspaper chosen to reflects one’s own social standing, political views, etc., and lament the problems involved in the replacement of hard-copy files of historic newspapers with microform or digital equivalents.

The third chapter focuses on editing, described as a “cultural work”, and the way it is changed in a digital environment. Ranging widely, it touches on metadata, text markup and encoding, and other new issues which concern the digital editor.

The next chapter is concerned with new modes of publishing: such things as electronic journals, self-archive repositories, e-books, print-on-demand books, niche publishing and self-publishing, and so on. The authors’ decision to avoid include blogs, micro-blogs, social networking and the like means that this chapter has a distinctly calming air, compared with more radical reflections on this topic.

Chapter five, entitled ‘The Universal Library’, gives an excellent potted history of the development of libraries, and reflects on how they will adapt to the digital age. Along the way, we get such treats as the estimate that the recorded output of humanity from the days of the Sumerians comprises “at least 32 million books, 750 million articles and essays, 25 million songs, 500 million images, 500,000 movies, 3 million videos, TV shows and short films, and 100 billion public Web pages”. The authors remark that this is a ‘woeful underestimate” of the numbers of books at least, and conclude that that there are 15-20 billion pages of printed book materials available now. Digitisation projects are pondered and criticised, with few punches pulled: of one digitisation exercise, the authors comment that “the daftness or ignorance that led to the suppression of such vital material from this particular … project is staggering. It is also a useful warning about putting our faith in digital libraries”.

They are also concerned about how order, one of the major benefits that libraries bring to the enormous scope of written material, is lost in mass digitisation efforts. Quoting a tart definition of the Internet as “a library assembled piecemeal by packrats and vandalised nightly”, they clearly worry that the same fate awaits a library which converts itself to digital form using current best practice. While users are supposed to wonder “why can’t libraries be more like the Internet, filled with cool information that we can all have for free ?”, the authors think that the more appropriate question is “why can’t the Internet be more like libraries, organised, classified, and with powerful filters in place ?”

The final chapter ‘Durable Futures’, considers preservation issues, and laments how much has already been lost through lack of appropriate archiving, as well as technical problems. From this, they move on to considering how the ways in which material is preserved influence how it is used, and summary some concerns, now becoming familiar, about preferences for skimming and annotating, rather than sustained reading and argument.
cp cover
Overall, although most readers will find something to disagree with, this is an excellent book. It sits well alongside the volume edited by Cope and Phillips’ The future of the book in the digital age as an insightful account of where recorded information is going as the digital transition takes hold.

Ray Bradbury, libraries and the Internet

Posted June 26, 2009 by dbawden
Categories: Uncategorized

Tags: ,

According to a New York Times story, science fiction icon and general guru, is fighting to save his local public library in Ventura County, California. Bradbury is, of course, most famous for his depiction of book burning in his novel Fahrenheit 451, but the NYT reminds us that some of his other books have significant library scenes. He is says he believes in libraries, though not in colleges or universities, and certainly not in the internet, regarding the web as ‘distracting, meaningless, not real, in the air somewhere’.

(Thanks for Andy MacFarlane for pointing out this article)

Occasional tweeting

Posted June 24, 2009 by dbawden
Categories: Uncategorized

After resisting for a long while, I have finally joined Twitter. I image that my tweeting will be even more occasional than my blogging, but for anyone interested it’s @david_bawden. Yes, the underscore does matter.

Inforum proceedings

Posted June 22, 2009 by dbawden
Categories: Uncategorized

Tags: , , , ,

The proceedings of the Prague Inforum conference, which I mentioned in a previous post, are now available online here. They are of interest to anyone concerned with the use of digital information sources, particularly though not entirely in a Central European context. My own paper, ‘Library / information prospects: three views of the future’, can be found here.