New year, old idea ?

Posted January 28, 2010 by dbawden
Categories: Uncategorized

Tags: , , , , ,

A new year always provokes thoughts of what has gone and what is to come. The news media, feeding our liking for the comfort of the repetition of the annual cycle, devote much space in January to this kind of reflection, which often seems not to change much from one year to the next. I admit to rarely finding anything uplifting, or even convincing, in this sort of ruminating. Did any of them spot the credit crunch coming, and tell us to take evasive action ? Or advise that all the world would twitter ? I think not.

Richard Watson

But this year it was a little different. The commentaries seemed to me to me a bit more insightful, less wowed by technology. And, I suppose, more in line with my own prejudices. My favourite, popular in nature of course, but none the worse for that, was a ‘brief history of 2010‘ by Penny Wark in the London Times, based on the views of the futurist Richard Watson.

Of the various points raised in Watson’s ‘history’ of the coming year, the most striking to me, not least because it is very much in line with what others have suggested, is his idea of a “flight to the physical”. This builds on his previous comments about about the paradoxical isolation caused by ubiquitous digital communication – “we can get instant news and tweets throughout the day, but we don’t know our neighbours’ names” – and a consequent hunger for shared experiences in a common physical space. Watson writes “If virtual connection can never match its physical equivalent, this is partly because we associate digital with speed, being disposable and therefore of low value, and partly because we like to hold and touch real things… Downloading a video is easy and efficient, but it’s a soulless experience compared with going to a good video shop, having a chat with a movie buff, and looking at row upon row of illustrated titles”. The he moves on to the – by now increasingly common but still welcome – pean to an unfashionable institution: ” the public library, feared moribund in recent years, is in its element because it’s about much more than books. It’s a quiet and safe community space, an experience that enables you to access expertise from commercially uncorrupt resources, and that’s both ethical and resource friendly”.

A similar note is struck by Fleur Britten in her Sunday Times magazine article on what’s new for 2010. Quoting the trendspotter Marian Salzman, she notes the need for ‘emotional spaces .. to retreat from the modern buzz to ‘safe spaces’ .. get low key”. Although the L-word is not mentioned, the message seems similar.

is digital the future of paper?

Of course, no two futurologists could be expected to agree on much. Britten quotes authorities who tell her that “the next decade will be video, video video .. prepare to move from words to images”, and that “print won’t die, it will be become electronic, with the arrival of [digital paper]“. On the other, more restrained hand, Watson tells Wark that “paper is not dead, and … while news will mostly be delivered online, serious comment and analysis and novels will largely stay on paper”.

On the whole, though I make no claim to be much of a futurologist, I’m with Watson. Particularly because of his idea that “we’ll see phrases such as slow media emerge as people realise that if you read things on paper you are more relaxed, you register more, you reflect and see the big picture”. As one of my PhD students is researching the idea of slow information, I feel comfortably ahead of the trend.

Documenting Babel – languages in information science

Posted November 28, 2009 by dbawden
Categories: Uncategorized

Tags: , , ,

Tower of Babel - Pieter BreugelMusing on the continuing place of language issues in both research and practice in the information sciences, following my participation in the Zagreb InFuture meeting, I wrote an editorial on the topic for the Journal of Documentation. This post is an amended version.

Languages, in one guise or another, have been a constant feature of the landscape of the information sciences for many years.

There are, for example, the various artificial ‘languages’ – more usually thought of as notations, nomenclatures or ontologies – which have been devised to describe such things as chemical structures and reactions, medical diagnoses and treatments, and the burgeoning data-rich fields of modern biology. There is the presence of linguistics as a subject of seemingly perpetual potential relevance to the information sciences. As Sparck Jones and Kay (1973, p. 1) put it in their seminal textbook: “linguistics and information science are natural bedfellows … but there has been relatively little contact between the two fields”: the situation has not changed much in the intervening decades. There is the now ubiquitous searching of ‘full text’ databases, requiring a greater or lesser amount of ‘intelligent’ processing of the natural languages in which the content of such databases are couched.

But primarily, there is the continued need for handling communication of information in all of the world’s languages. Neither the earnest advocacy of ‘universal’ or ‘auxiliary’ languages, from Leibnitz’ logic-based characteristica universalis to Esperanto, nor the long-anticipated advent of English as a de facto global language (Crystal 2003), has reduced the demand for support for national and local languages, as the provision for 23 official languages in the European Union testifies.

This naturally has consequences for research and practice in the information sciences. A facility with languages other than one’s own has always been one of the requirements of the practising librarian and information officer, even in the traditionally language-averse United Kingdom. Sadly, the requirement for some facility with two languages other than English, a requirement when I studied information science at Masters level, has long gone from the UK, though an equivalent requirement is still largely present in Continental Europe. This manifested itself in a variety of detailed language tools for the information professions, Allen’s 1975 Manual of European Languages for Librarians, being a typical example.

In research terms, language issues have stimulated work on a variety of topics. An early example was the study of the value of ‘cover-to-cover’ translations of scientific journals, particularly from the Russian language following the shock to the Western scientific complacency caused by the Sputnik satellite of 1957 (Tybulewicz 1970). Other long-standing concerns, in the English-speaking world at least, were focused on the ‘language barrier’, the belief that valuable information, particularly in scientific, technical and medical subjects, was being missed because it was not published in the English language (see, for instance, Hutchins, Pargeter and Saunders 1971, Chan 1977, Thorpe, Schur, Bawden and Joice 1988). More recently, attention has been focused on such topics as the information practices of translators, natural language processing and cross-language information retrieval; Some examples of recent Journal of Documentation articles reporting such research, as an indication of its variety, are shown below.

These thoughts were stimulated by my attending the INFuture conference in Zagreb, Croatia, in November. A substantial proportion of this conference, which dealt with the future of information science, was devoted to language technologies – including machine-aided translation and natural language processing – and to languages issues in general. The topics covered included the European Union’s CLARIN (Common Language Resources and Technology Infrastructure) project, which aims to compile a series of digital archives with data sources for language-based materials (text and speech corpuses, dictionaries, etc.) together with language and speech technology tools. Particularly aimed at academic users in the arts and social sciences, CLARIN adopts the philosophy that all languages – irrespective of the number of speakers or of their commercial importance – are of equal importance.

It seems clear that the predictions, or fears, of the adoption of artificial languages, and of the ubiquitous adoption of any single one, are very far from fulfilment. We may expect that these issues will be an important feature of the information research agenda for the foreseeable future.

References

Allen, C.G. (1975), A manual of European languages for librarians, London: Bowker

Chan, G.K.L. (1977), Mushroom poisoning, thioctic acid and the foreign language barrier, Aslib Proceedings, 29(6), 237-240

Crystal, D. (2003), English as a global language (2nd edn.), Cambridge: Cambridge University Press

Hutchins, W.J., Pergeter, L.J. and Saunders, W.L. (1971), University research and the language barrier, Journal of Librarianship, 3(1), 1-25

Sparck Jones, K. and Kay, M. (1973), Linguistics and Information Science, New York NY: Academic Press

Thorpe, R.A., Schur, H., Bawden, D. and Joice, J.R. (1988), The foreign language barrier: a study among pharmaceutical research workers, Journal of Information Science, 14(1), 17-24

Tybulewicz, A. (1970), Cover-to-cover translations of Soviet scientific journals, Aslib Proceedings, 22(2), 55-62

Examples of recent language-related papers in Journal of Documentation

White, M.D., Matteson, M. and Abels, E.G. (2008), Beyond dictionaries: understanding information behaviour of professional translators. Journal of Documentation, 64(4), 576-601

Pinto, M. and Sales, D. (2008), INFOLITRANS: a model for the development of information competences for translators, Journal of Documentation, 64(3), 413-437

Airio, E. (2008), Who benefits from CLIR (cross-language information retrieval) in web retrieval, Journal of Documentation, 64(5), 760-778

Peng, F. and Huang, X. (2007), Machine learning for Asian text classification, Journal of Documentation, 63(3), 378-397

Talvensaari, T., Lauriiaka, J., Järvelin, K., and Juhola, M. (2006), A study on automatic creation of a comparable document collection in cross-language information retrieval, Journal of Documentation, 62(3), 372-378

INFuture2009, Zagreb

Posted November 21, 2009 by dbawden
Categories: Uncategorized

I attended the second of the INFuture conferences, organised by the Department of Information Sciences at the University of Zagreb, Croatia, in early November. The general theme of the conference series is ‘The Future of Information Sciences’, and the focus for this conference was ‘Digital Resources and Knowledge Sharing’. By comparison with the first of the conferences, held two years ago, this one had many more papers, with parallel sessions the norm, and a larger and much more internationally diverse audience.

Palace Hotel, Zagreb


Most of the conference was held in the renovated grandeur of the Palace Hotel, set in the ‘Green Horseshoe’ of parks in Central Zagreb. As before, the organisers had arranged a nice social programme, culminating in the conference dinner amid the Art Deco surroundings of the Zagreb journalists’ club.

With over 70 papers presented – the main themes being digitisation and preservation, knowledge management, language technologies, cultural heritage, and the use of web 2.0 and virtual environments – picking out highlights is difficult. My own presentation, on possible future scenarios for the library / information environment, provoked some interesting discussions, and attracted the attention of the Croatian TV network, who interviewed me immediately after the talk. It may yet be on YouTube…

Art Deco glass, Palace Hotel


A number of presentations particularly interested me, across the whole range of topics covered.

A presentation by Steven Krauwer on the development of the CLARIN project, which aims to promote language tools within the EU, looked forward to retrieval systems which could answer questions such as: find a video clip of a person speaking German with a Spanish accent; find articles on a specific topic from a French newspaper, and summarise them in Polish; find negative remarks about football in the proceedings of the German Parliament. We are not there yet, but progress is being made.

Kia Ng, from Leeds University, gave a fascinating talk on the i-Maestro system for recording the whole of a cultural performance, in multimedia form. He gave examples for the recording of dance, conducting, performance art, and violin playing. In the latter, as well as the image of the player, and the sound recording, the system recorded the detailed movements of the bow and the pressure on each string. While this was designed mainly as a teaching aid, it is potentially searchable by many parameters, and could in principle be used to recreate a performance…. this way to the Star Trek Holodeck.

Conference dinner


A presentation by Senka Drobac of the Ruder Boskovic Institute in Zagreb outlined the EU’s Dariah project, aimed at building an ICT infrastructure to support the arts and cultural sector, in the same way that grid computing is supporting the physical and biological sciences. This particular perspective showed how such initiatives can be valuable in smaller countries such as Croatia.

I also liked the contribution of the UK information consultant, Bob Bater. Speaking generally of knowledge management issues, he reminded us that, according to a study by the UK National Archives, 80% of information lives ‘in the wild’: in unstructured and uncatalogued environments such as emails, blogs, wikis, and twitter messages. Bob also reminded us that the 17th century English savant Francis Bacon is often quoted as saying that “Knowledge is power”, whereas what he actually said that “But mere knowledge is not power: it is only possibility. Action is power; and its highest manifestation is when it is directed by knowledge”. Not really the same thing at all.

A very good conference overall, and the organisers should feel pleased with themselves. The full materials will be available soon on the conference website.

The House of Wisdom

Posted October 11, 2009 by dbawden
Categories: Uncategorized

Tags:

A new book by Jonathan Lyons, The House of Wisdom: how the Arabs transformed Western civilisation, has a few surprising insights on developments in the recording and transmission of knowledge in the period.
2009-02-02-HouseofWisdomcover2
Lyons focuses on the contribution of Abbasid rulers of Baghdad, from the founding of the dynasty in 762 to its overthrow by the Mongols in 1258, and on the ways in which Arab knowledge was introduced to the West, by such travelling scholars as Adelard of Bath in the twelfth century. The usual Western image of such transmission portrays the Arabs as faithful and benevolent, if rather unimaginative, guardians and custodians of Greek learning, saving the classic texts by translating them into Arabic, and then returning them to Europe just in time to start the Renaissance. Lyons shows that the Arab contribution was a much more active and creative one, with interpretation and discovery in many fields, from astronomy and physics, to medicine and agriculture.

The ‘House of Wisdom’ of the title was the early mechanism for this process: a combination of library, book depository, and translation bureau, supporting a active community of scholars. Libraries appeared in many environments, the larger ones typically holding tens of thousands of volumes. Elsewhere in the Arab world, the library of the caliphs in Cordoba was generally reckoned to have a collection of nearly half a million volumes, comparable to to a substantial university library today. In general, as Lyons shows the contribution of Arab learning and recording of knowledge is still underestimated in the West.

The position of the Abbasid realms, spreading onto three continents, enabled them to engage on a grand-scale in what would later be termed knowledge transfer. Their links with the Mediterranean world, and in particular the Hellenistic cities which came under their control and with the still-powerful Byzantine empire, gave them access to the classical literature. Their contacts with the civilisations of the Indian sub-continent provided what we still call Arabic numerals, though they are in truth Indian. And their spread into central Asia brought them into contact with the Chinese empire, from which they learnt the making of paper. At a time when Europe was still reliant on parchment for recording information, the Arab world was paper-based. The cheapness, convenience and ease of storage of paper played a great part in the development of a ‘book culture’, which drew the attention of those Europeans who came into contact with it. Intriguingly, and a point which Lyons does not speculate on, the Arabs never adopted that other Chinese innovation, printing. If they had done so, the Arab world might have developed a print-on-paper information environment five hundred years before Europe managed to do so. The consequences of that make intriguing food for thought for those who like to speculate on alternative histories.

Finally, Lyons has a cautionary tale for those who believe the power of collections of recorded information to pass on, by their very existence, the knowledge of a culture or a civilisation. The Arabs, he points out, had, from a very early stage, accurate translations of Ptolemy’s astronomical works. But they lacked understanding of the astronomical concepts, and could make no use of them. It is not, apparently, enough just to store the information; understanding has to kept alive along with it, or painfully rediscovered.

English public libraries; needing new leadership ?

Posted October 10, 2009 by dbawden
Categories: Uncategorized

Tags: ,

A report on governance and leadership in the English public library service has been published, following an enquiry by the grandly-named All-Party Parliamentary Group on Libraries, Literacy and Information Management. Cynics have pointed out that this group is not quite the impartial assembly of the great and the good that its title might suggest; it is sponsored and supported by CILIP, who largely funded the inquiry which led to this report. Nonetheless, having five MPs and one Lord taking an intelligent interest in library matters is something of a novelty. The Group was chaired initially by Labour MPs Lyn Brown, initially, and Lynda Waltho, latterly.

Lyn Brown MP

Lyn Brown MP


Among fourteen recommendations in the report, arguably the most address the woeful lack of coherent leadership of the service from the top. One recommendation is that a single government department should be responsible for English public libraries, and should provide funding for them. This should be supported by a new Library Development Agency for England. It is also recommended that there should be a clear definition of what the current legal requirement to provide a “comprehensive and efficient service” actually means, and hence what level of service users are entitled to expect.
Lynda Waltho MP

Lynda Waltho MP


Given the lack of official clout of the Group, and especially in view of the current uncertain political situation, it seems highly unlikely that these recommendations will be acted on, any time soon. Even so, the report is a useful reproach to those who have allowed the English library service to drift into its current rather sad state. Its provisions could help to avoid the kind of thing seen with the recent Wirral controversy.

The full report may be found here, and CILIP’s commentary here.

Wirral public libraries saved ?

Posted October 3, 2009 by dbawden
Categories: Uncategorized

Tags: , ,

In a previous post, I wrote about the plans by the local authority in Wirral, in the North West of England, to close half their public library branches, and turn the rest into multi-purpose community centres. A government enquiry into whether this action would break the law which requires the authority to provide an adequate public library service has been completed, but the findings have not been made public.
FloralLibraryProtests1.jpg.display
The latest news is that the authority has now unilaterally changed its mind, and cancelled all its plans for library closures. The government enquiry is to be extended for three weeks, to consider the change of mind as new evidence. CILIP are, of course, very pleased.

Most commentators assume that the enquiry has found the authority to be in the wrong, and they have changed their plans before being forced to do do by the government. There are, however, concerns that the plans for library closures will be back next year, perhaps slightly modified.

Magic searching

Posted August 26, 2009 by dbawden
Categories: Uncategorized

Tags: ,

I have written a review of a new book Magic Search: getting the best results from your catalog and beyond, which will appear in due course in Journal of Documentation. Here’s a flavour of the review of what proved to be of much more interest than we have any right to expect from a book about the sub-divisions of the Library of Congress Subject Headings.

MagicSearch(L)This is not, it has to be said, an immediately gripping book, despite the best attempts of the authors, or perhaps the publisher, to present it with a Harry Potter-like title . Subtitled ‘Subdivide and conquer with LC subdivision!’, it would seem to be a good candidate for one of those competitions for most obscure or nerdish titles of the year. Is there really a book to be written, not even about the Library of Congress Headings – themselves perhaps not the most immediately accessible or popular topic – but just about the sub-headings.

Such cynicism should be stilled, for this short book is interesting in a number of ways.

Most obviously, it will be of value to those – and the books is pretty clearly aimed at library professionals – who search databases indexed with LC subject headings. It gives advice on the best ways if using the sub-headings as a means to cutting down lengthy results lists, and giving just the kind of results needed: as a means of improving precision without unduly sacrificing recall in fact, although this terminology is not used. Of over 3,500 available sub-divisions, the authors pick about 500 which they judge of most value, and show how to sue them to best effect. So, if you want resources with images of people, ‘PORTRAITS or PICTORIAL WORKS’ will do the trick. To find how an artist of writer has been received, you need ‘APPRECIATION or INFLUENCE’. To get at primary sources on your topic, the magic search (sic) involves ‘ARCHIVES or SOURCES or DIARIES or CORRESPONDENCE or NARRATIVES or INTERVIEWS or FACSIMILES’. And so on, for 19 short chapters on different subjects and topics.

The book then will certainly be useful in improving searching practice. But its interest goes beyond this, in that it illustrates some of the current issues in bibliographic retrieval today. For one thing, there is a strong Google influence throughout the book; perhaps surprising considering that the LC subject indexing methods might be considered the epitome of the ‘librarianly’ approach. The authors, however, point out two interesting facts. Google Book Search, which – whether we like it or not – is going to gain a very significant position over the next few years, will be making use of WorldCat metadata, and hence be searchable by LC headings, and their magic subdivisions. And the authors show how the subheadings can be used to form Google-like searches, providing, in a sense, the best of both worlds.

Of course, this happy situation depends on a number of factors, as the authors remind us. It relies on the continued practice of intellectual indexing using controlled vocabularies, and a powerful and well-argued plea is made for this. It relies on the Library of Congress getting their vocabulary right, and some good recommendations are made. And, of course, it relies on searchers knowing what they are doing, and having appropriate interfaces, to catalogues and databases, to support them. When a book of this sort has to be researched and written to remind the professional library / information community of the right way to use these tools, one wonders how much use is being made of them by most users of catalogues and other bibliographic collections. This book is a very good start, but a major task of consciousness raising awaits.

Are important years information years ?

Posted August 26, 2009 by dbawden
Categories: Uncategorized

Tags:

GutenbergAn article in Intelligent Life magazine for summer 2009 tried to decide which was ‘The most important year ever‘. The feature writer, Andrew Marr, argued for 1776, with the American Declaration of Independence, or 1945, with its world changing events; he was duly rebuked for US-centrism by website commentators. His five guests chose arguably more interesting dates:

  • 5BC as the likely date of birth of Jesus Christ
  • 1204 for the Crusader sack of Constantinople, and division of the Eastern and Western Christian world, and origins of Muslim power in the Middle East
  • 1439 for the introduction of the printing press to Europe
  • 1791 for the origin of the telegraph
  • 1944 as an alternative to 1945 for the new political structure, and also for the writing of seminal texts on what would later be seen as post-modernism and as Thatcherite / Reaganite politics
  • Web commentators have added some dates of ancient battles, some significant dates for Asia in particular, 1492 (a bit US-centric again), 1919 for the Treaty of Versailles, 1940 for the Battle of Britain; and, interesting, 1958 for the origin of the integrated circuit and the basis of modern computers.

    So .. if we discount ‘political’ and ‘military’ dates, whose importance seems to vary dramatically according to your national original and political views, and ‘religious’ dates, important (presumably) only if you are an adherent of the religion in question … then we seem to be left with printing, the telegraph, the integrated circuit… ‘Information dates’ seem to have a stronger hold over the imagination that the dates of, say, medical advances, or developments in transportation or agriculture. Reality or perception, I wonder ?

    mcflynnMy vote for most important year ? I suppose I would go with the introduction of printing, since we don’t have a good date for the start of writing. But, more parochially, and as I’ve written in an editorial for the Journal of Documentation (2009, vol. 65, no. 3) , I have a fondness for 1759. Establishment of the British Museum, start-up of Kew Gardens as a serious botanical institution; and, according to Frank McFlynn, when the Brits took over the world [McLynn, F., 1759: The year Britain became master of the world, London: Jonathan Cape, 2004].

    How to forget, when you’ve remembered to do so

    Posted August 26, 2009 by dbawden
    Categories: Uncategorized

    vanish-logoIn a previous post, I wrote about the need to ‘forget’ the cloud of digital information which we all seem obliged to create nowadays, and which may come back to haunt us forever. I mentioned the idea of having some of this information ‘self-destruct’ after an appropriate time. Those clever people (Hank Levy, Tadayoshi Kohno, Roxana Geambasu and Amit Levy) in the Computer Science Department at the University of Washington in Seattle have now arranged for just this to happen, with their aptly named Vanish software.

    Amit Levy and Roxana Geambasu

    Amit Levy and Roxana Geambasu


    This allows you to specify that a document will irreparably disappear after eight hours. At present, it works with emails, blog postings, Google Docs documents, and Facebook messages, and only with the Firefox web browser. The creators are planning to extend the range of information entities to be vanished, and to allow a wider range of life times. Of course, a recipient can always make a non-vanishing copy, and, as the creators have noted, government agencies will no doubt have ways round it. But it’s a certainly a nice start in enabling us to leave the past behind.

    Remembering to forget

    Posted August 14, 2009 by dbawden
    Categories: Uncategorized

    Tags: , , ,

    memory2Most of us have problems with remembering things at time. Memory problems usually go one way – we can’t remember things. Drastic loss of memory has been a theme of many books (I can, just about, remember Asimov’s Currents of Space as being the first with this theme that I read) and movies (the Bourne series, I understand, not having seen any of them, has this theme). The lack of incentive for memorisation has been lamented over thousands of years, from the increasing use of printed books reducing the need for the ‘memory arts’ of the ancient world, which lasted well into the Renaissance, to current laments about the disappearance of ‘pub quiz style’ general knowledge in the face of Google and Wikipedia.
    google gen kno

    Remembering always seems to be the problem. The idea that forgetting might also be good at times has been slower to catch on. Some analogy, I suppose, to the way in which the need to filter incoming information to avoid overload has, until recently, been much less recognised than the need to find good information in the first place. Deliberate forgetting has in some cases been recognised as a good thing, and again science fiction has led the way. Spock used his Vulcan mental powers on occasion to make others forget, while the characters of Greg Bear’s Eon series had ‘good forgetings’ as birthday presents.

    The idea seems now to be creeping into information management. James Harkin, in an article in the London Sunday Times, points up a number of horror stories about how old material on social networking sites came back to haunt its originators. He gives this as the most dramatic example of how digital information can remain active indefinitely, not merely cluttering up our ever-expanding storage systems and confusing our not-expanding-at-all brains, but doing real harm.

    deleteA possible solution is proposed by Viktor Mayer-Schonberger in a new book Delete: the virtue of forgetting in the digital age. He argues for a mode of ‘digital forgetting’, by setting expiry dates on each and every digital document, photograph, message, etc. that we create. This will he suggests. make our digital memories more like their brainware equivalents.

    Or perhaps we should take a more literary view of things, and look to a new era of memory arts, closely associated with poetry, as David Barber nicely suggests, in his essay Does Memory have a Future.

    Whichever way it turns out, I think that we will see ideas of memory, and forgetting, as gaining a new importance as the digital transition engulfs us. Who knows, perhaps a resourceful professor, looking for a new course to market, will come up with a degree in forgetfulness.