Grabbing the long tail of search engines

A revamped search engine from Heriot-Watt University in the UK makes the most of the long tail allowing searchers to drill way down into the most obscure, but nevertheless useful, pages across dozens of technology databases and sites.

“It’s a prerequisite for any successful search service in technology
subjects to have a ‘Long Tail’ (or large inventory), Heriot-Watt’s Roddy MacLeod explains, “This is because the majority of search queries made by technologists, or by others seeking information in technology-related subjects, tend to be very specific. It’s in the nature of the subjects, and the real information retrieval needs of those involved in these subjects, for granularity to be important.”

MacLeod heads up the team behind TechXtra http://www.techxtra.ac.uk/. This search engine aggregates content from a vast array of databases with technology-related content, and provides meta results that are essentially invisible to general search engines. “A search of TechXtra will search across more than 4 million records of various kinds – articles, technical reports, digital theses and dissertations, books, eprints, news items, job announcements, video, learning & teaching resources, key websites, and more – most of which relate to technology subjects,” MacLeod adds.

You can restrict searches to a particular format (technical reports, or
articles, or books, and so on), or select only specific databases among those listed with the ubiquitous Advanced Search option.

Among the most interesting of the databases for the Youtube generation is perhaps the Open Video Project, which is a growing repository of digitized video. And serious tech research can provide the full text of thousands of theses, eprints from arXiv, earthquake engineering
technical reports from Caltech, and almost half a million articles in computer and information science from CiteSeer. In addition many articles in Digital Open Access Journals (DOAJ), are available.

I’ve added a TechXtra link to the Sciencebase science search toolbox and will upgrade that to a fully-fledged search link in the next few days.

One thing which is a fairly recent development is the number of freely available full text digitised theses that TechXtra can now access. “These theses are the results of considerable research,” MacLeod told me, “they can be excellent resources on specialised topics.

HW University also produces the superb Internet Resources Newsletter.