Topic Detection and Tracking

project: Moving Manchester / Mediating Marginalities: How the experience of migration has informed the work of writers in Greater Manchester

Moving Manchester (formerly 'Mediating Marginalities') has spent the past four years (2006-2010) investigating the ways in which the experience of migration has impacted upon contemporary writing in the city and, by extension, the ways in which these multicultural publications and performances have impacted upon the urban population's view of itself as well as the wider perception of Manchester as a British city. [read more]

tool: Solr

Purpose: 

Solr is an open source enterprise search platform from the Apache Lucene project. It operates as a standalone full-text search server within an appropriate servlet container, such as Tomcat. Solr uses the Lucene Java search library at its core for full-text indexing and search, and has REST-like HTTP/XML and JSON APIs that make it easy to use from virtually any programming language.

Features: 

• May be tailored to many types of application with minimal programming knowledge
• Extensive plug-in support
• Full-text indexing and search

A&H use case 1 description: 
The “British Cartoon Archive Digitisation (BCAD)” project has used Solr to deliver the search results and metadata.
Creator: 
CNET Networks
Publisher: 
Apache Software Foundation
Software/programming languages used: 
Suite: 
Data structuring and enhancement: 
Alternate tool(s): 

Sphynx

Licence: 
lifecycleStage: 
Platform: 

tool: Lucene

Purpose: 

Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-text search, especially cross-platform.

Features: 

• Scalable, high-performance indexing
• Powerful, accurate and efficient search algorithms
• Cross-platform solution

A&H use case 1 description: 
The “Freeze Frame – Historic Polar Images 1845-1960” project has used Lucene for advanced search of photographs from both Arctic and Antarctic expeditions.
Creator: 
Doug Cutting
Publisher: 
Apache Software Foundation
Software/programming languages used: 
Suite: 
Data structuring and enhancement: 
Alternate tool(s): 

InQuira, Verity, dtSearch, ISYS

Licence: 
lifecycleStage: 
Platform: 

tool: CONTENTdm

Purpose: 

CONTENTdm is digital collection management software that allows for the upload, description, management and access of digital collections.

CONTENTdm is mostly used by libraries, archives, museums, government agencies, universities, corporations, historical societies, and other organizations that wish to host a digital collection.

Features: 

• supports numerous industry standards including Unicode, Z39.50, Qualified Dublin Core, VRA, XML, JPEG2000 and OAI-PMH
• flexible and fully customisable
• integration with OCLC products
• not limited by format

A&H use case 1 description: 
The “First World War Poetry Digital Archive” project has used CONTENTdm to form the backend of the archive of highly valued primary material from major poets of the First World War period.
Creator: 
CISO, Center for Information Systems Optimization
Publisher: 
OCLC
Software/programming languages used: 
Specifications: 
Data structuring and enhancement: 
Alternate tool(s): 

Fedora Commons, DigiTool, MetaStar

Licence: 
lifecycleStage: 
Suite: 

tool: MantisBT

Purpose: 

MantisBT is a free popular web-based bugtracking system written in the PHP scripting language.

The most common use of MantisBT is to track software defects. However, MantisBT is often configured by users to serve as a more generic issue tracking system and project management tool.

Features: 

• event-driven-plug-in system
• works with MySQL, MS SQL, PostgreSQL, SQLite, Oracle and IBM DB2 databases
• RSS Feeds
• Customisable workflow
• Wiki integration
• Chat integration

A&H use case 1 description: 
The “First World War Poetry Digital Archive” project has used MantisBT to track development and the use cases developed were used as reference points for successful development prior to user testing.
Creator: 
Kenzaburo Ito and Victor Boctor
Publisher: 
Futureware Pty Ltd
lifecycleStage: 
Specifications: 
Alternate tool(s): 

JIRA, Trac, Bugzilla

Software/programming languages used: 

tool: Concordance

Purpose: 

A software tool for performing concordance – the analysis of a set of words within its immediate context - on a body of text. The tool performs full concordance, reading and analysing each and every word in a text. It was initially written for the analysis of English texts, but has since been extended to cater for other Western languages. Limited support is also provided for text in East Asian scripts, such as Chinese and Korean.

Features: 
  • Index and word list creation
  • Word frequency count
  • Word usage comparison
  • Keyword analysis
  • Phrase and idiom discovery
A&H use case 1 description: 
The Historical Corpus of the Welsh Language 1500-1850 project used Concordance to analyse samples of Welsh text of different stylistic levels and varying geographic provenance that were created between 1500-1850.
Creator: 
R.J.C. Watt
Publisher: 
R.J.C. Watt
Specifications: 
Data capture: 
Software/programming languages used: 
Discipline: 
Data structuring and enhancement: 
Alternate tool(s): 
Licence: 
lifecycleStage: 

project: Person Data Repository of the 19th Century

The project “Construction of a repository for biographical data on historical persons of the 19th century” – short form: Person Data Repository – enhances the existing approaches to data integration and electronically supported research in biographies. It investigates connecting and presenting heterogeneous information on persons of the “long nineteenth century” (1789–1914). The project's aim is to provide a de-central software system for research institutions, universities, archives, and libraries that allows combined access on biographic information from different data pools. [read more]

project: JainPedia

JainPedia will be a free world-leading resource on the web. It offers translations and transcriptions of selected texts and a wealth of contextual information about the Jain religion and its host society in India. The JainPedia team is leading the digitisation of approximately 4,000 pages of the thousands of jain manuscripts and Jain objects in the United Kingdom. The involvement of eminent academics and volunteers from the Jain community in the project highlights how the expertise and enthusiasm of different groups can work together to produce a valuable resource for all. [read more]

project: Schenker Documents Online

The twentieth century's leading theorist of tonal music, Heinrich Schenker produced a series of innovative studies and editions between 1903 and 1935 and left behind a voluminous archive of correspondence, diaries and lessonbooks. Edited in near-diplomatic transcription and with English translations, these materials form the core of the edition, supported by additional documents relating to his life, and a set of "profiles" of people, places and organizations with which he came into contact. [read more]

project: Nineteenth Century Serials Edition

A three year Arts and Humanities Research Council (AHRC) funded project, ncse seeks to achieve two key objectives: First the ncse project responds to the pressing need to republish these fragile printed items in ways which maintain their integrity. As physical collections are often incomplete, and deteriorating quality hampers access, electronic editions offer new opportunities to re-present such material in a way that is, for the first time online, comprehensive and freely available meaning that the material can be used in entirely novel ways. [read more]

Pages