job: Research Associate in Information Extraction from Scanned Literature, OU

Application deadline has expired!
Application Deadline: 
16/10/2008

The Open University and the Natural History Museum are seeking a postdoctoral researcher to work on concept extraction from scanned taxonomic literature.

Scanned texts contain errors introduced by imperfect OCR and other sources, so techniques are required that are robust in the face of such errors. The successful applicant will develop techniques that use typographical and contextual cues to identify and tag relevant document content.

You will have a PhD (or equivalent experience), and experience in one or more of the following:

- natural language processing/information extraction/information retrieval, in particular from noisy data;
- image analysis and feature extraction;
- document layout (reverse-engineering a DTD);
- XML for mark-up and term annotation;
- broad familiarity with biological systematics.

Good programming skills are essential, as is the ability to learn quickly. Applications from candidates with a background in the biological sciences who can demonstrate appropriate computing skills are encouraged.

Based in Milton Keynes. 12 month contract.

£23,002 – £33,780

For enquiries about the research project, please contact: Dr David Morse d.r.morse@open.ac.uk.

Syndicate content