Automatic Biodiversity Literature Enhancement (ABLE): a project overview
Morse, D.R.; Dil, A.; King, D.J.; Willis, A.G.; Roberts, D.M.; Lyal, C. [s.d.]. Automatic Biodiversity Literature Enhancement (ABLE): a project overview. [S.n.]: [s.l.]. 6 pp.
| |
| Authors | | Top |
- Morse, D.R.
- Dil, A.
- King, D.J.
|
- Willis, A.G.
- Roberts, D.M.
- Lyal, C.
|
|
| Abstract |
We introduce the ABLE project, which aims to enhance access to collections of scanned documents from the biological taxonomy literature. Searching this literature needs to be robust to errors introduced by Optical Character Recognition and other sources. Biological knowledge, especially taxonomic knowledge, is often presented in a stylised form, generally using typographical clues to its meaning. This project aims to use typographical information and other contextual clues to identify and tag document content by its type. We describe some of the difficulties encountered in interpreting these scanned texts, and briefly discuss some methods of dealing with these issues. |
|