High-Throughput Workflow for Computer-Assisted Human Parsing of Biological Specimen Label Data

The Texas Center for Digital Knowledge (TxCDK) at the University of North Texas and the Botanical Research Institute of Texas (BRIT) are conducting fundamental research with the goal of identifying how human intelligence can be combined with machine processes for effective and efficient transformation of textual museum specimen label information into high-quality machine-processible parsed data. This two-year project, which we call Apiary, will advance understanding of the workflow and processes best able to increase access to and use of digitized biological collection metadata within the stakeholder communities comprised of biologists, natural history museum collections managers, biodiversity standards groups, and the library and information science community.
 
The Apiary Project is funded by Institute of Museum and Library Services National Leadership Grant # 06-08-0079-08. The Institute of Museum and Library Services is the primary source of federal support for the nation’s 122,000 libraries and 17,500 museums. The Institute's mission is to create strong libraries and museums that connect people to information and ideas.