I want to subscribe to any message that purports to be about duplicates for anything in a certain taxonomic group of the collection. (curatorial case) I'm interested in taxon T because I'm working in the collection. There's a special form for sending the name of the taxon as a subscription to a genus field. This subscription will return to you a list of specimen records and annotations that relate to the query. How do we order the results? Hierarchy of specimens and then annotations as subcategories? I want to restrict results to records from a particular institution (maybe).
(Note that there may be several levels of related records: a specimen record may be directly retrieved, and then an annotation associated with that specimen, and an annotation of the annotation...)
There are other scenarios related to research as opposed to quality control or data entry. A researcher may want to know about outliers as opposed to matches. Is an outlier of scientific interest, or is it noise? Is it noise related to observation, or observation tools.
Research is not just taxonomic; important information may be in geographic data, for example.
Logo: it's a tortoise!
We should have a story. Why a tortoise?
Entomology: Paul says FP duplicate detection might apply to entomology as well. Specimens collected in the same place at the same time are likely to have the same label data applied to them.
Searching databases in Google: Google provides tools to allow you to provide data and pages for their indexer.
Wolfram has an analytical tool for providing analyzed search results. Current terms of service say that they own the results.
James did some looking for how often the phrase "fuzzy matching" comes up in discussions of biodiversity: a lot. Maybe what we're doing is better described as intelligent mapping. Our fuzzy matching is applied in some narrow cases, more as knowledge of sets of field numbers associated with collectors.
There is important negative data involved in biodiversity research: "went collecting and didn't find specimens here," maybe FP can help make that data available.
What's the meaning of voucher? Does electronic data count (example of video of ivory billed woodpecker)? How are specimens related to observations? An examined voucher can provide information about data quality.