2012Nov14

From Filtered Push Wiki
Jump to: navigation, search


Etherpad for meeting notes: http://firuta.huh.harvard.edu:9000/FP-2012Nov14

Agenda

  • Annotations
    • Progress on: Rewriting examples/rules from AO/AOD to OA
  • ApplePie
    • What Constitutes ApplePie
    • NEVP DigitizationApparatus to Symbiota ingest mapping draft.
  • Embedded Kepler
    • Hello world
    • Source(s) for data for QC
  • Drivers (API documentation, MCZbase).
  • Progress on pubsubhubub implementation test.

Non-Tech

  • Annotations
    • Annotation MS
    • Task Group for Applicability Statement on OA
  • Collaborations
    • Specify/Symbiota
    • SCAN TCN
    • NEVP TCN

Reports

  • Jim: Harvard has renewed FP subcontract with UC Davis.
  • Jim: Draft NIBA Implementation Plan is available at <http://blogs.aibs.org/niba/>. Comments welcome.
  • Maureen: put Solr and a Lucene index of sample HUH specimen data on fp3

Notes

Present: Bob, David, Maureen, Paul, James, Jim, Bertram, Tianhong, Heather

  • Annotations
    • Progress on: Rewriting examples/rules from AO/AOD to OA

David: Annotation generator generating annotations in OA for identifications, response, georeference. Updated sparql queries (e.g. used in symbiota annotation tab). Mostly involves changing configuration - Lei has configuration to load rule set. Change over close to complete.

Bob: Tests to validate - make sure that if we load an example in the endpoint and run the relevant query, we get the example back in the query results. Should repeat test for each rule defined annotation type (two rounds, one synthetic example, another using generated on the fly annotation documents).

  • ApplePie
    • What Constitutes ApplePie

James: Two sides: FP-Medium? What is missing - duplicates, quality control. Second, who are the people.

Paul: Probably FP-Medium with duplicate detection, qc with Kepler, and annotations.

Bob: Should specify what we expect to deliver with respect to interests. On research grounds demonstrate with reasoning (e.g taxonomic hierarchies, perhaps geographic hierarchies, a few type hierarchies).

James: NEVP - CNH logical and good. Nice to include a few outsiders to demonstrate function. Good to include university of florida, CA consortium. Key issue is support for deployment.

Maureen: List of administrative use cases, what do participants expect. What support is needed?

James to compile list of the people we should be talking to for ApplePie from the proposal.

Paul and Bob to have conversation with Jim Beach about direction of specify and integration with specify. Improvements on duplicate detections (Australian's, others..)

Bertram: UCD starting to look into "million records challenge", i.e., stress-test first Kepler with some simple curation workflows and see whether/where it breaks. Then compare with an early Kurator/P prototype (having Tianhong work on that after we're done with some version of "embedded Kepler")

    • NEVP DigitizationApparatus to Symbiota ingest mapping draft.

Paul: Have draft, need to circulate.

  • Embedded Kepler
    • Hello world
    • Source(s) for data for QC

Tianhong: Able to deploy code to VM. Working on getting Kepler to run on the VM. Issues with tests.

Bertram: looking for large test data set. Have list of actors to run (three curation steps).

Paul: Need to extract data from GBIF cache.

  • Drivers (API documentation, MCZbase).
  • Progress on pubsubhubub implementation test.

Maureen: Working on driver interface some more, mostly specify driver. Lucene index and solar on VM used by UCDavis. (10k-100k records).

Non-Tech

  • Annotations
    • Annotation MS
    • Task Group for Applicability Statement on OA
  • Collaborations
    • Specify/Symbiota
    • SCAN TCN
    • NEVP TCN