2013Apr24
Etherpad for meeting notes: http://firuta.huh.harvard.edu:9000/FP-2013Apr24
Agenda
- Report from Bio-IT meeting
- Specify on Postgresql
- Demonstration of Launch of Kepler QC workflow from Annotation Processor UI - "Quality Control My Data".
- Upcoming Meetings
- SPNHC ( [1] ) ApplePie
- Registration by May 15, Abstracts due May 15
- SPNHC ( [1] ) ApplePie
- Kepler Georeference QC Actor.
- Annotations
- Progress on rewriting dwFP, OAD, and example annotations.
Non-Tech
- Annotations
- Annotation MS
- Collaborations
- Specify/Symbiota
- SCAN TCN
- Niko will be looking for a FP/SCAN update in the next few weeks.
- NEVP TCN
For Future meetings
- Upcoming Meetings
- OA East coast rollout meeting.
- Rebuild of demonstration (and video)
- MCZbase Driver
- Specify Integration
- Prospective meetings, development targets.
- TDWG (late October) http://www.tdwg.org/homepage-news-item/article/tdwg-2013-call-for-symposia-and-workshops/
- CNH: meeting will include NEVP. Workshop to get feedback from the botanists present. In Vermont, July ApplePie
- Duplicate Finding
- Embedded Kepler - Duplicate finding, More QC Tasks.
- Task Group for Applicability Statement on OA
Reports
- Paul
- Working with Bob on Revisions of data annotation paper.
- Revised more examples from AO/AOD/TdwgOnt to OA/OAD/dwcFP. Examples 1-6 now expressed as OA/OAD, 7 and 8 remain as do some non-numbered examples which may be obsoleted by examples in the test suite.
- Maureen
- attended Bio-IT convention in Boston Apr 9-11
- was on vacation the week after that
- submitted a patch to the Specify development list that contains an alternate database setup wizard; this new wizard allows the choice of HSQLDB or PostgreSQL rdbms in addition to MySQL; patch also contains enough code changes to allow the thick client to run on the non-MySQL rdbms
Notes
FilteredPush Team Meeting 2012 Apr 24
Present: Bertram, Maureen, Bob, Paul, David, James, Tianhong, Jim, Heather, Joel
- Report from Bio-IT meeting
Maureen: Learned of some tools we may be able to use, and some things about visualization, and somethings about data processsng in the cloud.
- Specify on Postgresql
Maureen: Exploring an approach to tree issues, got a working patch for Specify to run over Postgreql instead of MySQL.
- Demonstration of Launch of Kepler QC workflow from Annotation Processor UI - "Quality Control My Data".
BL: Instead of tens of thousands of records, we use a very small number (a handful?) to create a reasonable demo response time. Question: how does what we show relate to a more realistic use case? For long-running workflows, a different MoC/UI paradigm is needed!? E.g. asynchronous / call-back mechanisms? Or the user might receive an email once the result is available? Another (or additional) way is to have Kepler "push out" status/progress info (e.g. showing for each actor the # of records processed so far) and then display that info on a FPush "dashboard".
MK: we should have a full set of mockups for all the control flows through the software before we bring a dedicated UI person on.
MK: are results kept forever or how/when are they disposed of?
- Upcoming Meetings
- SPNHC ( [2] ) ApplePie
Abstract based on this demo.
- Registration by May 15, Abstracts due May 15
- Kepler Georeference QC Actor.
Tianhong: Working on implementation (architecture of actor ok, working with shapefile loading). What dataset to run on?
Paul: Good to work from a small dataset with synthetic problems - known errors where the actor should produce a known solution.
Bertram: Any Java shapefile libraries.
Paul: Look at OpenJump http://www.openjump.org/
Tianhong: got it
- Annotations
- Progress on rewriting dwFP, OAD, and example annotations.
Bob: Driven by what is needed by manuscript. AnySuchResource a pending decision that may affect examples.
Bob: Need complete examples for the fragments in the paper.
Paul: new dwcFP:DwCTripletSelector.
David: Implications for configuration, probably just need to reconfigure type.
Non-Tech
- Annotations
- Annotation MS
Bob: Aim to have a just about finalized draft to circulate on Friday.
- Collaborations
- Specify/Symbiota
Thanks for Maureen doing the port to Postgresql.
- SCAN TCN
- Niko will be looking for a FP/SCAN update in the next few weeks.
- NEVP TCN
- SCAN TCN
Paul: Ed signed off on the code for ingesting new OccurrenceAnnotations.
James: Data quality discussion on taxacom worth memorializing on wiki, some interesting ideas from folks, and interesting expectations.
See: http://mailman.nhm.ku.edu/pipermail/taxacom/2013-April/thread.html "Data Quality in Aggregated Datasets thread"