2013Apr24

From Filtered Push Wiki
Jump to: navigation, search


Etherpad for meeting notes: http://firuta.huh.harvard.edu:9000/FP-2013Apr24

Agenda

  • Report from Bio-IT meeting
  • Specify on Postgresql
  • Demonstration of Launch of Kepler QC workflow from Annotation Processor UI - "Quality Control My Data".
  • Upcoming Meetings
    • SPNHC ( [1] ) ApplePie
      • Registration by May 15, Abstracts due May 15
  • Kepler Georeference QC Actor.
  • Annotations
    • Progress on rewriting dwFP, OAD, and example annotations.

Non-Tech

  • Annotations
    • Annotation MS
  • Collaborations
    • Specify/Symbiota
    • SCAN TCN
      • Niko will be looking for a FP/SCAN update in the next few weeks.
    • NEVP TCN

For Future meetings

  • Upcoming Meetings
    • OA East coast rollout meeting.
    • Rebuild of demonstration (and video)
  • MCZbase Driver
  • Specify Integration
  • Prospective meetings, development targets.
  • Duplicate Finding
  • Embedded Kepler - Duplicate finding, More QC Tasks.
  • Task Group for Applicability Statement on OA

Reports

  • Paul
    • Working with Bob on Revisions of data annotation paper.
    • Revised more examples from AO/AOD/TdwgOnt to OA/OAD/dwcFP. Examples 1-6 now expressed as OA/OAD, 7 and 8 remain as do some non-numbered examples which may be obsoleted by examples in the test suite.
  • Maureen
    • attended Bio-IT convention in Boston Apr 9-11
    • was on vacation the week after that
    • submitted a patch to the Specify development list that contains an alternate database setup wizard; this new wizard allows the choice of HSQLDB or PostgreSQL rdbms in addition to MySQL; patch also contains enough code changes to allow the thick client to run on the non-MySQL rdbms

Notes

FilteredPush Team Meeting 2012 Apr 24

Present: Bertram, Maureen, Bob, Paul, David, James, Tianhong, Jim, Heather, Joel

  • Report from Bio-IT meeting

Maureen: Learned of some tools we may be able to use, and some things about visualization, and somethings about data processsng in the cloud.

  • Specify on Postgresql

Maureen: Exploring an approach to tree issues, got a working patch for Specify to run over Postgreql instead of MySQL.

  • Demonstration of Launch of Kepler QC workflow from Annotation Processor UI - "Quality Control My Data".

BL: Instead of tens of thousands of records, we use a very small number (a handful?) to create a reasonable demo response time. Question: how does what we show relate to a more realistic use case? For long-running workflows, a different MoC/UI paradigm is needed!? E.g. asynchronous / call-back mechanisms? Or the user might receive an email once the result is available? Another (or additional) way is to have Kepler "push out" status/progress info (e.g. showing for each actor the # of records processed so far) and then display that info on a FPush "dashboard".

MK: we should have a full set of mockups for all the control flows through the software before we bring a dedicated UI person on.

MK: are results kept forever or how/when are they disposed of?

  • Upcoming Meetings
    • SPNHC ( [2] ) ApplePie

Abstract based on this demo.

      • Registration by May 15, Abstracts due May 15
  • Kepler Georeference QC Actor.

Tianhong: Working on implementation (architecture of actor ok, working with shapefile loading). What dataset to run on?

Paul: Good to work from a small dataset with synthetic problems - known errors where the actor should produce a known solution.

Bertram: Any Java shapefile libraries.

Paul: Look at OpenJump http://www.openjump.org/

Tianhong: got it

  • Annotations
    • Progress on rewriting dwFP, OAD, and example annotations.

Bob: Driven by what is needed by manuscript. AnySuchResource a pending decision that may affect examples.

Bob: Need complete examples for the fragments in the paper.

Paul: new dwcFP:DwCTripletSelector.

David: Implications for configuration, probably just need to reconfigure type.

Non-Tech

  • Annotations
    • Annotation MS

Bob: Aim to have a just about finalized draft to circulate on Friday.

  • Collaborations
    • Specify/Symbiota

Thanks for Maureen doing the port to Postgresql.

    • SCAN TCN
      • Niko will be looking for a FP/SCAN update in the next few weeks.
    • NEVP TCN

Paul: Ed signed off on the code for ingesting new OccurrenceAnnotations.

James: Data quality discussion on taxacom worth memorializing on wiki, some interesting ideas from folks, and interesting expectations.

See: http://mailman.nhm.ku.edu/pipermail/taxacom/2013-April/thread.html "Data Quality in Aggregated Datasets thread"