2011Aug23

From Filtered Push Wiki
Jump to: navigation, search


Etherpad for meeting notes: http://firuta.huh.harvard.edu:9000/FP-2011Aug23


Agenda

  • Connecting Harvard and UCD grant admins
  • The Mapper
  • Agenda for this Friday's Meeting

Reports

  • Paul
    • Reviewing CVs of 25 Candidates for Project Programmer position.
  • Lei
    • For the TDWG demo, made a dataset based on last year’s BioBlitz data and completed the outlier identification workflow
  • Tim
    • Investigating GBIF's Integrated Publishing Toolkit. Will be installing and playing with its data mapping features this week.
    • Installed Specify 6 to look at it's features.
  • James
    • Spent three days working on AppleCore in Montreal with Peter Desmet, Patrick Sweeney, and some help from Luc Broulliet. Made it through most of the relevant fields in the Darwin Core. There are several challenges associated with botanical specimens for which there are more than one way to deal with. We favored remaining flat with the exception of the identification field. If interested see: http://code.google.com/p/applecore/wiki/TableOfContents?tm=6
    • I am also working on the TDWG Annotation abstract and one for an AppleCore lightening talk
  • Maureen
    • Got a Kepler web service and Lei's workflow running with the BioBlitz data


Notes

FilteredPush Team Meeting 2011 Aug 23 Present: Maureen, Paul, Bertram, James H., Lei, Tim, James M. Agenda:

  • Connecting Harvard and UCD grant admins
  • The Mapper
  • Agenda for this Friday's Meeting

(Nico Franz on the schedule after 30min!?)

  • TDWG demo

Notes

  • Connecting Harvard and UCD grant admins

Seems to be progressing.

  • The Mapper

Tim did a demo a few weeks ago and got a little bit of feedback, but at this stage we should evaluate where we are and where we're going. Do we need to do more investigation, or do we do engineering? Tim is asking for people with insight to think about what we're doing, what will work and what won't, and give feedback.

Similar technologies being investigated are GBIF's IPT, Specify 6.

Would our mapping tool be something that could be packaged as part of these other technologies?

IPT is scaling back from "publish data + get quality control annotations" to "publish data."

Canadensys is using IPT 2.

The general mapping problem is more extensive than what might be "allowed" by IPT 2.

IPT mapping is done by a db admin who makes the schema flat for matching with DWC.

One of FP's additional requirements is to receive annotations and have them inserted. IPT on the other hand, provides an interface for querying data.

IPT 1 had some code for analyzing data quality and showing the user some report based on that analysis. The user would then change the data or the mappings.

Over the next week, we can start working on a charter for engineering.

  • Agenda for this Friday's Meeting

(Nico Franz on the schedule after 30min!?)

Friday meeting: we will have guests. Nico Franz from ASU, and another from Northern Arizona. Entomology collections forming a thematic coordinating network for a round of funding this fall. We've proposed they might have use for FP to connect their collections and use quality control for their taxonomic data. This meeting occurs during the normal tech call, so the non-usual tech call participants will call in to the tech meeting in progress, at which point we will switch to that topic.

We should ask how we will fit into the hub. We should ask the hub and some program officers at nsf to clarify.

Entomology collections have a similar duplicate problem (as botany) though the duplication has more to do with the label and the locality than the specimen itself.

  • TDWG demo

Lei's made a dataset based on TDWG 2010 BioBlitz data, and has constructed a workflow (outlier identification). She and Maureen should work together to make the whole process run. Would Lei be taking a laptop to TDWG with everything running on the laptop?

Would it be Maureen doing the demo, as this demo is supposed to be FP-centric rather than Kepler-centric (cf. SPHNC) ?

We'd like to be able to quickly start with a fresh copy of initial data: "reset."

(What's the schedule again for the demo, i.e., by when do we plan to have a first version running?)

Three weeks before the TDWG meeting.

Other notes:

James M. is working on the abstract for the annotation meeting.