2013May15

From FilteredPush
Jump to: navigation, search


Etherpad for meeting notes: http://firuta.huh.harvard.edu:9000/FP-2013May15

Agenda

  • Progress on SPNHC demonstration
    • Kepler QC Actors.
    • Annotation processor
    • Driver support for ingest of insert and update georeference annotations
    • Results and Provenance
  • Upcoming Meetings
    • SPNHC ( [1] ) ApplePie
      • Registration by May 15, Abstracts due May 15 (today)
  • Annotations
    • Progress on rewriting dwcFP, OAD, and example annotations.
  • MCZbase Driver
  • Duplicate Finding

Non-Tech

  • Annotations
    • Annotation MS
  • Collaborations
    • Specify/Symbiota
    • SCAN TCN
      • Niko will be looking for a FP/SCAN update in a week.
    • NEVP TCN

For Future meetings

Reports

Notes

Present: David, Paul, Maureen, James, Bob, Tianhong, Jim

  • Progress on SPNHC demonstration
    • Kepler QC Actors.

Tianhong: Workflows are working and generating annotations

Paul: Actors?

Tianhong: Scientific name validator, flowering time validator, Modify georeference.

On agenda for Friday: Update scientific name validator to use current GBIF services and to have a QC process for dealing with nomenclatural/taxonomic issues, e.g. homonyms. Friday: Set up framework for questions to ask domain users.

Tianhong: Currently working on use of data quality library in development by Canadensis (coordinating with David Shorthouse).

    • Annotation processor

David: Added display for "solve with more data". Added a spreadsheet view to the launch analysis page, can view results of analysis. Can export as csv from that view (first sheet in example spreadsheet of results). Working with Tianhong and Sven on changes to result set (largely separating styling from data), have some added changes to propose.

Demo: Spreadsheet view of JSON data as stored in MongoDB, loaded into a backing bean. Applying styling to show images and colors on the far right (record status, check, delta, x).

James: David highligting: User chose quality control on scientific name, complication is that X shows more than just yes/no, should we report the inconsistency.

Jim: Like the direction here, in particular the colors - having colors is intuitive. Red category may be correlated with many different reasons - beware of biting off too much at this time. Focus on correcting defects in the data - think of fittness for use for particular research purposes as more advanced.

Bob: Analogy to content negotiation: consumer asks for something, producer can respond with what it has available (whatever it has a set of rules for). Main thing may be that the annotation originaor pushes an annotation into the network and the conumer says can you give it to me in form x, consumer needs to be able to undersand what they need for x. We can probably do some simple examples where everyone is onboard with the domain vocabulary.

David: Is there a requirement for transitive reasoning on the taxon heirarchy, if not, we can just use the mongo query?

Paul: Something we need to worry about, but not for the demo.

David: Should be all on the UI end, Kepler should just run the query supplied.

    • Driver support for ingest of insert and update georeference annotations

Maureen: Have been working on removing the chain of servlets involved in driver invocation. Have separated out the J2EE container bits from the buisness logic. Need to look at this more closely on Friday - getting to run the annotation procesor + driver in one JVM instead of three different ones. Will be small step from there to doing FilteredPush interactions with the driver.

    • Results and Provenance

Tianhong: How do we do the second level of the spreadsheet?

David: Data is all there, just UI.

Tianhong: Link to next sheet (or mouseover on problematic data).

David: Need to look at available widgets.

  • Upcoming Meetings
    • SPNHC ( [2] ) ApplePie
      • Registration by May 15, Abstracts due May 15 (today)

Paul: Abstracts due today. Democamp abstract in good shape. Please send any comments right away. Sent out first draft of second (presentation abstract) please comment.

James, Heather, Paul are attending.

  • Annotations
    • Progress on rewriting dwcFP, OAD, and example annotations.

Bob: Checking that all the examples use consistent namespaces, domain vocabularies, etc. Ancilary materials for paper.

  • MCZbase Driver

Maureen: Awating firewall issues from RC, DB rebuild from Brendan.

  • Duplicate Finding

Non-Tech

  • Annotations
    • Annotation MS

Bob: Narative in good shape. Edits from previous revisons incorporated. Might make some changes to order of sections. Will submit soon. Working on getting all ancilary materials in shape and checking through reviewer's issues.

  • Collaborations
    • Specify/Symbiota
    • SCAN TCN
      • Niko will be looking for a FP/SCAN update in a week.
    • NEVP TCN


James: Bob had something we should submit to?

Bob: International semantic web conference (in Sydney), includes workshop on semantic web applications in the sciences (annotations? Paolo as keynote speaker). Deadline in June. Would need someone to go to Australia. Definitely budgetary issue.

Bob: Went to monthly semantic web meetup in Cambridge (at MIT), Paolo was speaking. Some interest in the sort of things we are doing.