2013Jun12
Etherpad for meeting notes: http://firuta.huh.harvard.edu:9000/FP-2013Jun12
Reminder: Change of meeting time effective Sept 4: (12-1 Eastern 9-10 Pacific).
Agenda
- Project/Package Refactoring: Developer Documentation
- Annual NSF project report: update GoogleDocs doc by June 24th.
- SPNHC Demonstration
- Walkthrough of current state.
- Status of not yet included pieces.
- Example data and workflow.
- Annotations
- DwC RDF Guide
- MCZbase Driver
- Kepler
- Akka Akka_Analysis_Engine
- Taxon name cleaning Embedding_Kepler#Scientific_Name_Validator
- Provenance and rendering
- Duplicate Finding Find_Duplicates
Non-Tech
- Third Project Programmer, Burndown.
- Recent Contacts
- Collaborations
- Specify/Symbiota
- SCAN TCN
- NEVP TCN
For Future meetings
- Prospective meetings, development targets.
- TDWG (late October) http://www.tdwg.org/homepage-news-item/article/tdwg-2013-call-for-symposia-and-workshops/
- CNH: meeting will include NEVP. Workshop to get feedback from the botanists present. In Vermont, July ApplePie
- Task Group for Applicability Statement on OA
Reports
- Paul
- Got AnnotationProccessor installed on laptop, connected to access point on fp3.
- Added more error handling and reporting to annotation processor, particularly for configuration problems and missing network services.
- Some work on user management functionality in the annotation processor.
Notes
FilteredPush Team Meeting 2013 June 12
Present: James, Paul, Jim, Maureen, Bob, David, Tianhong
- Project/Package Refactoring: Developer Documentation
David: Minor updates from things found missing in last weeki.
- Annual NSF project report: update GoogleDocs doc by June 24th.
James: Will start into soon.
- SPNHC Demonstration
- Walkthrough of current state.
Paul: Using local specify and FP3 annotation processor, walked through current state of code.
Jim: Thinking ahead, underscores need to develop user interface and documentation to support less technologically aware users. From experience at SPNHC last year, audience is very heterogeneous, many are not informatics savy - important to provide background and context for that part of the community - so that they can understand what is going on during the demonstration.
Jim: What would be an answer to a question about timeline for deployment to production use?
Discussion: Likely timscale, about 6 months, still need to refine the workflow actors, and UI elements for dealing with annotations and workflows.
- Status of not yet included pieces.
Deployment - issues with current revision.
- Specify Driver: Maureen working on connecting to current annotation processor. Paul to get copy of specify data set to Maureen.
- Showing Workflow results. Need to get latest Kepler jar working, deployment issues with its build.
- Ingest of update georeference annotations. Working on driver side, still some work in annotation processor.
- Tuning workflow and example data. Paul and Tianhong by email.
- Switch of Access Point services to port 80. Access Point and Annotation Generator.
- CSS issue with icon sets in primefaces.
- Show RDF: Styling issue with pre tag, need to change tag.
- Plan B deployment. Paul hasn't tried yet.
- Example data and workflow.
Add 6 more Non NAU Curculionidae records to set available in Mongo for Analysis.
Remove Flowering time validator from workflow.
Change taxon name validator from IPNI to GBIF class.
For after next week. (issues to be resolved before deployment)
- How to deal with Client Identity
- Matching users to local data sources, datasource access management
- Scheduling harvests, finding harvestable datasources
- Where does harvested data go? Mongo? Is Mongo a staging area for Workflows, or a repository of harvested data? If we harvest into Lucene, and have workflows get data from Lucene, then we have an easy way to do faceted search on the data. Just use Mongo for workflow output.
- Purposes of harvest: (1) providing data for reasoning (e.g. taxon heirarchies). (2) providing data for analysis - quality control and duplicate finding.
- Identifier resolution for harvested data
- Annotations
- DwC RDF Guide
Bob is working on a message about the guidance document.
Steve not yet looking for feedback on the RDF dwc representation, that's the time for us to raise parallels in dwcFP.
- MCZbase Driver
Brendan hasn't gotten time to do the database dump/build yet.
- Kepler
- Akka Akka_Analysis_Engine
- Taxon name cleaning Embedding_Kepler#Scientific_Name_Validator
Call, James, Paul, Tianhong today.
Need some more examples and test cases for Tianhong.
- Provenance and rendering
- Duplicate Finding Find_Duplicates
Non-Tech
- CNH Meeting
James: Give this demonstration, solicit feedback.
Paul: Overview perhaps:
- Give context and a demonstration
- Breakout groups with tasks to solicit feedback on current ui elements.
- Breakout groups tackling large numbers of annotations.
- Third Project Programmer, Burndown.
Posted.
- Recent Contacts
- Collaborations
- Specify/Symbiota
- SCAN TCN
- NEVP TCN