2014Oct07
Etherpad for meeting notes: http://firuta.huh.harvard.edu:9000/FP-2014Oct07
Agenda
Non-Tech
- Publications
- Progress
- Paul: Collection Objects
- Bob: Refactoring Dup finding cluster analysis
- Progress
Tech
- QC for SCAN
- Feedback from Neil/Paul Heinrich
- QC work
- Adding agent authority file to Symbiota - harvest to solr index - use in actor.
- Firuta server move.
- Cleanup of artifacts in Archiva - complete?
- Update deployed apps in Tomcat and Apache (Symbiota, Morphbank, Annotation Processor)
- Distribution upgrade
- Status of Mongo on FP2
- Deployments
- Access point updates
- Bringing Annotation Processor up-to-date
- Deploy and re-run harvest of occurrence records
- Status of fp2 and fp3
- SVN re-organization/cleanup
Reports
- Paul
- Working on enhanced agent tables in Symbiota to allow maintenance of scientist authority lists.
Notes
Present: Bertram, David, Tianhong, Paul, Jim, Bob
Non-Tech
- Bertram: FYI: Joined a project kick-off (SKOPE: Synthesized Knowledge of Past Environments) at ASU (w/ Keith Kintigh .. archaeology/anthropology); themes: data integration, workflows, provenance: http://www.nsf.gov/awardsearch/showAward?AWD_ID=1439603
Also met with Nico Franz over lunch: opening of their new biodiversity center: http://taxonbytes.org/impressions-alameda-grand-opening/
- Publications
- Progress
- Paul: Collection Objects
- Progress
Paul: More work on draft, haven't circulated yet.
- Bob: Refactoring Dup finding cluster analysis
Bob: Refactoring code to the point it can be run on real data - needs a hadoop cluster, setting that up.
Tech
- QC for SCAN
- Feedback from Neil/Paul Heinrich
David: Planning for Paul to come in on the tech call tomorrow. Good for Paul to get some more context in order to get useful feedback.
- QC work
- Adding agent authority file to Symbiota - harvest to solr index - use in actor.
Paul: Got started on needed schema changes (to support authorities for agents curated by symbiota projects that we can harvest for QC services).
- Firuta server move.
- Cleanup of artifacts in Archiva - complete?
David: Looks like this is complete - added artifacts and read-only access,.
David: FP-CurationServices has a georeferencing library issue (open geo), Tim had to make a modification to the pom file.
- Update deployed apps in Tomcat and Apache (Symbiota, Morphbank, Annotation Processor)
David: Symbiota and Morphbank deployed, not configured yet. Haven't deployed annotation processor or access point yet.
Paul: I Put a copy of the Specify-HUH database in place there to back the annotation processor.
David: I Grabbed copy for local machine as well.
- Distribution upgrade
Paul: Done on Firuta.
- Status of Mongo on FP2
David: Has been stable for last few weeks since the cleanup and space increase. Biggest issue was the result set collections containing incorrectly repeated values.
- Deployments
- Access point updates
David: Access Point still needs to be updated. Working with Bob on client helper and deployment of client helper.
- Bringing Annotation Processor up-to-date
David: Have been looking at Maureen's driver code, looking at applying a spring data configuration to the hybernate layer - then coupling that to business logic from the NEVP ingest - getting back to how the driver and annotation processor should communicate.
- Deploy and re-run harvest of occurrence records
David: Still needs the work on the bash scripts to do the harvest.
- Status of fp2 and fp3
David: Symbiota 4 has client helper for NEVP (not yet configured) and SCAN. FP2 has an about 2 week old access point deployed. Bug in sparql query on returing annotations - returns too many. FP3 has all of the supporting software installed (except akka), needs updates to configuration and redeployment of access point. Would be good to have analysis running on a separate VM, both Tianhong and I have been running analysies on local machines.
Paul: FP1?
David: With a RAM increase.
- SVN re-organization/cleanup
David: FP-Deployments, removed old subprojects, ready to tag FP-JavaSOA (pre camel javaEE code can be tagged and removed, next after that will be FP-Tools (lots of driver projects there to trim down).
Agenda for Thursday: Benchmarking and perfomance issues in the Akka workflows. Tim to join starting week after.