Who am I?

Education and Work

Undergrad: Virginia Tech:

  1. Mathematics
  2. Computational Modeling and Data Analytics.

Work:

2018-2021: Software engineer with an emphasis on machine learning and computer vision in 3D microscopy at Janelia Research Campus. I worked with the cardona lab, funke lab, mouselight project team, and scicomp.

2021-present: Machine Learning Consultant. I continued to work with teams at Janelia: cellmap as well as a few other labs and companies including E11 BIO, Uri Manor at the salk and Kate McDole at Cambridge

Most of my work has been in running machine learning projects for large scale image processing in massive 3D volumes of neurons and other cell types throughout tissue samples ranging from drosophila neural tissue to mouse liver, kidney, and heart. I have worked with a wide variety of microscopy data from many different microscope setups such as cryo-em, fib-sem, lightsheet, confocal, and spinning disk.

Code

I am very interested in the intersection of software engineering and scientific computing. I have quite a few libraries that I have contributed to and/or maintain related to the field of large scale segmentation.

  1. gunpowder, a package that helps build data processing pipelines in a compute graph structure. This package is extremely helpful for generating training data for volumetric image processing models.
  2. daisy, a tool for blockwise parallelized processing of large volumes with robust logging and error handling. It has some very neat tools for handling the parallelization of tasks that have complicated block boundary interactions.
  3. funlib.*, a collection of packages containing helpful generalized solutions for simple tasks such as creating and opening datasets with the appropriate metadata, querying our data with appropriate units, and visualization code with neuroglancer.
    • funlib.geometry - Provides Coordinate and Roi classes as helpers for working with spatial arrays.
    • funlib.persistence - Provides crud (create/read/update/delete) interfaces for large scale arrays and graphs along with convenient api calls to read/write with spatial queries using Coordinate and Roi objects.
    • funlib.math - Some reusable helper functions such as n dimensional cantor pairing function along with its inverse.
    • funlib.show.neuroglancer - Provides a nice command line interface for spinning up a neuroglancer instance for the visualization of local n dimensional zarr arrays.
    • funlib.learn.torch - Provides custom torch modules and loss functions commonly used in deep learning applications in microscopy environments.
  4. dacapo, a framework designed to help manage configure and apply top machine learning methods on volumetric microscopy data. Used extensively by the openorganelle project to manage the hundreds of networks we trained for various organelles and datasets.
  5. mwatershed, a rust implementation of mutex watershed that is easily pip installable.
  6. tems, a library that provides models that are particularly convenient for blockwise processing.