Developing large-scale algorithms for sequencing data using Python (NumPy, matplotlib, pandas, sklearn, etc…). Applications
generally deal with high-dimensional data fitting and make use of multiprocessing and distributed computing.
Examples include
DemuxHMM
and
Wasserstein Principal Curves.