Project Description
Methods and tools for exascale analysis of microbiome data, including metagenome assembly, protein clustering, and comparative metagenome analysis. Some analysis of large scale plant genomes or pan genomes may also be included. This is part of the Exascale Computing Project and will also stress various architectural features of the system, including injection rate, one-sided communication, GPUs, and collective communication. The algorithms and data structures include hash tables, histograms, sparse (unstructured) matrices, and graphs as well as machine learning methods (in particular Graph Neural Networks).
Testbed
Iris and Yarrow