ECP 2.2.4.04 ExaBiome

PI Name Katherine Yelick
PI Institution Lawrence Berkeley National Laboratory
Collaborating ANL Division Leadership Computing Facility (LCF)
Project Description

Methods and tools for exascale analysis of microbiome data, including metagenome assembly, protein clustering, and comparative metagenome analysis. Some analysis of large scale plant genomes or pan genomes may also be included. This is part of the Exascale Computing Project and will also stress various architectural features of the system, including injection rate, one-sided communication, GPUs, and collective communication. The algorithms and data structures include hash tables, histograms, sparse (unstructured) matrices, and graphs as well as machine learning methods (in particular Graph Neural Networks).

Testbed

Iris and Yarrow