Our Approach

We build open infrastructure for science at scale, from community-driven roadmaps to collaborative living datasets and transparent public benchmarks. Everything we create is shared openly to accelerate discovery.

Our Roadmap

Imagine a world with abundant data.

The goal of our datasets is to enable scientists to create powerful predictive models that can advance the life sciences field.

The OPEN DATASETS INITIATIVE

A new paradigm for funding, collecting, and sharing large,
high-fidelity datasets in biology.

We are working to accelerate community-driven use of automated labs to pioneer robust data collection methods with the goal of curating high-fidelity, AI-ready biological datasets. Our goal is to enable scientists to create powerful predictive models that can advance the life sciences field.

Why we need predictive models.

Life science is on the cusp of a monumental transformation. “AI capabilities are growing rapidly, and now is the time to develop broader predictive models that can provide answers to unanswered questions at every size scale of biology: from molecules, to whole cells, to the behavior of cells at the macroscale… the next century will resemble a coordinated, whole-field effort to divide biology into a series of prediction tasks and then solve those tasks, one-by-one.” Read more perspectives from our Co-Founder Erika in her essay What Biology Can Learn from Physics.

Interested in getting involved?