GROQ-seq

Gathering the data necessary to link protein Sequence-to-Function

Overview

By systematically mapping how protein sequences drive function, we’re creating the foundation for predictive models that can unlock discoveries across the life sciences. GROQ-seq is Align’s flagship experimental platform for measuring protein function at scale. Designed to map the relationship between sequence and function, GROQ-seq enables researchers to test up to 500,000 protein variants per experiment across diverse biological contexts—delivering the high-quality, standardized datasets needed to power next-generation predictive models in biology.

Strategy

Sequence-to-Function: GROQ-seq

At its core, GROQ-seq is a pooled growth assay that links a protein’s function to cell fitness. Each library of protein variants is barcoded, transformed into E. coli, and grown under selective pressures such as antibiotics or environmental conditions. Barcode sequencing at multiple time points allows us to track the growth dynamics of each variant. A calibration ladder of reference variants ensures that measurements are quantitative and reproducible across experiments and labs.

Assay Capabilities

GROQ-seq assays are:

  • Scalable — Supporting 300,000–500,000 variants per run
  • Low-cost — ~$0.05 per variant (excluding DNA synthesis)
  • Extensible — Adaptable to a wide range of protein functions including enzymes, transcription factors, RNA polymerases, and more
  • Standardized — Compatible with automation across labs, with data served via a unified ontology and accessible API

These capabilities make GROQ-seq a powerful tool for building generalizable models of protein function, accelerating advances in protein engineering, synthetic biology, and therapeutic design.

Timeline

2023

Held Protein Sequence to Function Workshop

2024

Methods Development

2025

Start Platform Expansion & Onboarding new Users

2026

Finish Platform Expansion & Onboarding new Users

Technical Program Manager

Dana Cortade
LinkedIn

Dana Cortade holds a PhD from Stanford, where she focused on developing medical diagnostic devices. She has expertise in bioengineering, optics, and materials science. Dana’s experience across academic and nonprofit settings allows her to drive Align’s mission of enabling global collaboration through accessible technology platforms.

Explore the dataset

Transcription Factors
Read More
Proteases
Read More
Aminoacyl tRNA Synthetases
Read More
Single-Chain Antibody Fragments
Read More
Histidine Kinases
Read More

Interested in getting involved?