bioRxiv, / 2025
Evolution simultaneously and combinatorially explores complex genetic changes across perturbation classes, including gene knockouts, knockdowns, overexpression, and the creation of new genes from existing domains. Separate technologies are capable of genetic perturbations at scale in human cells, but these methods are largely mutually incompatible. Here we present CRISPR-All, a unified genetic perturbation language for programming of any major type of genetic perturbation simultaneously, in any combination, at genome scale, in primary human cells. This is enabled by a standardized molecular architecture for each major perturbation class, development of a functional syntax for combining arbitrary numbers of elements across classes, and linkage to unique single cell compatible barcodes. To facilitate use, CRISPR-All converts high level descriptions of desired complex genetic changes into a single DNA sequence that can rewire genomic programs within a cell. Using the CRISPR-All language allowed for head-to-head functional comparisons across perturbation types in a comprehensive analysis of all previously identified genetic enhancements of human CAR-T cells. Combining CRISPR-All programs with single cell RNA sequencing revealed a greater diversity of phenotypic states, including improved functional performance, only accessible through distinct perturbation classes. Finally, CRISPR-All combinatorial genome scale screening of up to four distinct perturbations simultaneously revealed additive functional improvements in human T cells accessible only through iterative multiplexing of modifications across perturbation classes. CRISPR-All enables exploration of a combinatorial genetic perturbation space, which may be impactful for biological and clinical applications.
Current methods for the precise integration of DNA sequences into the genome of human T cells predominantly target exonic regions, which limits the choice of integration site and requires complex cell-selection strategies. Here we show that non-viral intron knock-ins for incorporating synthetic exons into endogenous introns enable efficient gene targeting and selective gene knockout in successfully edited cells. In primary human T cells, the knock-in of a chimaeric antigen receptor (CAR) into the T-cell receptor alpha constant locus facilitated the purification of more than 90% CAR+ T cells via the negative selection of T-cell-receptor-negative cells. The method is scalable, applicable across intronic sites, as we show for introns within four distinct endogenous surface-receptor genes, and supports the integration of large synthetic exons (longer than 5 kb), of alternative splicing architectures that preserve endogenous gene expression, and of synthetic promoters allowing for endogenous or user-defined gene regulation. Non-viral intron knock-ins expand the range of targetable genomic sites and provide a simplified and high-throughput strategy for selecting edited primary human T cells.
Protein-coding genes in the human genome evolved via modular rearrangement of domains from ancestral genes1. Here, we develop a scalable, evolutionarily guided method to assemble novel protein-coding genes from constituent domains within a protein family, termed DESynR (Domain Engineered via Synthesis and Recombination) genes. Using primary human chimeric antigen receptor T cells as a model system, we find that the expression of DESynR Activator Protein-1 (AP-1) transcription factors (TFs) significantly outperforms the overexpression of natural AP-1 TFs in multiple functional assays in vitro and in vivo. Top DESynR AP-1 TFs exhibit non-intuitive architectures of constituent domains, including from TFs that are not canonically expressed in T cells. DESynR AP-1 TFs induce broad transcriptional and epigenetic reprogramming of T cells and, in some cases, lead to the development of non-natural T cell states, engaging gene expression modules from disparate human cell types. Taken together, we demonstrate that novel configurations of existing protein domains may uncover non-evolved genes that program cell states with therapeutically relevant functions.
Cell therapies have yielded durable clinical benefits for patients with cancer, but the risks associated with the development of therapies from manipulated human cells are understudied. For example, we lack a comprehensive understanding of the mechanisms of toxicities observed in patients receiving T cell therapies, including recent reports of encephalitis caused by reactivation of human herpesvirus 6 (HHV-6)1. Here, through petabase-scale viral genomics mining, we examine the landscape of human latent viral reactivation and demonstrate that HHV-6B can become reactivated in cultures of human CD4+ T cells. Using single-cell sequencing, we identify a rare population of HHV-6 ‘super-expressors’ (about 1 in 300-10,000 cells) that possess high viral transcriptional activity, among research-grade allogeneic chimeric antigen receptor (CAR) T cells. By analysing single-cell sequencing data from patients receiving cell therapy products that are approved by the US Food and Drug Administration2 or are in clinical studies3-5, we identify the presence of HHV-6-super-expressor CAR T cells in patients in vivo. Together, the findings of our study demonstrate the utility of comprehensive genomics analyses in implicating cell therapy products as a potential source contributing to the lytic HHV-6 infection that has been reported in clinical trials1,6-8 and may influence the design and production of autologous and allogeneic cell therapies.