Technologies

Paving the Way for New Discoveries and New Ideas

We develop high-throughput & interdisciplinary technologies that can measure, perturb, predict, or interpret cellular and genomic features. Traditionally, biology has been grounded in iterations of observation, individual perturbation, and measurement. The efficiency and scalability are limiting our ability to address many fundamental questions. We are developing and applying predictive biology technologies as well as high-throughput experimental techniques to address these challenges, aiming to accelerate discoveries in fundamental biology

Next Generation Genetics: Artificial Intelligence (AI) & In Silico Genetic Screen (ISGS)

For decades, forward and reverse genetic screens have been central in functional studies of genes and beyond. Forward genetic screening starts with phenotypes and aims to determine the genetic basis responsible for a given phenotype, while reverse genetic screening starts from known genes or more broadly, DNA sequences and assays the effect of each gene upon perturbation. However, traditional genetic screening typically requires a large-scale setup and is strongly limited by the available resources and experimental feasibility. 


We recently proposed and developed the in silico genetic screen (ISGS) research framework as a next-generation approach to genetic discoveries. The ISGS framework integrates advanced machine learning / artificial intelligence models and high-throughput in silico genetic perturbation. Similar to the experimental genetic screen, the ISGS framework interrogates the effect of perturbations through accurate computational prediction in an ultra-high-throughput scenario. We recently developed C.Origami, a deep neural network that performs de novo prediction of cell type-specific chromatin organization with optimal performance. Coupling the C.Origami model with the ISGS framework, we systematically analyzed how individual DNA elements affect chromatin organization across the genome. We continue developing novel predictive biology technologies and high-throughput in silico genetic screen platforms to enable more fundamental discoveries in genome sciences.

Chemical Genomic Technologies

Chemical genomics, at the intersection of chemistry and genomics, plays an important role in the advanced understanding of the genome through technological innovation. Chemical approaches for studying dynamic biological interactions such as protein-DNA and protein-protein interactions set the foundational knowledge for genome regulation. Bisulfite salt treatment to DNA bases has provided fundamental insights into the epigenetic status of the genome. We embrace all chemical magics that can enable genomic discovery and cell fate engineering.


We have pioneered the development of several chemical genomic technologies for DNA epigenetic modifications. DNA methylation (5mC) and TET-protein-mediated DNA methylation-derivatives (5hmC, 5fC, and 5caC) represent one major part of epigenetics. The critical information to understand the function of an epigenetic factor is to profile its genome-wide distribution pattern. Bisulfite reaction and sequencing used to be a golden standard for DNA methylation, but failed to distinguish these new epigenetic bases. We have developed several unique and robust chemical methods to analyze the genome distribution map of 5fC and 5hmC. Represented by 'fC-CET' and 'CLEVER-seq', these methods demonstrated the concept of 'bisulfite-free & base-resolution' analysis of DNA epigenetic modifications. We used these methods to analyze the epigenomes in embryonic stem cells and single cells of the early developing embryo. These technologies will help us understand the molecular basis of epigenetic gene expression regulation and how these chemical modifications – and their modifier and reader proteins – affect mammalian development.