A systematic guide for identifying transcription factors that directly regulate the expression of a gene of interest

Andrew D. Bates; Dawid Grzela; Maciej Studzian; Louise Brennan; Moli Williams; Conor Fawcett; Beth Hammond; Manreen Grewal; Marcin Ratajewski; Lukasz Pulaski; Urszula L. McClurg

doi:10.1101/gr.281154.125

A systematic guide for identifying transcription factors that directly regulate the expression of a gene of interest

¹Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7BE, United Kingdom;
²Institute of Medical Biology, Polish Academy of Sciences, 93-232 Lodz, Poland;
³Bio-Med-Chem Doctoral School of the University of Lodz and Lodz Institutes of the Polish Academy of Sciences, 90-237 Lodz, Poland;
⁴Medical University of Lodz, 90-419 Lodz, Poland;
⁵Department of Oncobiology and Epigenetics, University of Lodz, 90-237 Lodz, Poland

↵6 These authors contributed equally to this work.

Corresponding authors: urszula.mcclurg{at}liverpool.ac.uk, mratajewski{at}cbm.pan.pl, lpulaski{at}uni.lodz.pl

Next Section

Abstract

Transcriptional regulation lies at the heart of cellular identity and function, hinging on the precise binding of transcription factors (TFs) and cofactors to gene regulatory elements such as promoters and enhancers. Although it is relatively routine to profile genome-wide DNA binding landscapes of proteins, identifying the specific proteins that bind to, and regulate the transcription of, a particular gene of interest (GOI) remains a persistent experimental and conceptual challenge. This gene-centric question is complicated by the multilayered regulatory environment in which each gene resides, comprising 3D chromatin structure, enhancer–promoter looping, DNA accessibility, histone modifications, and cell state–dependent protein dynamics. In this review, we dissect the strengths, limitations, and biological relevance of current approaches for studying direct protein–DNA interactions, distinguishing between protein-centric and DNA-centric methodologies. We introduce a conceptual matrix of biological relevance, integrating the origin of DNA and protein elements (cis and trans) to evaluate false-positive and false-negative risks across experimental systems. Moreover, we explore how perturbation strategies—gain and loss of function—can complement steady-state profiling to establish causality in gene regulation. By critically examining both established tools and emerging techniques such as genome editing, synthetic chromosomes, and high-resolution imaging, we provide a practical framework for investigators seeking to uncover direct regulators of specific genes. Our goal is to guide the design of experiments that balance biological relevance, sensitivity, and interpretability to ultimately answer a deceptively simple question: What TFs directly regulate the expression of my GOI?

Cellular processes are regulated by transcription machinery binding the promoters and enhancers of target genes to activate gene expression. This is followed by a cascade of RNA processing and protein translation, allowing for the newly expressed protein to execute its function, which in turn may contribute to cell signaling. Consequently, one of the main questions that often arises in research is what transcription factors (TFs) regulate the expression of my gene of interest (GOI) directly on its promoter. However, answering this is not straightforward. Genes exist within multilayered microenvironments of proximal and distal promoter–enhancer interactions, DNA accessibility, histone post-translational modifications (PTMs), and DNA looping and high-order chromatin structure (Panigrahi and O'Malley 2021; Karr et al. 2022; Chen et al. 2024b). Furthermore, the DNA interactome at a particular gene locus is controlled by the availability of the transcription machinery, its correct localization, transcriptional complex formation, and the PTM profile of regulatory proteins. To investigate direct DNA–protein interactions, scientists can either isolate a protein of interest and identify all of the DNA sequences capable of binding to it or, conversely, use a DNA sequence of interest to identify proteins that are capable of binding to it. These two approaches answer fundamentally different questions: The first one allows us to discover the DNA binding profile of a protein of interest; the second one is aimed at identifying proteins that may directly regulate a GOI.

Protein-centric approaches were initiated by the development of chromatin immunoprecipitation (ChIP) in 1984 followed by ChIP-qPCR and ChIP-seq, which allowed scientists to probe the full palette of DNA molecules bound by proteins of interest. A ChIP experiment begins by using formaldehyde to chemically cross-link DNA and protein molecules (Das et al. 2004). Nuclei are isolated, and the chromatin (which now contains fixed protein–DNA complexes) is sonicated, or another method of shearing it into fixed-average-length DNA fragments is applied. DNA fragments bound to proteins are incubated with antibodies specific to a protein of interest and antibody–protein complexes are precipitated using beads. To focus on individual candidate genes predicted to be bound by the protein of interest, precipitated DNA can be analyzed using qPCR with primers designed specifically against the regulatory sequences of the GOIs. Alternatively, in ChIP-seq following precipitation, cross-links are reversed, and the released DNA can be sequenced to identify all DNA sequences bound to the protein of interest using high-throughput platforms (Park 2009). Standard controls include comparing this immunoprecipitation to DNA binding by nonimmunized IgG raised in the same species as the protein targeting antibody, using qPCR primers designed against nonregulatory regions of the gene when enrichment is not expected, and ensuring DNA shearing with fragments that are not too long in order to prevent nonspecific GOI identification. ChIP has had a revolutionary impact on our understanding of biology and has spearheaded clinical translation. Complete genome-wide ChIP-seq for a TF can be mapped within the cells and tissues in an efficient and timely manner with novel technologies.

However, complications arise when we pose the DNA-centric question: which proteins are bound to, and directly regulate, a GOI. Initial approaches consisted of electrophoretic mobility shift assays (EMSA) (Hellman and Fried 2007). In an EMSA, labeled probes corresponding to the DNA of interest are synthesized or isolated and are consequently incubated with a purified protein of interest or with a mixture of proteins (e.g., a nuclear extract). To determine if direct binding of DNA and protein has occurred, reaction is separated on a nondenaturing agarose or polyacrylamide gel to study if there has been a shift in DNA mobility caused by increased size owing to protein binding. To confirm the identity of DNA-bound protein, mobility supershift by specific antibodies may be tested. However, EMSA lacks the majority of biological context. Many proteins are capable of binding naked DNA; however, within the cellular environment they might not be localized to the nucleus, preventing this binding. Furthermore, in the cell the DNA sequence of interest might not be accessible owing to chromatin compaction or PTMs as well as competition from other binding proteins. Consequently, EMSA can be applied to rule out DNA–protein interactions: If a protein is not able to bind the sequence of interest in an EMSA assay, it is unlikely that this interaction occurs in nature; however, positive binding in an EMSA requires further confirmation of interaction within the cellular environment.

When selecting a method to study DNA–protein interactions, it is crucial to consider various experimental parameters. In molecular genetics, DNA is referred to as the cis-element, whereas the protein that binds to it is the trans-element. The biological relevance of these elements must be evaluated individually. Analyzed DNA can be a synthetic, naked DNA sequence of in vitro origin; an artificial chromosome introduced into a cell (exogenous DNA); or endogenous DNA within the cell's genome. Exogenous DNA loses characteristics such as genome compaction and accessibility, whereas endogenous DNA allows for in cellulo analysis under physiological conditions. Similarly, proteins can be purified and studied in vitro, expressed exogenously from a plasmid, or endogenously produced within the cell from genomic DNA. Although plasmid-based expression allows for controlled experiments, it lacks physiological regulation of protein levels. Using endogenous protein ensures a physiological setting for studying DNA–protein interactions. These variations create a biological relevance matrix, in which cis-elements and trans-elements can range from fully artificial to completely physiological (Fig. 1). This choice of system affects the likelihood of false positives and false negatives. In vitro experiments carry a high risk of false positives, as binding may occur under artificial conditions but might not be reproduced in a cellular environment owing to factors like protein compartmentalization or DNA inaccessibility. However, false negatives are less likely. Endogenous cellular experiments have a lower risk of false positives owing to the physiologically relevant context. However, they are more prone to false negatives, as protein–DNA interactions may vary depending on the cell type, cell cycle stage, or environmental stressors and can be missed depending on the experimental setup. It is impossible to account for all these variables in a single experiment.

View larger version:

Download as PowerPoint Slide

Figure 1.

Matrix of biological relevance in DNA–protein interaction studies. Experimental approaches to study DNA–protein interactions vary in their use of cis-elements (DNA) and trans-elements (proteins). The matrix illustrates combinations of DNA substrates (in vitro naked DNA, exogenously introduced DNA, and endogenous chromatin) and protein sources (purified, plasmid-expressed, or endogenously produced) arranged by increasing physiological relevance along the diagonal from bottom right to top left. Cells are color-coded to reflect biological relevance, ranging from low (red) to high (green). Systems using purified proteins and naked DNA are prone to false positives, whereas fully endogenous systems, although biologically informative, carry a higher risk of false negatives owing to contextual dependencies such as chromatin state, cell cycle stage, and transcriptional noise. It is critical to carefully select experimental configurations to balance control, sensitivity, and physiological accuracy when investigating transcription factor–DNA interactions, although of course no method is free of risk; for example, EMSA can also generate false negatives (e.g., false negative for binding when a cofactor would be required in the cell environment).

Beyond biological relevance, it is also important to distinguish between methods that study the steady state of the cell versus those that introduce experimental perturbations for comparative analysis. Some techniques rely on genetic modifications to reveal functional differences. Gain-of-function approaches introduce new elements, such as exogenous proteins, activating mutations, plasmids, artificial chromosomes, or knock-in modifications. Conversely, loss-of-function approaches remove specific factors using mutations, siRNA silencing, or gene-editing techniques. These modifications can be applied to either the DNA or the protein, further influencing experimental outcomes.

Although well-established approaches have been developed to determine DNA binding patterns for a protein of interest, because of a lack of an established, and universally accepted, methodology for identifying direct regulators of a GOI, there is a risk of wasting time and money by taking false or indirect routes that will not answer the question at issue. In this review, we discuss advances that have been made to address this methodological challenge, as well as technical and biological limitations of existing technologies. We compare existing methods, provide a guide for considerations that need to be made during study design, and highlight novel approaches that are becoming possible with recent developments in genome editing as well as high-resolution imaging.

To answer which proteins are bound to and directly regulate the transcription of a GOI, we need to know the following:

– What are the regulatory elements, especially promoters and enhancers, of my GOI?
– What proteins bind to the regulatory elements of my GOI?
– Which of the proteins bound to my GOI directly regulate its expression?

Previous Section Next Section

Computational prediction of regulatory sites and TF binding sites

A eukaryotic gene typically consists of regulatory elements such as distal enhancers, which can act over long distances; a proximal promoter containing key TF binding sites; a core promoter surrounding the transcription start site (TSS) where the preinitiation complex assembles, followed by the coding region (exons and introns); and ending with a terminator and polyadenylation signals that ensure proper transcript processing and stability. Before we can determine proteins that directly regulate GOI expression, we must identify the regulatory DNA sequences relevant to the GOI such as its promoter(s) and enhancer(s) (Zhang et al. 2022). Although many tools exist for this purpose, newcomers should understand the benefits and limitations of in silico approaches. With advanced, often algorithmically opaque, machine learning tools now available, a cautious approach can save time and effort. It is critical to appreciate both the biochemistry of TF-DNA binding specificity and the probabilistic nature of tools that predict it. TF binding depends on a number of non-sequence-dependent factors in addition to the local DNA sequence information that prediction tools are based on, leading to an inordinately high level of false-positive predictions that the user must expect and take into account. Sequence-independent factors include DNA accessibility, competition from other binders, and TF localization regulation. This knowledge helps biologists generate testable hypotheses without overrelying on potentially misleading predictions.

Prediction of potential regulatory sites

This review focuses on gene-specific transcriptional regulation via direct TF binding to regulatory DNA sites. The initial task is thus to locate candidate DNA regions potentially involved in TF binding, and this starts with identifying the TSS, the RNA polymerase recruitment point. TF binding sites near the TSS form the promoter, whereas more distant ones are typically called enhancers (Bateman and Johnson 2022). Some genes have multiple functional TSSs, which complicates regulatory analysis, so alternative promoters must be considered (Nepal and Andersen 2023). For well-studied organisms like humans, genome-wide data are accessible and can assist with TSS identification. For lesser-known genomes, predictive tools use core promoter sites to infer probable TSS locations (Adato et al. 2024). These predictions should be verified by experimental data, with attention to possible alternative TSSs in introns or distant upstream regions (Alfonso-Gonzalez and Hilgers 2024). Widely-used tools for TSS prediction include the trainable ab initio software TSSFinder (de Medeiros Oliveira et al. 2021) tailored for broad, nonselective application in any eukaryotic organism, TSSPlant (Shahmuradov et al. 2017) designed for plant TSS prediction, and machine learning-enhanced options like DeePromoter (Oubounyt et al. 2019) designed exclusively for mammalian TSSs. Newer CamoTSS specifically calls TSS from actual RNA-seq data (Hou et al. 2023).

Most regulatory events occur at TF binding sites some distance from the TSS, and predicting these is crucial in computational genetics. TF binding operates within a topologically associating domain typically upstream of the TSS but sometimes within the gene. However, predicting the boundaries of this domain in silico is challenging; researchers often analyze an arbitrary sequence length surrounding the TSS or rely on database-mined experimental data. Sequence conservation analysis can help identify important regulatory sequences (Stark et al. 2007).

At least 40%–50% of human genes are subject to some degree of regulation by DNA methylation, depending on factors such as cell type, developmental stage, and environmental conditions (Gardiner-Garden and Frommer 1987; Jaenisch and Bird 2003; Illingworth and Bird 2009; Loyfer et al. 2023). CG-rich sequences are predominantly found in the 5′-flanking regions of genes and are bound by TFs such as Sp/Kruppel-like factor (Kaczynski et al. 2003; Suske et al. 2005). Methylation of these sequences prevents the binding of TFs, leading to the recruitment of proteins associated with methyl-CpG binding (Du et al. 2015), which in turn inhibits transcription. Several tools are available for predicting the presence of CpG islands in DNA sequences. Examples include the CpGplot tool from the European Molecular Biology Open Software Suite (EMBOSS) (Rice et al. 2000), the CpG Island Searcher (Takai and Jones 2003), CpGProD (Ponger and Mouchiroud 2002), and CpGPAP (Chuang et al. 2012).

Predicted regulatory sites can be validated experimentally using reporter gene assays. In this assay, the transcriptional activity of a predicted promoter, or an enhancer, of a specific GOI can be investigated using plasmids that include the predicted regulatory sequence upstream of a reporter gene (for promoter testing) or upstream of a known promoter linked to a reporter gene (for enhancer testing), encoding an easily assayed protein such as GFP, LacZ, or luciferase. Reporter assays involve DNA transcription within cells to evaluate their regulatory activity (Romanov et al. 2021), if the regulatory sequence is functional, the reporter gene is expressed/induced and the cellular concentration of the gene product is measured (Smale 2010).

TF binding prediction

TFs bind DNA in a sequence-specific manner based on motifs, with each binding site having a consensus sequence that indicates prerequisites of functional binding. Binding affinity is usually degenerate, with multiple sequence variations binding equally well (Stormo and Zhao 2010). Many TFs, especially in eukaryotes with larger genomes, have low specificity, leading to high false positives in binding site predictions. Short, degenerate DNA sequences matching a TF motif often occur by chance; consequently, most motif matches in the genome are not functional binding sites (Wasserman and Sandelin 2004). This highlights that a sequence matching a position weight matrix (PWM) does not necessarily indicate actual TF binding in vivo. Functional binding depends on additional context, including chromatin accessibility, the presence of cofactors, 3D genome architecture, cell type, and other regulatory factors—not just the DNA sequence itself. Consequently, predictive tools must tackle the challenge of filtering out nonfunctional sites, a task still evolving, with new algorithms frequently published after four decades of research (Stormo et al. 1982).

The classical method for dealing with motif degeneracy is to compare the regulatory sequence in a GOI to a PWM, which scores TF binding potential at each site based on experimentally derived probabilistic data (Mathelier and Wasserman 2013). PWMs, often displayed as sequence logos, which are bar graphs with bars made of nucleotide symbol letters, help illustrate binding motif conservation. Energy matrices, an alternative to PWMs, account for binding energy contributions. Still, PWMs remain dominant owing to early statistical approaches preceding the availability of structural data. PWM-based predictions depend on databases like TRANSFAC (Matys et al. 2006) and JASPAR (Khan et al. 2018). Modern high-throughput techniques that experimentally identify protein–DNA binding, such as ChIP-seq, SELEX, and protein-binding microarrays, have significantly improved database quality (Jolma et al. 2013). However, the human genome has nearly 2000 transcription regulators, many of which share consensus sites, adding complexity to predictions (Lambert et al. 2018).

Despite improvements, PWM-based predictions suffer from high false positives owing to oversimplified assumptions, such as treating nucleotides as independent. Dinucleotide matrices and k-mer approaches address some limitations (Siddharthan 2010), but factors like overlapping TF binding sites, cooperative binding, and DNA shape further complicate predictions (Srivastava and Mahony 2020). The chromatin context also affects TF binding, as many consensus sites may be inaccessible owing to histone-bound heterochromatin (Zhu et al. 2018). Chromatin openness strongly correlates with TF binding potential (Thurman et al. 2012), and DNA methylation and local DNA shape also influence binding (Yin et al. 2017). Lastly, it is important to remember that TFs can regulate genes without directly binding DNA, through complexes with other TFs (tethering) (Yamada et al. 2019). This effect must be experimentally verified to fully understand GOI regulation.

A summary of TF binding prediction for nonadvanced users

Starting with established PWM-based tools like MatInspector (Cartharius et al. 2005) can be effective. More advanced options, such as PSCAN (Zambelli et al. 2009) and MEME (Bailey et al. 2015), improve reliability by analyzing binding site enrichment. Tools like REUNION (Yang and Pe'er 2024), Pando (Fleck et al. 2023), and TRIPOD incorporate chromatin data, enhancing specificity. TFBShape and CRPTS account for DNA shape influences (Yang et al. 2014). Given the inherent challenges, modern prediction tools often incorporate machine learning to enhance traditional methods. Early tools like hidden Markov models and support vector machines have given way to neural networks, such as convolutional and recurrent models, which power tools like DeepBIND (Alipanahi et al. 2015) and DeepTF (Bao et al. 2019). Although user-friendly classical tools remain popular, newer models like DeepReg (Ledesma-Dominguez et al. 2024) and DeepGRN (Chen et al. 2021a) offer improved specificity by integrating chromatin accessibility.

Recent deep learning tools offer incremental improvements and may appeal to advanced users. However, for practical GOI analysis, especially in laboratories with limited IT resources, classical algorithms are often sufficient. Combining tools with awareness of their limitations allows researchers to design experiments with high-confidence hypotheses. However, one of the major steps when trying to identify functional regions of a gene involves systematic perturbation of its regulatory DNA through deletion, linker-scanning, or site-directed mutagenesis. These approaches enable precise mapping of essential cis-elements, such as promoters, enhancers, and TF binding sites, by directly testing the impact of sequence alterations on transcriptional activity.

Previous Section Next Section

Data mining genome-wide studies

Although computational prediction alone cannot explain the transcriptional regulation of your GOI, this does not mean that you have to do all of the “wet-laboratory” experiments yourself: Data mining is a crucial step in developing your hypotheses. Proliferation of easily accessible, large-scale, high-throughput studies provides fertile ground for analyses. The sources of useful data for inferring TFs that might regulate a GOI include

– Proximity to experimentally proven TSSs.
– Chromatin landscape indicating open chromatin or, more specifically, epigenetic features known to be linked with enhancers.
– TF binding to DNA sequences within the GOI locus, preferably within suspected regulatory regions.
– Positive correlation between expression of regulators (TFs) and the GOI product at the protein level; this can be determined by comparing different tissues/cell types or by observing fluctuations at the single-cell level.
– Instances when disruption, or perturbation, of the regulator (TF) causes disruption/perturbation of the GOI product: This may mean correlating the GOI expression with downregulation or disruption of TF, experimental disruption of TF binding site in the regulatory element, or phenotype (e.g., disease)-linked noncoding variants in regulatory sequences.
– Conservation of regulatory mechanisms in GOI orthologs in other organisms.

Identifying regulatory regions

To determine TSS(s) by data mining, whole-genome 5′-RACE, CAGE, or RNA-seq experiments are readily available either through genome browsers or through dedicated databases like FANTOM5 (Kawaji et al. 2014) or DBTSS (Suzuki et al. 2015). However, for genes expressed at a low level, whole-genome experiments might not have enough coverage to reliably pinpoint TSS(s).

Several databases provide comprehensive DNA methylation data derived from high-throughput sequencing, including single-base resolution methylation levels for individual CpG sites across the genome. These resources also annotate hypomethylated regions commonly linked to gene promoters, as well as allele-specific methylation patterns relevant to genomic imprinting. Notable examples include MethBase (Song et al. 2013), MethBank (Zhang et al. 2023), EWAS Atlas (Li et al. 2019b), Human Epigenome Atlas (Milosavljevic 2010), and iMethyl (Hachiya et al. 2017).

The euchromatic signatures present in enhancer-rich stretches of chromosomes include increased DNA accessibility to nucleases and modifying enzymes (e.g., measured by DNase I hypersensitivity), high nucleosomal turnover (e.g., high levels of rapid-turnover histones H2A.Z and H3.3), specific epigenetic markers (e.g., high ratio of H3K4me1 and H3K4me2 to H3K4me3, high ratio of 5hmC to 5mC, enrichment of H3K27ac), chromatin loops in direct contact with the core promoter (enrichment of cohesin complexes, peaks in chromatin proximity assays such as Hi-C or ChIA-PET), and the appearance of randomly, bidirectionally transcribed nonpolyadenylated cognate eRNA (Long et al. 2016). Most accessible repositories for the entry-level molecular geneticist with a limited number of GOIs are those that collate and integrate many different sources of data and databases because ease of use and reliability are key.

Three types of tools should be applied in parallel:

– Genome browsers with multiple annotation tracks, especially the unrivalled University of California at Santa Cruz (UCSC) browser (Perez et al. 2025)
– Nonspecialized hyperlinked databases of genetic and genomic information, such as the broad-scope Harmonizome (Rouillard et al. 2016), which is intuitive in use, or GeneCards (Stelzer et al. 2016), which is more limited with regards to gene regulation but abundant in information about gene function and links to practical research tools
– Tools oriented specifically toward gene regulation, which allow reliable data mining, such as the excellent Gene Transcription Regulation Database (Kolmykov et al. 2021), the main advantage of which is seamless integration of ChIP-seq, ChIP-exo, DNase-seq, and MNase-seq chromatin status with ChIP-derived data regarding specific TF binding.

Most large-scale data come from two major, multidecade projects, ENCODE (Luo et al. 2020) and FANTOM (Abugessaisa et al. 2021), both of which make their data available directly, but accessing it is much easier via the above-mentioned browsers and databases. If data for your GOI turn out to be limited or if there is reason to suspect cell-specific epigenetic factors at play, more focused chromatin accessibility databases should be studied, such as deepBlue (Albrecht et al. 2016), which is unfortunately available only via GitHub, or i-cis Target, which has a useful option of selecting the types of experiments to browse (e.g., ChIP-seq, FAIRE-seq, or ATAC-seq) (Verfaillie et al. 2015). Additionally, ATACdb, is specific to ATAC-seq data (Wang et al. 2021), whereas the 3D Genome Browser (Wang et al. 2018) compiles chromosome conformation capture (3C), HiC, and ChIA-PET data on chromatin loops.

Direct TF binding

There are two main original sources of data on specific TF binding to the regulatory sequences of your GOI: major genome-wide programs (such as the above-mentioned ENCODE or FANTOM, which are especially rich in data on ncRNAs) and individual studies that concentrate either on a limited number of TFs or on specific groups of genes. Once again, the best practical path to easily accessing these types of data is via curated integrating databases, especially Harmonizome and GTRD. It is important to remember that although these are experimental data that detected actual TF binding to a specific locus or site, this does not necessarily mean that the TF has an important function in GOI regulation, especially not in your favorite cell type/condition/treatment. However, this is probably the strongest hypothesis-generating tool available. More specialized tools for this approach include the Peak Browser at ChIP Atlas (Oki et al. 2018) or browsing ChIPBase (Huang et al. 2023), which is especially strong for ncRNAs but can also be used for mRNA genes. The Eukaryotic Promoter Database ExPASy has the useful Mass Genome Annotation Archive, which includes ChIP-seq TF annotation (Dréos et al. 2018).

Secondary sources that suggest regulators of your GOI

Correlating expression of your GOI at the RNA level with known TFs or other target genes, for which regulatory pathways have previously been elucidated, is an important way to generate testable hypotheses. Cscan is a simple tool for finding common regulators of several GOIs (Zambelli et al. 2012). COXPRESdb (Obayashi et al. 2019) and the somewhat more cumbersome SEEK (Zhu et al. 2015) were created to study coexpression relationships across a broad spectrum of data, including various model organisms, making it possible to search for a similar function. The richest source of potentially unexpected perturbation-based data comes from correlating genetic variability in candidate regulatory regions with known phenotypes. The GTEx Consortium et al. (2020) has analyzed variations in potential regulatory sequences and their association with expression quantitative trait loci (eQTLs) to form the most comprehensive resource of this type, accessible through the UCSC Genome Browser. If your GOI and/or candidate TF are known to be expressed in human cancer, two useful tools that are relevant are DepMap, which allows regulator/target coexpression analysis (Tsherniak et al. 2017), and the Xena Browser (Goldman et al. 2015), which can also be used to correlate variation with expression (both of GOI and of candidate TF). It is, however, important to remember that DepMap does not validate successful knockout and is susceptible to splice variant “blindness.” Human cancer is especially rich in variations, including coding region mutations of TF genes; these features can sometimes offer similar insights as knockdown/knockout experiments. The Eukaryotic Promoter Database has the SNP2TFBS subdatabase linking genetic variants to known, or potential, TF binding sites (Kumar et al. 2017).

One should not discard more remote sources to infer potential TFs that could regulate your GOI: Orthologous genes in better-studied organisms are often a goldmine of information for designing initial experiments. Analyze databases of regulatory regions in model organisms that combine data and predictions: The best-known (and available via many integrator tools) is ReMap (Hammal et al. 2022), but an interesting alternative is CistromeDB (Zheng et al. 2019). There are also time-honored genomic resources for specific model organisms, such as the Saccharomyces Genome Database (Cherry 2015), Arabidopsis Gene Regulatory Information Server (Yilmaz et al. 2011), WormBase (Sternberg et al. 2024), and FlyBase (Jenkins et al. 2022). Finally, sometimes little known publications contain gems of prepublished experimental data that should not be overlooked; carefully curated databases compile as many of them as possible, including JASPAR (Rauluseviciute et al. 2024) and ORegAnno (Lesurf et al. 2016), both available via the USCS Genome Browser. MSigDB contains curated gene sets from the literature; the most useful ones for the purpose described here are the regulatory target gene set and the chemical and genetic perturbation gene set (Liberzon et al. 2015). Among modern AI-based tools, TRRUST finds co-occurrences of mentions of terms (e.g., TFs and GOIs) in published literature (Han et al. 2015).

Practical gene regulation data mining for nonadvanced user

When you first approach the question of what regulates your GOI transcriptionally, you should start by compiling available information on chromatin landscape (especially regulatory region hallmarks) and TF binding from genome browsers (e.g., UCSC Genome Browser) and user-friendly databases (e.g., Harmonizome). For most genes and organisms, this should provide plenty of starting points for planning confirmatory experiments. If the available data appear insufficient, contradictory, or unsuitable, it may be necessary to delve into specialized databases or explore additional sources, such as perturbation studies, coexpression patterns, or homology analyses with other organisms. It is crucial to keep in mind that all in silico approaches, both predictive and based on data mining, are never sufficient to determine GOI regulators owing to the complexity of transcriptional regulation and intricacies of protein–DNA binding. Their main use is for generating experimentally testable hypotheses for further research.

Previous Section Next Section

Experimentally validating TFs by measuring transcription from the GOI

When dissecting transcriptional regulation, the first experimental point of interest is the process of transcription itself and its immediate product, the RNA transcript. The ability to detect whether the GOI is being transcribed in a cell or in a group of cells provides a direct readout that can be tested by introducing experimental perturbations to TF level or activity. Directed modifications to genomes in experimental models are becoming essential for elucidating how specific genes, their regulatory elements, and other genome-related components influence physiological processes. Gene editing encompasses various methodologies and involves the targeted insertion, deletion, modification, or replacement of DNA sequences at precise locations within the genome. In the simplest terms, if we can accurately detect transcription of a GOI by comparing transcript levels between wild-type (WT) cells and cells depleted of a predicted TF for the GOI, we can validate if the TF in question regulates the GOI. Similarly, we can compare transcription of the GOI between different cellular conditions such as hypoxia, starvation, DNA damage, etc., to determine the cellular context required for GOI expression.

Originally the nuclear run-on technique was developed to detect the emergent transcripts that are in the process of being produced from the GOI. In this approach, intact nuclei are flash-isolated from cells, depriving them of ribonucleotide substrates for transcription. This stalls RNA polymerases on genes that were in the process of being transcribed. Subsequently, transcription is restarted and allowed to run to completion by adding exogenous-labeled ribonucleotides that are incorporated into completed transcripts and allow for their identification, for example, by hybridization-based techniques with radiolabeling (Smale 2009). This approach was recently improved by including modified nucleotides that allow for enrichment of the emergent transcripts by affinity isolation. Some examples of modified nucleotides include bromouridine, which can be immunoprecipitated by specific antibodies (global run-on assay) (Core and Lis 2008), or biotinylated cytosine, which can be bound to streptavidin-based binders (precision run-on assay) (Mahat et al. 2016). The resulting transcripts are usually subjected to RNA-seq, but transcription from an individual GOI can also be quantified by qPCR. This method is compatible with experimental manipulations which can identify cellular conditions and/or TFs that are capable of switching GOI transcription on or off.

The challenge of run-on analysis lies in the need to isolate nuclei from their cellular environment. Alternative methods, which allow the relative quantification of nascent transcripts (freshly produced RNA) in cellulo, have emerged. Initial approaches concentrated on stopping transcription with pharmacological inhibitors of RNA polymerase (such as actinomycin D). Currently, the mainstream approaches include the use of click chemistry to isolate transcripts with incorporated modified nucleotides provided as a pulse to cells (Palozola et al. 2017) as well as those that use fluorescent in situ hybridization (FISH) and advanced imaging to visualize emergent transcripts at the single-cell (or sometimes single-GOI) level (Salataj et al. 2023). Recently, a sophisticated method of quasi-immediate visualization of transcribed genes has been developed in the form of nascent RNA tagging. This method requires the GOI modification (endogenous or exogenously introduced, e.g., on an artificial chromosome) by insertion of an aptamer-like sequence (e.g., MS2 or PP7), which upon transcription folds into a tertiary structure that binds a fluorogenic dye with high affinity (Chubb et al. 2006). This powerful technique can be combined with superresolution and multicolor imaging to provide single-allele resolution of GOI expression (Hocine et al. 2013). However, as mentioned above, modification of cell state is required for this type of observational approach to provide answers regarding potential GOI regulators.

Alternatively, changes in TF activity and transcriptional regulation can be inferred from experimentally measured changes in chromatin accessibility at carefully selected gene loci. For this purpose, Förster resonance energy transfer combined with fluorescence in situ hybridization (FRET-FISH) was recently developed (Mota et al. 2022). This method takes advantage of the sensitivity of FISH and the spatial resolution of FRET. FRET-FISH uses DNA probes labeled with donor and acceptor fluorophores that are designed to bind to specific DNA sequences. In close proximity, energy transfer occurs from the donor to the acceptor, which can be quantified to assess the distance between the two probes. This distance is inversely related to chromatin compaction; a high FRET efficiency indicates close proximity (and thus compacted chromatin), whereas low FRET efficiency indicates a more relaxed chromatin state. FRET-FISH can measure chromatin compaction at selected gene loci, providing insights into the accessibility of a GOI. Chromatin compaction measured by FRET-FISH was shown to correlate strongly with chromatin accessibility data obtained from ATAC-seq (Mota et al. 2022). FRET-FISH can also detect changes in chromatin compaction in response to drug treatments, during different phases of the cell cycle, and as cells age. Beyond chromatin compaction, FRET-FISH could in the future be adapted to study enhancer–promoter interactions and chromatin loop organization with greater resolution compared with straightforward FISH or aptamer-tagging techniques used so far (Nazarova and Sexton 2026), making it a versatile tool in single-cell genomics.

Traditionally, steady-state mRNA levels have been assessed using qPCR or RNA-seq, but these approaches capture both primary and secondary responses and are influenced by RNA stability. To address this limitation, a variety of nascent RNA profiling techniques have been developed that monitor transcription as it occurs. Pioneering work introduced methods such as global run-on sequencing (GRO-seq) (Core and Lis 2008) and precision nuclear run-on sequencing (PRO-seq) (Kwak et al. 2013), which measure the location and activity of engaged RNA polymerase II (Pol II) genome-wide. These approaches provide a high-resolution view of transcription initiation, pausing, and elongation, enabling rapid detection of transcriptional changes following TF perturbation.

Other methods survey Pol II–associated chromatin-bound RNAs, such as chromatin-associated RNA-seq (ChrRNA-seq) (Bell et al. 2018) or transient transcriptome sequencing (TT-seq) (Schwalb et al. 2016), which enrich for nascent transcripts and thereby sharpen the link between TF activity and transcriptional outcome. Together, these nascent RNA methodologies offer a more direct readout of transcriptional regulation, making them particularly powerful for validating whether a TF truly governs expression of a given gene.

Previous Section Next Section

Experimentally validating TFs by measuring GOI transcript levels

Because the first outcome of a TF activating expression from the GOI is transcription, which leads to production of cognate RNA, measuring steady-state RNA levels is an obvious proxy for gene expression. One of the most direct approaches to assess whether a candidate TF regulates a GOI is to perturb TF abundance and measure downstream transcriptional consequences. Several complementary technologies exist for TF depletion, each with distinct strengths and caveats.

RNA interference (RNAi), typically in the form of short interfering RNAs (siRNAs) or short hairpin RNAs (shRNAs), has been widely used to reduce TF levels by promoting degradation of the corresponding mRNA. Although inexpensive and relatively easy to implement, RNAi often results in incomplete knockdown and can suffer from off-target effects, which complicate interpretation of transcriptional changes.

Genome editing approaches, such as CRISPR–Cas9-mediated knockout (Nakamura et al. 2021), enable permanent removal of a TF. Knockouts are powerful for defining essential regulatory roles but can be confounded by compensatory adaptation during cell culture or development, particularly when TFs are essential for cell survival or identity. CRISPR interference (CRISPRi), which uses a catalytically inactive Cas9 fused to a repressor domain, allows reversible transcriptional silencing of the TF gene without DNA cleavage, providing a tunable alternative.

Conditional protein depletion systems have become increasingly popular for dissecting TF function with higher temporal resolution. Degron-based strategies, such as the auxin-inducible degron (AID) (Nishimura et al. 2009), the dTAG system (Nabet et al. 2018), or Halo- (Tovell et al. 2019) and SMASh-tags (Chung et al. 2015), allow acute and reversible depletion of TF proteins in response to a small molecule. These methods are particularly valuable because they bypass transcriptional adaptation and permit rapid TF loss, thereby minimizing indirect or secondary effects. Compared with RNAi or CRISPR knockout, degrons can thus reveal more immediate transcriptional dependencies on the TF of interest.

In practice, combining depletion strategies with the precise transcriptional readouts discussed in this section provides a robust framework for validating direct TF–GOI relationships. Acute depletion approaches, especially degron systems, are now considered state-of-the-art for verifying TF function because they best preserve the causal connection between factor presence and gene regulation.

Northern blotting (NB) is a well-established technique that enables the detection and quantification of RNA transcripts of interest. This method involves separating RNA molecules by size using gel electrophoresis, followed by immobilization onto a membrane for hybridization with a labeled sequence-specific probe (He and Green 2013). Despite its reliability, NB has notable limitations, including low sensitivity, labor-intensive protocol, and the requirement for relatively large amounts of RNA. Importantly, NB does not provide insights into the spatial localization of transcripts or post-transcriptional modifications.

RNase protection assay (RPA) offers a more sensitive approach with the concomitant ability to determine TSSs by using sequence-specific DNA probes together with single-strand-specific nucleases to preserve hybridized RNA fragments while degrading unprotected single-stranded RNAs (Zhao et al. 2020). This method enables precise quantification of RNA abundance and mapping initiation sites, crucial for identifying promoter regions and TF binding sites. However, RPA is technically demanding, requiring careful probe design and optimization to minimize background noise. Additionally, it is less suitable for high-throughput analysis.

Historically microarrays were widely used to quantify transcript levels across the genome by hybridizing fluorescently labeled cDNA to thousands of predefined DNA probes immobilized on a solid surface. Each probe was designed to be complementary to a specific transcript, allowing its abundance in a sample to be inferred from the intensity of the fluorescent signal. However, the reliability of the microarray data was highly dependent on probe design; poorly designed probes could lead to cross-hybridization, in which nontarget transcripts bound to probes with partial sequence similarity, thereby generating false-positive or ambiguous signals.

Because probe sequences had to be predetermined, microarrays could only detect transcripts for which corresponding probes were included on the array. This meant that novel or poorly annotated genes were often completely missed, especially in early versions of microarrays based on incomplete genome annotations. Moreover, differential splicing, overlapping transcripts, or polymorphisms could further complicate interpretation, as probes were not always isoform specific.

Expression profiles generated by microarrays were thus limited to what the array was “designed to see,” making it a hypothesis-limited rather than truly discovery-based method. Additionally, probe performance could vary significantly between transcripts, necessitating careful normalization and often validation by qRT-PCR or NB. Despite these limitations, microarrays enabled high-throughput transcriptomic comparisons across conditions and were instrumental in defining early gene expression signatures in development and disease, whereas they have been largely retired following high-throughput RNA-seq development. However, mining existing databases of microarray studies following candidate perturbation can generate valid hypothesis regarding GOI regulators, but it is critical to keep in mind the limitations of this technique.

RT-qPCR can be employed to validate in silico predictions and determine whether the suspected TF plays a role in the transcription of the GOI. Total mRNA is first reverse-transcribed into complementary DNA (cDNA); this cDNA template is used for the quantitative PCR or real-time PCR reaction (qRT-PCR). Specific target fluorescent DNA-binding probes or fluorescent dsDNA-binding dyes (nonspecific used with target-specific primers) are incorporated during the PCR reaction, so that the increase in emitted fluorescence proportionally correlates with the GOI transcript quantity (Nolan et al. 2006; Green and Sambrook 2018).

Digital droplet PCR (ddPCR) is a highly sensitive method used to precisely quantify transcripts by partitioning a PCR reaction into thousands of nanoliter-sized droplets, each functioning as an individual PCR microreactor. Similar to qRT-PCR, total RNA is first reverse-transcribed into cDNA, which is mixed with primers, a fluorescent probe, and PCR reagents before being emulsified into droplets. Following thermal cycling, each droplet is analyzed to determine whether the target sequence was successfully amplified within that droplet. The absolute number of target molecules is calculated using Poisson statistics, bypassing the need for standard curves and making ddPCR advantageous for low-abundance targets or small fold-changes that may fall below the detection limits of qRT-PCR. Because each amplification occurs in isolation, ddPCR is less susceptible to primer–dimer formation or amplification bias caused by reaction kinetics.

RNA-seq leverages high-throughput sequencing to capture the entire transcriptome, without requiring predesigned probes (Wang et al. 2009; McGettigan 2013). Consequently, this method is intended for studying cell-wide impact of experimental conditions of interest rather than to validate, or identify, direct regulators of a GOI. However, as described above, databases collating RNA-seq data can be very useful in predicting potential regulators of a GOI as they allow levels of the GOI transcript to be correlated with other genes, which can be combined with data obtained from open-question experiments that map protein–DNA landscape.

All of the above-listed transcript quantification methods work by isolating RNAs, usually from large cell populations. Consequently, the spatial resolution of gene expression regulation is lost. Spatial resolution can be studied by microscopic imaging techniques. Recent advancements in both staining and imaging technologies have enhanced our ability to visualize gene activity at unprecedented resolution and through various modalities. Although imaging is not generally used to formulate hypotheses about what regulates your GOI, it is indispensable for validating potential regulators in a physiologically relevant context of the cell or organism. Specifically, fluorescence microscopy has emerged as a leading technique for studying gene regulation owing to its unique capabilities that allow researchers to visualize and analyze gene expression in real time at high resolution (Hickey et al. 2021).

Fluorescence microscopy enables the observation of gene expression at the single-cell level, or even single-molecule level with superresolution, providing detailed spatial information about gene activity within cells and tissues. This is crucial for understanding the dynamics of gene regulation during cell growth, differentiation, and responses to stimuli (Zou and Bai 2019). When staining protocols do not require cells to be fixed, fluorescence microscopy imaging is not destructive, allowing researchers to observe live cells without significant perturbation. This is vital when studying dynamic biological processes in real-time, as it minimizes the impact on the system being studied. Techniques such as light-sheet microscopy or spinning disk confocal microscopy further enhance this advantage by reducing photo damage, enabling longer observation times while maintaining high image quality (Eismann et al. 2020). FISH is a powerful imaging-based technique that provides spatial resolution by mapping RNA localization within cells and tissues. It uses fluorescently labeled probes that hybridize to target RNA sequences, enabling detection in a cellular context, down to the level of a single molecule when combined with modern imaging techniques (Young et al. 2020). Despite its limited multiplexing capability owing to the small number of available fluorescence detection channels, imaging is crucial for distinguishing regulatory mechanisms at a GOI that are differentially regulated in different cell types in a complex tissue. A key highlight of FISH is that it provides spatial resolution and the ability to focus on specific transcripts, dissecting transcriptional regulation from potential post-transcriptional regulation mechanisms.

Multiplex hybridization techniques, such as QuantiGene, further enhance RNA detection through a branched DNA (bDNA) system enabling the detection of low-abundance mRNA molecules. Instead of amplifying the target RNA, this system amplifies the detection signal, allowing for the direct quantification of RNA transcripts with exceptional accuracy (Lombardelli et al. 2025). Multiplex hybridization can be followed by quantification of isolated RNA in a tube or in cellulo by a FISH-like approach.

Lately, because of the advances in cellular engineering, it became possible to visualize mRNA molecules in living cells by tagging them with genetically encoded fluorescent RNAs, RNA aptamers that can bind fluorogenic substrates, allowing GOI expression to be monitored in living cells. These aptamers, usually denoted by developers with plant-derived names for easier recognition (Spinach, Broccoli, and Corn in the first generation; Pepper and Clivia in the second), have revolutionized RNA imaging techniques in living cells (Bereiter and Micura 2025). These tools can be used to quantify mRNA levels in individual cells, as well as imaging genomic loci using CRISPR display, real-time tracking of protein–RNA interactions, and superresolution imaging.

Other recent advances include techniques such as proximity ligation of RNA (PLAYR), which allows simultaneous quantification of GOI mRNA and the resulting protein at a single-cell resolution. Unlike other techniques such as RNA-seq and FISH, PLAYR avoids the loss of spatial and contextual information (Duckworth et al. 2019), offering an integrated view of gene and protein expression. PLAYR's compatibility with flow cytometry, mass cytometry, and imaging allows for scalable and high-resolution analysis of regulatory networks in a single assay by using in situ and proximal ligation of target-specific PLAYR probes. These predesigned pairs of hybridization probes bind in close proximity on the transcript of interest. The positioning of the probe pairs creates a platform for the DNA oligonucleotide insert and backbone to anneal. The insert and backbone DNA is then continuously synthesized through rolling circle amplification (RCA) by phi29 polymerase, amplifying the target sequence. Specific detection probes bind to the amplified insert DNA, allowing visualization of transcript-specific PLAYR signal detected via fluorescently labeled oligonucleotides (flow cytometry) or metal tags (mass cytometry) as small punctate dots (Duckworth et al. 2024). Alongside RNA detection, PLAYR also includes antibody staining to visualize GOI protein within cells. Furthermore, designing hybridization pairs to target both spliced and prespliced mRNA expands PLAYR's scope to not only quantify RNA expression but also study RNA dynamics and spatial arrangements during transcription and translation. This spatial and temporal insight into RNA and protein interactions is crucial for understanding GOI regulation in its native cellular context.

However, PLAYR has notable limitations. Although the design of hybridization probe pairs minimizes off-target effects, it increases the overall length of the probes. This poses a challenge for studying small genes, as there may be insufficient space for the required four pairs of hybridization probes to bind effectively (Duckworth et al. 2019). Furthermore, the RCA process introduces potential biases, particularly in signal quantification of expressed genes. These biases may lead to over- or underestimation of RNA abundance, limiting the accuracy in detecting subtle changes. Most importantly, similarly to other methods discussed in this section, PLAYR is an observational method. Although it provides valuable descriptive data, such as RNA and protein localization and abundance, it does not directly establish causation between a regulator and a GOI. To address this, complementary approaches like CRISPR and siRNA libraries need to be used to determine and/ or validate potential regulatory factors of a GOI.

Critically, evaluating changes in transcript abundance upon changes to candidate TF levels using RNA-centric approaches does not clarify whether the TF regulates gene expression directly or indirectly. These approaches also cannot identify the specific regulatory site or the mechanism of action for the TF. To identify the regulators of a GOI, it is crucial to consider all layers of gene regulation. A change in GOI transcript levels can be caused at the level of transcription by TFs and chromatin modifiers. However, GOI transcript levels can also be regulated post-transcriptionally by RNA-binding proteins, as well as microRNAs, and translational modifications or PTMs. Understanding these mechanisms requires tools that can simultaneously capture RNA expression and protein interactions, as these components work together to regulate gene expression dynamically. To gain an understanding of how these factors modulate the GOI, additional experiments discussed later in this article are necessary.

Previous Section Next Section

Experimentally validating TFs by measuring GOI protein product

Reporter assays

Expression from the GOI can be measured by quantifying the levels of proteins regulated by GOI promoter. To allow for high-throughput analysis, reporter assays integrate GOI promoters with proteins that are easy to detect and quantify such as luciferase. The typical approach utilizes two luciferase reporters, allowing simultaneous measurement of GOI transcription alongside a control, but other types of reporter genes may be used both for the GOI or reference promoter. The most commonly used luciferase enzymes include those from firefly (FLuc or more commonly simply Luc), Renilla (RLuc), Cypridina (CLuc), Gaussia (GLuc), or Oplophorus (NanoLuc) (Shifera and Hardin 2010; England et al. 2016). The disadvantage of FLuc and RLuc is the requirement for cell lysis to measure transcriptional activity. Secreted CLuc and GLuc reporters allow cells to remain intact so they can be used in further assays or for long-term monitoring (Wu et al. 2007). NanoLuc is the most versatile system owing to the high sensitivity and low molecular size of the enzyme as well as high stability of the substrate.

Although reporter assays are valuable tools for investigating transcriptional regulation, they have notable limitations. These assays measure the reporter gene's expression controlled by predicted regulatory elements of a GOI. In this artificial expression system, enhancers and/or promoters are inserted into a vector in which DNA methylation is minimal and higher-order DNA structures are absent; consequently, the promoter is always accessible to the transcription machinery. Therefore, although vector-driven reporter assays offer useful insights into how TF could regulate GOI expression (e.g., by using mutated vectors), they do not reflect the complexity of gene regulation in the genomic context. To address this, tags can also be incorporated through knock-in strategies. This allows for endogenous tagging of a GOI, or candidate TFs, a technique that preserves the native regulatory elements. Fluorescent protein (FP) knock-ins can be used for imaging of GOI expression dynamics, observation of TF binding, and chromatin remodeling (Chen et al. 2018; Lionnet and Wu 2021; Viushkov et al. 2022).

Similarly to methods discussed above, luciferase assays simply quantify expression from GOI promoter. Once a list of candidate TF regulators is compiled, we can assess their potential impact on the regulatory element of the GOI through reporter assays. This approach will quantify the effect of a TF on the activation of the GOI promoter and transcription of the reporter gene. However, to evaluate candidate TF impact on GOI expression, reporter assays have to be combined with approaches that alter TF levels. We can cotransfect (1) vectors carrying a control reporter, (2) vectors carrying the experimental reporter, along with (3) a vector encoding the candidate TF. Alternatively, comparison can be made between reporter gene expression in WT cells versus cells that underwent candidate TF depletion.

In vivo imaging of reporter genes

In vivo bioluminescence imaging (BLI), just like the typical gene reporter assays described previously, captures light produced by a chemical reaction catalyzed by an enzyme (luciferase) and its substrate (luciferin, which is injected prior to imaging). Unlike fluorescence imaging, BLI does not rely on external light sources, which results in minimal background noise and reduces the risk of phototoxicity. Although the amount of light generated by a bioluminescent reaction is usually too faint to be seen by the naked eye, highly sensitive charge-coupled-device (CCD) cameras can detect and capture this light from tissues or organs in small animals like mice (Keyaerts et al. 2012). In vivo reporter assays use transgenic animals and genome editing techniques, allowing GOI expression reporters to be localized to specific tissues or cellular processes (Yan et al. 2013; Kimura et al. 2014; Serganova and Blasberg 2019). Although technically challenging and costly, this approach provides several advantages over simple cellular models, making it the ultimate choice for validation of GOI regulators (Kvon 2015). Transgenic animal models maintain the complexity of whole organisms, including tissue architecture, systemic responses, and response to the environmental influences. This allows us to study GOI expression regulation in the context of natural biological conditions (Li et al. 2018b). Moreover, in vivo reporter assays enable the assessment of GOI activity across various tissues and organs simultaneously, which is particularly useful to understand how GOI is regulated under different physiological conditions (Maguire et al. 2013). Finally, because reporters used for in vivo studies are designed to be imaged by noninvasive techniques, changes in gene expression can be monitored dynamically over time (Buckley et al. 2015; He et al. 2019).

Detecting bioluminescence beyond a few centimeters remains challenging, as light signals must travel through tissue that absorbs, attenuates, and scatters their emissions (Curtis et al. 2011). Another common shortcoming of BLI assays is the hardship of standardization. The signal depends on many components like the concentration and stability/availability of luciferin, luciferase, and cofactors. Factors like oxygen levels, pH, temperature, and the choice of substrate and administration method affect the BLI signal in vivo. Standardization and validation are crucial for accurate data interpretation, and continuous technological advancements help in overcoming some of these challenges (Badr 2014). As BLI becomes more widely used in biological research, new luciferase genes and their mutant variants are being developed, alongside structural modifications to both the luciferases and their substrates (Xu et al. 2016; Mezzanotte et al. 2017; Saito-Moriya et al. 2021). The aim is to increase photon emission, expand the range of emission wavelengths for multispectral imaging, and shift emissions to longer wavelengths (>620 nm) to improve tissue penetration in small-animal models.

Alternatively to BLI, in vivo fluorescent imaging (FLI) can be used. It utilizes FPs, probes or ligands, which emit light upon excitation by an external light source. FLI stands out among other imaging methods owing to its minimal invasiveness, ability for real-time and multiplexed imaging, and relatively low cost. Because of the high number of photons generated, FLI typically requires very low concentrations of fluorophores (pico- to femtomolar) to produce high-contrast images, which helps lower both the cost of probe production and the risk of toxicity (Refaat et al. 2022). However, autofluorescence from tissues can interfere with the signal, reducing both contrast and accuracy. Additionally, photobleaching can occur with prolonged light exposure, causing fluorescent molecules to lose their ability to emit light and compromising image quality over time, which limits the potential for long-term kinetic studies. Furthermore, similarly to BLI, the main limitation of FLI is tissue penetration, as photon absorption and scattering reduce visible light intensity by about 10-fold for every centimeter of tissue (Sadikot and Blackwell 2005). To overcome this, advances have been made to develop reporters emitting in near-infrared (NIR) (Richie et al. 2017; Ding et al. 2018). At NIR wavelengths (650–1700 nm), light is absorbed and scattered by biological tissues to a lesser extent, whereas background autofluorescence decreases and is virtually nonexistent in the 1500- to 1700-nm spectral band (Diao et al. 2015).

In addition to the most commonly used BLI and FLI reporters, deep-tissue reporters have been developed to make use of medical imaging techniques. Some efforts have also been made to develop reporters perceptible with techniques better suited for bigger animals. These include PET reporters (Nerella et al. 2023), SPECT reporters (Wu et al. 2013), and MRI contrast reporters (Cheng et al. 2017; Concilio et al. 2021). Among various types of reporter genes, each has its own set of advantages and limitations depending on the imaging technique used. Fused reporter genes or single reporter genes detectable by multiple modalities can help overcome drawbacks while enhancing the benefits of each imaging method (Li et al. 2018a). These multimodal reporter genes include dual FP reporters (Camino et al. 2020), dual fluorescence and luminescence reporters (Choi et al. 2016), dual luminescence and radionuclide reporters (Gaspar et al. 2024), dual luminescence and MRI reporters (Rossi et al. 2016), or dual radionuclide and MRI reporters (Patrick et al. 2014).

Functional assays

Unlike gene reporter assays, which often involve introducing “naked” DNA constructs with reporter genes under control of artificial promoters, functional assays assess gene regulation within the natural chromatin environment. As a result, functional assays can provide an understanding of how gene regulation impacts biological processes, offering insights that are closely aligned with the gene's natural regulatory mechanisms. By combining these assays with tools like CRISPR-Cas9 or rapid protein depletion, researchers can directly manipulate gene expression, observing how these changes influence protein function, cell behavior, or organismal traits in a context that mirrors biological conditions. These insights are invaluable for understanding the mechanisms of gene regulation and for validating computational data experimentally.

One key challenge of functional assays is that they measure the outcome of gene regulation, such as protein levels, enzyme activity, or cellular behavior, without identifying the specific regulatory level at which changes occur. For example, an increase in protein expression observed through a functional assay might be a result of enhanced transcription, but it is just as likely to be caused by increased mRNA stability or by more efficient translation; however, the assay itself will not distinguish between these possibilities. This ambiguity makes it impossible to determine if observed changes are caused by direct regulation of GOI expression.

Measuring the protein product

One indirect way of assessing changes in GOI expression is by measuring the levels of protein product encoded by the GOI upon changes in candidate TF levels. Assays that measure protein quantity and /or activity include western blotting, ELISA, enzyme activity assays, and receptor-binding assays. For example, ELISA quantifies protein expression by detecting specific antigens. Enzyme activity assays indirectly assess how gene regulation impacts the amount of enzyme capable of catalytic function (Vimalraj 2020); however, this relies on a simplistic assumption that there is a direct correlation between the amount of the enzyme and its activity level, which is not always true, for example, when excess inhibitors might be present. Similarly, receptor binding assays examine if GOI-regulated receptor expression is changed upon TF increase/decrease. These assays collectively offer a “quick and dirty” method to validate candidate TFs, although it is critical to remember that changes in protein quantity or activity might not be a direct result of GOI expression. It is just as likely that processes such as splicing, translation, protein stability, PTM, or chaperone levels have been affected causing a change in the readout levels.

Measuring GOI protein levels using fluorescent tagging

Tagging the GOI with FPs is another method to track GOI expression, as well as to follow TF-driven regulatory events. Time-lapse imaging enhances this approach by enabling collection of data over extended periods, facilitating comprehensive analysis of gene expression luminesce fluctuations (Liu and Tjian 2018). In the past two decades hundreds of new FPs covering the visible light spectrum have been engineered and become indispensable for the visualization and quantification of gene expression (Specht et al. 2017; Lukyanov 2022). Moreover, many new variants of FPs (photoactivatable, photoswitchable, reversibly photoswitchable) use advanced imaging techniques and allow time-dependent analysis and real-time tracking of transcriptional processes in single cells (Wu et al. 2011). As a result, a variety of computational techniques have emerged to quantitatively reconstruct promoter dynamics from time-lapse measurements (Lichten et al. 2014; Zulkower et al. 2015; Kannan et al. 2018), lately even taking into account FP maturation dynamics (Pavlou et al. 2022). Another way of taking advantage of distinct kinetics of fluorophore maturation in FPs is the use of tandem FP timers (tFTs), which are versatile reporters of protein dynamics (Khmelinskii and Knop 2014). When combined with high-throughput genome editing, tFTs can be used to identify regulators of GOI that differently impact the rate of protein synthesis, maturation, and degradation (Fung et al. 2022).

Cell-based assays

Cell-based assays can also be used as a “quick and dirty” way to validate the impact of candidate TF on the GOI expression, providing that GOI is a regulator of cell division, cell death, metabolism, or another easy-to-measure process. For GOIs that regulate cellular proliferation, assays such as colony formation, cell counting, or cell cycle profiling can be used upon candidate TF level manipulation. Similarly, migration assays, including chamber-based methods, evaluate cellular motility and can be used to validate TFs of GOIs that control these processes. By integrating cell-based assays with other molecular methods like reporter assays, researchers can try to define the effect of candidate TFs on GOIs in a quick, technically easy, and cost-effective way.

To summarize, in vitro and in vivo reporter assays as well as fluorescent tagging of endogenous GOI loci or GOI product detection allow us to quantitate expression of the GOI; however, these methods alone are not able to identify, or even validate, GOI regulators. Perturbation of candidate TF levels is required to make these quantitative observations informative.

Previous Section Next Section

Experimentally validating if candidate TF binds to the GOI

To identify direct regulators of GOI expression, it is critical to confirm their ability to bind to the GOI. Usually a candidate TF regulator of a GOI is identified in silico first, but this relationship has to be validated by confirming the binding between the TF and the GOI regulator DNA sequence. Methods outlined in this section targeted to specific proteins work well for validating regulators of the gene in question, rather than for the discovery of novel GOI regulators, as they rely on study-specific fusion proteins.

The basic method for identifying DNA–protein binding in the molecular geneticist's repertoire has already been mentioned: supershift EMSA, which analyses the capacity of individual TFs extracted from cells’ nuclear lysate to bind to a labeled DNA probe. EMSA's fundamental limitation is the reliance on naked DNA, lacking all chromatin context. To overcome this, experiments can be coupled with targeted ChIP, which is currently the entry-level golden standard in supporting direct TF-GOI binding (Kim and Dekker 2018). Chromatin is reversibly cross-linked with formaldehyde, sheared, and immunoprecipitated with antibodies against the TF to be tested. The precipitate is de-cross-linked, and the abundance of DNA derived from regulatory sequences that were bound to the TF in question, as a measure of its relative occupancy of the response elements in the cell population, is assayed. TF-bound DNA can be quantified by simple qPCR, with nonregulatory sequences known not to bind the TF, or regulatory sequences in which the binding of investigated TF is known not to be influenced by the treatment in question, used for reference. The main drawbacks of this technique are the need for large cell numbers, as well as artifacts related to antibody quality, chromatin fixation, and shearing, combined with risk of false-negative results when an incorrect location in the gene promoter/enhancer is selected for detection.

Alternatively, DNase I footprinting offers a classical biochemical approach to map exact protein–DNA interaction sites at base pair resolution. Radiolabeled or fluorescently labeled DNA is incubated with a purified DNA-binding protein or nuclear extract and then subjected to limited digestion by DNase I. Regions of the DNA protected from cleavage by bound proteins are visualized as “footprints” compared with a control digestion in the absence of protein. This method provides detailed structural information about binding sites and can distinguish between specific and nonspecific DNA–protein interactions. However, it is technically demanding and low-throughput and requires relatively large amounts of purified protein and clean DNA, limiting its use.

When reliable and quantitative data are a priority, rather than the cellular context, binding between the candidate TF and GOI regulatory sequence can be assayed by direct biophysical techniques. These approached require high-quality purified TFs but give definite answers on a TF's absolute ability to bind a response element and the kinetics of such binding. Although spectroscopic techniques detecting conformation change, such as circular dichroism, have been successfully applied for this task, the current technique of choice is surface plasmon resonance (SPR), in which one interacting partner (DNA or protein) is immobilized on the surface of a flow cell, and this surface is kinetically exposed to a solution of the other partner. Upon binding, the average molecular mass of complexes at the surface increases, changing the properties of plasmons (energy vibrations induced in interface matter by light upon total internal reflection) and allowing optical detection by monitoring the attenuated total internal reflection angle. This powerful technique has been successfully applied to detect weak, transient TF–DNA interactions or binding facilitated by other interaction partners (e.g., coactivators) (Moyroud et al. 2009; Aditham et al. 2021).

Microscopic imaging of TF–GOI interactions

When identification of physiologically occurring TF–GOI interaction is of paramount importance, imaging techniques are the go-to approach (Lee and Myong 2021). Their basic application can be divided to simple colocalization or proximity based. For colocalization analysis, binding partners, which are usually the candidate TF and other proteins already known to bind to the characterized GOI promoter sequence (stained, e.g., by DNA-FISH), are fluorescently labeled (e.g., by immunofluorescence) and visualized at highest possible resolution to determine the proportion of pixels when both co-occur. Similarly to functional assays, these simple interaction-detection approaches are an easy and cheap way of verifying GOI candidate regulators (Chaumeil et al. 2013; Gilbonio et al. 2023). A positive result does not confirm binding but is a strong indication of a possible interaction especially when superresolution microscopy is employed (Dai and Dai 2012). Unfortunately, these techniques suffer from drawbacks including poor signal-to-background ratio and limited specificity and spatial resolution. Furthermore, because high-throughput widefield fluorescence microscopy images cannot resolve crowded molecular targets, the amount of extractable biological information is often limited. One way to overcome some of these limitations is to apply advanced computational methodologies of image deconvolution (Wernersson et al. 2024). However, simply localizing DNA and protein in one place is not enough to prove regulatory mechanisms as elegantly outlined by the Hahn group (Mahendrawada et al. 2025). To quantify DNA–TF interactions based on molecular distance Foerster resonance energy transfer (FRET) between two fluorophore molecules attached to interacting partners, DNA–TF, can be detected by fluorescence microscopy (Chen et al. 2021b; Wang et al. 2022). Alternatively, in bioluminescence resonance energy transfer (BRET), the energy donor is not a fluorophore but a light-emitting enzyme, for example, luciferase (Vickers and Crooke 2016).

High-resolution imaging of TF–GOI interactions

Superresolution fluorescence microscopy (SRM) technologies developed over the past two decades have pushed the resolution limit of fluorescently labeled molecules into the nanometer range and revolutionized our understanding of gene regulation by providing unprecedented insights into the spatial organization and dynamics of chromatin and transcriptional machinery at the nanoscale (Xu et al. 2020; Parteka-Tojek et al. 2022). These advanced imaging techniques enhance resolution by refining illumination and detection strategies, including methods like stimulated emission depletion (STED) microscopy, structured illumination microscopy (SIM), and single-molecule localization microscopy (SMLM) approaches such as PALM and STORM (Birk 2019). The ability to directly detect individual fluorescent molecules in living cells enhanced our comprehension of how gene expression is regulated within the cellular environment and with close-to-physiological cellular context (Uphoff 2016). High-throughput superresolution imaging can now reveal 3D chromatin organization, resolving prior uncertainties, challenging existing models of regulatory specificity, and suggesting new roles for chromatin structure in transcription (Brandão et al. 2021; Hafner and Boettiger 2023). Although SRM techniques usually rely on fluorescent labeling of proteins of interest (like TFs), an interesting alternative is inverse imaging of unlabeled proteins bound to DNA. This technique relies on DNA-binding fluorophores that transiently label bare but not protein-bound DNA, making it possible to observe unlabeled protein patches bound to isolated strands of DNA with a resolution limit estimated between 5 and 15 nm in size (Meijering et al. 2020). Despite its powerful capabilities, SRM requires expensive and time-consuming sample preparation, robust imaging protocols, and access to advanced microscopic equipment and software to ensure accurate and reproducible results (Chen et al. 2024a).

Alternatively, expansion microscopy (ExM) enhances the resolution of light microscopy by physically enlarging biological samples, allowing for detailed studies of gene regulation at the nanoscale. ExM employs a hydrogel embedding process in which fixed and immunolabeled cells are treated to swell isotropically, effectively increasing the distance between labeled molecules. This expansion allows conventional microscopes to resolve structures that were previously below the diffraction limit of light microscopy, achieving lateral resolutions of ∼70–80 nm (Wen et al. 2023).

ExM has been used to map epigenetic interactions (Acke et al. 2022), investigate the nanoscale organization of chromatin during transcription (Pownall et al. 2023), analyze nuclear structures in cultured cells, and facilitate the examination of spatial relationships between proteins involved in gene regulation at a subnuclear scale (Holsapple et al. 2023; Mäntylä et al. 2023). This allowed researchers to observe interactions between different nuclear components and their roles in gene expression (Faulkner et al. 2022). Although ExM provides remarkable insights, it also presents significant challenges. The process can be time-consuming and requires careful optimization of hydrogel chemistry and sample preparation to ensure isotropic expansion and minimize artifacts. Variations in expansion factors across different cellular structures can complicate data interpretation, necessitating rigorous controls and validation (Gaudreau-Lapierre et al. 2021).

In summary, although useful in many cases, one major drawback of using visualization-based approaches is that they require prior sample fixation and thus necessitate mindful planning of experimental data points. Furthermore, as discussed above, chromatin accessibility and DNA–protein colocalization are simply prerequisites for binding and expression regulation.

DNA adenine methyltransferase identification

DNA adenine methylase identification (DamID) can be used to validate, or identify, the DNA binding sites of candidate proteins and can be seen as an alternative to ChIP (Fig. 2A,B). A fusion product made up of the candidate TF and an Escherichia coli DNA adenine methyltransferase (Dam) is generated. When candidate TF binds to the DNA, the attached Dam methylates nearby adenines at their N6 positions in dsDNA within GATC sequences (Barras and Marinus 1989). Because adenine methylation does not occur naturally in most eukaryotes, this provides a unique tagging system, marking the binding sites of candidate TF. After expression of the Dam fusion protein, methylated adenine regions are cleaved by DpnI, which selectively recognizes G^mATC sites (van Steensel and Henikoff 2000), and amplified using a methyl-specific PCR reaction (Greil et al. 2003). Amplified fragments of DNA that were bound by candidate TF can be detected using qPCR, Southern blotting (Greil et al. 2006), or DamID-seq (Wu et al. 2016). Because DamID uses cloning resulting in expression of the fusion protein inside a living cell, no cross-linking is required, reducing the risk of artifacts.

View larger version:

Download as PowerPoint Slide

Figure 2.

Comparison between select protein focused protein–DNA interaction analysis methods. (A) DamID (Greil et al. 2006; Aughey and Southall 2016), ChEC (Schmid et al. 2004), and CUT & Tag (Kaya-Okur et al. 2019) workflows following candidate protein identification. (B) DamID-sequencing. DamID-fusion with candidate TF binds to DNA and proceeds to methylate nearby adenines at the N6 position within GTAC motifs. Following DNA isolation, DpnI selectively cleaves DNA at GmATC positions. Cleaved DNA is ligated to primers for PCR amplification. After sufficient amplification, DNA is sent for sequencing. (C) ChEC-sequencing. MNase-candidate TF fusion binds to DNA. Cells are permeabilized, and Ca²⁺ is added, causing MNase to cleave nearby DNA. Cleaved DNA is isolated and ligated to primers for PCR amplification. After sufficient amplification, DNA is sent for sequencing. (D) CUT&Tag. Cells are isolated and permeabilized, with primary antibody added to bind to target protein and secondary antibody added to increase yields. A Protein-A–Tn5 fusion protein is added, binding to the secondary antibody. Mg²⁺ is added to activate the Tn5. Tn5 cleaves adjacent DNA and ligates sequencing adapters to DNA. SDS-PAGE is added to the reaction to release DNA fragments, ready for sequencing.

Because of methylation-based detection, DamID lacks precision; the spread of methylation may be broader than the binding site, and unbound fusion protein will inevitably diffuse through the nucleus and, because of Dam's high affinity for GATC sequences, produce considerable levels of background methylation (Greil et al. 2006). Another limitation is that chromatin structure can restrict access of the Dam–TF fusion protein to certain regions of the DNA, leading to potentially incomplete or biased mapping of binding sites; however, this improves biological relevance (Gottschling 1992). A further drawback of this method is that it is dependent on the availability, as well as the frequency, of GATC sequences within the GOI; Dam methylation can occur up to ∼5 kb from a binding site (Greil et al. 2006). However, by generating a fusion protein of candidate TF and comparing it to nonspecific Dam fusion, it allows experimental validation of TF binding within the GOI regulatory sequences. It is important to point out that candidate protein is introduced artificially as a fusion. Furthermore, just like ChIP-based methods, DamID characterizes DNA binding profile of a TF of interest and hence cannot help in predicting regulators of a GOI if these are not known.

Chromatin endogenous cleavage

Chromatin endogenous cleavage (ChEC) (Schmid et al. 2004) is another ChIP alternative; it involves generating a fusion protein, consisting of the candidate TF and a C-terminal micrococcal calcium–dependent endonuclease (MNase) (Fig. 2C). When candidate TF binds to its target sites on the DNA, MNase is activated through the artificial increase of calcium ion concentration. Once activated, MNase cleaves DNA nearby to the binding site of the candidate TF (Babl et al. 2015). Cleaved DNA fragments can be identified at specific, single loci, through the use of Southern blotting and indirect end-labeling (Schmid et al. 2004), although these methods have been surpassed by newer developments. More recently, ChEC-seq has been developed to incorporate high-throughput sequencing (Zentner et al. 2015), DNA fragments are isolated through negative selection using size-selecting magnetic beads; DNA ends are repaired and subsequently ligated to Illumina TruSeq adapters for sequencing. Additionally, ChEC can be used to provide data on the frequency of the protein–DNA interactions; the degree of DNA degradation by the MNase fusion protein can be monitored in time course experiments (Merz et al. 2008). Similarly to DamID, thanks to the production of a fusion protein, cross-linking is not required, reducing the risk of artifacts.

However, when highly abundant proteins are studied, MNase has the potential to produce excessive cleavage, which makes it more challenging to confirm specific binding events. It is important to consider that observed cleavage may not represent direct binding of the protein to specific DNA sequences, as some cleavage may occur owing to spatial proximity rather than direct interactions. However, background signal caused by nonspecific cleavage can be addressed through the recently developed bioinformatic filtering, DoubleChEC, which produces robust identification of TF binding motifs and target genes (VanBelzen et al. 2024). Another drawback of the original ChEC-seq is that proteins that sparsely interact with DNA yield relatively few DNA fragments. However, this has been addressed in ChEC-seq2 (VanBelzen et al. 2024), in which free ends of the genomic fragments are marked through ligation to a custom adapter, which is followed by Tn5 transposase-mediated library construction. These fragments are then specifically amplified with the Nextera index primers, resulting in a DNA library compatible with Illumina sequencing. Similarly to DamID, a candidate protein is introduced artificially as a fusion.

Cleavage under targets and tagmentation

Cleavage under targets and tagmentation (CUT&Tag) is based upon the same immunotethering principles as ChIP, which are described above (Kaya-Okur et al. 2019). However, in CUT&Tag, after antibody incubation, instead of immunoprecipitation, chromatin shearing and library preparation are performed (Fig. 2D).

First, cells are isolated and immobilized on magnetic beads, permeabilized, and treated with an antibody-specific to the candidate TF, along with a secondary antibody to amplify the assay yields. A fusion protein made up of Protein A–Tn5 (transposase), which binds to the antibody-labeled chromatin, is added to the reaction. The Tn5 comes preloaded with sequencing adapters for high-throughput sequencing. Mg²⁺ is added to the reaction to activate Tn5, which cleaves the adjacent chromatin and ligates the sequencing adapters to the DNA. SDS buffer is added to the solution, releasing the tagmented DNA and allowing for it to be sequenced (Kaya-Okur et al. 2019).

CUT&TAG is simpler and cheaper than ChIP-seq; all the steps are compatible with a single test tube workflow and take just 2 days. Costs are lower because there is no separate library preparation step (Kaya-Okur et al. 2019; Henikoff et al. 2020). Critically, CUT&Tag works with very low cell numbers and even single cells (Kaya-Okur et al. 2019).

However, CUT&Tag can give high background owing to Tn5 binding open chromatin and/or mitochondrial DNA. Nonspecific noise caused by mitochondrial DNA binding can be avoided by using nuclei rather than whole cells. CUT&Tag requires high-salt washes to remove nonspecific or unbound fusion-protein before tagmentation to help reduce background noise; however, these high-salt washes may impact weaker chromatin interactions, potentially affecting biological relevance of the data when mapping proteins that do not bind to the DNA as strongly. It is important to consider that, similarly to ChIP-seq, DamID, ChEC, and CUT&Tag are much better suited for identifying genes bound to a protein of interest rather than for validating candidate protein binding to a GOI. These methods involve significant bioinformatic analysis of a large number of sequences identified for each protein of interest, and this richness of information is simply not needed to confirm if protein X binds to a single GOI (Fig. 2A; Table 1).

View this table:

Table 1.

Comparisons of ChIP, DamID, ChEC, and CUT&Tag

When assessing whether a TF regulates a GOI using data sets, genome-wide data offer key advantages over single-locus measurements. Techniques such as ChIP-qPCR or reporter assays can provide sensitive, targeted information, but they are inherently limited by primer design, locus selection, and assumptions in enrichment calculations. In contrast, genome-wide approaches such as ChIP-seq, DamID-seq, ChEC-seq CUT&RUN, or CUT&Tag generate comprehensive profiles of TF binding that can be rigorously normalized and benchmarked across positive and negative controls. These data sets also allow for the discovery of enriched DNA motifs and cobinding partners, providing broader biological context that single measurements cannot capture. Importantly, genome-wide assays make it possible to distinguish genuine TF binding events from technical artifacts and to evaluate binding specificity across the genome. Thus, although locus-specific assays remain useful for validation, genome-wide strategies provide a more robust and controlled framework for inferring direct TF–gene regulatory relationships.

In parallel with experimental approaches, computational prediction of TF targets using deep learning is rapidly gaining traction (Choi et al. 2025). Models can infer regulatory relationships directly from DNA sequence, epigenomic features, or large-scale perturbation data sets such as Perturb-seq (Dixit et al. 2016), offering the potential to nominate candidate TF–gene interactions at scale. Although these predictions are powerful, they should be interpreted cautiously: Model accuracy depends on training data quality, and predicted interactions require experimental validation. With the rise of large language models (LLMs) and other AI approaches, the use of in silico predictions is likely to proliferate, but they should complement, not replace, empirical measurements when establishing causal TF–gene relationships.

How to make an ideal cell-based reporter assay

An ideal gene regulation assay consists of two components: an effective and well-researched TF or treatment and a reliable, consistent detection method. In the simplest terms, the yeast one-hybrid (Y1H) assay provides a genetic approach to identify TFs that bind to a specific DNA sequence in vivo. The DNA bait, GOI promoter, is integrated into the yeast genome upstream of a reporter gene. Yeast is then transformed with libraries expressing potential TFs fused to a transcriptional activation domain. Binding of a factor to the bait DNA drives reporter expression, revealing functional interactions. Y1H is especially useful for screening large cDNA libraries to uncover novel regulators. However, its utility is constrained by the heterologous yeast system, which may not recapitulate all aspects of human chromatin structure or protein folding, and by the possibility that certain interactions require cofactors absent in yeast. Additionally, as with all artificial systems, false positives and negatives are a known limitation, necessitating downstream validation in the native cellular context. However, this approach can be improved through the use of CRISPR-Cas9 technology, which enables the incorporation of gene reporters directly into the human genome. By knock-in, a reporter tag can be attached to a specific gene and expressed stably, thus creating a cell line with a permanent reporter expression (Fig. 3A; Damhofer et al. 2021; Tamura and Kamiyama 2023; Deleuze et al. 2024). Contrary to traditional plasmid systems, CRISPR knock-in preserves a GOI's native promoter, enhancers, and chromatin context. Real-time changes in GOI expression can be observed in response to stimuli, TFs or chromatin modifications, directly identifying key regulators.

View larger version:

Download as PowerPoint Slide

Figure 3.

Generating a reporter cell line of the endogenous locus. (A) Using CRISPR-Cas9 to tag a gene of interest (GOI) with a fluorescent tag at the C or N terminus. (B) Following transfection of the CRISPR-Cas9 plasmids and selection of the tagged GOI cell line, GOI expression from its endogenous locus can be quantified by measuring fluorescent signal.

This method has been commonly used with a NanoLuc reporter incorporated into the genome, allowing for luciferase-based quantification of expression (Oh-Hashi et al. 2016; Li et al. 2019a). Similarly, FPs can be used, allowing product visualization in a range of cell lines, including brain neurons, hiPSCs, and HeLas (Uemura et al. 2016; Koch et al. 2018; Sharma et al. 2018). Incorporation of a fluorescent tag enables efficient and effective fluorescence-activated cell sorting (FACS), allowing for isolation of distinct regulatory states for further analysis with RNA-seq or ChIP-seq.

Once generated, a stably tagged knock-in cell line is a reliable cell-based reporter that can be assayed on a large scale (Fig. 3B). However, creation of the cell line poses several challenges. Consideration must be made for which end of the protein is tagged (predictive web tools can determine if functional regions or folding will be perturbed) (Käll et al. 2004; Roy et al. 2010), the importance of subcellular localization and trafficking (Teufel et al. 2022), and the assurance of minimal disruption of protein interactions (Omasits et al. 2014), as these are critical for accurate detection at protein level. Alternatively, a promoter can be tagged with a fluorescent sequence downstream; this answers if the promoter is activated but the FP is produced on its own, without compromising the target product (Ijaz and Ikegami 2021). This approach uncouples gene expression from transcription, allowing separate quantification of translational and transcriptional activity in a single system by tagging both the promoter and target gene with differing tags. However, tagging the promoter is challenging as some promoters have multiple regulatory elements; this makes tagging the GOI easier to interpret in the context of its natural regulation.

Another consideration is what to tag with. Fluorescent tags are relatively bigger than other knock-in options and so can pose challenges experimentally. Smaller tags such as mNeonGreen and mScarlet, which are compact but bright FPs, may prove easier to both insert and detect (Shaner et al. 2013; Bindels et al. 2017). However, these are newer and less tested with knock-in applications (Crowe and Yue 2019; Morrow et al. 2021). Both eGFP and mCherry are more widely used, photostable options with established knock-in protocols (Lee et al. 2022; Kim et al. 2023, 2024a; Tamura and Kamiyama 2023; Jiang et al. 2024). If you want to tag multiple genes in one system, mCherry and eGFP are also an excellent combination for flow cytometric analysis (Kleeman et al. 2018). Further gene tagging requires a FACS machine capable of blue excitation; for this, mTagBFP pairs well (Subach et al. 2011; Perez-Leal et al. 2021). Lastly, as with most CRISPR-Cas9 experiments, care must be taken to screen for unwanted mutations and produce a homozygous cell line to reduce noise from expression level differences (Koch et al. 2018; Hunt et al. 2023).

Users must consider the time and care needed to create a stable knock-in cell line for a GOI and balance this against its advantages, including the preserved chromatin context and the dependable nature of the system for long-term investigations. The time-consuming nature of producing a cell-based reporter line can be mitigated by using the latest advances in CRISPR knock-in techniques. One example is the “HiTag” technique, which can incorporate a tag into primary cells, which are difficult to clonally select, as well as postmitotic cells, with a single electroporation step (Zeng et al. 2020). This method has been used successfully with mCherry to produce a C-terminal tagged protein library (Kim et al. 2024b). Another method is targeted knock-in with templates (TKIT); this approach uses two-guide RNA and donor templates (Fang et al. 2021). TKIT was used successfully to tag synaptic proteins with mNeonGreen in mouse cortical neurons.

Successful creation of a homozygous tagged cell line offers a method for systematically identifying what regulates a GOI. The endogenous reporter for a GOI line can be treated with specific TFs through plasmid delivery or direct protein addition (Del'Guidice et al. 2018) as a direct measure of gene activation or repression. Further, cell populations or individual cells can be sorted based on fluorescence levels using FACS and kept for downstream applications such as RNA-seq (Grolmusz et al. 2016) or techniques described in the previous sections, imaging using the cells genetically encoded fluorescence, further flow cytometry for additional markers, ChIP-seq, and western blot. CRISPR knock-in can also be combined with the already established CRISPR knockout libraries to identify GOI regulators (Sanson et al. 2018). Through introduction of knockouts in predicted regulatory regions, fluorescence changes representing altered GOI expression can be quantified, revealing factors that control GOI expression. This high-throughput technique could reveal upstream regulators, pathways, and chromatin modifications involved in GOI control.

Using a CRISPR-Cas9 knock-in tag to determine what regulates a GOI offers a powerful approach that has significant advantages over traditional methods. This technique provides a permanent biosensor integrated into the genome at one time point, allowing for consistent long-term expression without the need to incorporate a detection method at each assay, thus increasing experimental reproducibility and reducing technical inconsistencies.

Summary

Targeted protein centric methods that analyze known candidate protein binding to the DNA using fusion proteins or immunodetection, although powerful for identifying DNA bound to a protein of interest, are not good tools for validating candidate proteins suspected of binding to a GOI. Generating fusion proteins with the TF of interest is often required, and the process can be resource-intensive when dealing with a large candidate list. Therefore, prioritizing candidates through complementary approaches or bioinformatics can streamline the validation process. Furthermore, it is critical to remember that not all of these methods explore if candidate protein binding has any effect on GOI expression, and as such, they cannot confirm a regulatory role. Similarly, methods based on visualizing chromatin or DNA–protein colocalization give no insight into a regulatory role and need to be coupled with other assays to identify, or even validate, candidate TFs of a GOI. However, recent developments in genome editing allow for smart and ambitious protocols to be implemented.

Previous Section Next Section

Identification of proteins bound to the regulatory elements of the GOI within the cell

To discover what regulates the expression of a GOI, we need to be able to map which proteins bind to its regulatory elements on the DNA. There has been a lot of effort to address this challenge, resulting in reverse ChIP and in proteomics of isolated chromatin segments (PICh) development. Both of these methods leverage MS/MS analysis to identify proteins bound to the DNA of interest; however, reverse ChIP does this in vitro, whereas PICh allows for cell-based analysis. The principle is that if the expression of a GOI is increased when cells are subjected to condition X, by comparing which proteins are bound to the GOI regulatory element upon treatment X, in contrast to an untreated control, we can determine proteins that drive GOI expression, at least in that condition. This offers a significant step up from methods that rely on artificially introduced fusion proteins and, as such, are not suited for testing if candidate protein binding to TF affects GOI expression.

Reverse chromatin immunoprecipitation

For both reverse ChIP (r-ChIP), also known as the DNA pull-down assay (Wen et al. 2020; Wen and Wang 2024), and PICh (Déjardin and Kingston 2009), the DNA sequence of interest, such as the GOI promoter or enhancer, needs to be determined first as described above. Once this sequence is known in reverse ChIP, the sequence needs to be synthesized with tags that will allow pull-down such as biotin. Cells are grown under conditions of interest followed by lysis and fractionation to isolate the DNA-bound proteins. Chromatin fraction undergoes sonication to shear the DNA, leaving the proteins that were bound to the DNA. The resulting protein solution is incubated with the synthesized GOI DNA–biotin construct. Following washes and pull-down, or magnetic bead enrichment, proteins that are specifically bound to the DNA of interest are isolated and can be identified using MS/MS. However, there are few set backs to this approach. Controls need to be carefully designed, including scrambled DNA sequence, beads alone, etc., to weed out nonspecific binders. Construct length needs to be carefully considered, taking into account binding motifs as well as specificity. Furthermore, careful consideration needs to be made when deciding which end of the DNA should be tagged as that might affect protein binding. Similarly, there can be an advantage in cross-linking protein–protein complexes to ensure their competency to bind the DNA; however, increased binding can significantly increase the amount of nonspecific hits. The most important complication arises from the design of this experiment in which proteins are exposed to, in essence, a dsDNA break. Consequently, there is recruitment of DNA ds break repair machinery, which is not specific to the GOI sequence and competes with specific binders. Of course, the use of scrambled DNA as a control is meant to combat this, but in practice, this is a major obstacle. Some studies have attempted to further enhance specificity by using probes with single-base mutations at sites known to play a critical role for TF binding; however, if this level of knowledge about the GOI, and its TF binding, already exists, the whole experiment is probably not needed as the GOI regulator is already well known. Importantly, in reverse ChIP, DNA is synthesized; as such, there is no chromatin structure or accessibility constraints, and this will inevitably generate false binding.

The PICh approach

PICh aims to resolve the technical problems of r-ChIP. In PICh, affinity purification relies on the specific hybridization of nucleic acid probes to selected targets and thus can be directly used to purify endogenous targets. Critically, PICh is not based on the generation of a transgenic target or the expression of adaptor proteins and does not require prior knowledge about the identity of the bound proteins (Ide and Dejardin 2015). This ensures that chromatin accessibility and structure are accounted for. Chromatin with the GOI-protein binding intact is cross-linked and broken down into smaller fragments. Next, the locus of interest is targeted with desthiobiotinylated oligonucleotide probes containing 50% of locked nucleic acids (LNAs) to improve the stability of probe–chromatin interactions. Modified nucleotides are used owing to their high melting temperature, as well as stable and specific hybridization. Hybridized probes are subsequently pulled down with streptavidin-coated beads. Because the probes are tagged with desthiobiotin, a biotin analog that binds less strongly to streptavidin, the probe–hybrid complex is easily released into solution by biotin elution, and nonspecific chromatin fragments remain bound to the beads, significantly reducing background. The final product can be analyzed by western blotting for candidate TF or MS for all GOI bound proteins (Khmelinskii and Knop 2014). PICh can be combined with stable isotope labeling by amino acids in cell culture (SILAC) for quantitative analyses (Kan et al. 2017).

Although PICh can identify proteins interacting with specific chromatin regions, there are notable limitations to consider, including the need for large amounts of cell material to obtain sufficient quantities of purified DNA-bound protein for MS, difficulties in avoiding hybridization with other genomic regions, and challenges related to the stability of TF–DNA binding, as well as reliance on repeat regions. PICh provides an enrichment factor of up to 10,000-fold; however, it is not well suited for studying the composition of single-copy small loci (∼3 kb) in mammalian genomes. Recent improvements include development of end-targeting PICh (ePICh), which allows for targeting of less abundant genomic loci by using a restriction digest to mark chromatin segment ends (Ide and Dejardin 2015). However, further development is needed for PICh to be compatible with the identification of proteins bound to a promoter of a GOI. In the future, PICh could be improved by a dCas9-based approach similar to recent work in plants (Wang et al. 2023). Enhancing cross-linking methods and developing multiplexing capabilities could also increase its efficiency and broaden its applicability for large-scale and high-resolution studies.

Artificial chromosomes

Conventional gene expression assays often lack the native chromatin environment required to faithfully study transcriptional regulation. Plasmid-based reporters, in vitro DNA probes, or transient overexpression systems frequently fail to recapitulate higher-order chromatin structure, limiting their physiological relevance. Artificial chromosomes overcome this limitation by mimicking the behavior of endogenous chromosomes. These constructs contain the essential elements of chromosomal maintenance—replication origins, centromeres, and telomeres—enabling stable replication and segregation in host cells (Nierman and Feldblyum 2001). This architecture allows the insertion of large genomic regions, including full-length genes with their endogenous promoters, enhancers, and boundary elements, offering a substantial experimental advantage over minimal vector systems. Artificial chromosomes provide more cellular material than endogenous loci and maintain chromatin features absent from naked DNA or plasmid vectors, making them a powerful complement to methods such as reporter assays, ChIP-seq, or proteomic analysis of chromatin segments.

Yeast artificial chromosomes (YACs) were originally developed for yeast genome mapping (Burke et al. 1987) and later adapted for use in mammalian cells via cell fusion (Huxley and Gnirke 1991). However, their large size renders them difficult to transfect. Although transfection efficiency can be improved by codelivery with adenovirus (Chen et al. 1997), challenges remain with YAC isolation. Techniques such as pulsed-field gel extraction are time-consuming and low-throughput and yield limited quantities of intact YAC DNA (Larionov et al. 1996), whereas alkaline extraction methods offer modest improvements but remain low-yield (Devenish and Newlon 1982).

Human artificial chromosomes (HACs) have since been developed using either top-down approaches (truncation of native chromosomes) or bottom-up strategies (synthetic construction) (Ponomartsev et al. 2022). HACs display favorable experimental properties, including mitotic stability, episomal maintenance, and low copy number, which reduce integration-associated artifacts and make them suitable for long-term studies (Kazuki et al. 2013). HACs exhibit stable tissue-specific expression in mice and can transmit transgenes into the second generation, albeit at a modest efficiency (∼50%) (Ikeno and Hasegawa 2020). Although HAC transfer methods such as microcell-mediated chromosome transfer, whole-cell fusion, or metaphase chromosome transfer require careful optimization (Ponomartsev et al. 2022), once introduced, HACs remain extrachromosomal and avoid the risks of insertional mutagenesis or immune activation (Thomas et al. 2003).

The ability to incorporate large DNA fragments, including entire gene loci and their extended regulatory landscapes, makes artificial chromosomes uniquely valuable for interrogating gene regulation in a near-native context. They allow integration of gene reporter assays to validate regulatory sequences (e.g., promoters or enhancers) with improved control, particularly when combined with CRISPR-Cas9-mediated mutagenesis of specific elements (Nakamura et al. 2021). This has been used to study distant regulatory elements of the CFTR gene (Mogayzel and Ashlock 2000). In addition, the platform is amenable to standard molecular techniques: ChIP-seq can be used to map TF binding sites and chromatin accessibility within the artificial construct (Nakato and Sakata 2021), whereas PICh enables identification of chromatin-associated protein complexes bound to specific loci (Kan et al. 2017).

Artificial chromosomes also support three-dimensional chromatin conformation studies using techniques such as 3C, allowing researchers to evaluate long-range DNA interactions and chromatin architecture (Akgol Oksuz et al. 2021). Moreover, they can be used to investigate epigenetic modifications—such as histone PTMs, DNA methylation, and R-loop formation—that influence transcription. These features can be experimentally manipulated using CRISPR-based epigenetic editing tools (Nakamura et al. 2021). HACs have been used to model centromeric function through the integration of satellite DNA and the study of histone mark deposition, transcription, and kinetochore formation (Molina et al. 2016), underscoring their utility in exploring the chromatin-based regulation of any GOI.

Artificial chromosomes provide a scalable and chromatinized platform for elucidating the molecular logic of transcriptional control. By supporting high-throughput, physiologically relevant interrogation of regulatory networks at a GOI, they bridge the gap between conventional vector systems and the complex regulatory landscape of the native genome.

CRISPR-assisted proximity labeling with APEX2

CAS9-catalyzed editing enables the modification of nucleotide sequences within the genome, facilitating a better comprehension of gene function, as well as their promoters and regulatory sequences (e.g., TF binding sites). Nevertheless, the application of the CRISPR-Cas9 system in transcriptional regulation studies goes well beyond mere DNA editing. The potential of Cas9 to inhibit transcription at various genomic locations has been investigated, given its ability to bind to any complementary sequence in the genome. Mutations were introduced into the two critical catalytic domains, RuvC and HNH, to eliminate the endonuclease activity. The resulting “dead” Cas9 (dCas9) protein still tightly binds to double-stranded DNA despite lacking the catalytic function. This inactive form of Cas9 has proven useful in studies involving programmable DNA binding (Qi et al. 2021). Through the use of customized sgRNAs, dCas9 can be directed to transcription initiation sites of any loci, where it can interfere with RNA polymerase at promoters, effectively halting transcription. Additionally, dCas9 can be targeted to the coding region of loci in a way that inhibits RNA polymerase during the elongation phase of transcription (Bikard et al. 2013).

Understanding gene expression regulation requires mapping the proteins that bind to the regulatory elements of a GOI. Traditional approaches to studying protein–DNA interactions, such as reverse ChIP and PICh, have provided valuable insights into chromatin-associated protein networks. However, these methods have limitations, particularly in detecting transient or weak interactions, which are often lost during immunoprecipitation or chromatin isolation steps. Furthermore, they rely on specific capture probes or antibodies, making them less suitable for unbiased identification of chromatin-bound proteins at a given genomic locus (Déjardin and Kingston 2009; Khan et al. 2021).To address these challenges, CRISPR-assisted proximity labeling with APEX2 (CASPEX) has emerged as a powerful method for studying chromatin-bound proteins at defined genomic locations. CASPEX utilizes a catalytically inactive mutant of Cas9 (dCas9) fused to the engineered ascorbate peroxidase (APEX2), enabling the selective labeling of proteins near a targeted DNA locus. This approach is based on the CRISPR-Cas9 targeting system, in which a sequence-specific guide RNA (gRNA) directs the dCas9–APEX2 fusion protein to a specific promoter, enhancer, or silencer of the GOI, allowing precise binding to specific genomic sequences (Myers et al. 2018). Upon activation by the addition of biotin-phenol and hydrogen peroxide, APEX2 catalyzes biotinylation of proteins within a ∼20 nm radius around the target locus. The biotinylated proteins are then captured using streptavidin affinity purification and analyzed by mass spectrometry, generating an unbiased list of chromatin-bound factors at the GOI (Hwang and Espenshade 2016). This labeling process captures dynamic protein–DNA interactions. CASPEX represents a significant advancement over existing chromatin proteomic techniques in several ways. In contrast to ChIP-MS, which requires antibodies for specific target proteins, CASPEX enables unbiased proteomic profiling of chromatin-bound proteins, even if their identity is unknown. The proximity labeling strategy allows for the detection of weak and transient interactions that are often difficult to capture using traditional immunoprecipitation-based methods (Qin et al. 2021). Another advantage is that CASPEX operates in living cells, preserving the native chromatin architecture and ensuring that the detected interactions reflect physiological conditions rather than artifacts introduced by fixation, chromatin shearing, or cross-linking (van Staalduinen et al. 2023).

This methodology has broad applications in studying gene regulation, chromatin remodeling, and TF dynamics. By allowing researchers to track changes in GOI-bound proteins under various conditions (such as cell differentiation, environmental stress, or drug treatment), CASPEX provides a useful tool for uncovering novel transcriptional regulators. It can also be applied to disease research, particularly in understanding how genetic mutations affect chromatin interactions. For example, mutations in genes associated with cancer or hematopoietic diseases often lead to altered TF binding and epigenetic dysregulation, which can now be studied in a more comprehensive manner using CASPEX (Okabe and Kaneda 2021; Morgan et al. 2022).

Another application of CASPEX is drug discovery and therapeutic target identification. By identifying the protein complexes that assemble at disease-associated regulatory elements, researchers can gain insight into potential therapeutic targets for modulating gene expression (Zou et al. 2024). Furthermore, CASPEX can be used to study long-range chromatin interactions, as modified versions of the technique have been adapted to identify proteins involved in enhancer–promoter looping and higher-order chromatin organization (Qiu et al. 2019; Ummethum and Hamperl 2020).

By combining CRISPR-Cas9-based locus targeting with high-resolution proteomic analysis, CASPEX achieves an unparalleled level of specificity and sensitivity. Unlike conventional methods, it allows for live-cell, antibody-independent, and unbiased profiling of regulatory protein complexes at specific genomic loci.

Previous Section Next Section

Important consideration when analyzing regulators of GOI expression by genomic modifications

CRISPR activation (CRISPRa) and CRISPRi are techniques that employ the CRISPR-Cas9 system to regulate gene expression without permanent modifications to the DNA sequence. CRISPRa is used to enhance gene expression through a programmable dCas9 protein. This protein is combined with activation domains, such as VPR, VP64, p65, or HSF1, as a fusion construct. These domains recruit TFs, leading to increased gene transcription. This approach has been widely applied for investigating gene function (Chavez et al. 2015; Konermann et al. 2015). CRISPRi functions in the opposite manner, concentrating on silencing gene expression. This method also employs dCas9, but instead of activation domains, protein is linked to repressor domains, like KRAB. When dCas9-KRAB is directed to a gene's promoter sequence or coding regions, it binds to them and physically obstructs transcriptional complexes from accessing these areas, thereby inhibiting transcription initiation. As a result, gene expression is suppressed, enabling the study of the effects of gene silencing and determination of its role in biological processes (Qi et al. 2013; Zalatan et al. 2015).

Nevertheless, epigenetic modifications pose challenges when using CRISPR-Cas9 technology. The activity of Cas9 nuclease can influence alterations in histone modifications and DNA methylation patterns at, or near, the targeted site, especially when CRISPR is employed to regulate transcription. For instance, targeting a gene promoter using CRISPRi can result in the recruitment of repressive histone marks, leading to broader changes in chromatin accessibility (Shi et al. 2025). These epigenetic changes may extend beyond the targeted gene, influencing adjacent regions and complicating data interpretation. Moreover, the chromatin context of the target location significantly impacts CRISPR-Cas9 efficiency and specificity. For example, heterochromatin regions are less accessible to Cas9, resulting in variable editing efficiencies and possible off-target effects in regions with similar sequence compositions (Schep et al. 2024). These unintended modifications to the chromatin structure and regulatory elements can obscure the direct impact of the intended genomic modification. To mitigate these issues, researchers are developing strategies to minimize unintended epigenetic alterations. These include using high-fidelity Cas9 variants to reduce off-target effects and employing epigenome editing tools that can modify specific histone marks or DNA methylation without introducing double-strand breaks (Pei et al. 2020). Additionally, comprehensive epigenomic profiling before and after CRISPR-Cas9 experiments could help identify and account for any unintended changes in the chromatin structure.

Previous Section Next Section

Summary

Understanding what regulates a GOI requires a multilayered experimental approach that integrates observational, functional, and molecular strategies. Initial RNA- and protein-centric assays provide spatial and quantitative data on transcript abundance and chromatin context. Although these techniques are informative, they remain largely correlative and require complementary validation to determine causality.

Functional assays such as CRISPR-Cas9 knock-ins offer powerful tools to create stable, endogenously tagged cell lines that faithfully report on GOI expression under native regulatory control. These systems allow dynamic monitoring of transcriptional responses and can be integrated with pooled CRISPR or siRNA libraries for high-throughput screening of candidate regulators.

To move beyond inference and identify proteins directly interacting with regulatory elements, techniques like CASPEX enable unbiased, live-cell proteomic profiling at specific genomic loci. By fusing dCas9 to APEX2, CASPEX facilitates high-resolution mapping of chromatin-associated proteins without the need for antibodies or cross-linking, capturing transient and dynamic interactions in physiological conditions.

Together, these methods form a cohesive experimental pipeline (Fig. 4; Table 2). Observational tools prioritize candidate regulators; functional assays validate their regulatory roles; and chromatin proteomics directly identify protein–DNA interactions at the GOI locus. The integration of these strategies (Fig. 5A,B) allows for a robust, system-level understanding of transcriptional regulation in both health and disease contexts.

View larger version:

Download as PowerPoint Slide

Figure 4.

Molecular assays used to investigate gene regulatory mechanisms, broadly categorized into four functional levels: (1) transcription factor (TF)–DNA binding, (2) mRNA abundance of the GOI, (3) GOI protein expression, and (4) GOI-associated signaling networks. (1) TF–DNA binding: Assays at this level, such as ChIP-seq, CUT&RUN, or EMSAs, provide direct evidence of a physical interaction between a TF and the regulatory elements of the GOI. These approaches assess binding specificity and occupancy but do not inform on the functional consequences of such binding. (2) GOI mRNA levels: Quantification of GOI transcripts (e.g., via RT-qPCR or RNA-seq) offers a measure of transcriptional output. Changes in mRNA abundance in response to TF perturbation can suggest regulatory influence but may still reflect indirect effects mediated by intermediate factors. (3) GOI protein levels: Techniques such as reporter assay or western blotting/mass spectrometry–based proteomics assess the steady-state levels of the GOI protein. Although changes at this level may reflect transcriptional regulation, they are also influenced by post-transcriptional and translational mechanisms, including mRNA stability, translation efficiency, and protein degradation rates. (4) GOI signaling networks: This includes analysis of downstream signaling pathways or phenotypic outputs modulated by the GOI protein. These readouts are inherently multilayered and integrate regulatory events occurring at several preceding molecular levels. It is critical to recognize that as we move downstream in the gene expression cascade, from TF binding to phenotypic outcome, additional layers of regulation become increasingly prominent. For instance, using GOI protein levels as a surrogate for transcriptional regulation by TF X can be misleading. An observed increase in GOI protein levels following TF X overexpression does not necessarily imply direct TF–GOI transcriptional activation. Instead, TF X may indirectly enhance GOI protein abundance by modulating expression of genes that encode mRNA-stabilizing factors, translation enhancers, or protein chaperones that extend GOI half-life. Thus, careful interpretation of molecular readouts is essential, and conclusions regarding direct regulatory relationships should be supported by complementary mechanistic evidence across multiple levels of analysis.

View larger version:

Download as PowerPoint Slide

Figure 5.

Integrated workflow for identifying direct promoter-bound regulators of a GOI. (A) This schematic outlines a stepwise strategy for decoding transcriptional regulation of a GOI, with a focus on identifying direct regulators acting at the promoter level. The process begins by selecting the GOI, including annotation of transcript variants, promoter structure, and determining regulatory elements. At this stage, mining publicly available data sets, such as ChIP-seq, CUT&Tag, and DamID profiles, can yield initial hypotheses regarding candidate transcription factors or chromatin regulators. Next, unbiased identification of promoter-associated proteins can be performed using locus-specific proteomics techniques such as PiCh or reverse ChIP (r-ChIP). In parallel, genome-wide CRISPR knockout screens coupled to endogenous reporters (e.g., fluorescent tagging or PLAYR-based detection) can be employed to identify both direct and indirect regulators of GOI expression. Following candidate identification, direct promoter occupancy should be validated using complementary in vitro and in vivo binding assays, including electrophoretic mobility shift assay (EMSA), ChIP, cleavage under targets and release using nuclease (CUT&RUN or CHEC), and DNA adenine methyltransferase identification (DamID), complemented by systematic evaluation of available public data sets. This step ensures that the regulators physically interact with the GOI promoter and are not acting solely via indirect pathways; consequently, it is critical to perform this analysis if candidate identification occurred via CRISPR screen as this will also identify non-promoter-bound indirect regulators. Finally, functional validation needs to be performed by perturbing candidate regulator levels—via overexpression, RNAi, or CRISPR-based approaches—and quantifying changes in GOI expression using qRT-PCR, RNA-FISH, or related transcript-level assays. Importantly, all assays should be conducted in physiologically relevant cellular contexts to preserve native chromatin architecture and regulatory interactions. (B) This iterative framework integrates discovery, validation, and functional characterization of promoter-bound regulatory factors and can be adapted to dissect transcriptional control of diverse genomic loci.

View this table:

Table 2.

Summary of experimental approaches for identifying transcriptional regulators of a gene of interest in human cells

Previous Section Next Section

Competing interest statement

The authors declare no competing interests.

Previous Section Next Section

Acknowledgments

U.L.M. and L.B. were supported by the Medical Research Council (MRC) (MR/X00855X/1). U.L.M. is also funded by the North West Cancer Research (RDG2021.15) and the MRC (DiMeN DTP2; supporting C.F.), as well as by a European Molecular Biology Organization Young Investigator Network Installation Grant (supporting A.D.B.) and NCN Sonata Bis grant no. 2023/50/E/NZ3/00281 (supporting A.D.B.).

Author contributions: A.D.B. led on the sections describing different high-throughput sequencing–led approaches. D.G. led on CASPEX-based approaches. M.S. led on imaging-based approached. L.B. led on cell-based endogenous reporter assays. M.W. led on Y2H. C.F. led on PLAYR. B.H. led on cell functional assays. M.G. led on luciferase reporter. M.R. did manuscript proofreading and editing. L.P. led on data mining, computational analysis, manuscript proofreading, and editing. U.L.M. led on figures, the introduction, manuscript proofreading, and editing.

Previous Section Next Section

Footnotes

Article published online before print. Article and publication date are at https://www.genome.org/cgi/doi/10.1101/gr.281154.125.
Freely available online through the Genome Research Open Access option.

Received July 4, 2025.
Accepted December 17, 2025.

© 2026 Bates et al.; Published by Cold Spring Harbor Laboratory Press

This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/.

Previous Section

References

↵

Abugessaisa I, Ramilowski JA, Lizio M, Severin J, Hasegawa A, Harshbarger J, Kondo A, Noguchi S, Yip CW, Ooi JLC, et al. 2021. FANTOM enters 20th year: expansion of transcriptomic atlases and functional annotation of non-coding RNAs. Nucleic Acids Res 49: D892–d898. doi:10.1093/nar/gkaa1054

CrossRef Medline Google Scholar
↵

Acke A, Van Belle S, Louis B, Vitale R, Rocha S, Voet T, Debyser Z, Hofkens J. 2022. Expansion microscopy allows high resolution single cell analysis of epigenetic readers. Nucleic Acids Res 50: e100. doi:10.1093/nar/gkac521

CrossRef Medline Google Scholar
↵

Adato O, Sloutskin A, Komemi H, Brabb I, Duttke S, Bucher P, Unger R, Juven-Gershon T. 2024. ElemeNT 2023: an enhanced tool for detection and curation of core promoter elements. Bioinformatics 40: btae110. doi:10.1093/bioinformatics/btae110

CrossRef Medline Google Scholar
↵

Aditham AK, Markin CJ, Mokhtari DA, DelRosso N, Fordyce PM. 2021. High-throughput affinity measurements of transcription factor and DNA mutations reveal affinity and specificity determinants. Cell Syst 12: 112–127.e11. doi:10.1016/j.cels.2020.11.012

CrossRef Medline Google Scholar
↵

Akgol Oksuz B, Yang L, Abraham S, Venev SV, Krietenstein N, Parsi KM, Ozadam H, Oomen ME, Nand A, Mao H, et al. 2021. Systematic evaluation of chromosome conformation capture assays. Nat Methods 18: 1046–1055. doi:10.1038/s41592-021-01248-7

CrossRef Medline Google Scholar
↵

Albrecht F, List M, Bock C, Lengauer T. 2016. DeepBlue epigenomic data server: programmatic data retrieval and analysis of epigenome region sets. Nucleic Acids Res 44: W581–W586. doi:10.1093/nar/gkw211

CrossRef Medline Google Scholar
↵

Alfonso-Gonzalez C, Hilgers V. 2024. (Alternative) transcription start sites as regulators of RNA processing. Trends Cell Biol 34: 1018–1028. doi:10.1016/j.tcb.2024.02.010

CrossRef Medline Google Scholar
↵

Alipanahi B, Delong A, Weirauch MT, Frey BJ. 2015. Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning. Nat Biotechnol 33: 831–838. doi:10.1038/nbt.3300

CrossRef Medline Google Scholar
↵

Aughey GN, Southall TD. 2016. Dam it's good! DamID profiling of protein-DNA interactions. Wiley Interdiscip Rev Dev Biol 5: 25–37. doi:10.1002/wdev.205

CrossRef Medline Google Scholar
↵

Babl V, Stöckl U, Tschochner H, Milkereit P, Griesenbeck J. 2015. Chromatin endogenous cleavage (ChEC) as a method to quantify protein interaction with genomic DNA in Saccharomyces cerevisiae. Methods Mol Biol 1334: 219–232. doi:10.1007/978-1-4939-2877-4_14

CrossRef Medline Google Scholar
↵

Badr CE. 2014. Bioluminescence imaging: basics and practical limitations. Methods Mol Biol 1098: 1–18. doi:10.1007/978-1-62703-718-1_1

CrossRef Medline Google Scholar
↵

Bailey TL, Johnson J, Grant CE, Noble WS. 2015. The MEME suite. Nucleic Acids Res 43: W39–W49. doi:10.1093/nar/gkv416

CrossRef Medline Google Scholar
↵

Bao X-R, Zhu Y-H, Yu D-J. 2019. DeepTF: accurate prediction of transcription factor binding sites by combining multi-scale convolution and long short-term memory neural network, pp. 126–138. Springer International Publishing, Cham, Switzerland.

Google Scholar
↵

Barras F, Marinus MG. 1989. The great GATC: DNA methylation in E. coli. Trends Genet 5: 139–143. doi:10.1016/0168-9525(89)90054-1

CrossRef Medline Google Scholar
↵

Bateman JR, Johnson JE. 2022. Altering enhancer–promoter linear distance impacts promoter competition in cis and in trans. Genetics 222: iyac098. doi:10.1093/genetics/iyac098

CrossRef Medline Google Scholar
↵

Bell JC, Jukam D, Teran NA, Risca VI, Smith OK, Johnson WL, Skotheim JM, Greenleaf WJ, Straight AF. 2018. Chromatin-associated RNA sequencing (ChAR-seq) maps genome-wide RNA-to-DNA contacts. eLife 7: e27024. doi:10.7554/eLife.27024

CrossRef Medline Google Scholar
↵

Bereiter R, Micura R. 2025. Synthesis of HBC fluorophores with an electrophilic handle for covalent attachment to Pepper RNA. Beilstein J Org Chem 21: 727–735. doi:10.3762/bjoc.21.56

CrossRef Medline Google Scholar
↵

Bikard D, Jiang W, Samai P, Hochschild A, Zhang F, Marraffini LA. 2013. Programmable repression and activation of bacterial gene expression using an engineered CRISPR-Cas system. Nucleic Acids Res 41: 7429–7437. doi:10.1093/nar/gkt520

CrossRef Medline Google Scholar
↵

Bindels DS, Haarbosch L, van Weeren L, Postma M, Wiese KE, Mastop M, Aumonier S, Gotthard G, Royant A, Hink MA, et al. 2017. mScarlet: a bright monomeric red fluorescent protein for cellular imaging. Nat Methods 14: 53–56. doi:10.1038/nmeth.4074

CrossRef Medline Google Scholar
↵

Birk UJ. 2019. Super-resolution microscopy of chromatin. Genes (Basel) 10: 493. doi:10.3390/genes10070493

CrossRef Medline Google Scholar
↵

Brandão HB, Gabriele M, Hansen AS. 2021. Tracking and interpreting long-range chromatin interactions with super-resolution live-cell imaging. Curr Opin Cell Biol 70: 18–26. doi:10.1016/j.ceb.2020.11.002

CrossRef Medline Google Scholar
↵

Buckley SM, Delhove JM, Perocheau DP, Karda R, Rahim AA, Howe SJ, Ward NJ, Birrell MA, Belvisi MG, Arbuthnot P, et al. 2015. In vivo bioimaging with tissue-specific transcription factor activated luciferase reporters. Sci Rep 5: 11842. doi:10.1038/srep11842

CrossRef Medline Google Scholar
↵

Burke DT, Carle GF, Olson MV. 1987. Cloning of large segments of exogenous DNA into yeast by means of artificial chromosome vectors. Science 236: 806–812. doi:10.1126/science.3033825

Abstract/FREE Full Text
↵

Camino EM, Weinstein ML, List MP, Vellky JE, Rebeiz M, Williams TM. 2020. Red light/green light, a dual fluorescent protein reporter system to study enhancer-promoter specificity in Drosophila. Adv Genet 10: 985–997. doi:10.1534/g3.119.401033

Abstract/FREE Full Text
↵

Cartharius K, Frech K, Grote K, Klocke B, Haltmeier M, Klingenhoff A, Frisch M, Bayerlein M, Werner T. 2005. MatInspector and beyond: promoter analysis based on transcription factor binding sites. Bioinformatics 21: 2933–2942. doi:10.1093/bioinformatics/bti473

CrossRef Medline Google Scholar
↵

Chaumeil J, Micsinai M, Skok JA. 2013. Combined immunofluorescence and DNA FISH on 3D-preserved interphase nuclei to study changes in 3D nuclear organization. J Vis Exp 72: e50087. doi:10.3791/50087

CrossRef Google Scholar
↵

Chavez A, Scheiman J, Vora S, Pruitt BW, Tuttle M, P R Iyer E, Lin S, Kiani S, Guzman CD, Wiegand DJ, et al. 2015. Highly efficient Cas9-mediated transcriptional programming. Nat Methods 12: 326–328. doi:10.1038/nmeth.3312

CrossRef Medline Google Scholar
↵

Chen M, Compton ST, Coviello VF, Green ED, Ashlock MA. 1997. Transient gene expression from yeast artificial chromosome DNA in mammalian cells is enhanced by adenovirus. Nucleic Acids Res 25: 4416–4418. doi:10.1093/nar/25.21.4416

CrossRef Medline Google Scholar
↵

Chen H, Levo M, Barinov L, Fujioka M, Jaynes JB, Gregor T. 2018. Dynamic interplay between enhancer–promoter topology and gene activity. Nat Genet 50: 1296–1303. doi:10.1038/s41588-018-0175-z

CrossRef Medline Google Scholar
↵

Chen C, Hou J, Shi X, Yang H, Birchler JA, Cheng J. 2021a. DeepGRN: prediction of transcription factor binding site across cell-types using attention-based deep neural networks. BMC Bioinformatics 22: 38. doi:10.1186/s12859-020-03952-1

CrossRef Medline Google Scholar
↵

Chen W, Lu W, Wolynes PG, Komives EA. 2021b. Single-molecule conformational dynamics of a transcription factor reveals a continuum of binding modes controlling association and dissociation. Nucleic Acids Res 49: 11211–11223. doi:10.1093/nar/gkab874

CrossRef Medline Google Scholar
↵

Chen H, Yan G, Wen M-H, Brooks KN, Zhang Y, Huang P-S, Chen T-Y. 2024a. Advancements and practical considerations for biophysical research: navigating the challenges and future of super-resolution microscopy. Chem Biomed Imaging 2: 331–344. doi:10.1021/cbmi.4c00019

CrossRef Medline Google Scholar
↵

Chen Y, Liang R, Li Y, Jiang L, Ma D, Luo Q, Song G. 2024b. Chromatin accessibility: biological functions, molecular mechanisms and therapeutic application. Signal Transduct Target Ther 9: 340. doi:10.1038/s41392-024-02030-9

CrossRef Medline Google Scholar
↵

Cheng S, Mi R, Xu Y, Jin G, Zhang J, Zhou Y, Chen Z, Liu F. 2017. Ferritin heavy chain as a molecular imaging reporter gene in glioma xenografts. J Cancer Res Clin Oncol 143: 941–951. doi:10.1007/s00432-017-2356-z

CrossRef Medline Google Scholar
↵

Cherry JM. 2015. The Saccharomyces genome database: a tool for discovery. Cold Spring Harb Protoc 2015: pdb.top083840. doi:10.1101/pdb.top083840

Abstract/FREE Full Text
↵

Choi YJ, Oh SG, Singh TD, Ha JH, Kim DW, Lee SW, Jeong SY, Ahn BC, Lee J, Jeon YH. 2016. Visualization of the biological behavior of tumor-associated macrophages in living mice with colon cancer using multimodal optical reporter gene imaging. Neoplasia 18: 133–141. doi:10.1016/j.neo.2016.01.004

CrossRef Medline Google Scholar
↵

Choi JJ, Svaren J, Wang D. 2025. CoTF-reg reveals cooperative transcription factors in oligodendrocyte gene regulation using single-cell multi-omics. Commun Biol 8: 181. doi:10.1038/s42003-025-07570-6

CrossRef Medline Google Scholar
↵

Chuang LY, Yang CH, Lin MC, Yang CH. 2012. CpGPAP: CpG island predictor analysis platform. BMC Genet 13: 13. doi:10.1186/1471-2156-13-13

CrossRef Medline Google Scholar
↵

Chubb JR, Trcek T, Shenoy SM, Singer RH. 2006. Transcriptional pulsing of a developmental gene. Curr Biol 16: 1018–1025. doi:10.1016/j.cub.2006.03.092

CrossRef Medline Google Scholar
↵

Chung HK, Jacobs CL, Huo Y, Yang J, Krumm SA, Plemper RK, Tsien RY, Lin MZ. 2015. Tunable and reversible drug control of protein production via a self-excising degron. Nat Chem Biol 11: 713–720. doi:10.1038/nchembio.1869

CrossRef Medline Google Scholar
↵

Concilio SC, Russell SJ, Peng KW. 2021. A brief review of reporter gene imaging in oncolytic virotherapy and gene therapy. Mol Ther Oncolytics 21: 98–109. doi:10.1016/j.omto.2021.03.006

CrossRef Medline Google Scholar
↵

Core LJ, Lis JT. 2008. Transcription regulation through promoter-proximal pausing of RNA polymerase II. Science 319: 1791–1792. doi:10.1126/science.1150843

Abstract/FREE Full Text
↵

Crowe AR, Yue W. 2019. Semi-quantitative determination of protein expression using immunohistochemistry staining and analysis: an integrated protocol. Bio Protoc 9: e3465. doi:10.21769/BioProtoc.3465

CrossRef Google Scholar
↵

Curtis A, Calabro K, Galarneau JR, Bigio IJ, Krucker T. 2011. Temporal variations of skin pigmentation in C57BL/6 mice affect optical bioluminescence quantitation. Mol Imaging Biol 13: 1114–1123. doi:10.1007/s11307-010-0440-8

CrossRef Medline Google Scholar
↵

Dai Z, Dai X. 2012. Nuclear colocalization of transcription factor target genes strengthens coregulation in yeast. Nucleic Acids Res 40: 27–36. doi:10.1093/nar/gkr689

CrossRef Medline Google Scholar
↵

Damhofer H, Radzisheuskaya A, Helin K. 2021. Generation of locus-specific degradable tag knock-ins in mouse and human cell lines. STAR Protoc 2: 100575. doi:10.1016/j.xpro.2021.100575

CrossRef Medline Google Scholar
↵

Das PM, Ramachandran K, vanWert J, Singal R. 2004. Chromatin immunoprecipitation assay. BioTechniques 37: 961–969. doi:10.2144/04376RV01

CrossRef Medline Google Scholar
↵

Déjardin J, Kingston RE. 2009. Purification of proteins associated with specific genomic loci. Cell 136: 175–186. doi:10.1016/j.cell.2008.11.045

CrossRef Medline Google Scholar
↵

Deleuze V, Soler E, Andrieu-Soler C. 2024. Protocol for efficient CRISPR-Cas9-mediated fluorescent tag knockin in hard-to-transfect erythroid cell lines. STAR Protoc 5: 103016. doi:10.1016/j.xpro.2024.103016

CrossRef Medline Google Scholar
↵

Del'Guidice T, Lepetit-Stoffaes JP, Bordeleau LJ, Roberge J, Théberge V, Lauvaux C, Barbeau X, Trottier J, Dave V, Roy DC, et al. 2018. Membrane permeabilizing amphiphilic peptide delivers recombinant transcription factor and CRISPR-Cas9/Cpf1 ribonucleoproteins in hard-to-modify cells. PLoS One 13: e0195558. doi:10.1371/journal.pone.0195558

CrossRef Medline Google Scholar
↵

de Medeiros Oliveira M, Bonadio I, Lie de Melo A, Mendes Souza G, Durham AM. 2021. TSSFinder-fast and accurate ab initio prediction of the core promoter in eukaryotic genomes. Brief Bioinform 22: bbab198. doi:10.1093/bib/bbab198

CrossRef Medline Google Scholar
↵

Devenish RJ, Newlon CS. 1982. Isolation and characterization of yeast ring chromosome III by a method applicable to other circular DNAs. Gene 18: 277–288. doi:10.1016/0378-1119(82)90166-4

CrossRef Medline Google Scholar
↵

Diao S, Hong G, Antaris AL, Blackburn JL, Cheng K, Cheng Z, Dai H. 2015. Biological imaging without autofluorescence in the second near-infrared region. Nano Res 8: 3027–3034. doi:10.1007/s12274-015-0808-9

CrossRef Google Scholar
↵

Ding F, Zhan Y, Lu X, Sun Y. 2018. Recent advances in near-infrared II fluorophores for multifunctional biomedical imaging. Chem Sci 9: 4370–4380. doi:10.1039/C8SC01153B

CrossRef Medline Google Scholar
↵

Dixit A, Parnas O, Li B, Chen J, Fulco CP, Jerby-Arnon L, Marjanovic ND, Dionne D, Burks T, Raychowdhury R, et al. 2016. Perturb-seq: dissecting molecular circuits with scalable single-cell RNA profiling of pooled genetic screens. Cell 167: 1853–1866.e17. doi:10.1016/j.cell.2016.11.038

CrossRef Medline Google Scholar
↵

Dréos R, Ambrosini G, Groux R, Périer RC, Bucher P. 2018. MGA repository: a curated data resource for ChIP-seq and other genome annotated data. Nucleic Acids Res 46: D175–d180. doi:10.1093/nar/gkx995

CrossRef Medline Google Scholar
↵

Du Q, Luu PL, Stirzaker C, Clark SJ. 2015. Methyl-CpG-binding domain proteins: readers of the epigenome. Epigenomics 7: 1051–1073. doi:10.2217/epi.15.39

CrossRef Medline Google Scholar
↵

Duckworth AD, Gherardini PF, Sykorova M, Yasin F, Nolan GP, Slupsky JR, Kalakonda N. 2019. Multiplexed profiling of RNA and protein expression signatures in individual cells using flow or mass cytometry. Nat Protoc 14: 901–920. doi:10.1038/s41596-018-0120-8

CrossRef Medline Google Scholar
↵

Duckworth AD, Slupsky JR, Kalakonda N. 2024. highly multiplexed and simultaneous characterization of protein and RNA in single cells by flow or mass cytometry platforms using proximity ligation assay for RNA. In Single cell analysis: methods and protocols (ed. Gužvić M), pp. 143–165. Springer, New York. doi:10.1007/978-1-0716-3621-3_10

CrossRef Google Scholar
↵

Eismann B, Krieger TG, Beneke J, Bulkescher R, Adam L, Erfle H, Herrmann C, Eils R, Conrad C. 2020. Automated 3D light-sheet screening with high spatiotemporal resolution reveals mitotic phenotypes. J Cell Sci 133: jcs245043. doi:10.1242/jcs.245043

Abstract/FREE Full Text
↵

England CG, Ehlerding EB, Cai W. 2016. NanoLuc: a small luciferase Is brightening up the field of bioluminescence. Bioconjug Chem 27: 1175–1187. doi:10.1021/acs.bioconjchem.6b00112

CrossRef Medline Google Scholar
↵

Fang H, Bygrave AM, Roth RH, Johnson RC, Huganir RL. 2021. An optimized CRISPR/Cas9 approach for precise genome editing in neurons. eLife 10: e65202. doi:10.7554/eLife.65202

CrossRef Medline Google Scholar
↵

Faulkner EL, Pike JA, Densham RM, Garlick E, Thomas SG, Neely RK, Morris JR. 2022. Imaging nanoscale nuclear structures with expansion microscopy. J Cell Sci 135: jcs259009. doi:10.1242/jcs.259009

CrossRef Google Scholar
↵

Fleck JS, Jansen SMJ, Wollny D, Zenk F, Seimiya M, Jain A, Okamoto R, Santel M, He Z, Camp JG, et al. 2023. Inferring and perturbing cell fate regulomes in human brain organoids. Nature 621: 365–372. doi:10.1038/s41586-022-05279-8

CrossRef Medline Google Scholar
↵

Fung JJ, Blöcher-Juárez K, Khmelinskii A. 2022. High-throughput analysis of protein turnover with tandem fluorescent protein timers. Methods Mol Biol 2378: 85–100. doi:10.1007/978-1-0716-1732-8_6

CrossRef Medline Google Scholar
↵

Gardiner-Garden M, Frommer M. 1987. CpG islands in vertebrate genomes. J Mol Biol 196: 261–282. doi:10.1016/0022-2836(87)90689-9

CrossRef Medline Google Scholar
↵

Gaspar N, Handula M, Stroet MCM, Marella-Panth K, Haeck J, Kirkland TA, Hall MP, Encell LP, Dalm S, Lowik C, et al. 2024. A novel luciferase-based reporter gene technology for simultaneous optical and radionuclide imaging of cells. Int J Mol Sci 25: 8206. doi:10.3390/ijms25158206

CrossRef Medline Google Scholar
↵

Gaudreau-Lapierre A, Mulatz K, Béïque JC, Trinkle-Mulcahy L. 2021. Expansion microscopy-based imaging of nuclear structures in cultured cells. STAR Protoc 2: 100630. doi:10.1016/j.xpro.2021.100630

CrossRef Medline Google Scholar
↵

Gilbonio HE, Puckett GL, Nguyen E, Rieder LE. 2023. A hybrid RNA FISH immunofluorescence protocol on Drosophila polytene chromosomes. BMC Res Notes 16: 197. doi:10.1186/s13104-023-06482-0

CrossRef Medline Google Scholar
↵

Goldman M, Craft B, Swatloski T, Cline M, Morozova O, Diekhans M, Haussler D, Zhu J. 2015. The UCSC Cancer Genomics Browser: update 2015. Nucleic Acids Res 43: D812–D817. doi:10.1093/nar/gku1073

CrossRef Medline Google Scholar
↵

Gottschling DE. 1992. Telomere-proximal DNA in Saccharomyces cerevisiae is refractory to methyltransferase activity in vivo. Proc Natl Acad Sci 89: 4062–4065. doi:10.1073/pnas.89.9.4062

Abstract/FREE Full Text
↵

Green MR, Sambrook J. 2018. Constructing a standard curve for real-time polymerase chain reaction (PCR) experiments. Cold Spring Harb Protoc 2018. doi:10.1101/pdb.prot095026

Abstract/FREE Full Text
↵

Greil F, van der Kraan I, Delrow J, Smothers JF, de Wit E, Bussemaker HJ, van Driel R, Henikoff S, van Steensel B. 2003. Distinct HP1 and Su(var)3-9 complexes bind to sets of developmentally coexpressed genes depending on chromosomal location. Genes Dev 17: 2825–2838. doi:10.1101/gad.281503

Abstract/FREE Full Text
↵

Greil F, Moorman C, van Steensel B. 2006. DamID: mapping of in vivo protein-genome interactions using tethered DNA adenine methyltransferase. Methods Enzymol 410: 342–359. doi:10.1016/S0076-6879(06)10016-6

CrossRef Medline Google Scholar
↵

Grolmusz VK, Tóth EA, Baghy K, Likó I, Darvasi O, Kovalszky I, Matkó J, Rácz K, Patócs A. 2016. Fluorescence activated cell sorting followed by small RNA sequencing reveals stable microRNA expression during cell cycle progression. BMC Genomics 17: 412. doi:10.1186/s12864-016-2747-6

CrossRef Medline Google Scholar
↵

GTEx Consortium, Aguet F, Anand S, Ardlie KG, Gabriel S, Getz GA, Graubert A, Hadley K, Handsaker RE, Huang KH, et al. 2020. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science 369: 1318–1330. doi:10.1126/science.aaz1776

Abstract/FREE Full Text
↵

Hachiya T, Furukawa R, Shiwa Y, Ohmomo H, Ono K, Katsuoka F, Nagasaki M, Yasuda J, Fuse N, Kinoshita K, et al. 2017. Genome-wide identification of inter-individually variable DNA methylation sites improves the efficacy of epigenetic association studies. NPJ Genom Med 2: 11. doi:10.1038/s41525-017-0016-5

CrossRef Medline Google Scholar
↵

Hafner A, Boettiger A. 2023. The spatial organization of transcriptional control. Nat Rev Genet 24: 53–68. doi:10.1038/s41576-022-00526-0

CrossRef Google Scholar
↵

Hammal F, de Langen P, Bergon A, Lopez F, Ballester B. 2022. Remap 2022: a database of human, mouse, Drosophila and Arabidopsis regulatory regions from an integrative analysis of DNA-binding sequencing experiments. Nucleic Acids Res 50: D316–d325. doi:10.1093/nar/gkab996

CrossRef Medline Google Scholar
↵

Han H, Shim H, Shin D, Shim JE, Ko Y, Shin J, Kim H, Cho A, Kim E, Lee T, et al. 2015. TRRUST: a reference database of human transcriptional regulatory interactions. Sci Rep 5: 11432. doi:10.1038/srep11432

CrossRef Medline Google Scholar
↵

He SL, Green R. 2013. Northern blotting. In Methods in enzymology (ed. Lorsch J), Vol. 530, pp. 75–87. Academic Press, Cambridge, MA.

CrossRef Medline Google Scholar
↵

He L, Binari R, Huang J, Falo-Sanjuan J, Perrimon N. 2019. In vivo study of gene expression with an enhanced dual-color fluorescent transcriptional timer. eLife 8: e46181. doi:10.7554/eLife.46181

CrossRef Google Scholar
↵

Hellman LM, Fried MG. 2007. Electrophoretic mobility shift assay (EMSA) for detecting protein-nucleic acid interactions. Nat Protoc 2: 1849–1861. doi:10.1038/nprot.2007.249

CrossRef Medline Google Scholar
↵

Henikoff S, Henikoff JG, Kaya-Okur HS, Ahmad K. 2020. Efficient chromatin accessibility mapping in situ by nucleosome-tethered tagmentation. eLife 9: e63274. doi:10.7554/eLife.63274

CrossRef Medline Google Scholar
↵

Hickey SM, Ung B, Bader C, Brooks R, Lazniewska J, Johnson IRD, Sorvina A, Logan J, Martini C, Moore CR, et al. 2021. Fluorescence microscopy: an outline of hardware, biological handling, and fluorophore considerations. Cells 11: 35. doi:10.3390/cells11010035

CrossRef Google Scholar
↵

Hocine S, Raymond P, Zenklusen D, Chao JA, Singer RH. 2013. Single-molecule analysis of gene expression using two-color RNA labeling in live yeast. Nat Methods 10: 119–121. doi:10.1038/nmeth.2305

CrossRef Medline Google Scholar
↵

Holsapple JS, Schnitzler L, Rusch L, Baldeweg TH, Neubert E, Kruss S, Erpenbeck L. 2023. Expansion microscopy of neutrophil nuclear structure and extracellular traps. Biophys Rep (N Y) 3: 100091. doi:10.1016/j.bpr.2022.10009

CrossRef Medline Google Scholar
↵

Hou R, Hon C-C, Huang Y. 2023. CamoTSS: analysis of alternative transcription start sites for cellular phenotypes and regulatory patterns from 5′ scRNA-seq data. Nat Commun 14: 7240. doi:10.1038/s41467-023-42636-1

CrossRef Medline Google Scholar
↵

Huang J, Zheng W, Zhang P, Lin Q, Chen Z, Xuan J, Liu C, Wu D, Huang Q, Zheng L, et al. 2023. ChIPBase v3.0: the encyclopedia of transcriptional regulations of non-coding RNAs and protein-coding genes. Nucleic Acids Res 51: D46–d56. doi:10.1093/nar/gkac1067

CrossRef Medline Google Scholar
↵

Hunt JMT, Samson CA, Rand AD, Sheppard HM. 2023. Unintended CRISPR-Cas9 editing outcomes: a review of the detection and prevalence of structural variants generated by gene-editing in human cells. Hum Genet 142: 705–720. doi:10.1007/s00439-023-02561-1

CrossRef Medline Google Scholar
↵

Huxley C, Gnirke A. 1991. Transfer of yeast artificial chromosomes from yeast to mammalian cells. Bioessays 13: 545–550. doi:10.1002/bies.950131009

CrossRef Medline Google Scholar
↵

Hwang J, Espenshade PJ. 2016. Proximity-dependent biotin labelling in yeast using the engineered ascorbate peroxidase APEX2. Biochem J 473: 2463–2469. doi:10.1042/BCJ20160106

Abstract/FREE Full Text
↵

Ide S, Dejardin J. 2015. End-targeting proteomics of isolated chromatin segments of a mammalian ribosomal RNA gene promoter. Nat Commun 6: 6674. doi:10.1038/ncomms7674

CrossRef Medline Google Scholar
↵

Ijaz F, Ikegami K. 2021. Knock-in of labeled proteins into 5′UTR enables highly efficient generation of stable cell lines. Cell Struct Funct 46: 21–35. doi:10.1247/csf.21002

CrossRef Medline Google Scholar
↵

Ikeno M, Hasegawa Y. 2020. Applications of bottom-up human artificial chromosomes in cell research and cell engineering. Exp Cell Res 390: 111793. doi:10.1016/j.yexcr.2019.111793

CrossRef Medline Google Scholar
↵

Illingworth RS, Bird AP. 2009. Cpg islands – ‘a rough guide'. FEBS Lett 583: 1713–1720. doi:10.1016/j.febslet.2009.04.012

CrossRef Medline Google Scholar
↵

Jaenisch R, Bird A. 2003. Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nat Genet 33: 245–254. doi:10.1038/ng1089

CrossRef Medline Google Scholar
↵

Jenkins VK, Larkin A, Thurmond J. 2022. Using FlyBase: a database of Drosophila genes and genetics. Methods Mol Biol 2540: 1–34. doi:10.1007/978-1-0716-2541-5_1

CrossRef Medline Google Scholar
↵

Jiang S, Dai T, Li Q, Xu T, Zhang W, Sun J, Liu H. 2024. Generation of ASCL1-mCherry knock-in reporter in human embryonic stem cell line, WAe001-A-2E, using CRISPR/Cas9-based gene targeting. Stem Cell Res 80: 103500. doi:10.1016/j.scr.2024.103500

CrossRef Medline Google Scholar
↵

Jolma A, Yan J, Whitington T, Toivonen J, Nitta KR, Rastas P, Morgunova E, Enge M, Taipale M, Wei G, et al. 2013. DNA-binding specificities of human transcription factors. Cell 152: 327–339. doi:10.1016/j.cell.2012.12.009

CrossRef Medline Google Scholar
↵

Kaczynski J, Cook T, Urrutia R. 2003. Sp1- and Krüppel-like transcription factors. Genome Biol 4: 206. doi:10.1186/gb-2003-4-2-206

CrossRef Medline Google Scholar
↵

Käll L, Krogh A, Sonnhammer EL. 2004. A combined transmembrane topology and signal peptide prediction method. J Mol Biol 338: 1027–1036. doi:10.1016/j.jmb.2004.03.016

CrossRef Medline Google Scholar
↵

Kan SL, Saksouk N, Déjardin J. 2017. Proteome characterization of a chromatin locus using the proteomics of isolated chromatin segments approach. Methods Mol Biol 1550: 19–33. doi:10.1007/978-1-4939-6747-6_3

CrossRef Medline Google Scholar
↵

Kannan S, Sams T, Maury J, Workman CT. 2018. Reconstructing dynamic promoter activity profiles from reporter gene data. ACS Synth Biol 7: 832–841. doi:10.1021/acssynbio.7b00223

CrossRef Google Scholar
↵

Karr JP, Ferrie JJ, Tjian R, Darzacq X. 2022. The transcription factor activity gradient (TAG) model: contemplating a contact-independent mechanism for enhancer-promoter communication. Genes Dev 36: 7–16. doi:10.1101/gad.349160.121

Abstract/FREE Full Text
↵

Kawaji H, Lizio M, Itoh M, Kanamori-Katayama M, Kaiho A, Nishiyori-Sueki H, Shin JW, Kojima-Ishiyama M, Kawano M, Murata M, et al. 2014. Comparison of CAGE and RNA-seq transcriptome profiling using clonally amplified and single-molecule next-generation sequencing. Genome Res 24: 708–717. doi:10.1101/gr.156232.113

Abstract/FREE Full Text
↵

Kaya-Okur HS, Wu SJ, Codomo CA, Pledger ES, Bryson TD, Henikoff JG, Ahmad K, Henikoff S. 2019. CUT&tag for efficient epigenomic profiling of small samples and single cells. Nat Commun 10: 1930. doi:10.1038/s41467-019-09982-5

CrossRef Medline Google Scholar
↵

Kazuki Y, Kobayashi K, Aueviriyavit S, Oshima T, Kuroiwa Y, Tsukazaki Y, Senda N, Kawakami H, Ohtsuki S, Abe S, et al. 2013. Trans-chromosomic mice containing a human CYP3A cluster for prediction of xenobiotic metabolism in humans. Hum Mol Genet 22: 578–592. doi:10.1093/hmg/dds468

CrossRef Medline Google Scholar
↵

Keyaerts M, Caveliers V, Lahoutte T. 2012. Bioluminescence imaging: looking beyond the light. Trends Mol Med 18: 164–172. doi:10.1016/j.molmed.2012.01.005

CrossRef Medline Google Scholar
↵

Khan A, Fornes O, Stigliani A, Gheorghe M, Castro-Mondragon JA, van der Lee R, Bessy A, Chèneby J, Kulkarni SR, Tan G, et al. 2018. JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework. Nucleic Acids Res 46: D260–d266. doi:10.1093/nar/gkx1126

CrossRef Medline Google Scholar
↵

Khan N, Shahid S, Asif AR. 2021. Current analytical strategies in studying chromatin-associated-proteome (chromatome). Molecules 26: 6694. doi:10.3390/molecules26216694

CrossRef Medline Google Scholar
↵

Khmelinskii A, Knop M. 2014. Analysis of protein dynamics with tandem fluorescent protein timers. Methods Mol Biol 1174: 195–210. doi:10.1007/978-1-4939-0944-5_13

CrossRef Medline Google Scholar
↵

Kim TH, Dekker J. 2018. ChIP-quantitative polymerase chain reaction (ChIP-qPCR). Cold Spring Harb Protoc 2018. doi:10.1101/pdb.prot082628

Abstract/FREE Full Text
↵

Kim AH, Lee HM, Kim HS, Jung J, Seol H, Choi E, Lee S, Min Choi Y, Jun JK, Kim HS, et al. 2023. Establishment of TUBB3-mCherry knock-in human pluripotent stem cell line using CRISPR/Cas9 (SNUe003-A-4). Stem Cell Res 69: 103064. doi:10.1016/j.scr.2023.103064

CrossRef Medline Google Scholar
↵

Kim AH, Lee HM, Kim HS, Jeong SW, Jun JK, Jang J. 2024a. CRISPR/Cas9-mediated knock-in of a fluorescent reporter into the target locus of interest in human pluripotent stem cells. MethodsX 13: 102807. doi:10.1016/j.mex.2024.102807

CrossRef Medline Google Scholar
↵

Kim J, Kratz AF, Chen S, Sheng J, Kim HK, Zhang L, Singh BK, Chavez A. 2024b. High-throughput tagging of endogenous loci for rapid characterization of protein function. Sci Adv 10: eadg8771. doi:10.1126/sciadv.adg8771

CrossRef Medline Google Scholar
↵

Kimura Y, Hisano Y, Kawahara A, Higashijima S. 2014. Efficient generation of knock-in transgenic zebrafish carrying reporter/driver genes by CRISPR/Cas9-mediated genome engineering. Sci Rep 4: 6545. doi:10.1038/srep06545

CrossRef Medline Google Scholar
↵

Kleeman B, Olsson A, Newkold T, Kofron M, DeLay M, Hildeman D, Grimes HL. 2018. A guide to choosing fluorescent protein combinations for flow cytometric analysis based on spectral overlap. Cytometry A 93: 556–562. doi:10.1002/cyto.a.23360

CrossRef Medline Google Scholar
↵

Koch B, Nijmeijer B, Kueblbeck M, Cai Y, Walther N, Ellenberg J. 2018. Generation and validation of homozygous fluorescent knock-in cells using CRISPR-Cas9 genome editing. Nat Protoc 13: 1465–1487. doi:10.1038/nprot.2018.042

CrossRef Google Scholar
↵

Kolmykov S, Yevshin I, Kulyashov M, Sharipov R, Kondrakhin Y, Makeev VJ, Kulakovskiy IV, Kel A, Kolpakov F. 2021. GTRD: an integrated view of transcription regulation. Nucleic Acids Res 49: D104–D111. doi:10.1093/nar/gkaa1057

CrossRef Medline Google Scholar
↵

Konermann S, Brigham MD, Trevino AE, Joung J, Abudayyeh OO, Barcena C, Hsu PD, Habib N, Gootenberg JS, Nishimasu H, et al. 2015. Genome-scale transcriptional activation by an engineered CRISPR-Cas9 complex. Nature 517: 583–588. doi:10.1038/nature14136

CrossRef Medline Google Scholar
↵

Kumar S, Ambrosini G, Bucher P. 2017. SNP2TFBS: a database of regulatory SNPs affecting predicted transcription factor binding site affinity. Nucleic Acids Res 45: D139–d144. doi:10.1093/nar/gkw1064

CrossRef Medline Google Scholar
↵

Kvon EZ. 2015. Using transgenic reporter assays to functionally characterize enhancers in animals. Genomics 106: 185–192. doi:10.1016/j.ygeno.2015.06.007

CrossRef Medline Google Scholar
↵

Kwak H, Fuda NJ, Core LJ, Lis JT. 2013. Precise maps of RNA polymerase reveal how promoters direct initiation and pausing. Science 339: 950–953. doi:10.1126/science.1229386

Abstract/FREE Full Text
↵

Lambert SA, Jolma A, Campitelli LF, Das PK, Yin Y, Albu M, Chen X, Taipale J, Hughes TR, Weirauch MT. 2018. The human transcription factors. Cell 172: 650–665. doi:10.1016/j.cell.2018.01.029

CrossRef Medline Google Scholar
↵

Larionov V, Kouprina N, Graves J, Resnick MA. 1996. Highly selective isolation of human DNAs from rodent-human hybrid cells as circular yeast artificial chromosomes by transformation-associated recombination cloning. Proc Natl Acad Sci 93: 13925–13930. doi:10.1073/pnas.93.24.13925

Abstract/FREE Full Text
↵

Ledesma-Dominguez L, Carbajal-Degante E, Moreno-Hagelsieb G, Pérez-Rueda E. 2024. DeepReg: a deep learning hybrid model for predicting transcription factors in eukaryotic and prokaryotic genomes. Sci Rep 14: 9155. doi:10.1038/s41598-024-59487-5

CrossRef Medline Google Scholar
↵

Lee CY, Myong S. 2021. Probing steps in DNA transcription using single-molecule methods. J Biol Chem 297: 101086. doi:10.1016/j.jbc.2021.101086

CrossRef Medline Google Scholar
↵

Lee HM, Kim AH, Hwang S, Jung J, Seol H, Sung JJ, Jeong SM, Choi YM, Jun JK, Kim HS, et al. 2022. Generation of αMHC-EGFP knock-in in human pluripotent stem cell line, SNUe003-A-3 using CRISPR/Cas9-based gene targeting. Stem Cell Res 61: 102779. doi:10.1016/j.scr.2022.102779

CrossRef Medline Google Scholar
↵

Lesurf R, Cotto KC, Wang G, Griffith M, Kasaian K, Jones SJ, Montgomery SB, Griffith OL. 2016. OReganno 3.0: a community-driven resource for curated regulatory annotation. Nucleic Acids Res 44: D126–D132. doi:10.1093/nar/gkv1203

CrossRef Medline Google Scholar
↵

Li M, Wang Y, Liu M, Lan X. 2018a. Multimodality reporter gene imaging: construction strategies and application. Theranostics 8: 2954–2973. doi:10.7150/thno.24108

CrossRef Medline Google Scholar
↵

Li S, Chen LX, Peng XH, Wang C, Qin BY, Tan D, Han CX, Yang H, Ren XN, Liu F, et al. 2018b. Overview of the reporter genes and reporter mouse models. Animal Model Exp Med 1: 29–35. doi:10.1002/ame2.12008

CrossRef Medline Google Scholar
↵

Li M, Hunt J, Bhattacharyya A, Zhao X. 2019a. One-step generation of seamless luciferase gene knockin using CRISPR/Cas9 genome editing in human pluripotent stem cells. Methods Mol Biol 1942: 61–69. doi:10.1007/978-1-4939-9080-1_5

CrossRef Medline Google Scholar
↵

Li M, Zou D, Li Z, Gao R, Sang J, Zhang Y, Li R, Xia L, Zhang T, Niu G, et al. 2019b. EWAS atlas: a curated knowledgebase of epigenome-wide association studies. Nucleic Acids Res 47: D983–d988. doi:10.1093/nar/gky1027

CrossRef Medline Google Scholar
↵

Liberzon A, Birger C, Thorvaldsdóttir H, Ghandi M, Mesirov JP, Tamayo P. 2015. The Molecular Signatures Database (MSigDB) hallmark gene set collection. Cell Syst 1: 417–425. doi:10.1016/j.cels.2015.12.004

CrossRef Medline Google Scholar
↵

Lichten CA, White R, Clark IBN, Swain PS. 2014. Unmixing of fluorescence spectra to resolve quantitative time-series measurements of gene expression in plate readers. BMC Biotechnol 14: 11. doi:10.1186/1472-6750-14-11

CrossRef Medline Google Scholar
↵

Lionnet T, Wu C. 2021. Single-molecule tracking of transcription protein dynamics in living cells: Seeing is believing, but what are we seeing? Curr Opin Genet Dev 67: 94–102. doi:10.1016/j.gde.2020.12.001

CrossRef Medline Google Scholar
↵

Liu Z, Tjian R. 2018. Visualizing transcription factor dynamics in living cells. J Cell Biol 217: 1181–1191. doi:10.1083/jcb.201710038

Abstract/FREE Full Text
↵

Lombardelli L, Logiodice F, Piccinni M-P. 2025. Evaluation of gene expression profiling by QuantiGene^TM 2.0 RNA assay. In Immunosenescence: methods and protocols (ed. Amoriello R, et al.), pp. 89–98. Springer, New York. doi:10.1007/978-1-0716-4128-6_8

CrossRef Google Scholar
↵

Long HK, Prescott SL, Wysocka J. 2016. Ever-changing landscapes: transcriptional enhancers in development and evolution. Cell 167: 1170–1187. doi:10.1016/j.cell.2016.09.018

CrossRef Medline Google Scholar
↵

Loyfer N, Magenheim J, Peretz A, Cann G, Bredno J, Klochendler A, Fox-Fisher I, Shabi-Porat S, Hecht M, Pelet T, et al. 2023. A DNA methylation atlas of normal human cell types. Nature 613: 355–364. doi:10.1038/s41586-022-05580-6

CrossRef Medline Google Scholar
↵

Lukyanov KA. 2022. Fluorescent proteins for a brighter science. Biochem Biophys Res Commun 633: 29–32. doi:10.1016/j.bbrc.2022.08.089

CrossRef Medline Google Scholar
↵

Luo Y, Hitz BC, Gabdank I, Hilton JA, Kagda MS, Lam B, Myers Z, Sud P, Jou J, Lin K, et al. 2020. New developments on the Encyclopedia of DNA Elements (ENCODE) data portal. Nucleic Acids Res 48: D882–d889. doi:10.1093/nar/gkz1062

CrossRef Medline Google Scholar
↵

Maguire CA, Bovenberg MS, Crommentuijn MH, Niers JM, Kerami M, Teng J, Sena-Esteves M, Badr CE, Tannous BA. 2013. Triple bioluminescence imaging for in vivo monitoring of cellular processes. Mol Ther Nucleic Acids 2: e99. doi:10.1038/mtna.2013.25

CrossRef Google Scholar
↵

Mahat DB, Kwak H, Booth GT, Jonkers IH, Danko CG, Patel RK, Waters CT, Munson K, Core LJ, Lis JT. 2016. Base-pair-resolution genome-wide mapping of active RNA polymerases using precision nuclear run-on (PRO-seq). Nat Protoc 11: 1455–1476. doi:10.1038/nprot.2016.086

CrossRef Medline Google Scholar
↵

Mahendrawada L, Warfield L, Donczew R, Hahn S. 2025. Low overlap of transcription factor DNA binding and regulatory targets. Nature 642: 796–804. doi:10.1038/s41586-025-08916-0

CrossRef Medline Google Scholar
↵

Mäntylä E, Montonen T, Azzari L, Mattola S, Hannula M, Vihinen-Ranta M, Hyttinen J, Vippola M, Foi A, Nymark S, et al. 2023. Iterative immunostaining combined with expansion microscopy and image processing reveals nanoscopic network organization of nuclear lamina. Mol Biol Cell 34: br13. doi:10.1091/mbc.E22-09-0448

CrossRef Medline Google Scholar
↵

Mathelier A, Wasserman WW. 2013. The next generation of transcription factor binding site prediction. PLoS Comput Biol 9: e1003214. doi:10.1371/journal.pcbi.1003214

CrossRef Medline Google Scholar
↵

Matys V, Kel-Margoulis OV, Fricke E, Liebich I, Land S, Barre-Dirrie A, Reuter I, Chekmenev D, Krull M, Hornischer K, et al. 2006. TRANSFAC(r) and its module TRANSCompel(R): transcriptional gene regulation in eukaryotes. Nucleic Acids Res 34: D108–D110. doi:10.1093/nar/gkj143

CrossRef Medline Google Scholar
↵

McGettigan PA. 2013. Transcriptomics in the RNA-seq era. Curr Opin Chem Biol 17: 4–11. doi:10.1016/j.cbpa.2012.12.008

CrossRef Medline Google Scholar
↵

Meijering AEC, Biebricher AS, Sitters G, Brouwer I, Peterman EJG, Wuite GJL, Heller I. 2020. Imaging unlabeled proteins on DNA with super-resolution. Nucleic Acids Res 48: e34. doi:10.1093/nar/gkaa061

CrossRef Medline Google Scholar
↵

Merz K, Hondele M, Goetze H, Gmelch K, Stoeckl U, Griesenbeck J. 2008. Actively transcribed rRNA genes in S. cerevisiae are organized in a specialized chromatin associated with the high-mobility group protein Hmo1 and are largely devoid of histone molecules. Genes Dev 22: 1190–1204. doi:10.1101/gad.466908

Abstract/FREE Full Text
↵

Mezzanotte L, van‘t Root M, Karatas H, Goun EA, Löwik C. 2017. In vivo molecular bioluminescence imaging: new tools and applications. Trends Biotechnol 35: 640–652. doi:10.1016/j.tibtech.2017.03.012

CrossRef Medline Google Scholar
↵

Milosavljevic A. 2010. Putting epigenome comparison into practice. Nat Biotechnol 28: 1053–1056. doi:10.1038/nbt1010-1053

CrossRef Medline Google Scholar
↵

Mogayzel PJ Jr., Ashlock MA. 2000. CFTR intron 1 increases luciferase expression driven by CFTR 5′-flanking DNA in a yeast artificial chromosome. Genomics 64: 211–215. doi:10.1006/geno.2000.6119

CrossRef Medline Google Scholar
↵

Molina O, Vargiu G, Abad MA, Zhiteneva A, Jeyaprakash AA, Masumoto H, Kouprina N, Larionov V, Earnshaw WC. 2016. Epigenetic engineering reveals a balance between histone modifications and transcription in kinetochore maintenance. Nat Commun 7: 13334. doi:10.1038/ncomms13334

CrossRef Medline Google Scholar
↵

Morgan MP, Finnegan E, Das S. 2022. The role of transcription factors in the acquisition of the four latest proposed hallmarks of cancer and corresponding enabling characteristics. Semin Cancer Biol 86: 1203–1215. doi:10.1016/j.semcancer.2022.10.002

CrossRef Medline Google Scholar
↵

Morrow CS, Porter TJ, Moore DL. 2021. Fluorescent tagging of endogenous proteins with CRISPR/Cas9 in primary mouse neural stem cells. STAR Protoc 2: 100744. doi:10.1016/j.xpro.2021.100744

CrossRef Medline Google Scholar
↵

Mota A, Berezicki S, Wernersson E, Harbers L, Li-Wang X, Gradin K, Peuckert C, Crosetto N, Bienko M. 2022. FRET-FISH probes chromatin compaction at individual genomic loci in single cells. Nat Commun 13: 6680. doi:10.1038/s41467-022-34183-y

CrossRef Medline Google Scholar
↵

Moyroud E, Reymond MC, Hamès C, Parcy F, Scutt CP. 2009. The analysis of entire gene promoters by surface plasmon resonance. Plant J 59: 851–858. doi:10.1111/j.1365-313X.2009.03903.x

CrossRef Medline Google Scholar
↵

Myers SA, Wright J, Peckner R, Kalish BT, Zhang F, Carr SA. 2018. Discovery of proteins associated with a predefined genomic locus via dCas9-APEX-mediated proximity labeling. Nat Methods 15: 437–439. doi:10.1038/s41592-018-0007-1

CrossRef Medline Google Scholar
↵

Nabet B, Roberts JM, Buckley DL, Paulk J, Dastjerdi S, Yang A, Leggett AL, Erb MA, Lawlor MA, Souza A, et al. 2018. The dTAG system for immediate and target-specific protein degradation. Nat Chem Biol 14: 431–441. doi:10.1038/s41589-018-0021-8

CrossRef Medline Google Scholar
↵

Nakamura M, Gao Y, Dominguez AA, Qi LS. 2021. CRISPR technologies for precise epigenome editing. Nat Cell Biol 23: 11–22. doi:10.1038/s41556-020-00620-7

CrossRef Medline Google Scholar
↵

Nakato R, Sakata T. 2021. Methods for ChIP-seq analysis: a practical workflow and advanced applications. Methods 187: 44–53. doi:10.1016/j.ymeth.2020.03.005

CrossRef Google Scholar
↵

Nazarova M, Sexton T. 2026. The dance of promoters and enhancers in gene regulation: fast or slow, entwined or distant? J Mol Biol 438: 169223. doi:10.1016/j.jmb.2025.169223

CrossRef Medline Google Scholar
↵

Nepal C, Andersen JB. 2023. Alternative promoters in CpG depleted regions are prevalently associated with epigenetic misregulation of liver cancer transcriptomes. Nat Commun 14: 2712. doi:10.1038/s41467-023-38272-4

CrossRef Medline Google Scholar
↵

Nerella SG, Michaelides M, Minamimoto T, Innis RB, Pike VW, Eldridge MAG. 2023. PET reporter systems for the brain. Trends Neurosci 46: 941–952. doi:10.1016/j.tins.2023.08.007

CrossRef Medline Google Scholar
↵

Nierman WC, Feldblyum TV. 2001. Genomic library. In Encyclopedia of genetics (ed. Brenner S, Miller JH), pp. 865–872. Academic Press, New York.

Google Scholar
↵

Nishimura K, Fukagawa T, Takisawa H, Kakimoto T, Kanemaki M. 2009. An auxin-based degron system for the rapid depletion of proteins in nonplant cells. Nat Methods 6: 917–922. doi:10.1038/nmeth.1401

CrossRef Medline Google Scholar
↵

Nolan T, Hands RE, Bustin SA. 2006. Quantification of mRNA using real-time RT-PCR. Nat Protoc 1: 1559–1582. doi:10.1038/nprot.2006.236

CrossRef Medline Google Scholar
↵

Obayashi T, Kagaya Y, Aoki Y, Tadaka S, Kinoshita K. 2019. COXPRESdb v7: a gene coexpression database for 11 animal species supported by 23 coexpression platforms for technical evaluation and evolutionary inference. Nucleic Acids Res 47: D55–D62. doi:10.1093/nar/gky1155

CrossRef Medline Google Scholar
↵

Oh-Hashi K, Furuta E, Norisada J, Amaya F, Hirata Y, Kiuchi K. 2016. Application of NanoLuc to monitor the intrinsic promoter activity of GRP78 using the CRISPR/Cas9 system. Genes Cells 21: 1137–1143. doi:10.1111/gtc.12401

CrossRef Medline Google Scholar
↵

Okabe A, Kaneda A. 2021. Transcriptional dysregulation by aberrant enhancer activation and rewiring in cancer. Cancer Sci 112: 2081–2088. doi:10.1111/cas.14884

CrossRef Medline Google Scholar
↵

Oki S, Ohta T, Shioi G, Hatanaka H, Ogasawara O, Okuda Y, Kawaji H, Nakaki R, Sese J, Meno C. 2018. ChIP-Atlas: a data-mining suite powered by full integration of public ChIP-seq data. EMBO Rep 19: e46255. doi:10.15252/embr.201846255

Abstract/FREE Full Text
↵

Omasits U, Ahrens CH, Müller S, Wollscheid B. 2014. Protter: interactive protein feature visualization and integration with experimental proteomic data. Bioinformatics 30: 884–886. doi:10.1093/bioinformatics/btt607

CrossRef Medline Google Scholar
↵

Oubounyt M, Louadi Z, Tayara H, Chong KT. 2019. DeePromoter: robust promoter predictor using deep learning. Front Genet 10: 286. doi:10.3389/fgene.2019.00286

CrossRef Medline Google Scholar
↵

Palozola KC, Donahue G, Liu H, Grant GR, Becker JS, Cote A, Yu H, Raj A, Zaret KS. 2017. Mitotic transcription and waves of gene reactivation during mitotic exit. Science 358: 119–122. doi:10.1126/science.aal4671

Abstract/FREE Full Text
↵

Panigrahi A, O'Malley BW. 2021. Mechanisms of enhancer action: the known and the unknown. Genome Biol 22: 108. doi:10.1186/s13059-021-02322-1

CrossRef Medline Google Scholar
↵

Park PJ. 2009. ChIP-seq: advantages and challenges of a maturing technology. Nat Rev Genet 10: 669–680. doi:10.1038/nrg2641

CrossRef Medline Google Scholar
↵

Parteka-Tojek Z, Zhu JJ, Lee B, Jodkowska K, Wang P, Aaron J, Chew TL, Banecki K, Plewczynski D, Ruan Y. 2022. Super-resolution visualization of chromatin loop folding in human lymphoblastoid cells using interferometric photoactivated localization microscopy. Sci Rep 12: 8582. doi:10.1038/s41598-022-12568-9

CrossRef Medline Google Scholar
↵

Patrick PS, Hammersley J, Loizou L, Kettunen MI, Rodrigues TB, Hu DE, Tee SS, Hesketh R, Lyons SK, Soloviev D, et al. 2014. Dual-modality gene reporter for in vivo imaging. Proc Natl Acad Sci 111: 415–420. doi:10.1073/pnas.1319000111

Abstract/FREE Full Text
↵

Pavlou A, Cinquemani E, Geiselmann J, de Jong H. 2022. Maturation models of fluorescent proteins are necessary for unbiased estimates of promoter activity. Biophys J 121: 4179–4188. doi:10.1016/j.bpj.2022.09.021

CrossRef Medline Google Scholar
↵

Pei W-D, Zhang Y, Yin T-L, Yu Y. 2020. Epigenome editing by CRISPR/Cas9 in clinical settings: possibilities and challenges. Brief Funct Genomics 19: 215–228. doi:10.1093/bfgp/elz035

CrossRef Medline Google Scholar
↵

Perez G, Barber Galt P, Benet-Pages A, Casper J, Clawson H, Diekhans M, Fischer C, Gonzalez Jairo N, Hinrichs Angie S, Lee Christopher M, et al. 2025. The UCSC Genome Browser database: 2025 update. Nucleic Acids Res 53: D1243–D1249. doi:10.1093/nar/gkae974

CrossRef Medline Google Scholar
↵

Perez-Leal O, Nixon-Abell J, Barrero CA, Gordon JC, Oesterling J, Rico MC. 2021. Multiplex gene tagging with CRISPR-Cas9 for live-cell microscopy and application to study the role of SARS-CoV-2 proteins in autophagy. CRISPR J 4: 854–871. doi:10.1089/crispr.2021.0041

CrossRef Medline Google Scholar
↵

Ponger L, Mouchiroud D. 2002. CpGProD: identifying CpG islands associated with transcription start sites in large genomic mammalian sequences. Bioinformatics 18: 631–633. doi:10.1093/bioinformatics/18.4.631

CrossRef Medline Google Scholar
↵

Ponomartsev SV, Sinenko SA, Tomilin AN. 2022. Human artificial chromosomes and their transfer to target cells. Acta Naturae 14: 35–45. doi:10.32607/actanaturae.11670

CrossRef Medline Google Scholar
↵

Pownall ME, Miao L, Vejnar CE, M'Saad O, Sherrard A, Frederick MA, Benitez MDJ, Boswell CW, Zaret KS, Bewersdorf J, et al. 2023. Chromatin expansion microscopy reveals nanoscale organization of transcription and chromatin. Science 381: 92–100. doi:10.1126/science.ade5308

CrossRef Google Scholar
↵

Qi LS, Larson MH, Gilbert LA, Doudna JA, Weissman JS, Arkin AP, Lim WA. 2013. Repurposing CRISPR as an RNA-guided platform for sequence-specific control of gene expression. Cell 152: 1173–1183. doi:10.1016/j.cell.2013.02.022

CrossRef Medline Google Scholar
↵

Qi LS, Larson MH, Gilbert LA, Doudna JA, Weissman JS, Arkin AP, Lim WA. 2021. Repurposing CRISPR as an RNA-guided platform for sequence-specific control of gene expression. Cell 184: 844. doi:10.1016/j.cell.2021.01.019

CrossRef Google Scholar
↵

Qin W, Cho KF, Cavanagh PE, Ting AY. 2021. Deciphering molecular interactions by proximity labeling. Nat Methods 18: 133–143. doi:10.1038/s41592-020-01010-5

CrossRef Medline Google Scholar
↵

Qiu W, Xu Z, Zhang M, Zhang D, Fan H, Li T, Wang Q, Liu P, Zhu Z, Du D, et al. 2019. Determination of local chromatin interactions using a combined CRISPR and peroxidase APEX2 system. Nucleic Acids Res 47: e52. doi:10.1093/nar/gkz134

CrossRef Medline Google Scholar
↵

Rauluseviciute I, Riudavets-Puig R, Blanc-Mathieu R, Castro-Mondragon JA, Ferenc K, Kumar V, Lemma RB, Lucas J, Chèneby J, Baranasic D, et al. 2024. JASPAR 2024: 20th anniversary of the open-access database of transcription factor binding profiles. Nucleic Acids Res 52: D174–d182. doi:10.1093/nar/gkad1059

CrossRef Medline Google Scholar
↵

Refaat A, Yap ML, Pietersz G, Walsh APG, Zeller J, Del Rosal B, Wang X, Peter K. 2022. In vivo fluorescence imaging: success in preclinical imaging paves the way for clinical applications. J Nanobiotechnology 20: 450. doi:10.1186/s12951-022-01648-7

CrossRef Medline Google Scholar
↵

Rice P, Longden I, Bleasby A. 2000. EMBOSS: the European molecular biology open software suite. Trends Genet 16: 276–277. doi:10.1016/S0168-9525(00)02024-2

CrossRef Medline Google Scholar
↵

Richie CT, Whitaker LR, Whitaker KW, Necarsulmer J, Baldwin HA, Zhang Y, Fortuno L, Hinkle JJ, Koivula P, Henderson MJ, et al. 2017. Near-infrared fluorescent protein iRFP713 as a reporter protein for optogenetic vectors, a transgenic Cre-reporter rat, and other neuronal studies. J Neurosci Methods 284: 1–14. doi:10.1016/j.jneumeth.2017.03.020

CrossRef Medline Google Scholar
↵

Romanov SE, Kalashnikova DA, Laktionov PP. 2021. Methods of massive parallel reporter assays for investigation of enhancers. Vavilovskii Zhurnal Genet Selektsii 25: 344–355. doi:10.18699/VJ21.038

CrossRef Medline Google Scholar
↵

Rossi M, Massai L, Diamanti D, Fiengo P, De Rosa A, Magrini R, Magnoni L, Chellini S, Coniglio S, Diodato E, et al. 2016. Multimodal molecular imaging system for pathway-specific reporter gene expression. Eur J Pharm Sci 86: 136–142. doi:10.1016/j.ejps.2016.03.006

CrossRef Medline Google Scholar
↵

Rouillard AD, Gundersen GW, Fernandez NF, Wang Z, Monteiro CD, McDermott MG, Ma'ayan A. 2016. The harmonizome: a collection of processed datasets gathered to serve and mine knowledge about genes and proteins. Database (Oxford) 2016: baw100. doi:10.1093/database/baw100

CrossRef Medline Google Scholar
↵

Roy A, Kucukural A, Zhang Y. 2010. I-TASSER: a unified platform for automated protein structure and function prediction. Nat Protoc 5: 725–738. doi:10.1038/nprot.2010.5

CrossRef Medline Google Scholar
↵

Sadikot RT, Blackwell TS. 2005. Bioluminescence imaging. Proc Am Thorac Soc 2: 537–540. doi:10.1513/pats.200507-067DS

CrossRef Medline Google Scholar
↵

Saito-Moriya R, Nakayama J, Kamiya G, Kitada N, Obata R, Maki SA, Aoyama H. 2021. How to select firefly luciferin analogues for in vivo imaging. Int J Mol Sci 22: 1848. doi:10.3390/ijms22041848

CrossRef Medline Google Scholar
↵

Salataj E, Spilianakis CG, Chaumeil J. 2023. Single-cell detection of primary transcripts, their genomic loci and nuclear factors by 3D immuno-RNA/DNA FISH in T cells. Front Immunol 14: 1156077. doi:10.3389/fimmu.2023.1156077

CrossRef Medline Google Scholar
↵

Sanson KR, Hanna RE, Hegde M, Donovan KF, Strand C, Sullender ME, Vaimberg EW, Goodale A, Root DE, Piccioni F, et al. 2018. Optimized libraries for CRISPR-Cas9 genetic screens with multiple modalities. Nat Commun 9: 5416. doi:10.1038/s41467-018-07901-8

CrossRef Medline Google Scholar
↵

Schep R, Trauernicht M, Vergara X, Friskes A, Morris B, Gregoricchio S, Manzo SG, Zwart W, Beijersbergen Roderick L, Medema RH, et al. 2024. Chromatin context-dependent effects of epigenetic drugs on CRISPR-Cas9 editing. Nucleic Acids Res 52: 8815–8832. doi:10.1093/nar/gkae570

CrossRef Medline Google Scholar
↵

Schmid M, Durussel T, Laemmli UK. 2004. ChIC and ChEC; genomic mapping of chromatin proteins. Mol Cell 16: 147–157. doi:10.1016/j.molcel.2004.09.007

CrossRef Medline Google Scholar
↵

Schwalb B, Michel M, Zacher B, Frühauf K, Demel C, Tresch A, Gagneur J, Cramer P. 2016. TT-seq maps the human transient transcriptome. Science 352: 1225–1228. doi:10.1126/science.aad9841

Abstract/FREE Full Text
↵

Serganova I, Blasberg RG. 2019. Molecular imaging with reporter genes: has its promise been delivered? J Nucl Med 60: 1665–1681. doi:10.2967/jnumed.118.220004

Abstract/FREE Full Text
↵

Shahmuradov IA, Umarov RK, Solovyev VV. 2017. TSSPlant: a new tool for prediction of plant Pol II promoters. Nucleic Acids Res 45: e65. doi:10.1093/nar/gkw135

CrossRef Medline Google Scholar
↵

Shaner NC, Lambert GG, Chammas A, Ni Y, Cranfill PJ, Baird MA, Sell BR, Allen JR, Day RN, Israelsson M, et al. 2013. A bright monomeric green fluorescent protein derived from Branchiostoma lanceolatum. Nat Methods 10: 407–409. doi:10.1038/nmeth.2413

CrossRef Medline Google Scholar
↵

Sharma A, Toepfer CN, Ward T, Wasson L, Agarwal R, Conner DA, Hu JH, Seidman CE. 2018. CRISPR/Cas9-mediated fluorescent tagging of endogenous proteins in human pluripotent stem cells. Curr Protoc Hum Genet 96: 21.11.21–21.11.20. doi:10.1002/cphg.52

CrossRef Google Scholar
↵

Shi L, Li S, Zhu R, Lu C, Xu X, Li C, Huang X, Zhao X, Mao F, Li K. 2025. CRISPRepi: a multi-omic atlas for CRISPR-based epigenome editing. Nucleic Acids Res 53: D901–d913. doi:10.1093/nar/gkae1039

CrossRef Medline Google Scholar
↵

Shifera AS, Hardin JA. 2010. Factors modulating expression of Renilla luciferase from control plasmids used in luciferase reporter gene assays. Anal Biochem 396: 167–172. doi:10.1016/j.ab.2009.09.043

CrossRef Medline Google Scholar
↵

Siddharthan R. 2010. Dinucleotide weight matrices for predicting transcription factor binding sites: generalizing the position weight matrix. PLoS One 5: e9722. doi:10.1371/journal.pone.0009722

CrossRef Medline Google Scholar
↵

Smale ST. 2009. Nuclear run-on assay. Cold Spring Harb Protoc 2009: pdb.prot5329. doi:10.1101/pdb.prot5329

Abstract/FREE Full Text
↵

Smale ST. 2010. Luciferase assay. Cold Spring Harb Protoc 2010: pdb.prot5421. doi:10.1101/pdb.prot5421

Abstract/FREE Full Text
↵

Song Q, Decato B, Hong EE, Zhou M, Fang F, Qu J, Garvin T, Kessler M, Zhou J, Smith AD. 2013. A reference methylome database and analysis pipeline to facilitate integrative and comparative epigenomics. PLoS One 8: e81148. doi:10.1371/journal.pone.0081148

CrossRef Medline Google Scholar
↵

Specht EA, Braselmann E, Palmer AE. 2017. A critical and comparative review of fluorescent tools for live-cell imaging. Annu Rev Physiol 79: 93–117. doi:10.1146/annurev-physiol-022516-034055

CrossRef Medline Google Scholar
↵

Srivastava D, Mahony S. 2020. Sequence and chromatin determinants of transcription factor binding and the establishment of cell type-specific binding patterns. Biochim Biophys Acta Gene Regul Mech 1863: 194443. doi:10.1016/j.bbagrm.2019.194443

CrossRef Medline Google Scholar
↵

Stark A, Lin MF, Kheradpour P, Pedersen JS, Parts L, Carlson JW, Crosby MA, Rasmussen MD, Roy S, Deoras AN, et al. 2007. Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures. Nature 450: 219–232. doi:10.1038/nature06340

CrossRef Medline Google Scholar
↵

Stelzer G, Rosen N, Plaschkes I, Zimmerman S, Twik M, Fishilevich S, Stein TI, Nudel R, Lieder I, Mazor Y, et al. 2016. The geneCards suite: from gene data mining to disease genome sequence analyses. Curr Protoc Bioinformatics 54: 1.30.31–1.30.33. doi:10.1002/cpbi.5

CrossRef Google Scholar
↵

Sternberg PW, Van Auken K, Wang Q, Wright A, Yook K, Zarowiecki M, Arnaboldi V, Becerra A, Brown S, Cain S, et al. 2024. WormBase 2024: status and transitioning to Alliance infrastructure. Genetics 227: iyae050. doi:10.1093/genetics/iyae050

CrossRef Medline Google Scholar
↵

Stormo GD, Zhao Y. 2010. Determining the specificity of protein–DNA interactions. Nat Rev Genet 11: 751–760. doi:10.1038/nrg2845

CrossRef Medline Google Scholar
↵

Stormo GD, Schneider TD, Gold L, Ehrenfeucht A. 1982. Use of the ‘Perceptron’ algorithm to distinguish translational initiation sites in E. coli. Nucleic Acids Res 10: 2997–3011. doi:10.1093/nar/10.9.2997

CrossRef Medline Google Scholar
↵

Subach OM, Cranfill PJ, Davidson MW, Verkhusha VV. 2011. An enhanced monomeric blue fluorescent protein with the high chemical stability of the chromophore. PLoS One 6: e28674. doi:10.1371/journal.pone.0028674

CrossRef Medline Google Scholar
↵

Suske G, Bruford E, Philipsen S. 2005. Mammalian SP/KLF transcription factors: bring in the family. Genomics 85: 551–556. doi:10.1016/j.ygeno.2005.01.005

CrossRef Medline Google Scholar
↵

Suzuki A, Wakaguri H, Yamashita R, Kawano S, Tsuchihara K, Sugano S, Suzuki Y, Nakai K. 2015. DBTSS as an integrative platform for transcriptome, epigenome and genome sequence variation data. Nucleic Acids Res 43: D87–D91. doi:10.1093/nar/gku1080

CrossRef Medline Google Scholar
↵

Takai D, Jones PA. 2003. The CpG island searcher: a new WWW resource. In Silico Biol 3: 235–240.

Medline Google Scholar
↵

Tamura R, Kamiyama D. 2023. CRISPR-Cas9-mediated knock-in approach to insert the GFP(11) tag into the genome of a human cell line. Methods Mol Biol 2564: 185–201. doi:10.1007/978-1-0716-2667-2_8

CrossRef Medline Google Scholar
↵

Teufel F, Almagro Armenteros JJ, Johansen AR, Gíslason MH, Pihl SI, Tsirigos KD, Winther O, Brunak S, von Heijne G, Nielsen H. 2022. SignalP 6.0 predicts all five types of signal peptides using protein language models. Nat Biotechnol 40: 1023–1025. doi:10.1038/s41587-021-01156-3

CrossRef Medline Google Scholar
↵

Thomas CE, Ehrhardt A, Kay MA. 2003. Progress and problems with the use of viral vectors for gene therapy. Nat Rev Genet 4: 346–358. doi:10.1038/nrg1066

CrossRef Medline Google Scholar
↵

Thurman RE, Rynes E, Humbert R, Vierstra J, Maurano MT, Haugen E, Sheffield NC, Stergachis AB, Wang H, Vernot B, et al. 2012. The accessible chromatin landscape of the human genome. Nature 489: 75–82. doi:10.1038/nature11232

CrossRef Medline Google Scholar
↵

Tovell H, Testa A, Maniaci C, Zhou H, Prescott AR, Macartney T, Ciulli A, Alessi DR. 2019. Rapid and reversible knockdown of endogenously tagged endosomal proteins via an optimized HaloPROTAC degrader. ACS Chem Biol 14: 882–892. doi:10.1021/acschembio.8b01016

CrossRef Medline Google Scholar
↵

Tsherniak A, Vazquez F, Montgomery PG, Weir BA, Kryukov G, Cowley GS, Gill S, Harrington WF, Pantel S, Krill-Burger JM, et al. 2017. Defining a cancer dependency map. Cell 170: 564–576.e16. doi:10.1016/j.cell.2017.06.010

CrossRef Medline Google Scholar
↵

Uemura T, Mori T, Kurihara T, Kawase S, Koike R, Satoga M, Cao X, Li X, Yanagawa T, Sakurai T, et al. 2016. Fluorescent protein tagging of endogenous protein in brain neurons using CRISPR/Cas9-mediated knock-in and in utero electroporation techniques. Sci Rep 6: 35861. doi:10.1038/srep35861

CrossRef Google Scholar
↵

Ummethum H, Hamperl S. 2020. Proximity labeling techniques to study chromatin. Front Genet 11: 450. doi:10.3389/fgene.2020.00450

CrossRef Google Scholar
↵

Uphoff S. 2016. Super-resolution microscopy and tracking of DNA-binding proteins in bacterial cells. Methods Mol Biol 1431: 221–234. doi:10.1007/978-1-4939-3631-1_16

CrossRef Medline Google Scholar
↵

VanBelzen J, Sakelaris B, Brickner DG, Marcou N, Riecke H, Mangan NM, Brickner JH. 2024. Chromatin endogenous cleavage provides a global view of yeast RNA polymerase II transcription kinetics. eLife 13: RP100764. doi:10.7554/eLife.100764

CrossRef Medline Google Scholar
↵

van Staalduinen J, van Staveren T, Grosveld F, Wendt KS. 2023. Live-cell imaging of chromatin contacts opens a new window into chromatin dynamics. Epigenetics Chromatin 16: 27. doi:10.1186/s13072-023-00503-9

CrossRef Medline Google Scholar
↵

van Steensel B, Henikoff S. 2000. Identification of in vivo DNA targets of chromatin proteins using tethered dam methyltransferase. Nat Biotechnol 18: 424–428. doi:10.1038/74487

CrossRef Medline Google Scholar
↵

Verfaillie A, Imrichova H, Janky R, Aerts S. 2015. Iregulon and i-cisTarget: reconstructing regulatory networks using motif and track enrichment. Curr Protoc Bioinformatics 52: 2.16.11–2.16.39. doi:10.1002/0471250953.bi0216s52

CrossRef Google Scholar
↵

Vickers TA, Crooke ST. 2016. Development of a quantitative BRET affinity assay for nucleic acid-protein interactions. PLoS One 11: e0161930. doi:10.1371/journal.pone.0161930

CrossRef Medline Google Scholar
↵

Vimalraj S. 2020. Alkaline phosphatase: structure, expression and its function in bone mineralization. Gene 754: 144855. doi:10.1016/j.gene.2020.144855

CrossRef Google Scholar
↵

Viushkov VS, Lomov NA, Rubtsov MA, Vassetzky YS. 2022. Visualizing the genome: experimental approaches for live-cell chromatin imaging. Cells 11: 4086. doi:10.3390/cells11244086

CrossRef Google Scholar
↵

Wang Z, Gerstein M, Snyder M. 2009. RNA-seq: a revolutionary tool for transcriptomics. Nat Rev Genet 10: 57–63. doi:10.1038/nrg2484

CrossRef Medline Google Scholar
↵

Wang Y, Song F, Zhang B, Zhang L, Xu J, Kuang D, Li D, Choudhary MNK, Li Y, Hu M, et al. 2018. The 3D Genome Browser: a web-based browser for visualizing 3D genome organization and long-range chromatin interactions. Genome Biol 19: 151. doi:10.1186/s13059-018-1519-9

CrossRef Medline Google Scholar
↵

Wang F, Bai X, Wang Y, Jiang Y, Ai B, Zhang Y, Liu Y, Xu M, Wang Q, Han X, et al. 2021. ATACdb: a comprehensive human chromatin accessibility database. Nucleic Acids Res 49: D55–d64. doi:10.1093/nar/gkaa943

CrossRef Medline Google Scholar
↵

Wang P, Zhang G, Xu Z, Chen Z, Liu X, Wang C, Zheng C, Wang J, Zhang H, Yan A. 2022. Whole-cell FRET monitoring of transcription factor activities enables functional annotation of signal transduction systems in living bacteria. J Biol Chem 298: 102258. doi:10.1016/j.jbc.2022.102258

CrossRef Medline Google Scholar
↵

Wang Z, He Z, Liu Z, Qu M, Gao C, Wang C, Wang Y. 2023. A reverse chromatin immunoprecipitation technique based on the CRISPR–dCas9 system. Plant Physiol 191: 1505–1519. doi:10.1093/plphys/kiac506

CrossRef Medline Google Scholar
↵

Wasserman WW, Sandelin A. 2004. Applied bioinformatics for the identification of regulatory elements. Nature Reviews Genetics 5: 276–287. doi:10.1038/nrg1315

CrossRef Medline Google Scholar
↵

Wen X, Wang Y. 2024. Reverse chromatin immunoprecipitation (R-ChIP). Methods Mol Biol 2846: 123–132. doi:10.1007/978-1-0716-4071-5_8

CrossRef Medline Google Scholar
↵

Wen X, Wang J, Zhang D, Ding Y, Ji X, Tan Z, Wang Y. 2020. Reverse chromatin immunoprecipitation (R-ChIP) enables investigation of the upstream regulators of plant genes. Commun Biol 3: 770. doi:10.1038/s42003-020-01500-4

CrossRef Medline Google Scholar
↵

Wen G, Leen V, Rohand T, Sauer M, Hofkens J. 2023. Current progress in expansion microscopy: chemical strategies and applications. Chem Rev 123: 3299–3323. doi:10.1021/acs.chemrev.2c00711

CrossRef Medline Google Scholar
↵

Wernersson E, Gelali E, Girelli G, Wang S, Castillo D, Mattsson Langseth C, Verron Q, Nguyen HQ, Chattoraj S, Martinez Casals A, et al. 2024. Deconwolf enables high-performance deconvolution of widefield fluorescence microscopy images. Nat Methods 21: 1245–1256. doi:10.1038/s41592-024-02294-7

CrossRef Medline Google Scholar
↵

Wu C, Chie S-O, Ohmiya Y. 2007. Dual-reporter assay using two secreted luciferase genes. BioTechniques 42: 290–292. doi:10.2144/000112428

CrossRef Medline Google Scholar
↵

Wu B, Piatkevich KD, Lionnet T, Singer RH, Verkhusha VV. 2011. Modern fluorescent proteins and imaging technologies to study gene expression, nuclear localization, and dynamics. Curr Opin Cell Biol 23: 310–317. doi:10.1016/j.ceb.2010.12.004

CrossRef Medline Google Scholar
↵

Wu T, Lang J, Sun X, Zhang B, Liu Y, An R. 2013. Monitoring bone marrow stem cells with a reporter gene system in experimental middle cerebral artery occlusion rat models. J Nucl Med 54: 984–989. doi:10.2967/jnumed.112.109280

Abstract/FREE Full Text
↵

Wu F, Olson BG, Yao J. 2016. DamID-seq: genome-wide mapping of protein-DNA interactions by high throughput sequencing of adenine-methylated DNA fragments. J Vis Exp 107: e53620. doi:10.3791/53620

CrossRef Google Scholar
↵

Xu T, Close D, Handagama W, Marr E, Sayler G, Ripp S. 2016. The expanding toolbox of in vivo bioluminescent imaging. Front Oncol 6: 150. doi:10.3389/fonc.2016.00150

CrossRef Medline Google Scholar
↵

Xu J, Ma H, Ma H, Jiang W, Mela CA, Duan M, Zhao S, Gao C, Hahm ER, Lardo SM, et al. 2020. Super-resolution imaging reveals the evolution of higher-order chromatin folding in early carcinogenesis. Nat Commun 11: 1899. doi:10.1038/s41467-020-15718-7

CrossRef Medline Google Scholar
↵

Yamada N, Lai WKM, Farrell N, Pugh BF, Mahony S. 2019. Characterizing protein–DNA binding event subtypes in ChIP-exo data. Bioinformatics 35: 903–913. doi:10.1093/bioinformatics/bty703

CrossRef Medline Google Scholar
↵

Yan X, Ray P, Paulmurugan R, Tong R, Gong Y, Sathirachinda A, Wu JC, Gambhir SS. 2013. A transgenic tri-modality reporter mouse. PLoS One 8: e73580. doi:10.1371/journal.pone.0073580

CrossRef Medline Google Scholar
↵

Yang Y, Pe'er D. 2024. REUNION: transcription factor binding prediction and regulatory association inference from single-cell multi-omics data. Bioinformatics 40: i567–i575. doi:10.1093/bioinformatics/btae234

CrossRef Medline Google Scholar
↵

Yang L, Zhou T, Dror I, Mathelier A, Wasserman WW, Gordân R, Rohs R. 2014. TFBSshape: a motif database for DNA shape features of transcription factor binding sites. Nucleic Acids Res 42: D148–D155. doi:10.1093/nar/gkt1087

CrossRef Medline Google Scholar
↵

Yilmaz A, Mejia-Guerra MK, Kurz K, Liang X, Welch L, Grotewold E. 2011. AGRIS: the Arabidopsis Gene Regulatory Information Server, an update. Nucleic Acids Res 39: D1118–D1122. doi:10.1093/nar/gkq1120

CrossRef Medline Google Scholar
↵

Yin Y, Morgunova E, Jolma A, Kaasinen E, Sahu B, Khund-Sayeed S, Das PK, Kivioja T, Dave K, Zhong F, et al. 2017. Impact of cytosine methylation on DNA binding specificities of human transcription factors. Science 356: eaaj2239. doi:10.1126/science.aaj2239

Abstract/FREE Full Text
↵

Young AP, Jackson DJ, Wyeth RC. 2020. A technical review and guide to RNA fluorescence in situ hybridization. PeerJ 8: e8806. doi:10.7717/peerj.8806

CrossRef Medline Google Scholar
↵

Zalatan JG, Lee ME, Almeida R, Gilbert LA, Whitehead EH, La Russa M, Tsai JC, Weissman JS, Dueber JE, Qi LS, et al. 2015. Engineering complex synthetic transcriptional programs with CRISPR RNA scaffolds. Cell 160: 339–350. doi:10.1016/j.cell.2014.11.052

CrossRef Medline Google Scholar
↵

Zambelli F, Pesole G, Pavesi G. 2009. Pscan: finding over-represented transcription factor binding site motifs in sequences from co-regulated or co-expressed genes. Nucleic Acids Res 37: W247–W252. doi:10.1093/nar/gkp464

CrossRef Medline Google Scholar
↵

Zambelli F, Prazzoli GM, Pesole G, Pavesi G. 2012. Cscan: finding common regulators of a set of genes by using a collection of genome-wide ChIP-seq datasets. Nucleic Acids Res 40: W510–W515. doi:10.1093/nar/gks483

CrossRef Medline Google Scholar
↵

Zeng F, Beck V, Schuierer S, Garnier I, Manneville C, Agarinis C, Morelli L, Quinn L, Knehr J, Roma G, et al. 2020. A simple and efficient CRISPR technique for protein tagging. Cells 9: 2618. doi:10.3390/cells9122618

CrossRef Google Scholar
↵

Zentner GE, Kasinathan S, Xin B, Rohs R, Henikoff S. 2015. ChEC-seq kinetics discriminates transcription factor binding sites by DNA sequence and shape in vivo. Nat Commun 6: 8733. doi:10.1038/ncomms9733

CrossRef Medline Google Scholar
↵

Zhang Y, Bao W, Cao Y, Cong H, Chen B, Chen Y. 2022. A survey on protein–DNA-binding sites in computational biology. Brief Funct Genomics 21: 357–375. doi:10.1093/bfgp/elac009

CrossRef Medline Google Scholar
↵

Zhang M, Zong W, Zou D, Wang G, Zhao W, Yang F, Wu S, Zhang X, Guo X, Ma Y, et al. 2023. MethBank 4.0: an updated database of DNA methylation across a variety of species. Nucleic Acids Res 51: D208–D216. doi:10.1093/nar/gkac969

CrossRef Medline Google Scholar
↵

Zhao J, Tang J, Elfman J, Li H. 2020. RNase protection assay. In Chimeric RNA: methods and protocols (ed. Li H, Elfman J), pp. 109–116. Springer, New York. doi:10.1007/978-1-4939-9904-0_8

CrossRef Google Scholar
↵

Zheng R, Wan C, Mei S, Qin Q, Wu Q, Sun H, Chen C-H, Brown M, Zhang X, Meyer CA, et al. 2019. Cistrome Data Browser: expanded datasets and new tools for gene regulatory analysis. Nucleic Acids Res 47: D729–D735. doi:10.1093/nar/gky1094

CrossRef Medline Google Scholar
↵

Zhu Q, Wong AK, Krishnan A, Aure MR, Tadych A, Zhang R, Corney DC, Greene CS, Bongo LA, Kristensen VN, et al. 2015. Targeted exploration and analysis of large cross-platform human transcriptomic compendia. Nat Methods 12: 211–214. doi:10.1038/nmeth.3249

CrossRef Google Scholar
↵

Zhu F, Farnung L, Kaasinen E, Sahu B, Yin Y, Wei B, Dodonova SO, Nitta KR, Morgunova E, Taipale M, et al. 2018. The interaction landscape between transcription factors and the nucleosome. Nature 562: 76–81. doi:10.1038/s41586-018-0549-5

CrossRef Medline Google Scholar
↵

Zou F, Bai L. 2019. Using time-lapse fluorescence microscopy to study gene regulation. Methods 159–160: 138–145. doi:10.1016/j.ymeth.2018.12.010

CrossRef Google Scholar
↵

Zou M, Zhou H, Gu L, Zhang J, Fang L. 2024. Therapeutic target identification and drug discovery driven by chemical proteomics. Biology (Basel) 13: 555. doi:10.3390/biology13080555

CrossRef Medline Google Scholar
↵

Zulkower V, Page M, Ropers D, Geiselmann J, de Jong H. 2015. Robust reconstruction of gene expression profiles from reporter gene data using linear inversion. Bioinformatics 31: i71–i79. doi:10.1093/bioinformatics/btv246

CrossRef Medline Google Scholar

[1] ↵

Abugessaisa I, Ramilowski JA, Lizio M, Severin J, Hasegawa A, Harshbarger J, Kondo A, Noguchi S, Yip CW, Ooi JLC, et al. 2021. FANTOM enters 20th year: expansion of transcriptomic atlases and functional annotation of non-coding RNAs. Nucleic Acids Res 49: D892–d898. doi:10.1093/nar/gkaa1054

CrossRef Medline Google Scholar

[2] ↵

Acke A, Van Belle S, Louis B, Vitale R, Rocha S, Voet T, Debyser Z, Hofkens J. 2022. Expansion microscopy allows high resolution single cell analysis of epigenetic readers. Nucleic Acids Res 50: e100. doi:10.1093/nar/gkac521

CrossRef Medline Google Scholar

[3] ↵

Adato O, Sloutskin A, Komemi H, Brabb I, Duttke S, Bucher P, Unger R, Juven-Gershon T. 2024. ElemeNT 2023: an enhanced tool for detection and curation of core promoter elements. Bioinformatics 40: btae110. doi:10.1093/bioinformatics/btae110

CrossRef Medline Google Scholar

[4] ↵

Aditham AK, Markin CJ, Mokhtari DA, DelRosso N, Fordyce PM. 2021. High-throughput affinity measurements of transcription factor and DNA mutations reveal affinity and specificity determinants. Cell Syst 12: 112–127.e11. doi:10.1016/j.cels.2020.11.012

CrossRef Medline Google Scholar

[5] ↵

Akgol Oksuz B, Yang L, Abraham S, Venev SV, Krietenstein N, Parsi KM, Ozadam H, Oomen ME, Nand A, Mao H, et al. 2021. Systematic evaluation of chromosome conformation capture assays. Nat Methods 18: 1046–1055. doi:10.1038/s41592-021-01248-7

CrossRef Medline Google Scholar

[6] ↵

Albrecht F, List M, Bock C, Lengauer T. 2016. DeepBlue epigenomic data server: programmatic data retrieval and analysis of epigenome region sets. Nucleic Acids Res 44: W581–W586. doi:10.1093/nar/gkw211

CrossRef Medline Google Scholar

[7] ↵

Alfonso-Gonzalez C, Hilgers V. 2024. (Alternative) transcription start sites as regulators of RNA processing. Trends Cell Biol 34: 1018–1028. doi:10.1016/j.tcb.2024.02.010

CrossRef Medline Google Scholar

[8] ↵

Alipanahi B, Delong A, Weirauch MT, Frey BJ. 2015. Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning. Nat Biotechnol 33: 831–838. doi:10.1038/nbt.3300

CrossRef Medline Google Scholar

[9] ↵

Aughey GN, Southall TD. 2016. Dam it's good! DamID profiling of protein-DNA interactions. Wiley Interdiscip Rev Dev Biol 5: 25–37. doi:10.1002/wdev.205

CrossRef Medline Google Scholar

[10] ↵

Babl V, Stöckl U, Tschochner H, Milkereit P, Griesenbeck J. 2015. Chromatin endogenous cleavage (ChEC) as a method to quantify protein interaction with genomic DNA in Saccharomyces cerevisiae. Methods Mol Biol 1334: 219–232. doi:10.1007/978-1-4939-2877-4_14

CrossRef Medline Google Scholar

[11] ↵

Badr CE. 2014. Bioluminescence imaging: basics and practical limitations. Methods Mol Biol 1098: 1–18. doi:10.1007/978-1-62703-718-1_1

CrossRef Medline Google Scholar

[12] ↵

Bailey TL, Johnson J, Grant CE, Noble WS. 2015. The MEME suite. Nucleic Acids Res 43: W39–W49. doi:10.1093/nar/gkv416

CrossRef Medline Google Scholar

[13] ↵

Bao X-R, Zhu Y-H, Yu D-J. 2019. DeepTF: accurate prediction of transcription factor binding sites by combining multi-scale convolution and long short-term memory neural network, pp. 126–138. Springer International Publishing, Cham, Switzerland.

Google Scholar

[14] ↵

Barras F, Marinus MG. 1989. The great GATC: DNA methylation in E. coli. Trends Genet 5: 139–143. doi:10.1016/0168-9525(89)90054-1

CrossRef Medline Google Scholar

[15] ↵

Bateman JR, Johnson JE. 2022. Altering enhancer–promoter linear distance impacts promoter competition in cis and in trans. Genetics 222: iyac098. doi:10.1093/genetics/iyac098

CrossRef Medline Google Scholar

[16] ↵

Bell JC, Jukam D, Teran NA, Risca VI, Smith OK, Johnson WL, Skotheim JM, Greenleaf WJ, Straight AF. 2018. Chromatin-associated RNA sequencing (ChAR-seq) maps genome-wide RNA-to-DNA contacts. eLife 7: e27024. doi:10.7554/eLife.27024

CrossRef Medline Google Scholar

[17] ↵

Bereiter R, Micura R. 2025. Synthesis of HBC fluorophores with an electrophilic handle for covalent attachment to Pepper RNA. Beilstein J Org Chem 21: 727–735. doi:10.3762/bjoc.21.56

CrossRef Medline Google Scholar

[18] ↵

Bikard D, Jiang W, Samai P, Hochschild A, Zhang F, Marraffini LA. 2013. Programmable repression and activation of bacterial gene expression using an engineered CRISPR-Cas system. Nucleic Acids Res 41: 7429–7437. doi:10.1093/nar/gkt520

CrossRef Medline Google Scholar

[19] ↵

Bindels DS, Haarbosch L, van Weeren L, Postma M, Wiese KE, Mastop M, Aumonier S, Gotthard G, Royant A, Hink MA, et al. 2017. mScarlet: a bright monomeric red fluorescent protein for cellular imaging. Nat Methods 14: 53–56. doi:10.1038/nmeth.4074

CrossRef Medline Google Scholar

[20] ↵

Birk UJ. 2019. Super-resolution microscopy of chromatin. Genes (Basel) 10: 493. doi:10.3390/genes10070493

CrossRef Medline Google Scholar

[21] ↵

Brandão HB, Gabriele M, Hansen AS. 2021. Tracking and interpreting long-range chromatin interactions with super-resolution live-cell imaging. Curr Opin Cell Biol 70: 18–26. doi:10.1016/j.ceb.2020.11.002

CrossRef Medline Google Scholar

[22] ↵

Buckley SM, Delhove JM, Perocheau DP, Karda R, Rahim AA, Howe SJ, Ward NJ, Birrell MA, Belvisi MG, Arbuthnot P, et al. 2015. In vivo bioimaging with tissue-specific transcription factor activated luciferase reporters. Sci Rep 5: 11842. doi:10.1038/srep11842

CrossRef Medline Google Scholar

[23] ↵

Burke DT, Carle GF, Olson MV. 1987. Cloning of large segments of exogenous DNA into yeast by means of artificial chromosome vectors. Science 236: 806–812. doi:10.1126/science.3033825

Abstract/FREE Full Text

[24] ↵

Camino EM, Weinstein ML, List MP, Vellky JE, Rebeiz M, Williams TM. 2020. Red light/green light, a dual fluorescent protein reporter system to study enhancer-promoter specificity in Drosophila. Adv Genet 10: 985–997. doi:10.1534/g3.119.401033

Abstract/FREE Full Text

[25] ↵

Cartharius K, Frech K, Grote K, Klocke B, Haltmeier M, Klingenhoff A, Frisch M, Bayerlein M, Werner T. 2005. MatInspector and beyond: promoter analysis based on transcription factor binding sites. Bioinformatics 21: 2933–2942. doi:10.1093/bioinformatics/bti473

CrossRef Medline Google Scholar

[26] ↵

Chaumeil J, Micsinai M, Skok JA. 2013. Combined immunofluorescence and DNA FISH on 3D-preserved interphase nuclei to study changes in 3D nuclear organization. J Vis Exp 72: e50087. doi:10.3791/50087

CrossRef Google Scholar

[27] ↵

Chavez A, Scheiman J, Vora S, Pruitt BW, Tuttle M, P R Iyer E, Lin S, Kiani S, Guzman CD, Wiegand DJ, et al. 2015. Highly efficient Cas9-mediated transcriptional programming. Nat Methods 12: 326–328. doi:10.1038/nmeth.3312

CrossRef Medline Google Scholar

[28] ↵

Chen M, Compton ST, Coviello VF, Green ED, Ashlock MA. 1997. Transient gene expression from yeast artificial chromosome DNA in mammalian cells is enhanced by adenovirus. Nucleic Acids Res 25: 4416–4418. doi:10.1093/nar/25.21.4416

CrossRef Medline Google Scholar

[29] ↵

Chen H, Levo M, Barinov L, Fujioka M, Jaynes JB, Gregor T. 2018. Dynamic interplay between enhancer–promoter topology and gene activity. Nat Genet 50: 1296–1303. doi:10.1038/s41588-018-0175-z

CrossRef Medline Google Scholar

[30] ↵

Chen C, Hou J, Shi X, Yang H, Birchler JA, Cheng J. 2021a. DeepGRN: prediction of transcription factor binding site across cell-types using attention-based deep neural networks. BMC Bioinformatics 22: 38. doi:10.1186/s12859-020-03952-1

CrossRef Medline Google Scholar

[31] ↵

Chen W, Lu W, Wolynes PG, Komives EA. 2021b. Single-molecule conformational dynamics of a transcription factor reveals a continuum of binding modes controlling association and dissociation. Nucleic Acids Res 49: 11211–11223. doi:10.1093/nar/gkab874

CrossRef Medline Google Scholar

[32] ↵

Chen H, Yan G, Wen M-H, Brooks KN, Zhang Y, Huang P-S, Chen T-Y. 2024a. Advancements and practical considerations for biophysical research: navigating the challenges and future of super-resolution microscopy. Chem Biomed Imaging 2: 331–344. doi:10.1021/cbmi.4c00019

CrossRef Medline Google Scholar

[33] ↵

Chen Y, Liang R, Li Y, Jiang L, Ma D, Luo Q, Song G. 2024b. Chromatin accessibility: biological functions, molecular mechanisms and therapeutic application. Signal Transduct Target Ther 9: 340. doi:10.1038/s41392-024-02030-9

CrossRef Medline Google Scholar

[34] ↵

Cheng S, Mi R, Xu Y, Jin G, Zhang J, Zhou Y, Chen Z, Liu F. 2017. Ferritin heavy chain as a molecular imaging reporter gene in glioma xenografts. J Cancer Res Clin Oncol 143: 941–951. doi:10.1007/s00432-017-2356-z

CrossRef Medline Google Scholar

[35] ↵

Cherry JM. 2015. The Saccharomyces genome database: a tool for discovery. Cold Spring Harb Protoc 2015: pdb.top083840. doi:10.1101/pdb.top083840

Abstract/FREE Full Text

[36] ↵

Choi YJ, Oh SG, Singh TD, Ha JH, Kim DW, Lee SW, Jeong SY, Ahn BC, Lee J, Jeon YH. 2016. Visualization of the biological behavior of tumor-associated macrophages in living mice with colon cancer using multimodal optical reporter gene imaging. Neoplasia 18: 133–141. doi:10.1016/j.neo.2016.01.004

CrossRef Medline Google Scholar

[37] ↵

Choi JJ, Svaren J, Wang D. 2025. CoTF-reg reveals cooperative transcription factors in oligodendrocyte gene regulation using single-cell multi-omics. Commun Biol 8: 181. doi:10.1038/s42003-025-07570-6

CrossRef Medline Google Scholar

[38] ↵

Chuang LY, Yang CH, Lin MC, Yang CH. 2012. CpGPAP: CpG island predictor analysis platform. BMC Genet 13: 13. doi:10.1186/1471-2156-13-13

CrossRef Medline Google Scholar

[39] ↵

Chubb JR, Trcek T, Shenoy SM, Singer RH. 2006. Transcriptional pulsing of a developmental gene. Curr Biol 16: 1018–1025. doi:10.1016/j.cub.2006.03.092

CrossRef Medline Google Scholar

[40] ↵

Chung HK, Jacobs CL, Huo Y, Yang J, Krumm SA, Plemper RK, Tsien RY, Lin MZ. 2015. Tunable and reversible drug control of protein production via a self-excising degron. Nat Chem Biol 11: 713–720. doi:10.1038/nchembio.1869

CrossRef Medline Google Scholar

[41] ↵

Concilio SC, Russell SJ, Peng KW. 2021. A brief review of reporter gene imaging in oncolytic virotherapy and gene therapy. Mol Ther Oncolytics 21: 98–109. doi:10.1016/j.omto.2021.03.006

CrossRef Medline Google Scholar

[42] ↵

Core LJ, Lis JT. 2008. Transcription regulation through promoter-proximal pausing of RNA polymerase II. Science 319: 1791–1792. doi:10.1126/science.1150843

Abstract/FREE Full Text

[43] ↵

Crowe AR, Yue W. 2019. Semi-quantitative determination of protein expression using immunohistochemistry staining and analysis: an integrated protocol. Bio Protoc 9: e3465. doi:10.21769/BioProtoc.3465

CrossRef Google Scholar

[44] ↵

Curtis A, Calabro K, Galarneau JR, Bigio IJ, Krucker T. 2011. Temporal variations of skin pigmentation in C57BL/6 mice affect optical bioluminescence quantitation. Mol Imaging Biol 13: 1114–1123. doi:10.1007/s11307-010-0440-8

CrossRef Medline Google Scholar

[45] ↵

Dai Z, Dai X. 2012. Nuclear colocalization of transcription factor target genes strengthens coregulation in yeast. Nucleic Acids Res 40: 27–36. doi:10.1093/nar/gkr689

CrossRef Medline Google Scholar

[46] ↵

Damhofer H, Radzisheuskaya A, Helin K. 2021. Generation of locus-specific degradable tag knock-ins in mouse and human cell lines. STAR Protoc 2: 100575. doi:10.1016/j.xpro.2021.100575

CrossRef Medline Google Scholar

[47] ↵

Das PM, Ramachandran K, vanWert J, Singal R. 2004. Chromatin immunoprecipitation assay. BioTechniques 37: 961–969. doi:10.2144/04376RV01

CrossRef Medline Google Scholar

[48] ↵

Déjardin J, Kingston RE. 2009. Purification of proteins associated with specific genomic loci. Cell 136: 175–186. doi:10.1016/j.cell.2008.11.045

CrossRef Medline Google Scholar

[49] ↵

Deleuze V, Soler E, Andrieu-Soler C. 2024. Protocol for efficient CRISPR-Cas9-mediated fluorescent tag knockin in hard-to-transfect erythroid cell lines. STAR Protoc 5: 103016. doi:10.1016/j.xpro.2024.103016

CrossRef Medline Google Scholar

[50] ↵

Del'Guidice T, Lepetit-Stoffaes JP, Bordeleau LJ, Roberge J, Théberge V, Lauvaux C, Barbeau X, Trottier J, Dave V, Roy DC, et al. 2018. Membrane permeabilizing amphiphilic peptide delivers recombinant transcription factor and CRISPR-Cas9/Cpf1 ribonucleoproteins in hard-to-modify cells. PLoS One 13: e0195558. doi:10.1371/journal.pone.0195558

CrossRef Medline Google Scholar

[51] ↵

de Medeiros Oliveira M, Bonadio I, Lie de Melo A, Mendes Souza G, Durham AM. 2021. TSSFinder-fast and accurate ab initio prediction of the core promoter in eukaryotic genomes. Brief Bioinform 22: bbab198. doi:10.1093/bib/bbab198

CrossRef Medline Google Scholar

[52] ↵

Devenish RJ, Newlon CS. 1982. Isolation and characterization of yeast ring chromosome III by a method applicable to other circular DNAs. Gene 18: 277–288. doi:10.1016/0378-1119(82)90166-4

CrossRef Medline Google Scholar

[53] ↵

Diao S, Hong G, Antaris AL, Blackburn JL, Cheng K, Cheng Z, Dai H. 2015. Biological imaging without autofluorescence in the second near-infrared region. Nano Res 8: 3027–3034. doi:10.1007/s12274-015-0808-9

CrossRef Google Scholar

[54] ↵

Ding F, Zhan Y, Lu X, Sun Y. 2018. Recent advances in near-infrared II fluorophores for multifunctional biomedical imaging. Chem Sci 9: 4370–4380. doi:10.1039/C8SC01153B

CrossRef Medline Google Scholar

[55] ↵

Dixit A, Parnas O, Li B, Chen J, Fulco CP, Jerby-Arnon L, Marjanovic ND, Dionne D, Burks T, Raychowdhury R, et al. 2016. Perturb-seq: dissecting molecular circuits with scalable single-cell RNA profiling of pooled genetic screens. Cell 167: 1853–1866.e17. doi:10.1016/j.cell.2016.11.038

CrossRef Medline Google Scholar

[56] ↵

Dréos R, Ambrosini G, Groux R, Périer RC, Bucher P. 2018. MGA repository: a curated data resource for ChIP-seq and other genome annotated data. Nucleic Acids Res 46: D175–d180. doi:10.1093/nar/gkx995

CrossRef Medline Google Scholar

[57] ↵

Du Q, Luu PL, Stirzaker C, Clark SJ. 2015. Methyl-CpG-binding domain proteins: readers of the epigenome. Epigenomics 7: 1051–1073. doi:10.2217/epi.15.39

CrossRef Medline Google Scholar

[58] ↵

Duckworth AD, Gherardini PF, Sykorova M, Yasin F, Nolan GP, Slupsky JR, Kalakonda N. 2019. Multiplexed profiling of RNA and protein expression signatures in individual cells using flow or mass cytometry. Nat Protoc 14: 901–920. doi:10.1038/s41596-018-0120-8

CrossRef Medline Google Scholar

[59] ↵

Duckworth AD, Slupsky JR, Kalakonda N. 2024. highly multiplexed and simultaneous characterization of protein and RNA in single cells by flow or mass cytometry platforms using proximity ligation assay for RNA. In Single cell analysis: methods and protocols (ed. Gužvić M), pp. 143–165. Springer, New York. doi:10.1007/978-1-0716-3621-3_10

CrossRef Google Scholar

[60] ↵

Eismann B, Krieger TG, Beneke J, Bulkescher R, Adam L, Erfle H, Herrmann C, Eils R, Conrad C. 2020. Automated 3D light-sheet screening with high spatiotemporal resolution reveals mitotic phenotypes. J Cell Sci 133: jcs245043. doi:10.1242/jcs.245043

Abstract/FREE Full Text

[61] ↵

England CG, Ehlerding EB, Cai W. 2016. NanoLuc: a small luciferase Is brightening up the field of bioluminescence. Bioconjug Chem 27: 1175–1187. doi:10.1021/acs.bioconjchem.6b00112

CrossRef Medline Google Scholar

[62] ↵

Fang H, Bygrave AM, Roth RH, Johnson RC, Huganir RL. 2021. An optimized CRISPR/Cas9 approach for precise genome editing in neurons. eLife 10: e65202. doi:10.7554/eLife.65202

CrossRef Medline Google Scholar

[63] ↵

Faulkner EL, Pike JA, Densham RM, Garlick E, Thomas SG, Neely RK, Morris JR. 2022. Imaging nanoscale nuclear structures with expansion microscopy. J Cell Sci 135: jcs259009. doi:10.1242/jcs.259009

CrossRef Google Scholar

[64] ↵

Fleck JS, Jansen SMJ, Wollny D, Zenk F, Seimiya M, Jain A, Okamoto R, Santel M, He Z, Camp JG, et al. 2023. Inferring and perturbing cell fate regulomes in human brain organoids. Nature 621: 365–372. doi:10.1038/s41586-022-05279-8

CrossRef Medline Google Scholar

[65] ↵

Fung JJ, Blöcher-Juárez K, Khmelinskii A. 2022. High-throughput analysis of protein turnover with tandem fluorescent protein timers. Methods Mol Biol 2378: 85–100. doi:10.1007/978-1-0716-1732-8_6

CrossRef Medline Google Scholar

[66] ↵

Gardiner-Garden M, Frommer M. 1987. CpG islands in vertebrate genomes. J Mol Biol 196: 261–282. doi:10.1016/0022-2836(87)90689-9

CrossRef Medline Google Scholar

[67] ↵

Gaspar N, Handula M, Stroet MCM, Marella-Panth K, Haeck J, Kirkland TA, Hall MP, Encell LP, Dalm S, Lowik C, et al. 2024. A novel luciferase-based reporter gene technology for simultaneous optical and radionuclide imaging of cells. Int J Mol Sci 25: 8206. doi:10.3390/ijms25158206

CrossRef Medline Google Scholar

[68] ↵

Gaudreau-Lapierre A, Mulatz K, Béïque JC, Trinkle-Mulcahy L. 2021. Expansion microscopy-based imaging of nuclear structures in cultured cells. STAR Protoc 2: 100630. doi:10.1016/j.xpro.2021.100630

CrossRef Medline Google Scholar

[69] ↵

Gilbonio HE, Puckett GL, Nguyen E, Rieder LE. 2023. A hybrid RNA FISH immunofluorescence protocol on Drosophila polytene chromosomes. BMC Res Notes 16: 197. doi:10.1186/s13104-023-06482-0

CrossRef Medline Google Scholar

[70] ↵

Goldman M, Craft B, Swatloski T, Cline M, Morozova O, Diekhans M, Haussler D, Zhu J. 2015. The UCSC Cancer Genomics Browser: update 2015. Nucleic Acids Res 43: D812–D817. doi:10.1093/nar/gku1073

CrossRef Medline Google Scholar

[71] ↵

Gottschling DE. 1992. Telomere-proximal DNA in Saccharomyces cerevisiae is refractory to methyltransferase activity in vivo. Proc Natl Acad Sci 89: 4062–4065. doi:10.1073/pnas.89.9.4062

Abstract/FREE Full Text

[72] ↵

Green MR, Sambrook J. 2018. Constructing a standard curve for real-time polymerase chain reaction (PCR) experiments. Cold Spring Harb Protoc 2018. doi:10.1101/pdb.prot095026

Abstract/FREE Full Text

[73] ↵

Greil F, van der Kraan I, Delrow J, Smothers JF, de Wit E, Bussemaker HJ, van Driel R, Henikoff S, van Steensel B. 2003. Distinct HP1 and Su(var)3-9 complexes bind to sets of developmentally coexpressed genes depending on chromosomal location. Genes Dev 17: 2825–2838. doi:10.1101/gad.281503

Abstract/FREE Full Text

[74] ↵

Greil F, Moorman C, van Steensel B. 2006. DamID: mapping of in vivo protein-genome interactions using tethered DNA adenine methyltransferase. Methods Enzymol 410: 342–359. doi:10.1016/S0076-6879(06)10016-6

CrossRef Medline Google Scholar

[75] ↵

Grolmusz VK, Tóth EA, Baghy K, Likó I, Darvasi O, Kovalszky I, Matkó J, Rácz K, Patócs A. 2016. Fluorescence activated cell sorting followed by small RNA sequencing reveals stable microRNA expression during cell cycle progression. BMC Genomics 17: 412. doi:10.1186/s12864-016-2747-6

CrossRef Medline Google Scholar

[76] ↵

GTEx Consortium, Aguet F, Anand S, Ardlie KG, Gabriel S, Getz GA, Graubert A, Hadley K, Handsaker RE, Huang KH, et al. 2020. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science 369: 1318–1330. doi:10.1126/science.aaz1776

Abstract/FREE Full Text

[77] ↵

Hachiya T, Furukawa R, Shiwa Y, Ohmomo H, Ono K, Katsuoka F, Nagasaki M, Yasuda J, Fuse N, Kinoshita K, et al. 2017. Genome-wide identification of inter-individually variable DNA methylation sites improves the efficacy of epigenetic association studies. NPJ Genom Med 2: 11. doi:10.1038/s41525-017-0016-5

CrossRef Medline Google Scholar

[78] ↵

Hafner A, Boettiger A. 2023. The spatial organization of transcriptional control. Nat Rev Genet 24: 53–68. doi:10.1038/s41576-022-00526-0

CrossRef Google Scholar

[79] ↵

Hammal F, de Langen P, Bergon A, Lopez F, Ballester B. 2022. Remap 2022: a database of human, mouse, Drosophila and Arabidopsis regulatory regions from an integrative analysis of DNA-binding sequencing experiments. Nucleic Acids Res 50: D316–d325. doi:10.1093/nar/gkab996

CrossRef Medline Google Scholar

[80] ↵

Han H, Shim H, Shin D, Shim JE, Ko Y, Shin J, Kim H, Cho A, Kim E, Lee T, et al. 2015. TRRUST: a reference database of human transcriptional regulatory interactions. Sci Rep 5: 11432. doi:10.1038/srep11432

CrossRef Medline Google Scholar

[81] ↵

He SL, Green R. 2013. Northern blotting. In Methods in enzymology (ed. Lorsch J), Vol. 530, pp. 75–87. Academic Press, Cambridge, MA.

CrossRef Medline Google Scholar

[82] ↵

He L, Binari R, Huang J, Falo-Sanjuan J, Perrimon N. 2019. In vivo study of gene expression with an enhanced dual-color fluorescent transcriptional timer. eLife 8: e46181. doi:10.7554/eLife.46181

CrossRef Google Scholar

[83] ↵

Hellman LM, Fried MG. 2007. Electrophoretic mobility shift assay (EMSA) for detecting protein-nucleic acid interactions. Nat Protoc 2: 1849–1861. doi:10.1038/nprot.2007.249

CrossRef Medline Google Scholar

[84] ↵

Henikoff S, Henikoff JG, Kaya-Okur HS, Ahmad K. 2020. Efficient chromatin accessibility mapping in situ by nucleosome-tethered tagmentation. eLife 9: e63274. doi:10.7554/eLife.63274

CrossRef Medline Google Scholar

[85] ↵

Hickey SM, Ung B, Bader C, Brooks R, Lazniewska J, Johnson IRD, Sorvina A, Logan J, Martini C, Moore CR, et al. 2021. Fluorescence microscopy: an outline of hardware, biological handling, and fluorophore considerations. Cells 11: 35. doi:10.3390/cells11010035

CrossRef Google Scholar

[86] ↵

Hocine S, Raymond P, Zenklusen D, Chao JA, Singer RH. 2013. Single-molecule analysis of gene expression using two-color RNA labeling in live yeast. Nat Methods 10: 119–121. doi:10.1038/nmeth.2305

CrossRef Medline Google Scholar

[87] ↵

Holsapple JS, Schnitzler L, Rusch L, Baldeweg TH, Neubert E, Kruss S, Erpenbeck L. 2023. Expansion microscopy of neutrophil nuclear structure and extracellular traps. Biophys Rep (N Y) 3: 100091. doi:10.1016/j.bpr.2022.10009

CrossRef Medline Google Scholar

[88] ↵

Hou R, Hon C-C, Huang Y. 2023. CamoTSS: analysis of alternative transcription start sites for cellular phenotypes and regulatory patterns from 5′ scRNA-seq data. Nat Commun 14: 7240. doi:10.1038/s41467-023-42636-1

CrossRef Medline Google Scholar

[89] ↵

Huang J, Zheng W, Zhang P, Lin Q, Chen Z, Xuan J, Liu C, Wu D, Huang Q, Zheng L, et al. 2023. ChIPBase v3.0: the encyclopedia of transcriptional regulations of non-coding RNAs and protein-coding genes. Nucleic Acids Res 51: D46–d56. doi:10.1093/nar/gkac1067

CrossRef Medline Google Scholar

[90] ↵

Hunt JMT, Samson CA, Rand AD, Sheppard HM. 2023. Unintended CRISPR-Cas9 editing outcomes: a review of the detection and prevalence of structural variants generated by gene-editing in human cells. Hum Genet 142: 705–720. doi:10.1007/s00439-023-02561-1

CrossRef Medline Google Scholar

[91] ↵

Huxley C, Gnirke A. 1991. Transfer of yeast artificial chromosomes from yeast to mammalian cells. Bioessays 13: 545–550. doi:10.1002/bies.950131009

CrossRef Medline Google Scholar

[92] ↵

Hwang J, Espenshade PJ. 2016. Proximity-dependent biotin labelling in yeast using the engineered ascorbate peroxidase APEX2. Biochem J 473: 2463–2469. doi:10.1042/BCJ20160106

Abstract/FREE Full Text

[93] ↵

Ide S, Dejardin J. 2015. End-targeting proteomics of isolated chromatin segments of a mammalian ribosomal RNA gene promoter. Nat Commun 6: 6674. doi:10.1038/ncomms7674

CrossRef Medline Google Scholar

[94] ↵

Ijaz F, Ikegami K. 2021. Knock-in of labeled proteins into 5′UTR enables highly efficient generation of stable cell lines. Cell Struct Funct 46: 21–35. doi:10.1247/csf.21002

CrossRef Medline Google Scholar

[95] ↵

Ikeno M, Hasegawa Y. 2020. Applications of bottom-up human artificial chromosomes in cell research and cell engineering. Exp Cell Res 390: 111793. doi:10.1016/j.yexcr.2019.111793

CrossRef Medline Google Scholar

[96] ↵

Illingworth RS, Bird AP. 2009. Cpg islands – ‘a rough guide'. FEBS Lett 583: 1713–1720. doi:10.1016/j.febslet.2009.04.012

CrossRef Medline Google Scholar

[97] ↵

Jaenisch R, Bird A. 2003. Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nat Genet 33: 245–254. doi:10.1038/ng1089

CrossRef Medline Google Scholar

[98] ↵

Jenkins VK, Larkin A, Thurmond J. 2022. Using FlyBase: a database of Drosophila genes and genetics. Methods Mol Biol 2540: 1–34. doi:10.1007/978-1-0716-2541-5_1

CrossRef Medline Google Scholar

[99] ↵

Jiang S, Dai T, Li Q, Xu T, Zhang W, Sun J, Liu H. 2024. Generation of ASCL1-mCherry knock-in reporter in human embryonic stem cell line, WAe001-A-2E, using CRISPR/Cas9-based gene targeting. Stem Cell Res 80: 103500. doi:10.1016/j.scr.2024.103500

CrossRef Medline Google Scholar

[100] ↵

Jolma A, Yan J, Whitington T, Toivonen J, Nitta KR, Rastas P, Morgunova E, Enge M, Taipale M, Wei G, et al. 2013. DNA-binding specificities of human transcription factors. Cell 152: 327–339. doi:10.1016/j.cell.2012.12.009

CrossRef Medline Google Scholar

[101] ↵

Kaczynski J, Cook T, Urrutia R. 2003. Sp1- and Krüppel-like transcription factors. Genome Biol 4: 206. doi:10.1186/gb-2003-4-2-206

CrossRef Medline Google Scholar

[102] ↵

Käll L, Krogh A, Sonnhammer EL. 2004. A combined transmembrane topology and signal peptide prediction method. J Mol Biol 338: 1027–1036. doi:10.1016/j.jmb.2004.03.016

CrossRef Medline Google Scholar

[103] ↵

Kan SL, Saksouk N, Déjardin J. 2017. Proteome characterization of a chromatin locus using the proteomics of isolated chromatin segments approach. Methods Mol Biol 1550: 19–33. doi:10.1007/978-1-4939-6747-6_3

CrossRef Medline Google Scholar

[104] ↵

Kannan S, Sams T, Maury J, Workman CT. 2018. Reconstructing dynamic promoter activity profiles from reporter gene data. ACS Synth Biol 7: 832–841. doi:10.1021/acssynbio.7b00223

CrossRef Google Scholar

[105] ↵

Karr JP, Ferrie JJ, Tjian R, Darzacq X. 2022. The transcription factor activity gradient (TAG) model: contemplating a contact-independent mechanism for enhancer-promoter communication. Genes Dev 36: 7–16. doi:10.1101/gad.349160.121

Abstract/FREE Full Text

[106] ↵

Kawaji H, Lizio M, Itoh M, Kanamori-Katayama M, Kaiho A, Nishiyori-Sueki H, Shin JW, Kojima-Ishiyama M, Kawano M, Murata M, et al. 2014. Comparison of CAGE and RNA-seq transcriptome profiling using clonally amplified and single-molecule next-generation sequencing. Genome Res 24: 708–717. doi:10.1101/gr.156232.113

Abstract/FREE Full Text

[107] ↵

Kaya-Okur HS, Wu SJ, Codomo CA, Pledger ES, Bryson TD, Henikoff JG, Ahmad K, Henikoff S. 2019. CUT&tag for efficient epigenomic profiling of small samples and single cells. Nat Commun 10: 1930. doi:10.1038/s41467-019-09982-5

CrossRef Medline Google Scholar

[108] ↵

Kazuki Y, Kobayashi K, Aueviriyavit S, Oshima T, Kuroiwa Y, Tsukazaki Y, Senda N, Kawakami H, Ohtsuki S, Abe S, et al. 2013. Trans-chromosomic mice containing a human CYP3A cluster for prediction of xenobiotic metabolism in humans. Hum Mol Genet 22: 578–592. doi:10.1093/hmg/dds468

CrossRef Medline Google Scholar

[109] ↵

Keyaerts M, Caveliers V, Lahoutte T. 2012. Bioluminescence imaging: looking beyond the light. Trends Mol Med 18: 164–172. doi:10.1016/j.molmed.2012.01.005

CrossRef Medline Google Scholar

[110] ↵

Khan A, Fornes O, Stigliani A, Gheorghe M, Castro-Mondragon JA, van der Lee R, Bessy A, Chèneby J, Kulkarni SR, Tan G, et al. 2018. JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework. Nucleic Acids Res 46: D260–d266. doi:10.1093/nar/gkx1126

CrossRef Medline Google Scholar

[111] ↵

Khan N, Shahid S, Asif AR. 2021. Current analytical strategies in studying chromatin-associated-proteome (chromatome). Molecules 26: 6694. doi:10.3390/molecules26216694

CrossRef Medline Google Scholar

[112] ↵

Khmelinskii A, Knop M. 2014. Analysis of protein dynamics with tandem fluorescent protein timers. Methods Mol Biol 1174: 195–210. doi:10.1007/978-1-4939-0944-5_13

CrossRef Medline Google Scholar

[113] ↵

Kim TH, Dekker J. 2018. ChIP-quantitative polymerase chain reaction (ChIP-qPCR). Cold Spring Harb Protoc 2018. doi:10.1101/pdb.prot082628

Abstract/FREE Full Text

[114] ↵

Kim AH, Lee HM, Kim HS, Jung J, Seol H, Choi E, Lee S, Min Choi Y, Jun JK, Kim HS, et al. 2023. Establishment of TUBB3-mCherry knock-in human pluripotent stem cell line using CRISPR/Cas9 (SNUe003-A-4). Stem Cell Res 69: 103064. doi:10.1016/j.scr.2023.103064

CrossRef Medline Google Scholar

[115] ↵

Kim AH, Lee HM, Kim HS, Jeong SW, Jun JK, Jang J. 2024a. CRISPR/Cas9-mediated knock-in of a fluorescent reporter into the target locus of interest in human pluripotent stem cells. MethodsX 13: 102807. doi:10.1016/j.mex.2024.102807

CrossRef Medline Google Scholar

[116] ↵

Kim J, Kratz AF, Chen S, Sheng J, Kim HK, Zhang L, Singh BK, Chavez A. 2024b. High-throughput tagging of endogenous loci for rapid characterization of protein function. Sci Adv 10: eadg8771. doi:10.1126/sciadv.adg8771

CrossRef Medline Google Scholar

[117] ↵

Kimura Y, Hisano Y, Kawahara A, Higashijima S. 2014. Efficient generation of knock-in transgenic zebrafish carrying reporter/driver genes by CRISPR/Cas9-mediated genome engineering. Sci Rep 4: 6545. doi:10.1038/srep06545

CrossRef Medline Google Scholar

[118] ↵

Kleeman B, Olsson A, Newkold T, Kofron M, DeLay M, Hildeman D, Grimes HL. 2018. A guide to choosing fluorescent protein combinations for flow cytometric analysis based on spectral overlap. Cytometry A 93: 556–562. doi:10.1002/cyto.a.23360

CrossRef Medline Google Scholar

[119] ↵

Koch B, Nijmeijer B, Kueblbeck M, Cai Y, Walther N, Ellenberg J. 2018. Generation and validation of homozygous fluorescent knock-in cells using CRISPR-Cas9 genome editing. Nat Protoc 13: 1465–1487. doi:10.1038/nprot.2018.042

CrossRef Google Scholar

[120] ↵

Kolmykov S, Yevshin I, Kulyashov M, Sharipov R, Kondrakhin Y, Makeev VJ, Kulakovskiy IV, Kel A, Kolpakov F. 2021. GTRD: an integrated view of transcription regulation. Nucleic Acids Res 49: D104–D111. doi:10.1093/nar/gkaa1057

CrossRef Medline Google Scholar

[121] ↵

Konermann S, Brigham MD, Trevino AE, Joung J, Abudayyeh OO, Barcena C, Hsu PD, Habib N, Gootenberg JS, Nishimasu H, et al. 2015. Genome-scale transcriptional activation by an engineered CRISPR-Cas9 complex. Nature 517: 583–588. doi:10.1038/nature14136

CrossRef Medline Google Scholar

[122] ↵

Kumar S, Ambrosini G, Bucher P. 2017. SNP2TFBS: a database of regulatory SNPs affecting predicted transcription factor binding site affinity. Nucleic Acids Res 45: D139–d144. doi:10.1093/nar/gkw1064

CrossRef Medline Google Scholar

[123] ↵

Kvon EZ. 2015. Using transgenic reporter assays to functionally characterize enhancers in animals. Genomics 106: 185–192. doi:10.1016/j.ygeno.2015.06.007

CrossRef Medline Google Scholar

[124] ↵

Kwak H, Fuda NJ, Core LJ, Lis JT. 2013. Precise maps of RNA polymerase reveal how promoters direct initiation and pausing. Science 339: 950–953. doi:10.1126/science.1229386

Abstract/FREE Full Text

[125] ↵

Lambert SA, Jolma A, Campitelli LF, Das PK, Yin Y, Albu M, Chen X, Taipale J, Hughes TR, Weirauch MT. 2018. The human transcription factors. Cell 172: 650–665. doi:10.1016/j.cell.2018.01.029

CrossRef Medline Google Scholar

[126] ↵

Larionov V, Kouprina N, Graves J, Resnick MA. 1996. Highly selective isolation of human DNAs from rodent-human hybrid cells as circular yeast artificial chromosomes by transformation-associated recombination cloning. Proc Natl Acad Sci 93: 13925–13930. doi:10.1073/pnas.93.24.13925

Abstract/FREE Full Text

[127] ↵

Ledesma-Dominguez L, Carbajal-Degante E, Moreno-Hagelsieb G, Pérez-Rueda E. 2024. DeepReg: a deep learning hybrid model for predicting transcription factors in eukaryotic and prokaryotic genomes. Sci Rep 14: 9155. doi:10.1038/s41598-024-59487-5

CrossRef Medline Google Scholar

[128] ↵

Lee CY, Myong S. 2021. Probing steps in DNA transcription using single-molecule methods. J Biol Chem 297: 101086. doi:10.1016/j.jbc.2021.101086

CrossRef Medline Google Scholar

[129] ↵

Lee HM, Kim AH, Hwang S, Jung J, Seol H, Sung JJ, Jeong SM, Choi YM, Jun JK, Kim HS, et al. 2022. Generation of αMHC-EGFP knock-in in human pluripotent stem cell line, SNUe003-A-3 using CRISPR/Cas9-based gene targeting. Stem Cell Res 61: 102779. doi:10.1016/j.scr.2022.102779

CrossRef Medline Google Scholar

[130] ↵

Lesurf R, Cotto KC, Wang G, Griffith M, Kasaian K, Jones SJ, Montgomery SB, Griffith OL. 2016. OReganno 3.0: a community-driven resource for curated regulatory annotation. Nucleic Acids Res 44: D126–D132. doi:10.1093/nar/gkv1203

CrossRef Medline Google Scholar

[131] ↵

Li M, Wang Y, Liu M, Lan X. 2018a. Multimodality reporter gene imaging: construction strategies and application. Theranostics 8: 2954–2973. doi:10.7150/thno.24108

CrossRef Medline Google Scholar

[132] ↵

Li S, Chen LX, Peng XH, Wang C, Qin BY, Tan D, Han CX, Yang H, Ren XN, Liu F, et al. 2018b. Overview of the reporter genes and reporter mouse models. Animal Model Exp Med 1: 29–35. doi:10.1002/ame2.12008

CrossRef Medline Google Scholar

[133] ↵

Li M, Hunt J, Bhattacharyya A, Zhao X. 2019a. One-step generation of seamless luciferase gene knockin using CRISPR/Cas9 genome editing in human pluripotent stem cells. Methods Mol Biol 1942: 61–69. doi:10.1007/978-1-4939-9080-1_5

CrossRef Medline Google Scholar

[134] ↵

Li M, Zou D, Li Z, Gao R, Sang J, Zhang Y, Li R, Xia L, Zhang T, Niu G, et al. 2019b. EWAS atlas: a curated knowledgebase of epigenome-wide association studies. Nucleic Acids Res 47: D983–d988. doi:10.1093/nar/gky1027

CrossRef Medline Google Scholar

[135] ↵

Liberzon A, Birger C, Thorvaldsdóttir H, Ghandi M, Mesirov JP, Tamayo P. 2015. The Molecular Signatures Database (MSigDB) hallmark gene set collection. Cell Syst 1: 417–425. doi:10.1016/j.cels.2015.12.004

CrossRef Medline Google Scholar

[136] ↵

Lichten CA, White R, Clark IBN, Swain PS. 2014. Unmixing of fluorescence spectra to resolve quantitative time-series measurements of gene expression in plate readers. BMC Biotechnol 14: 11. doi:10.1186/1472-6750-14-11

CrossRef Medline Google Scholar

[137] ↵

Lionnet T, Wu C. 2021. Single-molecule tracking of transcription protein dynamics in living cells: Seeing is believing, but what are we seeing? Curr Opin Genet Dev 67: 94–102. doi:10.1016/j.gde.2020.12.001

CrossRef Medline Google Scholar

[138] ↵

Liu Z, Tjian R. 2018. Visualizing transcription factor dynamics in living cells. J Cell Biol 217: 1181–1191. doi:10.1083/jcb.201710038

Abstract/FREE Full Text

[139] ↵

Lombardelli L, Logiodice F, Piccinni M-P. 2025. Evaluation of gene expression profiling by QuantiGene^TM 2.0 RNA assay. In Immunosenescence: methods and protocols (ed. Amoriello R, et al.), pp. 89–98. Springer, New York. doi:10.1007/978-1-0716-4128-6_8

CrossRef Google Scholar

[140] ↵

Long HK, Prescott SL, Wysocka J. 2016. Ever-changing landscapes: transcriptional enhancers in development and evolution. Cell 167: 1170–1187. doi:10.1016/j.cell.2016.09.018

CrossRef Medline Google Scholar

[141] ↵

Loyfer N, Magenheim J, Peretz A, Cann G, Bredno J, Klochendler A, Fox-Fisher I, Shabi-Porat S, Hecht M, Pelet T, et al. 2023. A DNA methylation atlas of normal human cell types. Nature 613: 355–364. doi:10.1038/s41586-022-05580-6

CrossRef Medline Google Scholar

[142] ↵

Lukyanov KA. 2022. Fluorescent proteins for a brighter science. Biochem Biophys Res Commun 633: 29–32. doi:10.1016/j.bbrc.2022.08.089

CrossRef Medline Google Scholar

[143] ↵

Luo Y, Hitz BC, Gabdank I, Hilton JA, Kagda MS, Lam B, Myers Z, Sud P, Jou J, Lin K, et al. 2020. New developments on the Encyclopedia of DNA Elements (ENCODE) data portal. Nucleic Acids Res 48: D882–d889. doi:10.1093/nar/gkz1062

CrossRef Medline Google Scholar

[144] ↵

Maguire CA, Bovenberg MS, Crommentuijn MH, Niers JM, Kerami M, Teng J, Sena-Esteves M, Badr CE, Tannous BA. 2013. Triple bioluminescence imaging for in vivo monitoring of cellular processes. Mol Ther Nucleic Acids 2: e99. doi:10.1038/mtna.2013.25

CrossRef Google Scholar

[145] ↵

Mahat DB, Kwak H, Booth GT, Jonkers IH, Danko CG, Patel RK, Waters CT, Munson K, Core LJ, Lis JT. 2016. Base-pair-resolution genome-wide mapping of active RNA polymerases using precision nuclear run-on (PRO-seq). Nat Protoc 11: 1455–1476. doi:10.1038/nprot.2016.086

CrossRef Medline Google Scholar

[146] ↵

Mahendrawada L, Warfield L, Donczew R, Hahn S. 2025. Low overlap of transcription factor DNA binding and regulatory targets. Nature 642: 796–804. doi:10.1038/s41586-025-08916-0

CrossRef Medline Google Scholar

[147] ↵

Mäntylä E, Montonen T, Azzari L, Mattola S, Hannula M, Vihinen-Ranta M, Hyttinen J, Vippola M, Foi A, Nymark S, et al. 2023. Iterative immunostaining combined with expansion microscopy and image processing reveals nanoscopic network organization of nuclear lamina. Mol Biol Cell 34: br13. doi:10.1091/mbc.E22-09-0448

CrossRef Medline Google Scholar

[148] ↵

Mathelier A, Wasserman WW. 2013. The next generation of transcription factor binding site prediction. PLoS Comput Biol 9: e1003214. doi:10.1371/journal.pcbi.1003214

CrossRef Medline Google Scholar

[149] ↵

Matys V, Kel-Margoulis OV, Fricke E, Liebich I, Land S, Barre-Dirrie A, Reuter I, Chekmenev D, Krull M, Hornischer K, et al. 2006. TRANSFAC(r) and its module TRANSCompel(R): transcriptional gene regulation in eukaryotes. Nucleic Acids Res 34: D108–D110. doi:10.1093/nar/gkj143

CrossRef Medline Google Scholar

[150] ↵

McGettigan PA. 2013. Transcriptomics in the RNA-seq era. Curr Opin Chem Biol 17: 4–11. doi:10.1016/j.cbpa.2012.12.008

CrossRef Medline Google Scholar

[151] ↵

Meijering AEC, Biebricher AS, Sitters G, Brouwer I, Peterman EJG, Wuite GJL, Heller I. 2020. Imaging unlabeled proteins on DNA with super-resolution. Nucleic Acids Res 48: e34. doi:10.1093/nar/gkaa061

CrossRef Medline Google Scholar

[152] ↵

Merz K, Hondele M, Goetze H, Gmelch K, Stoeckl U, Griesenbeck J. 2008. Actively transcribed rRNA genes in S. cerevisiae are organized in a specialized chromatin associated with the high-mobility group protein Hmo1 and are largely devoid of histone molecules. Genes Dev 22: 1190–1204. doi:10.1101/gad.466908

Abstract/FREE Full Text

[153] ↵

Mezzanotte L, van‘t Root M, Karatas H, Goun EA, Löwik C. 2017. In vivo molecular bioluminescence imaging: new tools and applications. Trends Biotechnol 35: 640–652. doi:10.1016/j.tibtech.2017.03.012

CrossRef Medline Google Scholar

[154] ↵

Milosavljevic A. 2010. Putting epigenome comparison into practice. Nat Biotechnol 28: 1053–1056. doi:10.1038/nbt1010-1053

CrossRef Medline Google Scholar

[155] ↵

Mogayzel PJ Jr., Ashlock MA. 2000. CFTR intron 1 increases luciferase expression driven by CFTR 5′-flanking DNA in a yeast artificial chromosome. Genomics 64: 211–215. doi:10.1006/geno.2000.6119

CrossRef Medline Google Scholar

[156] ↵

Molina O, Vargiu G, Abad MA, Zhiteneva A, Jeyaprakash AA, Masumoto H, Kouprina N, Larionov V, Earnshaw WC. 2016. Epigenetic engineering reveals a balance between histone modifications and transcription in kinetochore maintenance. Nat Commun 7: 13334. doi:10.1038/ncomms13334

CrossRef Medline Google Scholar

[157] ↵

Morgan MP, Finnegan E, Das S. 2022. The role of transcription factors in the acquisition of the four latest proposed hallmarks of cancer and corresponding enabling characteristics. Semin Cancer Biol 86: 1203–1215. doi:10.1016/j.semcancer.2022.10.002

CrossRef Medline Google Scholar

[158] ↵

Morrow CS, Porter TJ, Moore DL. 2021. Fluorescent tagging of endogenous proteins with CRISPR/Cas9 in primary mouse neural stem cells. STAR Protoc 2: 100744. doi:10.1016/j.xpro.2021.100744

CrossRef Medline Google Scholar

[159] ↵

Mota A, Berezicki S, Wernersson E, Harbers L, Li-Wang X, Gradin K, Peuckert C, Crosetto N, Bienko M. 2022. FRET-FISH probes chromatin compaction at individual genomic loci in single cells. Nat Commun 13: 6680. doi:10.1038/s41467-022-34183-y

CrossRef Medline Google Scholar

[160] ↵

Moyroud E, Reymond MC, Hamès C, Parcy F, Scutt CP. 2009. The analysis of entire gene promoters by surface plasmon resonance. Plant J 59: 851–858. doi:10.1111/j.1365-313X.2009.03903.x

CrossRef Medline Google Scholar

[161] ↵

Myers SA, Wright J, Peckner R, Kalish BT, Zhang F, Carr SA. 2018. Discovery of proteins associated with a predefined genomic locus via dCas9-APEX-mediated proximity labeling. Nat Methods 15: 437–439. doi:10.1038/s41592-018-0007-1

CrossRef Medline Google Scholar

[162] ↵

Nabet B, Roberts JM, Buckley DL, Paulk J, Dastjerdi S, Yang A, Leggett AL, Erb MA, Lawlor MA, Souza A, et al. 2018. The dTAG system for immediate and target-specific protein degradation. Nat Chem Biol 14: 431–441. doi:10.1038/s41589-018-0021-8

CrossRef Medline Google Scholar

[163] ↵

Nakamura M, Gao Y, Dominguez AA, Qi LS. 2021. CRISPR technologies for precise epigenome editing. Nat Cell Biol 23: 11–22. doi:10.1038/s41556-020-00620-7

CrossRef Medline Google Scholar

[164] ↵

Nakato R, Sakata T. 2021. Methods for ChIP-seq analysis: a practical workflow and advanced applications. Methods 187: 44–53. doi:10.1016/j.ymeth.2020.03.005

CrossRef Google Scholar

[165] ↵

Nazarova M, Sexton T. 2026. The dance of promoters and enhancers in gene regulation: fast or slow, entwined or distant? J Mol Biol 438: 169223. doi:10.1016/j.jmb.2025.169223

CrossRef Medline Google Scholar

[166] ↵

Nepal C, Andersen JB. 2023. Alternative promoters in CpG depleted regions are prevalently associated with epigenetic misregulation of liver cancer transcriptomes. Nat Commun 14: 2712. doi:10.1038/s41467-023-38272-4

CrossRef Medline Google Scholar

[167] ↵

Nerella SG, Michaelides M, Minamimoto T, Innis RB, Pike VW, Eldridge MAG. 2023. PET reporter systems for the brain. Trends Neurosci 46: 941–952. doi:10.1016/j.tins.2023.08.007

CrossRef Medline Google Scholar

[168] ↵

Nierman WC, Feldblyum TV. 2001. Genomic library. In Encyclopedia of genetics (ed. Brenner S, Miller JH), pp. 865–872. Academic Press, New York.

Google Scholar

[169] ↵

Nishimura K, Fukagawa T, Takisawa H, Kakimoto T, Kanemaki M. 2009. An auxin-based degron system for the rapid depletion of proteins in nonplant cells. Nat Methods 6: 917–922. doi:10.1038/nmeth.1401

CrossRef Medline Google Scholar

[170] ↵

Nolan T, Hands RE, Bustin SA. 2006. Quantification of mRNA using real-time RT-PCR. Nat Protoc 1: 1559–1582. doi:10.1038/nprot.2006.236

CrossRef Medline Google Scholar

[171] ↵

Obayashi T, Kagaya Y, Aoki Y, Tadaka S, Kinoshita K. 2019. COXPRESdb v7: a gene coexpression database for 11 animal species supported by 23 coexpression platforms for technical evaluation and evolutionary inference. Nucleic Acids Res 47: D55–D62. doi:10.1093/nar/gky1155

CrossRef Medline Google Scholar

[172] ↵

Oh-Hashi K, Furuta E, Norisada J, Amaya F, Hirata Y, Kiuchi K. 2016. Application of NanoLuc to monitor the intrinsic promoter activity of GRP78 using the CRISPR/Cas9 system. Genes Cells 21: 1137–1143. doi:10.1111/gtc.12401

CrossRef Medline Google Scholar

[173] ↵

Okabe A, Kaneda A. 2021. Transcriptional dysregulation by aberrant enhancer activation and rewiring in cancer. Cancer Sci 112: 2081–2088. doi:10.1111/cas.14884

CrossRef Medline Google Scholar

[174] ↵

Oki S, Ohta T, Shioi G, Hatanaka H, Ogasawara O, Okuda Y, Kawaji H, Nakaki R, Sese J, Meno C. 2018. ChIP-Atlas: a data-mining suite powered by full integration of public ChIP-seq data. EMBO Rep 19: e46255. doi:10.15252/embr.201846255

Abstract/FREE Full Text

[175] ↵

Omasits U, Ahrens CH, Müller S, Wollscheid B. 2014. Protter: interactive protein feature visualization and integration with experimental proteomic data. Bioinformatics 30: 884–886. doi:10.1093/bioinformatics/btt607

CrossRef Medline Google Scholar

[176] ↵

Oubounyt M, Louadi Z, Tayara H, Chong KT. 2019. DeePromoter: robust promoter predictor using deep learning. Front Genet 10: 286. doi:10.3389/fgene.2019.00286

CrossRef Medline Google Scholar

[177] ↵

Palozola KC, Donahue G, Liu H, Grant GR, Becker JS, Cote A, Yu H, Raj A, Zaret KS. 2017. Mitotic transcription and waves of gene reactivation during mitotic exit. Science 358: 119–122. doi:10.1126/science.aal4671

Abstract/FREE Full Text

[178] ↵

Panigrahi A, O'Malley BW. 2021. Mechanisms of enhancer action: the known and the unknown. Genome Biol 22: 108. doi:10.1186/s13059-021-02322-1

CrossRef Medline Google Scholar

[179] ↵

Park PJ. 2009. ChIP-seq: advantages and challenges of a maturing technology. Nat Rev Genet 10: 669–680. doi:10.1038/nrg2641

CrossRef Medline Google Scholar

[180] ↵

Parteka-Tojek Z, Zhu JJ, Lee B, Jodkowska K, Wang P, Aaron J, Chew TL, Banecki K, Plewczynski D, Ruan Y. 2022. Super-resolution visualization of chromatin loop folding in human lymphoblastoid cells using interferometric photoactivated localization microscopy. Sci Rep 12: 8582. doi:10.1038/s41598-022-12568-9

CrossRef Medline Google Scholar

[181] ↵

Patrick PS, Hammersley J, Loizou L, Kettunen MI, Rodrigues TB, Hu DE, Tee SS, Hesketh R, Lyons SK, Soloviev D, et al. 2014. Dual-modality gene reporter for in vivo imaging. Proc Natl Acad Sci 111: 415–420. doi:10.1073/pnas.1319000111

Abstract/FREE Full Text

[182] ↵

Pavlou A, Cinquemani E, Geiselmann J, de Jong H. 2022. Maturation models of fluorescent proteins are necessary for unbiased estimates of promoter activity. Biophys J 121: 4179–4188. doi:10.1016/j.bpj.2022.09.021

CrossRef Medline Google Scholar

[183] ↵

Pei W-D, Zhang Y, Yin T-L, Yu Y. 2020. Epigenome editing by CRISPR/Cas9 in clinical settings: possibilities and challenges. Brief Funct Genomics 19: 215–228. doi:10.1093/bfgp/elz035

CrossRef Medline Google Scholar

[184] ↵

Perez G, Barber Galt P, Benet-Pages A, Casper J, Clawson H, Diekhans M, Fischer C, Gonzalez Jairo N, Hinrichs Angie S, Lee Christopher M, et al. 2025. The UCSC Genome Browser database: 2025 update. Nucleic Acids Res 53: D1243–D1249. doi:10.1093/nar/gkae974

CrossRef Medline Google Scholar

[185] ↵

Perez-Leal O, Nixon-Abell J, Barrero CA, Gordon JC, Oesterling J, Rico MC. 2021. Multiplex gene tagging with CRISPR-Cas9 for live-cell microscopy and application to study the role of SARS-CoV-2 proteins in autophagy. CRISPR J 4: 854–871. doi:10.1089/crispr.2021.0041

CrossRef Medline Google Scholar

[186] ↵

Ponger L, Mouchiroud D. 2002. CpGProD: identifying CpG islands associated with transcription start sites in large genomic mammalian sequences. Bioinformatics 18: 631–633. doi:10.1093/bioinformatics/18.4.631

CrossRef Medline Google Scholar

[187] ↵

Ponomartsev SV, Sinenko SA, Tomilin AN. 2022. Human artificial chromosomes and their transfer to target cells. Acta Naturae 14: 35–45. doi:10.32607/actanaturae.11670

CrossRef Medline Google Scholar

[188] ↵

Pownall ME, Miao L, Vejnar CE, M'Saad O, Sherrard A, Frederick MA, Benitez MDJ, Boswell CW, Zaret KS, Bewersdorf J, et al. 2023. Chromatin expansion microscopy reveals nanoscale organization of transcription and chromatin. Science 381: 92–100. doi:10.1126/science.ade5308

CrossRef Google Scholar

[189] ↵

Qi LS, Larson MH, Gilbert LA, Doudna JA, Weissman JS, Arkin AP, Lim WA. 2013. Repurposing CRISPR as an RNA-guided platform for sequence-specific control of gene expression. Cell 152: 1173–1183. doi:10.1016/j.cell.2013.02.022

CrossRef Medline Google Scholar

[190] ↵

Qi LS, Larson MH, Gilbert LA, Doudna JA, Weissman JS, Arkin AP, Lim WA. 2021. Repurposing CRISPR as an RNA-guided platform for sequence-specific control of gene expression. Cell 184: 844. doi:10.1016/j.cell.2021.01.019

CrossRef Google Scholar

[191] ↵

Qin W, Cho KF, Cavanagh PE, Ting AY. 2021. Deciphering molecular interactions by proximity labeling. Nat Methods 18: 133–143. doi:10.1038/s41592-020-01010-5

CrossRef Medline Google Scholar

[192] ↵

Qiu W, Xu Z, Zhang M, Zhang D, Fan H, Li T, Wang Q, Liu P, Zhu Z, Du D, et al. 2019. Determination of local chromatin interactions using a combined CRISPR and peroxidase APEX2 system. Nucleic Acids Res 47: e52. doi:10.1093/nar/gkz134

CrossRef Medline Google Scholar

[193] ↵

Rauluseviciute I, Riudavets-Puig R, Blanc-Mathieu R, Castro-Mondragon JA, Ferenc K, Kumar V, Lemma RB, Lucas J, Chèneby J, Baranasic D, et al. 2024. JASPAR 2024: 20th anniversary of the open-access database of transcription factor binding profiles. Nucleic Acids Res 52: D174–d182. doi:10.1093/nar/gkad1059

CrossRef Medline Google Scholar

[194] ↵

Refaat A, Yap ML, Pietersz G, Walsh APG, Zeller J, Del Rosal B, Wang X, Peter K. 2022. In vivo fluorescence imaging: success in preclinical imaging paves the way for clinical applications. J Nanobiotechnology 20: 450. doi:10.1186/s12951-022-01648-7

CrossRef Medline Google Scholar

[195] ↵

Rice P, Longden I, Bleasby A. 2000. EMBOSS: the European molecular biology open software suite. Trends Genet 16: 276–277. doi:10.1016/S0168-9525(00)02024-2

CrossRef Medline Google Scholar

[196] ↵

Richie CT, Whitaker LR, Whitaker KW, Necarsulmer J, Baldwin HA, Zhang Y, Fortuno L, Hinkle JJ, Koivula P, Henderson MJ, et al. 2017. Near-infrared fluorescent protein iRFP713 as a reporter protein for optogenetic vectors, a transgenic Cre-reporter rat, and other neuronal studies. J Neurosci Methods 284: 1–14. doi:10.1016/j.jneumeth.2017.03.020

CrossRef Medline Google Scholar

[197] ↵

Romanov SE, Kalashnikova DA, Laktionov PP. 2021. Methods of massive parallel reporter assays for investigation of enhancers. Vavilovskii Zhurnal Genet Selektsii 25: 344–355. doi:10.18699/VJ21.038

CrossRef Medline Google Scholar

[198] ↵

Rossi M, Massai L, Diamanti D, Fiengo P, De Rosa A, Magrini R, Magnoni L, Chellini S, Coniglio S, Diodato E, et al. 2016. Multimodal molecular imaging system for pathway-specific reporter gene expression. Eur J Pharm Sci 86: 136–142. doi:10.1016/j.ejps.2016.03.006

CrossRef Medline Google Scholar

[199] ↵

Rouillard AD, Gundersen GW, Fernandez NF, Wang Z, Monteiro CD, McDermott MG, Ma'ayan A. 2016. The harmonizome: a collection of processed datasets gathered to serve and mine knowledge about genes and proteins. Database (Oxford) 2016: baw100. doi:10.1093/database/baw100

CrossRef Medline Google Scholar

[200] ↵

Roy A, Kucukural A, Zhang Y. 2010. I-TASSER: a unified platform for automated protein structure and function prediction. Nat Protoc 5: 725–738. doi:10.1038/nprot.2010.5

CrossRef Medline Google Scholar

[201] ↵

Sadikot RT, Blackwell TS. 2005. Bioluminescence imaging. Proc Am Thorac Soc 2: 537–540. doi:10.1513/pats.200507-067DS

CrossRef Medline Google Scholar

[202] ↵

Saito-Moriya R, Nakayama J, Kamiya G, Kitada N, Obata R, Maki SA, Aoyama H. 2021. How to select firefly luciferin analogues for in vivo imaging. Int J Mol Sci 22: 1848. doi:10.3390/ijms22041848

CrossRef Medline Google Scholar

[203] ↵

Salataj E, Spilianakis CG, Chaumeil J. 2023. Single-cell detection of primary transcripts, their genomic loci and nuclear factors by 3D immuno-RNA/DNA FISH in T cells. Front Immunol 14: 1156077. doi:10.3389/fimmu.2023.1156077

CrossRef Medline Google Scholar

[204] ↵

Sanson KR, Hanna RE, Hegde M, Donovan KF, Strand C, Sullender ME, Vaimberg EW, Goodale A, Root DE, Piccioni F, et al. 2018. Optimized libraries for CRISPR-Cas9 genetic screens with multiple modalities. Nat Commun 9: 5416. doi:10.1038/s41467-018-07901-8

CrossRef Medline Google Scholar

[205] ↵

Schep R, Trauernicht M, Vergara X, Friskes A, Morris B, Gregoricchio S, Manzo SG, Zwart W, Beijersbergen Roderick L, Medema RH, et al. 2024. Chromatin context-dependent effects of epigenetic drugs on CRISPR-Cas9 editing. Nucleic Acids Res 52: 8815–8832. doi:10.1093/nar/gkae570

CrossRef Medline Google Scholar

[206] ↵

Schmid M, Durussel T, Laemmli UK. 2004. ChIC and ChEC; genomic mapping of chromatin proteins. Mol Cell 16: 147–157. doi:10.1016/j.molcel.2004.09.007

CrossRef Medline Google Scholar

[207] ↵

Schwalb B, Michel M, Zacher B, Frühauf K, Demel C, Tresch A, Gagneur J, Cramer P. 2016. TT-seq maps the human transient transcriptome. Science 352: 1225–1228. doi:10.1126/science.aad9841

Abstract/FREE Full Text

[208] ↵

Serganova I, Blasberg RG. 2019. Molecular imaging with reporter genes: has its promise been delivered? J Nucl Med 60: 1665–1681. doi:10.2967/jnumed.118.220004

Abstract/FREE Full Text

[209] ↵

Shahmuradov IA, Umarov RK, Solovyev VV. 2017. TSSPlant: a new tool for prediction of plant Pol II promoters. Nucleic Acids Res 45: e65. doi:10.1093/nar/gkw135

CrossRef Medline Google Scholar

[210] ↵

Shaner NC, Lambert GG, Chammas A, Ni Y, Cranfill PJ, Baird MA, Sell BR, Allen JR, Day RN, Israelsson M, et al. 2013. A bright monomeric green fluorescent protein derived from Branchiostoma lanceolatum. Nat Methods 10: 407–409. doi:10.1038/nmeth.2413

CrossRef Medline Google Scholar

[211] ↵

Sharma A, Toepfer CN, Ward T, Wasson L, Agarwal R, Conner DA, Hu JH, Seidman CE. 2018. CRISPR/Cas9-mediated fluorescent tagging of endogenous proteins in human pluripotent stem cells. Curr Protoc Hum Genet 96: 21.11.21–21.11.20. doi:10.1002/cphg.52

CrossRef Google Scholar

[212] ↵

Shi L, Li S, Zhu R, Lu C, Xu X, Li C, Huang X, Zhao X, Mao F, Li K. 2025. CRISPRepi: a multi-omic atlas for CRISPR-based epigenome editing. Nucleic Acids Res 53: D901–d913. doi:10.1093/nar/gkae1039

CrossRef Medline Google Scholar

[213] ↵

Shifera AS, Hardin JA. 2010. Factors modulating expression of Renilla luciferase from control plasmids used in luciferase reporter gene assays. Anal Biochem 396: 167–172. doi:10.1016/j.ab.2009.09.043

CrossRef Medline Google Scholar

[214] ↵

Siddharthan R. 2010. Dinucleotide weight matrices for predicting transcription factor binding sites: generalizing the position weight matrix. PLoS One 5: e9722. doi:10.1371/journal.pone.0009722

CrossRef Medline Google Scholar

[215] ↵

Smale ST. 2009. Nuclear run-on assay. Cold Spring Harb Protoc 2009: pdb.prot5329. doi:10.1101/pdb.prot5329

Abstract/FREE Full Text

[216] ↵

Smale ST. 2010. Luciferase assay. Cold Spring Harb Protoc 2010: pdb.prot5421. doi:10.1101/pdb.prot5421

Abstract/FREE Full Text

[217] ↵

Song Q, Decato B, Hong EE, Zhou M, Fang F, Qu J, Garvin T, Kessler M, Zhou J, Smith AD. 2013. A reference methylome database and analysis pipeline to facilitate integrative and comparative epigenomics. PLoS One 8: e81148. doi:10.1371/journal.pone.0081148

CrossRef Medline Google Scholar

[218] ↵

Specht EA, Braselmann E, Palmer AE. 2017. A critical and comparative review of fluorescent tools for live-cell imaging. Annu Rev Physiol 79: 93–117. doi:10.1146/annurev-physiol-022516-034055

CrossRef Medline Google Scholar

[219] ↵

Srivastava D, Mahony S. 2020. Sequence and chromatin determinants of transcription factor binding and the establishment of cell type-specific binding patterns. Biochim Biophys Acta Gene Regul Mech 1863: 194443. doi:10.1016/j.bbagrm.2019.194443

CrossRef Medline Google Scholar

[220] ↵

Stark A, Lin MF, Kheradpour P, Pedersen JS, Parts L, Carlson JW, Crosby MA, Rasmussen MD, Roy S, Deoras AN, et al. 2007. Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures. Nature 450: 219–232. doi:10.1038/nature06340

CrossRef Medline Google Scholar

[221] ↵

Stelzer G, Rosen N, Plaschkes I, Zimmerman S, Twik M, Fishilevich S, Stein TI, Nudel R, Lieder I, Mazor Y, et al. 2016. The geneCards suite: from gene data mining to disease genome sequence analyses. Curr Protoc Bioinformatics 54: 1.30.31–1.30.33. doi:10.1002/cpbi.5

CrossRef Google Scholar

[222] ↵

Sternberg PW, Van Auken K, Wang Q, Wright A, Yook K, Zarowiecki M, Arnaboldi V, Becerra A, Brown S, Cain S, et al. 2024. WormBase 2024: status and transitioning to Alliance infrastructure. Genetics 227: iyae050. doi:10.1093/genetics/iyae050

CrossRef Medline Google Scholar

[223] ↵

Stormo GD, Zhao Y. 2010. Determining the specificity of protein–DNA interactions. Nat Rev Genet 11: 751–760. doi:10.1038/nrg2845

CrossRef Medline Google Scholar

[224] ↵

Stormo GD, Schneider TD, Gold L, Ehrenfeucht A. 1982. Use of the ‘Perceptron’ algorithm to distinguish translational initiation sites in E. coli. Nucleic Acids Res 10: 2997–3011. doi:10.1093/nar/10.9.2997

CrossRef Medline Google Scholar

[225] ↵

Subach OM, Cranfill PJ, Davidson MW, Verkhusha VV. 2011. An enhanced monomeric blue fluorescent protein with the high chemical stability of the chromophore. PLoS One 6: e28674. doi:10.1371/journal.pone.0028674

CrossRef Medline Google Scholar

[226] ↵

Suske G, Bruford E, Philipsen S. 2005. Mammalian SP/KLF transcription factors: bring in the family. Genomics 85: 551–556. doi:10.1016/j.ygeno.2005.01.005

CrossRef Medline Google Scholar

[227] ↵

Suzuki A, Wakaguri H, Yamashita R, Kawano S, Tsuchihara K, Sugano S, Suzuki Y, Nakai K. 2015. DBTSS as an integrative platform for transcriptome, epigenome and genome sequence variation data. Nucleic Acids Res 43: D87–D91. doi:10.1093/nar/gku1080

CrossRef Medline Google Scholar

[228] ↵

Takai D, Jones PA. 2003. The CpG island searcher: a new WWW resource. In Silico Biol 3: 235–240.

Medline Google Scholar

[229] ↵

Tamura R, Kamiyama D. 2023. CRISPR-Cas9-mediated knock-in approach to insert the GFP(11) tag into the genome of a human cell line. Methods Mol Biol 2564: 185–201. doi:10.1007/978-1-0716-2667-2_8

CrossRef Medline Google Scholar

[230] ↵

Teufel F, Almagro Armenteros JJ, Johansen AR, Gíslason MH, Pihl SI, Tsirigos KD, Winther O, Brunak S, von Heijne G, Nielsen H. 2022. SignalP 6.0 predicts all five types of signal peptides using protein language models. Nat Biotechnol 40: 1023–1025. doi:10.1038/s41587-021-01156-3

CrossRef Medline Google Scholar

[231] ↵

Thomas CE, Ehrhardt A, Kay MA. 2003. Progress and problems with the use of viral vectors for gene therapy. Nat Rev Genet 4: 346–358. doi:10.1038/nrg1066

CrossRef Medline Google Scholar

[232] ↵

Thurman RE, Rynes E, Humbert R, Vierstra J, Maurano MT, Haugen E, Sheffield NC, Stergachis AB, Wang H, Vernot B, et al. 2012. The accessible chromatin landscape of the human genome. Nature 489: 75–82. doi:10.1038/nature11232

CrossRef Medline Google Scholar

[233] ↵

Tovell H, Testa A, Maniaci C, Zhou H, Prescott AR, Macartney T, Ciulli A, Alessi DR. 2019. Rapid and reversible knockdown of endogenously tagged endosomal proteins via an optimized HaloPROTAC degrader. ACS Chem Biol 14: 882–892. doi:10.1021/acschembio.8b01016

CrossRef Medline Google Scholar

[234] ↵

Tsherniak A, Vazquez F, Montgomery PG, Weir BA, Kryukov G, Cowley GS, Gill S, Harrington WF, Pantel S, Krill-Burger JM, et al. 2017. Defining a cancer dependency map. Cell 170: 564–576.e16. doi:10.1016/j.cell.2017.06.010

CrossRef Medline Google Scholar

[235] ↵

Uemura T, Mori T, Kurihara T, Kawase S, Koike R, Satoga M, Cao X, Li X, Yanagawa T, Sakurai T, et al. 2016. Fluorescent protein tagging of endogenous protein in brain neurons using CRISPR/Cas9-mediated knock-in and in utero electroporation techniques. Sci Rep 6: 35861. doi:10.1038/srep35861

CrossRef Google Scholar

[236] ↵

Ummethum H, Hamperl S. 2020. Proximity labeling techniques to study chromatin. Front Genet 11: 450. doi:10.3389/fgene.2020.00450

CrossRef Google Scholar

[237] ↵

Uphoff S. 2016. Super-resolution microscopy and tracking of DNA-binding proteins in bacterial cells. Methods Mol Biol 1431: 221–234. doi:10.1007/978-1-4939-3631-1_16

CrossRef Medline Google Scholar

[238] ↵

VanBelzen J, Sakelaris B, Brickner DG, Marcou N, Riecke H, Mangan NM, Brickner JH. 2024. Chromatin endogenous cleavage provides a global view of yeast RNA polymerase II transcription kinetics. eLife 13: RP100764. doi:10.7554/eLife.100764

CrossRef Medline Google Scholar

[239] ↵

van Staalduinen J, van Staveren T, Grosveld F, Wendt KS. 2023. Live-cell imaging of chromatin contacts opens a new window into chromatin dynamics. Epigenetics Chromatin 16: 27. doi:10.1186/s13072-023-00503-9

CrossRef Medline Google Scholar

[240] ↵

van Steensel B, Henikoff S. 2000. Identification of in vivo DNA targets of chromatin proteins using tethered dam methyltransferase. Nat Biotechnol 18: 424–428. doi:10.1038/74487

CrossRef Medline Google Scholar

[241] ↵

Verfaillie A, Imrichova H, Janky R, Aerts S. 2015. Iregulon and i-cisTarget: reconstructing regulatory networks using motif and track enrichment. Curr Protoc Bioinformatics 52: 2.16.11–2.16.39. doi:10.1002/0471250953.bi0216s52

CrossRef Google Scholar

[242] ↵

Vickers TA, Crooke ST. 2016. Development of a quantitative BRET affinity assay for nucleic acid-protein interactions. PLoS One 11: e0161930. doi:10.1371/journal.pone.0161930

CrossRef Medline Google Scholar

[243] ↵

Vimalraj S. 2020. Alkaline phosphatase: structure, expression and its function in bone mineralization. Gene 754: 144855. doi:10.1016/j.gene.2020.144855

CrossRef Google Scholar

[244] ↵

Viushkov VS, Lomov NA, Rubtsov MA, Vassetzky YS. 2022. Visualizing the genome: experimental approaches for live-cell chromatin imaging. Cells 11: 4086. doi:10.3390/cells11244086

CrossRef Google Scholar

[245] ↵

Wang Z, Gerstein M, Snyder M. 2009. RNA-seq: a revolutionary tool for transcriptomics. Nat Rev Genet 10: 57–63. doi:10.1038/nrg2484

CrossRef Medline Google Scholar

[246] ↵

Wang Y, Song F, Zhang B, Zhang L, Xu J, Kuang D, Li D, Choudhary MNK, Li Y, Hu M, et al. 2018. The 3D Genome Browser: a web-based browser for visualizing 3D genome organization and long-range chromatin interactions. Genome Biol 19: 151. doi:10.1186/s13059-018-1519-9

CrossRef Medline Google Scholar

[247] ↵

Wang F, Bai X, Wang Y, Jiang Y, Ai B, Zhang Y, Liu Y, Xu M, Wang Q, Han X, et al. 2021. ATACdb: a comprehensive human chromatin accessibility database. Nucleic Acids Res 49: D55–d64. doi:10.1093/nar/gkaa943

CrossRef Medline Google Scholar

[248] ↵

Wang P, Zhang G, Xu Z, Chen Z, Liu X, Wang C, Zheng C, Wang J, Zhang H, Yan A. 2022. Whole-cell FRET monitoring of transcription factor activities enables functional annotation of signal transduction systems in living bacteria. J Biol Chem 298: 102258. doi:10.1016/j.jbc.2022.102258

CrossRef Medline Google Scholar

[249] ↵

Wang Z, He Z, Liu Z, Qu M, Gao C, Wang C, Wang Y. 2023. A reverse chromatin immunoprecipitation technique based on the CRISPR–dCas9 system. Plant Physiol 191: 1505–1519. doi:10.1093/plphys/kiac506

CrossRef Medline Google Scholar

[250] ↵

Wasserman WW, Sandelin A. 2004. Applied bioinformatics for the identification of regulatory elements. Nature Reviews Genetics 5: 276–287. doi:10.1038/nrg1315

CrossRef Medline Google Scholar

[251] ↵

Wen X, Wang Y. 2024. Reverse chromatin immunoprecipitation (R-ChIP). Methods Mol Biol 2846: 123–132. doi:10.1007/978-1-0716-4071-5_8

CrossRef Medline Google Scholar

[252] ↵

Wen X, Wang J, Zhang D, Ding Y, Ji X, Tan Z, Wang Y. 2020. Reverse chromatin immunoprecipitation (R-ChIP) enables investigation of the upstream regulators of plant genes. Commun Biol 3: 770. doi:10.1038/s42003-020-01500-4

CrossRef Medline Google Scholar

[253] ↵

Wen G, Leen V, Rohand T, Sauer M, Hofkens J. 2023. Current progress in expansion microscopy: chemical strategies and applications. Chem Rev 123: 3299–3323. doi:10.1021/acs.chemrev.2c00711

CrossRef Medline Google Scholar

[254] ↵

Wernersson E, Gelali E, Girelli G, Wang S, Castillo D, Mattsson Langseth C, Verron Q, Nguyen HQ, Chattoraj S, Martinez Casals A, et al. 2024. Deconwolf enables high-performance deconvolution of widefield fluorescence microscopy images. Nat Methods 21: 1245–1256. doi:10.1038/s41592-024-02294-7

CrossRef Medline Google Scholar

[255] ↵

Wu C, Chie S-O, Ohmiya Y. 2007. Dual-reporter assay using two secreted luciferase genes. BioTechniques 42: 290–292. doi:10.2144/000112428

CrossRef Medline Google Scholar

[256] ↵

Wu B, Piatkevich KD, Lionnet T, Singer RH, Verkhusha VV. 2011. Modern fluorescent proteins and imaging technologies to study gene expression, nuclear localization, and dynamics. Curr Opin Cell Biol 23: 310–317. doi:10.1016/j.ceb.2010.12.004

CrossRef Medline Google Scholar

[257] ↵

Wu T, Lang J, Sun X, Zhang B, Liu Y, An R. 2013. Monitoring bone marrow stem cells with a reporter gene system in experimental middle cerebral artery occlusion rat models. J Nucl Med 54: 984–989. doi:10.2967/jnumed.112.109280

Abstract/FREE Full Text

[258] ↵

Wu F, Olson BG, Yao J. 2016. DamID-seq: genome-wide mapping of protein-DNA interactions by high throughput sequencing of adenine-methylated DNA fragments. J Vis Exp 107: e53620. doi:10.3791/53620

CrossRef Google Scholar

[259] ↵

Xu T, Close D, Handagama W, Marr E, Sayler G, Ripp S. 2016. The expanding toolbox of in vivo bioluminescent imaging. Front Oncol 6: 150. doi:10.3389/fonc.2016.00150

CrossRef Medline Google Scholar

[260] ↵

Xu J, Ma H, Ma H, Jiang W, Mela CA, Duan M, Zhao S, Gao C, Hahm ER, Lardo SM, et al. 2020. Super-resolution imaging reveals the evolution of higher-order chromatin folding in early carcinogenesis. Nat Commun 11: 1899. doi:10.1038/s41467-020-15718-7

CrossRef Medline Google Scholar

[261] ↵

Yamada N, Lai WKM, Farrell N, Pugh BF, Mahony S. 2019. Characterizing protein–DNA binding event subtypes in ChIP-exo data. Bioinformatics 35: 903–913. doi:10.1093/bioinformatics/bty703

CrossRef Medline Google Scholar

[262] ↵

Yan X, Ray P, Paulmurugan R, Tong R, Gong Y, Sathirachinda A, Wu JC, Gambhir SS. 2013. A transgenic tri-modality reporter mouse. PLoS One 8: e73580. doi:10.1371/journal.pone.0073580

CrossRef Medline Google Scholar

[263] ↵

Yang Y, Pe'er D. 2024. REUNION: transcription factor binding prediction and regulatory association inference from single-cell multi-omics data. Bioinformatics 40: i567–i575. doi:10.1093/bioinformatics/btae234

CrossRef Medline Google Scholar

[264] ↵

Yang L, Zhou T, Dror I, Mathelier A, Wasserman WW, Gordân R, Rohs R. 2014. TFBSshape: a motif database for DNA shape features of transcription factor binding sites. Nucleic Acids Res 42: D148–D155. doi:10.1093/nar/gkt1087

CrossRef Medline Google Scholar

[265] ↵

Yilmaz A, Mejia-Guerra MK, Kurz K, Liang X, Welch L, Grotewold E. 2011. AGRIS: the Arabidopsis Gene Regulatory Information Server, an update. Nucleic Acids Res 39: D1118–D1122. doi:10.1093/nar/gkq1120

CrossRef Medline Google Scholar

[266] ↵

Yin Y, Morgunova E, Jolma A, Kaasinen E, Sahu B, Khund-Sayeed S, Das PK, Kivioja T, Dave K, Zhong F, et al. 2017. Impact of cytosine methylation on DNA binding specificities of human transcription factors. Science 356: eaaj2239. doi:10.1126/science.aaj2239

Abstract/FREE Full Text

[267] ↵

Young AP, Jackson DJ, Wyeth RC. 2020. A technical review and guide to RNA fluorescence in situ hybridization. PeerJ 8: e8806. doi:10.7717/peerj.8806

CrossRef Medline Google Scholar

[268] ↵

Zalatan JG, Lee ME, Almeida R, Gilbert LA, Whitehead EH, La Russa M, Tsai JC, Weissman JS, Dueber JE, Qi LS, et al. 2015. Engineering complex synthetic transcriptional programs with CRISPR RNA scaffolds. Cell 160: 339–350. doi:10.1016/j.cell.2014.11.052

CrossRef Medline Google Scholar

[269] ↵

Zambelli F, Pesole G, Pavesi G. 2009. Pscan: finding over-represented transcription factor binding site motifs in sequences from co-regulated or co-expressed genes. Nucleic Acids Res 37: W247–W252. doi:10.1093/nar/gkp464

CrossRef Medline Google Scholar

[270] ↵

Zambelli F, Prazzoli GM, Pesole G, Pavesi G. 2012. Cscan: finding common regulators of a set of genes by using a collection of genome-wide ChIP-seq datasets. Nucleic Acids Res 40: W510–W515. doi:10.1093/nar/gks483

CrossRef Medline Google Scholar

[271] ↵

Zeng F, Beck V, Schuierer S, Garnier I, Manneville C, Agarinis C, Morelli L, Quinn L, Knehr J, Roma G, et al. 2020. A simple and efficient CRISPR technique for protein tagging. Cells 9: 2618. doi:10.3390/cells9122618

CrossRef Google Scholar

[272] ↵

Zentner GE, Kasinathan S, Xin B, Rohs R, Henikoff S. 2015. ChEC-seq kinetics discriminates transcription factor binding sites by DNA sequence and shape in vivo. Nat Commun 6: 8733. doi:10.1038/ncomms9733

CrossRef Medline Google Scholar

[273] ↵

Zhang Y, Bao W, Cao Y, Cong H, Chen B, Chen Y. 2022. A survey on protein–DNA-binding sites in computational biology. Brief Funct Genomics 21: 357–375. doi:10.1093/bfgp/elac009

CrossRef Medline Google Scholar

[274] ↵

Zhang M, Zong W, Zou D, Wang G, Zhao W, Yang F, Wu S, Zhang X, Guo X, Ma Y, et al. 2023. MethBank 4.0: an updated database of DNA methylation across a variety of species. Nucleic Acids Res 51: D208–D216. doi:10.1093/nar/gkac969

CrossRef Medline Google Scholar

[275] ↵

Zhao J, Tang J, Elfman J, Li H. 2020. RNase protection assay. In Chimeric RNA: methods and protocols (ed. Li H, Elfman J), pp. 109–116. Springer, New York. doi:10.1007/978-1-4939-9904-0_8

CrossRef Google Scholar

[276] ↵

Zheng R, Wan C, Mei S, Qin Q, Wu Q, Sun H, Chen C-H, Brown M, Zhang X, Meyer CA, et al. 2019. Cistrome Data Browser: expanded datasets and new tools for gene regulatory analysis. Nucleic Acids Res 47: D729–D735. doi:10.1093/nar/gky1094

CrossRef Medline Google Scholar

[277] ↵

Zhu Q, Wong AK, Krishnan A, Aure MR, Tadych A, Zhang R, Corney DC, Greene CS, Bongo LA, Kristensen VN, et al. 2015. Targeted exploration and analysis of large cross-platform human transcriptomic compendia. Nat Methods 12: 211–214. doi:10.1038/nmeth.3249

CrossRef Google Scholar

[278] ↵

Zhu F, Farnung L, Kaasinen E, Sahu B, Yin Y, Wei B, Dodonova SO, Nitta KR, Morgunova E, Taipale M, et al. 2018. The interaction landscape between transcription factors and the nucleosome. Nature 562: 76–81. doi:10.1038/s41586-018-0549-5

CrossRef Medline Google Scholar

[279] ↵

Zou F, Bai L. 2019. Using time-lapse fluorescence microscopy to study gene regulation. Methods 159–160: 138–145. doi:10.1016/j.ymeth.2018.12.010

CrossRef Google Scholar

[280] ↵

Zou M, Zhou H, Gu L, Zhang J, Fang L. 2024. Therapeutic target identification and drug discovery driven by chemical proteomics. Biology (Basel) 13: 555. doi:10.3390/biology13080555

CrossRef Medline Google Scholar

[281] ↵

Zulkower V, Page M, Ropers D, Geiselmann J, de Jong H. 2015. Robust reconstruction of gene expression profiles from reporter gene data using linear inversion. Bioinformatics 31: i71–i79. doi:10.1093/bioinformatics/btv246

CrossRef Medline Google Scholar

A systematic guide for identifying transcription factors that directly regulate the expression of a gene of interest

Abstract

Computational prediction of regulatory sites and TF binding sites

Prediction of potential regulatory sites

TF binding prediction

A summary of TF binding prediction for nonadvanced users

Data mining genome-wide studies

Identifying regulatory regions

Direct TF binding

Secondary sources that suggest regulators of your GOI

Practical gene regulation data mining for nonadvanced user

Experimentally validating TFs by measuring transcription from the GOI

Experimentally validating TFs by measuring GOI transcript levels

Experimentally validating TFs by measuring GOI protein product

Reporter assays

In vivo imaging of reporter genes

Functional assays

Measuring the protein product

Measuring GOI protein levels using fluorescent tagging

Cell-based assays

Experimentally validating if candidate TF binds to the GOI

Microscopic imaging of TF–GOI interactions

High-resolution imaging of TF–GOI interactions

DNA adenine methyltransferase identification

Chromatin endogenous cleavage

Cleavage under targets and tagmentation

How to make an ideal cell-based reporter assay

Summary

Identification of proteins bound to the regulatory elements of the GOI within the cell

Reverse chromatin immunoprecipitation

The PICh approach

Artificial chromosomes

CRISPR-assisted proximity labeling with APEX2

Important consideration when analyzing regulators of GOI expression by genomic modifications

Summary

Competing interest statement

Acknowledgments

Footnotes

References

This Article

Article Category

Services

Citing Articles

Google Scholar

PubMed/NCBI

ORCID

Share

Preprint Server

Navigate This Article

Current Issue

In This Issue