Part:BBa_K5237008
Staple subunit: rGCN4
rGCN4 is an engineered, reverse, variant of the yeast transcription factor GCN4, featuring a basic region and a leucine zipper dimerization domain. We used rGCN4 to study DNA-binding kinetics in our "Mini staples" that bring two DNA target sites into proximity by binding them simultaneously.
Contents
Next to the well-studied linear DNA sequence, the 3D spatial organization of DNA plays a crucial role in gene
regulation,
cell fate, disease development and more. However, the tools to precisely manipulate this genomic architecture
remain limited, rendering it challenging to explore the full potential of the
3D genome in synthetic biology. We - iGEM Team Heidelberg 2024 - have developed PICasSO, a powerful molecular
toolbox based on various DNA-binding proteins to address this issue.
The PICasSO part collection offers a comprehensive, modular platform for precise manipulation and re-programming of DNA-DNA interactions using protein staples in living cells, enabling researchers to recreate natural 3D genomic interactions, such as enhancer hijacking, or to design entirely new spatial architectures for gene regulation. Beyond its versatility, PICasSO includes robust assay systems to support the engineering, optimization, and testing of new staples, ensuring functionality in vitro and in vivo. We took special care to include parts crucial for testing every step of the cycle (design, build, test, learn) when engineering new parts.
At its heart, the PICasSO part collection consists of three categories.
(i) Our DNA-binding
proteins
include our
finalized enhancer hijacking Cas staple as well as half staples that can be used by scientists to compose entirely
new Cas staples in the future. We also include our Simple staples that serve as controls for successful stapling
and can be further engineered to create alternative, simpler and more compact staples.
(ii) As functional elements, we list additional parts that enhance the functionality of our Cas and
Basic staples. These
consist of
protease-cleavable peptide linkers and inteins that allow condition-specific, dynamic stapling in vivo.
Besides staple functionality, we also include the parts to enable the efficient delivery of PICasSO's constructs
with our
interkingdom conjugation system.
(iii) As the final category of our collection, we provide parts that support the use of our custom
readout
systems. These include components of our established FRET-based proximity assay system, enabling users to
confirm
accurate stapling. Additionally, we offer a complementary, application-oriented testing system for functional
readouts via a luciferase reporter, which allows for straightforward experimental simulation of enhancer hijacking
in mammalian cells.
The following table gives a comprehensive overview of all parts in our PICasSO toolbox. The highlighted parts showed
exceptional performance as described on our iGEM wiki and can serve as a reference. The other parts in
the
collection are versatile building blocks designed to provide future iGEMers with the flexibility to engineer their
own custom Cas staples, enabling further optimization and innovation.
Our part collection includes:
DNA-binding proteins: The building blocks for engineering of custom staples for DNA-DNA interactions with a modular system ensuring easy assembly. | ||
BBa_K5237000 | fgRNA Entry vector MbCas12a-SpCas9 | Entryvector for simple fgRNA cloning via SapI |
BBa_K5237001 | Staple subunit: dMbCas12a-Nucleoplasmin NLS | Staple subunit that can be combined with sgRNA or fgRNA and dCas9 to form a functional staple |
BBa_K5237002 | Staple subunit: SV40 NLS-dSpCas9-SV40 NLS | Staple subunit that can be combined witha sgRNA or fgRNA and dCas12avto form a functional staple |
BBa_K5237003 | Cas Staple: SV40 NLS-dMbCas12a-dSpCas9-Nucleoplasmin NLS | Functional Cas staple that can be combined with sgRNA or fgRNA to bring two DNA strands into close proximity |
BBa_K5237004 | Staple subunit: Oct1-DBD | Staple subunit that can be combined to form a functional staple, for example with TetR. Can also be combined with a fluorescent protein as part of the FRET proximity assay |
BBa_K5237005 | Staple subunit: TetR | Staple subunit that can be combined to form a functional staple, for example with Oct1. Can also be combined with a fluorescent protein as part of the FRET proximity assay |
BBa_K5237006 | Simple staple: TetR-Oct1 | Functional staple that can be used to bring two DNA strands in close proximity |
BBa_K5237007 | Staple subunit: GCN4 | Staple subunit that can be combined to form a functional staple, for example with rGCN4 |
BBa_K5237008 | Staple subunit: rGCN4 | Staple subunit that can be combined to form a functional staple, for example with rGCN4 |
BBa_K5237009 | Mini staple: bGCN4 | Assembled staple with minimal size that can be further engineered | Functional elements: Protease-cleavable peptide linkers and inteins are used to control and modify staples for further optimization for custom applications |
BBa_K5237010 | Cathepsin B-cleavable Linker: GFLG | Cathepsin B-cleavable peptide linker that can be used to combine two staple subunits to make responsive staples |
BBa_K5237011 | Cathepsin B Expression Cassette | Expression Cassette for the overexpression of cathepsin B |
BBa_K5237012 | Caged NpuN Intein | A caged NpuN split intein fragment that undergoes protein trans-splicing after protease activation. Can be used to create functionalized staples units |
BBa_K5237013 | Caged NpuC Intein | A caged NpuC split intein fragment that undergoes protein trans-splicing after protease activation. Can be used to create functionalized staples units |
BBa_K5237014 | fgRNA processing casette | Processing casette to produce multiple fgRNAs from one transcript, that can be used for multiplexed 3D genome reprograming |
BBa_K5237015 | Intimin anti-EGFR Nanobody | Interkindom conjugation between bacteria and mammalian cells, as alternative delivery tool for large constructs |
BBa_K4643003 | incP origin of transfer | Origin of transfer that can be cloned into the plasmid vector and used for conjugation as a means of delivery | Readout Systems: FRET and enhancer recruitment to measure proximity of stapled DNA in bacterial and mammalian living cells enabling swift testing and easy development for new systems |
BBa_K5237016 | FRET-Donor: mNeonGreen-Oct1 | FRET Donor-Fluorpohore fused to Oct1-DBD that binds to the Oct1 binding cassette. Can be used to visualize DNA-DNA proximity |
BBa_K5237017 | FRET-Acceptor: TetR-mScarlet-I | Acceptor part for the FRET assay binding the TetR binding cassette. Can be used to visualize DNA-DNA proximity |
BBa_K5237018 | Oct1 Binding Casette | DNA sequence containing 12 Oct1 binding motifs, compatible with various assays such as the FRET proximity assay |
BBa_K5237019 | TetR Binding Cassette | DNA sequence containing 12 Oct1 binding motifs, can be used for different assays such as the FRET proximity assay | BBa_K5237020 | Cathepsin B-Cleavable Trans-Activator: NLS-Gal4-GFLG-VP64 | Readout system that responds to protease activity. It was used to test cathepsin B-cleavable linker |
BBa_K5237021 | NLS-Gal4-VP64 | Trans-activating enhancer, that can be used to simulate enhancer hijacking | BBa_K5237022 | mCherry Expression Cassette: UAS, minimal Promotor, mCherry | Readout system for enhancer binding. It was used to test cathepsin B-cleavable linker |
BBa_K5237023 | Oct1 - 5x UAS binding casette | Oct1 and UAS binding cassette, that was used for the simulated enhancer hijacking assay |
BBa_K5237024 | TRE-minimal promoter- firefly luciferase | Contains Firefly luciferase controlled by a minimal promoter. It was used as a luminescence readout for simulated enhancer hijacking |
1. Sequence overview
- 10COMPATIBLE WITH RFC[10]
- 12COMPATIBLE WITH RFC[12]
- 21COMPATIBLE WITH RFC[21]
- 23COMPATIBLE WITH RFC[23]
- 25COMPATIBLE WITH RFC[25]
- 1000COMPATIBLE WITH RFC[1000]
rGCN4 is an engineered variant of the yeast transcription factor GCN4.
GCN4 transcription factor (BBa_K5237007).
In contrast to GCN4 that binds the CRE target sequence with the N-terminal region, rGCN4 was engineered as a reverse
variant, binding a modified DNA target sequence with the C-terminal region. The described binding affinity of rGCN4
to it's optimal target sequence INVii (5' GTCAtaTGAC 3', upper case letters indicate direct interaction between
protein and DNA), compares favorably to the wild-type GCN4 binding affinity to its native target sequence.
In our project we first
wanted
to analyze the DNA binding affinity of rGCN4 and then fuse it to GCN4 to create a
functional minimal bivalent protein staple called Mini staple (BBa_K5237009). (Hollenbeck & Oakley 1999)
The rGCN4 amino acid sequence was taken from literature (Hollenbeck & Oakley 1999) and codon optimized for E.
coli.
A FLAG-tag (DYKDDDDK) was added to the N-terminus for protein purification. If necessary, thhe FLAG-tag can be
cleaved off using an Enterokinase, if necessary.
The FLAG-GCN4 sequence was cloned into a T7 expression vector and expressed using E. coli BL21 (DE3) cells.
The FLAG-rGCN4 protein could be readily expressed in E. coli BL21 (DE3). The protein was purified using an
anti-FLAG resin.
Fractions taken during purification were analyzed by SDS-PAGE. Purified protein was quantified using a Lowry
assay, 3.4 mg/mL were obtained, resulting in 422 µM of monomeric FLAG-GCN4.
2. Usage and Biology
3. Assembly and part evolution
4. Results
4.1 Protein expression and purification
4.2 Electrophoretic Mobility shift assay
rGCN4 was tested for its DNA-binding capabilities using an electrophoretic mobility shift assay (EMSA). The
protein was incubated with a DNA probe containing the rGCN4 binding site (INVii). The formation of a protein-DNA
complex was analyzed by native PAGE.
To further analyze DNA binding, quantitative shift assays were performed for GCN4 (BBa_K5237008) and rGCN4.
0.5 µM DNA were incubated with varying concentrations of protein until equilibration. After
electrophoresis, bands were stained with SYBR-Safe and quantified based on pixel intensity. The
obtained values were fitted to equation 1, describing formation of a 2:1 protein-DNA complex:
Θapp = Θmin + (Θmax - Θmin) ×
(Ka2 [L]tot2) / (1 + Ka2
[L]tot2)
Equation 1
Here [L]tot describes the total protein monomer concentration, Ka
corresponds
to the apparent monomeric equilibration constant. The Θmin/max values are the
experimentally
determined site saturation values (For this experiment 0 and 1 were chosen for min and max
respectively). GCN4 binds to its optimal DNA binding motif with an apparent dissociation
constant Kk of (0.2930.033)×10-6 M, which is almost identical to the
rGCN4 binding
affinity to INVii a d of (0.2980.030)×10-6 M.
The apparent binding kinetics calculated for GCN4 ((0.2930.033) × 10-6 M) and rGCN4
((0.2980.030) × 10-6 M) are
approximately a factor 10 higher then those described in literature ((96) × 10-8 M for
GCN4 and (2.90.8) × 10-8 M for rGCN4) (Hollenbeck et al., 2001). The
differences could be due to the lower sensitivity of SYBR-Safe staining compared to radio-labeled oligos.
Most likely, the protein concentration was miscalculated due to the presence of additional (lower intensity)
bands in
the SDS-PAGE analysis, indicating the co-purification of small amounts of unspecific proteins.
The FLAG-tag fusion to the N-terminus of proteins could potentially decrease binding affinity, likely due
to steric hindrance affecting the interaction with DNA. Interestingly, the differences in binding affinity
between GCN4 and rGCN4 appear negligible. Since GCN4 binds to DNA via its N-terminus and rGCN4 binds
C-terminally, the FLAG-tag likely does not directly influence DNA binding. However, it may influence the
dimerization of the proteins, which is necessary for DNA binding. To further investigate this, the
FLAG-tag can be cleaved using an enterokinase and potential changes in binding affinity analyzed
4.3 In Silico Characterization using DaVinci
We developed the in silico model DaVinci for rapid engineering and development of our PiCasSO system. DaVinci acts as a digital twin to PiCasSO, designed to understand the forces acting on our system, refine experimental parameters, and find optimal connections between protein staples and target DNA. We calibrated DaVinci with literature and our own experimental affinity data obtained via EMSA assays and purified proteins. This enabled us to simulate enhancer hijacking in silico, providing valuable input for the design of further experiments. Additionally, we apply the same approach to our part collection. DaVinci is divided into three phases: static structure prediction, all-atom dynamics simulation, and long-ranged dna dynamics simulation. We applied the first two to our parts, characterizing structure and dynamics of the dna-binding interaction.
In our efforts to create a bivalent DNA binding protein with minimal size, we created a Mini staple (BBa_K5237009) consisting of GCN4 (BBa_K5237007) fused with an GSG-linker to rGCN4. The structure and binding affinity of rGCN4 were predicted and calculated. Furthermore different possible linkers were tested, based on dimerization and DNA binding affinity.
5. References
Hollenbeck, J. J., Gurnon, D. G., Fazio, G. C., Carlson, J. J., & Oakley, M. G. (2001). A GCN4 Variant with a C-Terminal Basic Region Binds to DNA with Wild-Type Affinity. Biochemistry, 40(46), 13833–13839.
Hollenbeck, J. J., & Oakley, M. G. (2000). GCN4 Binds with High Affinity to DNA Sequences Containing a Single Consensus Half-Site. Biochemistry, 39(21), 6380–6389. https://doi.org/10.1021/bi992705n
None |