Coding

Part:BBa_K5237008

Designed by: Marik Mueller   Group: iGEM24_Heidelberg   (2024-09-29)
Revision as of 09:07, 30 September 2024 by Marik (Talk | contribs)

BBa_K5237008

Staple subunit: rGCN4

rGCN4 is an engineered, reverse, variant of the yeast transcription factor GCN4, featuring a basic region and a leucine zipper dimerization domain. We used rGCN4 to study DNA-binding kinetics in our "Mini staples" that bring two DNA target sites into proximity by binding them simultaneously.

 

The PICasSO Toolbox


Figure 1: Example how the part collection can be used to engineer new staples


The 3D organization of the genome plays a crucial role in regulating gene expression in eukaryotic cells, impacting cellular behavior, evolution, and disease. Beyond the linear DNA sequence, the spatial arrangement of chromatin, influenced by DNA-DNA interactions, shapes pathways of gene regulation. However, the tools to precisely manipulate this genomic architecture remain limited, rendering it challenging to explore the full potential of the 3D genome in synthetic biology. We - iGEM Team Heidelberg 2024 - have developed PICasSO, a powerful molecular toolbox based on various DNA-binding proteins to address this issue.

The PICasSO part collection offers a comprehensive, modular platform for precise manipulation and re-programming of DNA-DNA interactions using protein staples in living cells, enabling researchers to recreate natural 3D genomic interactions, such as enhancer hijacking, or to design entirely new spatial architectures for gene regulation. Beyond its versatility, PICasSO includes robust assay systems to support the engineering, optimization, and testing of new staples, ensuring functionality in vitro and in vivo. We took special care to include parts crucial for testing every step of the cycle (design, build, test, learn) when engineering new parts

At its heart, the PICasSO part collection consists of three categories.
(i) Our DNA-binding proteins include our finalized enhancer hijacking Cas staple as well as half staples that can be used by scientists to compose entirely new Cas staples in the future. We also include our simple staples that serve as controls for successful stapling and can be further engineered to create alternative, simpler and more compact staples.
(ii) As functional elements, we list additional parts that enhance the functionality of our Cas and Basic staples. These consist of protease-cleavable peptide linkers and inteins that allow condition-specific, dynamic stapling in vivo. Besides staple functionality, we also include the parts to enable the efficient delivery of PICasSO's constructs with our interkingdom conjugation system.
(iii) As the final component of our collection, we provide parts that support the use of our custom readout systems. These include components of our established FRET-based proximity assay system, enabling users to confirm accurate stapling. Additionally, we offer a complementary, application-oriented testing system for functional readout via a luciferase reporter, which allows for straightforward experimental simulation of enhancer hijacking.

The following table gives a complete overview of all parts in our PICasSO toolbox. The highlighted parts showed exceptional performance as described on our iGEM wiki and can serve as a reference. The other parts in the collection are versatile building blocks designed to provide future iGEMers with the flexibility to engineer their own custom Cas staples, enabling further optimization and innovation.

Our part collection includes:

DNA-binding proteins: The building blocks for engineering of custom staples for DNA-DNA interactions with a modular system ensuring easy assembly.
BBa_K5237000 fgRNA Entryvector MbCas12a-SpCas9 Entryvector for simple fgRNA cloning via SapI
BBa_K5237001 Staple subunit: dMbCas12a-Nucleoplasmin NLS Staple subunit that can be combined to form a functional staple, for example with fgRNA and dCas9
BBa_K5237002 Staple subunit: SV40 NLS-dSpCas9-SV40 NLS Staple subunit that can be combined to form a functional staple, for example with our fgRNA or dCas12a
BBa_K5237003 Cas-Staple: SV40 NLS-dMbCas12a-dSpCas9-Nucleoplasmin NLS Functional Cas staple that can be combined with sgRNA or fgRNA to bring two DNA strands in close proximity
BBa_K5237004 Staple subunit: Oct1-DBD Staple subunit that can be combined to form a functional staple, for example with TetR.
Can also be combined with a fluorescent protein as part of the FRET proximity assay
BBa_K5237005 Staple subunit: TetR Staple subunit that can be combined to form a functional staple, for example with Oct1.
Can also be combined with a fluorescent protein as part of the FRET proximity assay
BBa_K5237006 Simple taple: TetR-Oct1 Functional staple that can be used to bring two DNA strands in close proximity
BBa_K5237007 Staple subunit: GCN4 Staple subunit that can be combined to form a functional staple, for example with rGCN4
BBa_K5237008 Staple subunit: rGCN4 Staple subunit that can be combined to form a functional staple, for example with rGCN4
BBa_K5237009 Mini staple: bGCN4 Assembled staple with minimal size that can be further engineered
Functional elements: Protease cleavable peptide linkers and inteins are used to control and modify staples for further optimization for custom applications.
BBa_K5237010 Cathepsin B-Cleavable Linker (GFLG) Cathepsin B cleavable peptide linker, that can be used to combine two staple subunits ,to make responsive staples
BBa_K5237011 Cathepsin B Expression Cassette Cathepsin B which can be selectively express to cut the cleavable linker
BBa_K5237012 Caged NpuN Intein Undergoes protein transsplicing after protease activation, can be used to create functionalized staple units
BBa_K5237013 Caged NpuC Intein Undergoes protein transsplicing after protease activation, can be used to create functionalized staple units
BBa_K5237014 fgRNA processing casette Processing casette to produce multiple fgRNAs from one transcript, can be used for multiplexing
BBa_K5237015 Intimin anti-EGFR Nanobody Interkindom conjugation between bacteria and mammalian cells, as alternative delivery tool for large constructs
Readout Systems: FRET and enhancer recruitment to measure proximity of stapled DNA in bacterial and mammalian living cells enabling swift testing and easy development for new systems.
BBa_K5237016 FRET-Donor: mNeonGreen-Oct1 Donor part for the FRET assay binding the Oct1 binding cassette. Can be used to visualize DNA-DNA proximity
BBa_K5237017 FRET-Acceptor: TetR-mScarlet-I Acceptor part for the FRET assay binding the TetR binding cassette. Can be used to visualize DNA-DNA proximity
BBa_K5237018 Oct1 Binding Casette DNA sequence containing 12 Oct1 binding motifs, can be used for different assays such as the FRET proximity assay
BBa_K5237019 TetR Binding Cassette DNA sequence containing 12 Oct1 binding motifs, can be used for different assays such as the FRET proximity assay
BBa_K5237020 Cathepsin B-Cleavable Trans-Activator: NLS-Gal4-GFLG-VP64 Readout system that responds to protease activity. It was used to test Cathepsin-B cleavable linker.
BBa_K5237021 NLS-Gal4-VP64 Trans-activating enhancer, that can be used to simulate enhancer hijacking.
BBa_K5237022 mCherry Expression Cassette: UAS, minimal Promotor, mCherry Readout system for enhancer binding. It was used to test Cathepsin-B cleavable linker.
BBa_K5237023 Oct1 - 5x UAS binding casette Oct1 and UAS binding cassette, that was used for the simulated enhancer hijacking assay.
BBa_K5237024 TRE-minimal promoter- firefly luciferase Contains Firefly luciferase controlled by a minimal promoter. It was used as a luminescence readout for simulated enhancer hijacking.

1. Sequence overview

Sequence and Features


Assembly Compatibility:
  • 10
    COMPATIBLE WITH RFC[10]
  • 12
    COMPATIBLE WITH RFC[12]
  • 21
    COMPATIBLE WITH RFC[21]
  • 23
    COMPATIBLE WITH RFC[23]
  • 25
    COMPATIBLE WITH RFC[25]
  • 1000
    COMPATIBLE WITH RFC[1000]

2. Usage and Biology

rGCN4 is an engineered variant of the yeast transcription factor GCN4. GCN4 transcription factor (BBa_K5237007). In contrast to GCN4 that binds the CRE target sequence with the N-terminal region, rGCN4 was engineered as a reverse variant, binding a modified DNA target sequence with the C-terminal region. The described binding affinity of rGCN4 to it's optimal target sequence INVii (5' GTCAtaTGAC 3', upper case letters indicate direct interaction between protein and DNA), compares favorably to the wild-type GCN4 binding affinity to its native target sequence. In our project we first wanted to analyze the DNA binding affinity of rGCN4 and then fuse it to GCN4 to create a functional minimal bivalent protein staple called Mini staple (BBa_K5237009). (Hollenbeck & Oakley 1999)

3. Assembly and part evolution

The rGCN4 amino acid sequence was taken from literature (Hollenbeck & Oakley 1999) and codon optimized for E. coli. A FLAG-tag (DYKDDDDK) was added to the N-terminus for protein purification. If necessary, thhe FLAG-tag can be cleaved off using an Enterokinase, if necessary. The FLAG-GCN4 sequence was cloned into a T7 expression vector and expressed using E. coli BL21 (DE3) cells.

4. Results

4.1 Protein expression and purification

The FLAG-rGCN4 protein could be readily expressed in E. coli BL21 (DE3). The protein was purified using an anti-FLAG resin. Fractions taken during purification were analyzed by SDS-PAGE. Purified protein was quantified using a Lowry assay, 3.4 mg/mL were obtained, resulting in 422 µM of monomeric FLAG-GCN4.

Figure 2: SDS-PAGE analysis of FLAG-GCN4 purification Fractions analysed are the raw lysate, flow through and eluate. Depicted is GCN4 (this part), rGCN4 (BBa_K5237008), and bGCN4 (BBa_K5237009). Protein size is indicated next to construct name and purified band with protein of interest highlighted by a red box.

4.2 Electrophoretic Mobility shift assay

The Electrophoretic mobility shift assay (EMSA) is a widely adopted method used to study DNA-protein interactions. EMSa functions on the basis that nucleic acids bound to proteins have reduced electrophoretic mobility, compared to their counterpart. (Hellman & Fried, 2007). Mobility-shift assays can both be used to qualitatively assess DNA binding capabilities or quantitatively to determine binding stoichiometry and kinetics such as the apparent dissociation constant (Kd) (Fried, 1989).

Figure 3: Overview Image of Electrophoretic Mobility Shift Assay (EMSA)

rGCN4 was tested for its DNA-binding capabilities using an electrophoretic mobility shift assay (EMSA). The protein was incubated with a DNA probe containing the rGCN4 binding site (INVii). The formation of a protein-DNA complex was analyzed by native PAGE. To further analyze DNA binding, quantitative shift assays were performed for GCN4 (BBa_K5237008) and rGCN4. 0.5 µM DNA were incubated with varying concentrations of protein until equilibration. After electrophoresis, bands were stained with SYBR-Safe and quantified based on pixel intensity. The obtained values were fitted to equation 1, describing formation of a 2:1 protein-DNA complex:

Θapp = Θmin + (Θmax - Θmin) × (Ka2 [L]tot2) / (1 + Ka2 [L]tot2) Equation 1

Here [L]tot describes the total protein monomer concentration, Ka corresponds to the apparent monomeric equilibration constant. The Θmin/max values are the experimentally determined site saturation values (For this experiment 0 and 1 were chosen for min and max respectively). GCN4 binds to its optimal DNA binding motif with an apparent dissociation constant Kk of (0.2930.033)×10-6 M, which is almost identical to the rGCN4 binding affinity to INVii a d of (0.2980.030)×10-6 M.

Figure 4: Quantitative EMSAQuantitative assessment of binding affinity for GCN4 and rGCN4. Proteins of different concentrations were incubated with 0.5 µM DNA until equilibrium and fraction bound analyzed, after gel electrophoresis, by dividing pixel intensity of bound fraction with pixel intensity of bound and unbound fraction. At least three separate measurements were conducted for each data point. Values are presented as mean +/- SD.

The apparent binding kinetics calculated for GCN4 ((0.2930.033) × 10-6 M) and rGCN4 ((0.2980.030) × 10-6 M) are approximately a factor 10 higher then those described in literature ((96) × 10-8 M for GCN4 and (2.90.8) × 10-8 M for rGCN4) (Hollenbeck et al., 2001). The differences could be due to the lower sensitivity of SYBR-Safe staining compared to radio-labeled oligos. Most likely, the protein concentration was miscalculated due to the presence of additional (lower intensity) bands in the SDS-PAGE analysis, indicating the co-purification of small amounts of unspecific proteins.

The FLAG-tag fusion to the N-terminus of proteins could potentially decrease binding affinity, likely due to steric hindrance affecting the interaction with DNA. Interestingly, the differences in binding affinity between GCN4 and rGCN4 appear negligible. Since GCN4 binds to DNA via its N-terminus and rGCN4 binds C-terminally, the FLAG-tag likely does not directly influence DNA binding. However, it may influence the dimerization of the proteins, which is necessary for DNA binding. To further investigate this, the FLAG-tag can be cleaved using an enterokinase and potential changes in binding affinity analyzed

5. References

Hollenbeck, J. J., Gurnon, D. G., Fazio, G. C., Carlson, J. J., & Oakley, M. G. (2001). A GCN4 Variant with a C-Terminal Basic Region Binds to DNA with Wild-Type Affinity. Biochemistry, 40(46), 13833–13839.

Hollenbeck, J. J., & Oakley, M. G. (2000). GCN4 Binds with High Affinity to DNA Sequences Containing a Single Consensus Half-Site. Biochemistry, 39(21), 6380–6389. https://doi.org/10.1021/bi992705n

[edit]
Categories
Parameters
None