Difference between revisions of "Part:BBa K5237008"

Line 67: Line 67:
 
       </li>
 
       </li>
 
       <li class="toclevel-1 tocsection-5"><a href="#4"><span class="tocnumber">4</span> <span
 
       <li class="toclevel-1 tocsection-5"><a href="#4"><span class="tocnumber">4</span> <span
        class="toctext">Results</span></a>
+
            class="toctext">Results</span></a>
    <ul>
+
        <ul>
      <li class="toclevel-2 tocsection-6"><a href="#4.1"><span class="tocnumber">4.1</span> <span class="toctext">Protein expression and
+
          <li class="toclevel-2 tocsection-6"><a href="#4.1"><span class="tocnumber">4.1</span> <span
          purification</span></a>
+
                class="toctext">Protein expression and
 +
                purification</span></a>
 +
          </li>
 +
          <li class="toclevel-2 tocsection-7"><a href="#4.2"><span class="tocnumber">4.2</span> <span
 +
                class="toctext">Electrophoretic Mobility
 +
                shift assay</span></a>
 +
          </li>
 +
          <li class="toclevel-2 tocsection-8"><a href="#4.3"><span class="tocnumber">4.3</span> <span
 +
                class="toctext"><i>In Silico</i>
 +
                Characterization using DaVinci</span></a>
 +
          </li>
 +
        </ul>
 
       </li>
 
       </li>
      <li class="toclevel-2 tocsection-7"><a href="#4.2"><span class="tocnumber">4.2</span> <span class="toctext">Electrophoretic Mobility
 
          shift assay</span></a>
 
      </li>
 
      <li class="toclevel-2 tocsection-8"><a href="#4.3"><span class="tocnumber">4.3</span> <span class="toctext"><i>In Silico</i>
 
          Characterization using DaVinci</span></a>
 
    </ul>
 
  </li>
 
 
       <li class="toclevel-1 tocsection-8"><a href="#5"><span class="tocnumber">5</span> <span
 
       <li class="toclevel-1 tocsection-8"><a href="#5"><span class="tocnumber">5</span> <span
 
             class="toctext">References</span></a>
 
             class="toctext">References</span></a>
Line 98: Line 102:
 
     <p>
 
     <p>
 
       <br />
 
       <br />
       Next to the well-studied linear DNA sequence, the 3D spatial organization of DNA plays a crucial role in gene
+
       Next to the well-studied linear DNA sequence, the <b>3D spatial organization</b> of DNA plays a crucial role in
       regulation,
+
       gene regulation,
       cell fate, disease development and more. However, the tools to precisely manipulate this genomic architecture
+
       cell fate, disease development and more. However, the tools to precisely manipulate this genomic
       remain limited, rendering it challenging to explore the full potential of the
+
       architecture remain limited, rendering it challenging to explore the full potential of the
       3D genome in synthetic biology. We - iGEM Team Heidelberg 2024 - have developed PICasSO, a powerful molecular
+
       3D genome in synthetic biology. We - iGEM Team Heidelberg 2024 - have developed PICasSO, a <b>powerful
      toolbox based on various DNA-binding proteins to address this issue.
+
        molecular toolbox</b> based on various DNA-binding proteins to address this issue.
 
     </p>
 
     </p>
 
     <p>
 
     <p>
 
       The <b>PICasSO</b> part collection offers a comprehensive, modular platform for precise manipulation and
 
       The <b>PICasSO</b> part collection offers a comprehensive, modular platform for precise manipulation and
       re-programming
+
       <b>re-programming
      of DNA-DNA interactions using protein staples in living cells, enabling researchers to recreate natural 3D genomic
+
        of DNA-DNA interactions</b> using protein staples in living cells, enabling researchers to recreate natural 3D
 +
      genomic
 
       interactions, such as enhancer hijacking, or to design entirely new spatial architectures for gene regulation.
 
       interactions, such as enhancer hijacking, or to design entirely new spatial architectures for gene regulation.
       Beyond its versatility, PICasSO includes robust assay systems to support the engineering, optimization, and
+
       Specifically, the fusion of two DNA binding proteins enables to artifically bring distant genomic loci into
 +
      proximty.
 +
      To unlock the system's full potential, we introduce versatile <b>chimeric CRISPR/Cas complexes</b>, connected
 +
      either on
 +
      the protein or the guide RNA level. These1 complexes are reffered to as protein- or Cas staples. Beyond its
 +
      versatility, PICasSO includes <b>robust assay</b> systems to support the engineering, optimization, and
 
       testing of new staples, ensuring functionality <i>in vitro</i> and <i>in vivo</i>. We took special care to include
 
       testing of new staples, ensuring functionality <i>in vitro</i> and <i>in vivo</i>. We took special care to include
       parts crucial for testing every step of the cycle (design, build, test, learn) when engineering new parts.
+
       parts crucial for testing every step of the cycle (design, build, test, learn) when <b>engineering new parts</b>.
 
     </p>
 
     </p>
 
     <p>At its heart, the PICasSO part collection consists of three categories. <br /><b>(i)</b> Our <b>DNA-binding
 
     <p>At its heart, the PICasSO part collection consists of three categories. <br /><b>(i)</b> Our <b>DNA-binding
Line 326: Line 336:
 
   <h1>2. Usage and Biology</h1>
 
   <h1>2. Usage and Biology</h1>
 
   <p>
 
   <p>
     rGCN4 is an engineered variant of the yeast transcription factor GCN4.
+
     rGCN4 is an engineered variant of the yeast GCN4 transcription factor (<a
    GCN4 transcription factor (<a href="https://parts.igem.org/Part:BBa_K5237007">BBa_K5237007</a>).
+
      href="https://parts.igem.org/Part:BBa_K5237007">BBa_K5237007</a>).
     In contrast to GCN4 that binds the CRE target sequence with the N-terminal region, rGCN4 was engineered as a reverse
+
     In contrast to GCN4 that binds the CRE target sequence with the N-terminal, rGCN4 binds the modified INVii (5'
     variant, binding a modified DNA target sequence with the C-terminal region. The described binding affinity of rGCN4
+
     GTCAtaTGAC 3') DNA target sequence with the C-terminal region.
     to it's optimal target sequence INVii (5' GTCAtaTGAC 3', upper case letters indicate direct interaction between
+
     rGCN4 is an engineered variant of the yeast GCN4 transcription factor (<a
    protein and DNA), compares favorably to the wild-type GCN4 binding affinity to its native target sequence.
+
      href="https://parts.igem.org/Part:BBa_K5237007">BBa_K5237007</a>).
     In our project we first
+
     In our project, we used rGCN4 to study DNA-binding kinetics in our "Mini staples" (<a
    wanted
+
       href="https://parts.igem.org/Part:BBa_K5237009">BBa_K5237009</a>) that bring two DNA target sites into proximity
    to analyze the DNA binding affinity of rGCN4 and then fuse it to GCN4 to create a
+
    by binding them simultaneously.
    functional minimal bivalent protein staple called Mini staple (<a
+
 
       href="https://parts.igem.org/Part:BBa_K5237009">BBa_K5237009</a>). (Hollenbeck &amp; Oakley 1999)
+
 
   </p>
 
   </p>
 
</section>
 
</section>
Line 344: Line 353:
 
     The rGCN4 amino acid sequence was taken from literature (Hollenbeck &amp; Oakley 1999) and codon optimized for <i>E.
 
     The rGCN4 amino acid sequence was taken from literature (Hollenbeck &amp; Oakley 1999) and codon optimized for <i>E.
 
       coli</i>.
 
       coli</i>.
     A FLAG-tag (DYKDDDDK) was added to the N-terminus for protein purification. If necessary, thhe FLAG-tag can be
+
     A FLAG-tag (DYKDDDDK) was added to the N-terminus for protein purification. The FLAG-tag can be
 
     cleaved off using an Enterokinase, if necessary.
 
     cleaved off using an Enterokinase, if necessary.
     The FLAG-GCN4 sequence was cloned into a T7 expression vector and expressed using <i>E. coli</i> BL21 (DE3) cells.
+
     Expression of FLAG-GCN4 was done under an IPTG inducible T7 promoter in <i>E. coli</i> BL21 (DE3) cells.
 
   </p>
 
   </p>
 
</section>
 
</section>
Line 353: Line 362:
 
   <section id="4.1">
 
   <section id="4.1">
 
     <h2>4.1 Protein expression and purification</h2>
 
     <h2>4.1 Protein expression and purification</h2>
     <p>The FLAG-rGCN4 protein could be readily expressed in <i>E. coli</i> BL21 (DE3). The protein was purified using an
+
     <p>The FLAG-rGCN4 protein could be readily expressed in <i>E. coli</i>. The protein was purified using an
       anti-FLAG resin.
+
       anti-FLAG resin. Fractions taken during purification were analyzed by SDS-PAGE and the protein concentration of
      Fractions taken during purification were analyzed by SDS-PAGE. Purified protein was quantified using a Lowry
+
      the eluted protein determined with a lowry protein assay.
       assay, 3.4 mg/mL were obtained, resulting in 422 µM of monomeric FLAG-GCN4.
+
       A yield of 3.4 mg/mL was obtained, corresponding to 422 µM of monomeric FLAG-GCN4.
 
     </p>
 
     </p>
 
     <div class="thumb">
 
     <div class="thumb">
Line 377: Line 386:
 
   <section id="4.2">
 
   <section id="4.2">
 
     <h2>4.2 Electrophoretic Mobility shift assay</h2>
 
     <h2>4.2 Electrophoretic Mobility shift assay</h2>
     <div class="thumb tright">
+
     <div class="thumb tright" style="margin:0;">
 
       <div class="thumbinner" style="width:310px;">
 
       <div class="thumbinner" style="width:310px;">
 
         <img alt="" class="thumbimage"
 
         <img alt="" class="thumbimage"
Line 394: Line 403:
 
     assays can both be used to qualitatively assess DNA binding capabilities or quantitatively to determine binding
 
     assays can both be used to qualitatively assess DNA binding capabilities or quantitatively to determine binding
 
     stoichiometry and kinetics such as the apparent dissociation constant (K<sub>d</sub>) (Fried, 1989).
 
     stoichiometry and kinetics such as the apparent dissociation constant (K<sub>d</sub>) (Fried, 1989).
    </p>
+
 
 
     <p>
 
     <p>
       rGCN4 was tested for its DNA-binding capabilities using an electrophoretic mobility shift assay (EMSA). The
+
       To analyze the binding DNA affinity an EMSA was performed, in which
       protein was incubated with a DNA probe containing the rGCN4 binding site (INVii). The formation of a protein-DNA
+
       rGCN4 was incubated in binding buffer with a 20 bp DNA probe containing the <i>INVii</i> rGCN4 binding
      complex was analyzed by native PAGE.
+
      sequence (5' GTCAtaTGAC 3') until equilibration.
       To further analyze DNA binding, quantitative shift assays were performed for GCN4 (<a
+
      Subsequently the formed protein-DNA complexes were loaded on a native PAGE. Afterwards the DNA bands were stained
         href="https://parts.igem.org/Part:BBa:K5237007">BBa_K5237008</a>) and rGCN4.
+
      with SYBR-safe. <br>
 +
       To further analyze DNA binding, quantitative shift assays were performed for GCN4 and rGCN4 (<a
 +
         href="https://parts.igem.org/Part:BBa:K5237008">BBa_K5237008</a>).
 
       0.5 µM DNA were incubated with varying concentrations of protein until equilibration. After
 
       0.5 µM DNA were incubated with varying concentrations of protein until equilibration. After
 
       electrophoresis, bands were stained with SYBR-Safe and quantified based on pixel intensity. The
 
       electrophoresis, bands were stained with SYBR-Safe and quantified based on pixel intensity. The
Line 415: Line 426:
 
       experimentally
 
       experimentally
 
       determined site saturation values (For this experiment 0 and 1 were chosen for min and max
 
       determined site saturation values (For this experiment 0 and 1 were chosen for min and max
       respectively). GCN4 binds to its optimal DNA binding motif with an apparent dissociation
+
       respectively).
      constant K<sub>k</sub> of (0.2930.033)×10<sup>-6</sup> M, which is almost identical to the
+
      rGCN4 binding
+
      affinity to INVii a <sub>d</sub> of (0.2980.030)×10<sup>-6</sup> M.
+
 
     </p>
 
     </p>
 
     <div class="thumb">
 
     <div class="thumb">
Line 435: Line 443:
 
       </div>
 
       </div>
 
       <p>
 
       <p>
         The apparent binding kinetics calculated for GCN4 ((0.2930.033) × 10<sup>-6</sup> M) and rGCN4
+
         rGCN4 binds to INVii with an apparent dissociation constant
         ((0.2980.030) × 10<sup>-6</sup> M) are
+
        K<sub>D</sub> of K<sub>d</sub> of (0.298 &#177; 0.030) × 10<sup>-6</sup> M, which is almost identical to the
         approximately a factor 10 higher then those described in literature ((96) × 10<sup>-8</sup> M for
+
         GCN4 dissociation constant of (0.293 &#177; 0.033) × 10<sup>-6</sup> M
 +
         Comparing them to literature values, our dissociation constants are approximately a factor 10 higher then those
 +
        described in literature ((96) × 10<sup>-8</sup> M for
 
         GCN4 and (2.90.8) × 10<sup>-8</sup> M for rGCN4) (Hollenbeck <i class="italic">et al.</i>, 2001). The
 
         GCN4 and (2.90.8) × 10<sup>-8</sup> M for rGCN4) (Hollenbeck <i class="italic">et al.</i>, 2001). The
 
         differences could be due to the lower sensitivity of SYBR-Safe staining compared to radio-labeled oligos.
 
         differences could be due to the lower sensitivity of SYBR-Safe staining compared to radio-labeled oligos.
 
         Most likely, the protein concentration was miscalculated due to the presence of additional (lower intensity)
 
         Most likely, the protein concentration was miscalculated due to the presence of additional (lower intensity)
 
         bands in
 
         bands in
         the SDS-PAGE analysis, indicating the co-purification of small amounts of unspecific proteins.
+
         the SDS-PAGE analysis, indicating co-purification of small amounts of unspecific proteins.
 
         <br /><br />
 
         <br /><br />
 
         The FLAG-tag fusion to the N-terminus of proteins could potentially decrease binding affinity, likely due
 
         The FLAG-tag fusion to the N-terminus of proteins could potentially decrease binding affinity, likely due
Line 449: Line 459:
 
         C-terminally, the FLAG-tag likely does not directly influence DNA binding. However, it may influence the
 
         C-terminally, the FLAG-tag likely does not directly influence DNA binding. However, it may influence the
 
         dimerization of the proteins, which is necessary for DNA binding. To further investigate this, the
 
         dimerization of the proteins, which is necessary for DNA binding. To further investigate this, the
         FLAG-tag can be cleaved using an enterokinase and potential changes in binding affinity analyzed
+
         FLAG-tag can be cleaved using an enterokinase and potential changes in binding affinity analyzed.
 +
        Furthermore, coiled coil formation, and the amount of dimeric and monomeric proteins could be further analyzed
 +
        with circular dichroism spectroscopy (Greenfield, 2006).
 
       </p>
 
       </p>
 
     </div>
 
     </div>
Line 462: Line 474:
 
       refine experimental parameters, and find optimal connections between protein staples and target DNA.
 
       refine experimental parameters, and find optimal connections between protein staples and target DNA.
 
       We calibrated DaVinci with literature and our own experimental affinity data obtained via EMSA assays and
 
       We calibrated DaVinci with literature and our own experimental affinity data obtained via EMSA assays and
       purified
+
       purified proteins.<br>
      proteins. This enabled us to simulate enhancer hijacking in silico, providing valuable input for the design of
+
      further
+
      experiments. Additionally, we apply the same approach to our part collection.
+
 
       DaVinci is divided into three phases: static structure prediction, all-atom dynamics simulation, and long-ranged
 
       DaVinci is divided into three phases: static structure prediction, all-atom dynamics simulation, and long-ranged
       dna
+
       dna dynamics simulation. We applied the first two to our parts, characterizing structure and dynamics of the
      dynamics simulation. We applied the first two to our parts, characterizing structure and dynamics of the
+
 
       dna-binding
 
       dna-binding
 
       interaction.
 
       interaction.
 
     </p>
 
     </p>
 
     <p>
 
     <p>
       In our efforts to create a bivalent DNA binding protein with minimal size, we created a Mini staple (<a href="https://parts.igem.org/Part:BBa_K5237009" target="_blank">BBa_K5237009</a>)  
+
       In our efforts to create a bivalent DNA binding protein with minimal size, we created a Mini staple (<a
 +
        href="https://parts.igem.org/Part:BBa_K5237009" target="_blank">BBa_K5237009</a>)
 
       consisting of
 
       consisting of
       GCN4 (<a href="https://parts.igem.org/Part:BBa_K5237007" target="_blank">BBa_K5237007</a>) fused with an GSG-linker to rGCN4. The structure and binding affinity of rGCN4 were predicted and calculated.
+
       GCN4 fused with an GSG-linker to rGCN4 (<a href="https://parts.igem.org/Part:BBa_K5237008"
       Furthermore different possible linkers were tested, based on dimerization and DNA binding affinity.
+
        target="_blank">BBa_K5237008</a>). The structure and binding affinity of GCN4 were predicted and calculated.
 +
       Furthermore different possible linkers were tested.<br>
 +
      We assessed the flexibility and rigidity of our constructs through the pLDDT values assigned
 +
      during the predictions. Figure 2 illustrates the variation in linkers using the ('GGGGS')<sub>n</sub>
 +
      sequence for flexible linkers and the ('EAAAK')<sub>n</sub> sequence for rigid linkers (Arai <i>et al.</i>, 2001).
 +
      The predictions are colored by their pLDDT scores, which act as a surrogate measure of
 +
      chain rigidity (Akdel <i>et al.</i>, 2022; Guo <i>et al.</i>, 2022). The construct C (Fig. 5) was tested as part
 +
      BBa_K5237009 in the Wetlab - it did not bind DNA on any side as the structure is to rigid
 +
      which hinders dimerisation of the two subunits
 +
 
 
     </p>
 
     </p>
 +
    <div class="thumb">
 +
      <div class="thumbinner" style="width:80%;">
 +
        <img alt="" class="thumbimage"
 +
          src="https://static.igem.wiki/teams/5237/model/structure-figure-3.svg" style="width:99%;" />
 +
        <div class="thumbcaption">
 +
          <i><b>Figure 5: Variation of linkers connecting our mini staples.</b>
 +
            Panels A (BBa_K5237007) and B (BBa_K5237008) show
 +
            orientations of the leucine zipper, each bound to DNA. Panels C to I display linker variations colored by their pLDDT
 +
            confidence score, which serves as a surrogate for chain flexibility (Akdel et al., 2022). Note that panels H and I are
 +
            not bound to the second DNA strand. All structures were predicted using the AlphaFold server (Google DeepMind,
 +
            2024).
 +
          </i>
 +
        </div>
 +
      </div>
 +
    </div>
 +
   
 +
  </section>
 
</section>
 
</section>
<section id="5">
+
  <section id="5">
  <h1>5. References</h1>
+
    <h1>5. References</h1>
  <p>Hollenbeck, J. J., Gurnon, D. G., Fazio, G. C., Carlson, J. J., &amp; Oakley, M. G. (2001). A GCN4 Variant with a
+
    <p>Akdel, M., Pires, D. E. V., Pardo, E. P., Janes, J., Zalevsky, A. O., Meszaros, B., Bryant, P., Good, L. L., Laskowski, R. A., Pozzati, G., Shenoy, A., Zhu, W., Kundrotas, P., Serra, V. R., Rodrigues, C. H. M., Dunham, A. S., Burke, D., Borkakoti, N., Velankar, S., … Beltrao, P. (2022). A structural biology community assessment of AlphaFold2 applications. <i>Nat Struct Mol Biol, 29</i>(11), 1056–1067. <a href="https://doi.org/10.1038/s41594-022-00849-w" target="_blank">https://doi.org/10.1038/s41594-022-00849-w</a></p>
    C-Terminal Basic Region Binds to DNA with Wild-Type Affinity. <em>Biochemistry, 40</em>(46), 13833–13839.</p>
+
 
  <p>Hollenbeck, J. J., &amp; Oakley, M. G. (2000). GCN4 Binds with High Affinity to DNA Sequences Containing a Single
+
<p>Arai, R., Ueda, H., Kitayama, A., Kamiya, N., & Nagamune, T. (2001). Design of the linkers which effectively separate domains of a bifunctional fusion protein. <i>Protein Engineering, Design and Selection, 14</i>(8), 529–532. <a href="https://doi.org/10.1093/protein/14.8.529" target="_blank">https://doi.org/10.1093/protein/14.8.529</a></p>
    Consensus Half-Site. <em>Biochemistry, 39</em>(21), 6380–6389. <a href="https://doi.org/10.1021/bi992705n"
+
 
      target="_blank">https://doi.org/10.1021/bi992705n</a></p>
+
<p>Chen, X., Zaro, J. L., & Shen, W.-C. (2013). Fusion protein linkers: Property, design and functionality. <i>Advanced Drug Delivery Reviews, 65</i>(10), 1357–1369. <a href="https://doi.org/10.1016/j.addr.2012.09.039" target="_blank">https://doi.org/10.1016/j.addr.2012.09.039</a></p>
 +
 
 +
<p>Google DeepMind. (2024). AlphaFold Server. <a href="https://alphafoldserver.com/terms" target="_blank">https://alphafoldserver.com/terms</a></p>
 +
<p>Guo, H.-B., Perminov, A., Bekele, S., Kedziora, G., Farajollahi, S., Varaljay, V., Hinkle, K., Molinero, V., Meister, K., Hung, C., Dennis, P., Kelley-Loughnane, N., & Berry, R. (2022). AlphaFold2 models indicate that protein sequence determines both structure and dynamics. <i>Scientific Reports, 12</i>(1), 10696. <a href="https://doi.org/10.1038/s41598-022-14382-9" target="_blank">https://doi.org/10.1038/s41598-022-14382-9</a></p>
 +
 
 +
    <p>Hollenbeck, J. J., Gurnon, D. G., Fazio, G. C., Carlson, J. J., &amp; Oakley, M. G. (2001). A GCN4 Variant with a
 +
      C-Terminal Basic Region Binds to DNA with Wild-Type Affinity. <em>Biochemistry, 40</em>(46), 13833–13839.</p>
 +
    <p>Hollenbeck, J. J., &amp; Oakley, M. G. (2000). GCN4 Binds with High Affinity to DNA Sequences Containing a Single
 +
      Consensus Half-Site. <em>Biochemistry, 39</em>(21), 6380–6389. <a href="https://doi.org/10.1021/bi992705n"
 +
        target="_blank">https://doi.org/10.1021/bi992705n</a></p>
 +
  </section>
 
</section>
 
</section>
  
 
</html>
 
</html>

Revision as of 00:33, 2 October 2024

BBa_K5237008

Staple subunit: rGCN4

rGCN4 is an engineered, reverse, variant of the yeast transcription factor GCN4, featuring a basic region and a leucine zipper dimerization domain. We used rGCN4 to study DNA-binding kinetics in our "Mini staples" that bring two DNA target sites into proximity by binding them simultaneously.



The PICasSO Toolbox
Figure 1: How our part collection can be used to engineer new staples


Next to the well-studied linear DNA sequence, the 3D spatial organization of DNA plays a crucial role in gene regulation, cell fate, disease development and more. However, the tools to precisely manipulate this genomic architecture remain limited, rendering it challenging to explore the full potential of the 3D genome in synthetic biology. We - iGEM Team Heidelberg 2024 - have developed PICasSO, a powerful molecular toolbox based on various DNA-binding proteins to address this issue.

The PICasSO part collection offers a comprehensive, modular platform for precise manipulation and re-programming of DNA-DNA interactions using protein staples in living cells, enabling researchers to recreate natural 3D genomic interactions, such as enhancer hijacking, or to design entirely new spatial architectures for gene regulation. Specifically, the fusion of two DNA binding proteins enables to artifically bring distant genomic loci into proximty. To unlock the system's full potential, we introduce versatile chimeric CRISPR/Cas complexes, connected either on the protein or the guide RNA level. These1 complexes are reffered to as protein- or Cas staples. Beyond its versatility, PICasSO includes robust assay systems to support the engineering, optimization, and testing of new staples, ensuring functionality in vitro and in vivo. We took special care to include parts crucial for testing every step of the cycle (design, build, test, learn) when engineering new parts.

At its heart, the PICasSO part collection consists of three categories.
(i) Our DNA-binding proteins include our finalized enhancer hijacking Cas staple as well as half staples that can be used by scientists to compose entirely new Cas staples in the future. We also include our Simple staples that serve as controls for successful stapling and can be further engineered to create alternative, simpler and more compact staples.
(ii) As functional elements, we list additional parts that enhance the functionality of our Cas and Basic staples. These consist of protease-cleavable peptide linkers and inteins that allow condition-specific, dynamic stapling in vivo. Besides staple functionality, we also include the parts to enable the efficient delivery of PICasSO's constructs with our interkingdom conjugation system.
(iii) As the final category of our collection, we provide parts that support the use of our custom readout systems. These include components of our established FRET-based proximity assay system, enabling users to confirm accurate stapling. Additionally, we offer a complementary, application-oriented testing system for functional readouts via a luciferase reporter, which allows for straightforward experimental simulation of enhancer hijacking in mammalian cells.

The following table gives a comprehensive overview of all parts in our PICasSO toolbox. The highlighted parts showed exceptional performance as described on our iGEM wiki and can serve as a reference. The other parts in the collection are versatile building blocks designed to provide future iGEMers with the flexibility to engineer their own custom Cas staples, enabling further optimization and innovation.

Our part collection includes:

DNA-binding proteins: The building blocks for engineering of custom staples for DNA-DNA interactions with a modular system ensuring easy assembly.
BBa_K5237000 fgRNA Entry vector MbCas12a-SpCas9 Entryvector for simple fgRNA cloning via SapI
BBa_K5237001 Staple subunit: dMbCas12a-Nucleoplasmin NLS Staple subunit that can be combined with sgRNA or fgRNA and dCas9 to form a functional staple
BBa_K5237002 Staple subunit: SV40 NLS-dSpCas9-SV40 NLS Staple subunit that can be combined witha sgRNA or fgRNA and dCas12avto form a functional staple
BBa_K5237003 Cas Staple: SV40 NLS-dMbCas12a-dSpCas9-Nucleoplasmin NLS Functional Cas staple that can be combined with sgRNA or fgRNA to bring two DNA strands into close proximity
BBa_K5237004 Staple subunit: Oct1-DBD Staple subunit that can be combined to form a functional staple, for example with TetR.
Can also be combined with a fluorescent protein as part of the FRET proximity assay
BBa_K5237005 Staple subunit: TetR Staple subunit that can be combined to form a functional staple, for example with Oct1.
Can also be combined with a fluorescent protein as part of the FRET proximity assay
BBa_K5237006 Simple staple: TetR-Oct1 Functional staple that can be used to bring two DNA strands in close proximity
BBa_K5237007 Staple subunit: GCN4 Staple subunit that can be combined to form a functional staple, for example with rGCN4
BBa_K5237008 Staple subunit: rGCN4 Staple subunit that can be combined to form a functional staple, for example with rGCN4
BBa_K5237009 Mini staple: bGCN4 Assembled staple with minimal size that can be further engineered
Functional elements: Protease-cleavable peptide linkers and inteins are used to control and modify staples for further optimization for custom applications
BBa_K5237010 Cathepsin B-cleavable Linker: GFLG Cathepsin B-cleavable peptide linker that can be used to combine two staple subunits to make responsive staples
BBa_K5237011 Cathepsin B Expression Cassette Expression Cassette for the overexpression of cathepsin B
BBa_K5237012 Caged NpuN Intein A caged NpuN split intein fragment that undergoes protein trans-splicing after protease activation. Can be used to create functionalized staples units
BBa_K5237013 Caged NpuC Intein A caged NpuC split intein fragment that undergoes protein trans-splicing after protease activation. Can be used to create functionalized staples units
BBa_K5237014 fgRNA processing casette Processing casette to produce multiple fgRNAs from one transcript, that can be used for multiplexed 3D genome reprograming
BBa_K5237015 Intimin anti-EGFR Nanobody Interkindom conjugation between bacteria and mammalian cells, as alternative delivery tool for large constructs
BBa_K4643003 incP origin of transfer Origin of transfer that can be cloned into the plasmid vector and used for conjugation as a means of delivery
Readout Systems: FRET and enhancer recruitment to measure proximity of stapled DNA in bacterial and mammalian living cells enabling swift testing and easy development for new systems
BBa_K5237016 FRET-Donor: mNeonGreen-Oct1 FRET Donor-Fluorpohore fused to Oct1-DBD that binds to the Oct1 binding cassette. Can be used to visualize DNA-DNA proximity
BBa_K5237017 FRET-Acceptor: TetR-mScarlet-I Acceptor part for the FRET assay binding the TetR binding cassette. Can be used to visualize DNA-DNA proximity
BBa_K5237018 Oct1 Binding Casette DNA sequence containing 12 Oct1 binding motifs, compatible with various assays such as the FRET proximity assay
BBa_K5237019 TetR Binding Cassette DNA sequence containing 12 Oct1 binding motifs, can be used for different assays such as the FRET proximity assay
BBa_K5237020 Cathepsin B-Cleavable Trans-Activator: NLS-Gal4-GFLG-VP64 Readout system that responds to protease activity. It was used to test cathepsin B-cleavable linker
BBa_K5237021 NLS-Gal4-VP64 Trans-activating enhancer, that can be used to simulate enhancer hijacking
BBa_K5237022 mCherry Expression Cassette: UAS, minimal Promotor, mCherry Readout system for enhancer binding. It was used to test cathepsin B-cleavable linker
BBa_K5237023 Oct1 - 5x UAS binding casette Oct1 and UAS binding cassette, that was used for the simulated enhancer hijacking assay
BBa_K5237024 TRE-minimal promoter- firefly luciferase Contains Firefly luciferase controlled by a minimal promoter. It was used as a luminescence readout for simulated enhancer hijacking

1. Sequence overview

Sequence and Features


Assembly Compatibility:
  • 10
    COMPATIBLE WITH RFC[10]
  • 12
    COMPATIBLE WITH RFC[12]
  • 21
    COMPATIBLE WITH RFC[21]
  • 23
    COMPATIBLE WITH RFC[23]
  • 25
    COMPATIBLE WITH RFC[25]
  • 1000
    COMPATIBLE WITH RFC[1000]

2. Usage and Biology

rGCN4 is an engineered variant of the yeast GCN4 transcription factor (BBa_K5237007). In contrast to GCN4 that binds the CRE target sequence with the N-terminal, rGCN4 binds the modified INVii (5' GTCAtaTGAC 3') DNA target sequence with the C-terminal region. rGCN4 is an engineered variant of the yeast GCN4 transcription factor (BBa_K5237007). In our project, we used rGCN4 to study DNA-binding kinetics in our "Mini staples" (BBa_K5237009) that bring two DNA target sites into proximity by binding them simultaneously.

3. Assembly and part evolution

The rGCN4 amino acid sequence was taken from literature (Hollenbeck & Oakley 1999) and codon optimized for E. coli. A FLAG-tag (DYKDDDDK) was added to the N-terminus for protein purification. The FLAG-tag can be cleaved off using an Enterokinase, if necessary. Expression of FLAG-GCN4 was done under an IPTG inducible T7 promoter in E. coli BL21 (DE3) cells.

4. Results

4.1 Protein expression and purification

The FLAG-rGCN4 protein could be readily expressed in E. coli. The protein was purified using an anti-FLAG resin. Fractions taken during purification were analyzed by SDS-PAGE and the protein concentration of the eluted protein determined with a lowry protein assay. A yield of 3.4 mg/mL was obtained, corresponding to 422 µM of monomeric FLAG-GCN4.

Figure 2: SDS-PAGE analysis of FLAG-GCN4 purification Fractions analysed are the raw lysate, flow through and eluate. Depicted is GCN4 (this part), rGCN4 (BBa_K5237008), and bGCN4 (BBa_K5237009). Protein size is indicated next to construct name and purified band with protein of interest highlighted by a red box.

4.2 Electrophoretic Mobility shift assay

Figure 3: Overview Image of Electrophoretic Mobility Shift Assay (EMSA)

The Electrophoretic mobility shift assay (EMSA) is a widely adopted method used to study DNA-protein interactions. EMSA functions on the basis that nucleic acids bound to proteins have reduced electrophoretic mobility, compared to their counterpart. (Hellman & Fried, 2007). Mobility-shift assays can both be used to qualitatively assess DNA binding capabilities or quantitatively to determine binding stoichiometry and kinetics such as the apparent dissociation constant (Kd) (Fried, 1989).

To analyze the binding DNA affinity an EMSA was performed, in which rGCN4 was incubated in binding buffer with a 20 bp DNA probe containing the INVii rGCN4 binding sequence (5' GTCAtaTGAC 3') until equilibration. Subsequently the formed protein-DNA complexes were loaded on a native PAGE. Afterwards the DNA bands were stained with SYBR-safe.
To further analyze DNA binding, quantitative shift assays were performed for GCN4 and rGCN4 (BBa_K5237008). 0.5 µM DNA were incubated with varying concentrations of protein until equilibration. After electrophoresis, bands were stained with SYBR-Safe and quantified based on pixel intensity. The obtained values were fitted to equation 1, describing formation of a 2:1 protein-DNA complex:

Θapp = Θmin + (Θmax - Θmin) × (Ka2 [L]tot2) / (1 + Ka2 [L]tot2) Equation 1

Here [L]tot describes the total protein monomer concentration, Ka corresponds to the apparent monomeric equilibration constant. The Θmin/max values are the experimentally determined site saturation values (For this experiment 0 and 1 were chosen for min and max respectively).

Figure 4: Quantitative EMSAQuantitative assessment of binding affinity for GCN4 and rGCN4. Proteins of different concentrations were incubated with 0.5 µM DNA until equilibrium and fraction bound analyzed, after gel electrophoresis, by dividing pixel intensity of bound fraction with pixel intensity of bound and unbound fraction. At least three separate measurements were conducted for each data point. Values are presented as mean +/- SD.

rGCN4 binds to INVii with an apparent dissociation constant KD of Kd of (0.298 ± 0.030) × 10-6 M, which is almost identical to the GCN4 dissociation constant of (0.293 ± 0.033) × 10-6 M Comparing them to literature values, our dissociation constants are approximately a factor 10 higher then those described in literature ((96) × 10-8 M for GCN4 and (2.90.8) × 10-8 M for rGCN4) (Hollenbeck et al., 2001). The differences could be due to the lower sensitivity of SYBR-Safe staining compared to radio-labeled oligos. Most likely, the protein concentration was miscalculated due to the presence of additional (lower intensity) bands in the SDS-PAGE analysis, indicating co-purification of small amounts of unspecific proteins.

The FLAG-tag fusion to the N-terminus of proteins could potentially decrease binding affinity, likely due to steric hindrance affecting the interaction with DNA. Interestingly, the differences in binding affinity between GCN4 and rGCN4 appear negligible. Since GCN4 binds to DNA via its N-terminus and rGCN4 binds C-terminally, the FLAG-tag likely does not directly influence DNA binding. However, it may influence the dimerization of the proteins, which is necessary for DNA binding. To further investigate this, the FLAG-tag can be cleaved using an enterokinase and potential changes in binding affinity analyzed. Furthermore, coiled coil formation, and the amount of dimeric and monomeric proteins could be further analyzed with circular dichroism spectroscopy (Greenfield, 2006).

4.3 In Silico Characterization using DaVinci

We developed the in silico model DaVinci for rapid engineering and development of our PiCasSO system. DaVinci acts as a digital twin to PiCasSO, designed to understand the forces acting on our system, refine experimental parameters, and find optimal connections between protein staples and target DNA. We calibrated DaVinci with literature and our own experimental affinity data obtained via EMSA assays and purified proteins.
DaVinci is divided into three phases: static structure prediction, all-atom dynamics simulation, and long-ranged dna dynamics simulation. We applied the first two to our parts, characterizing structure and dynamics of the dna-binding interaction.

In our efforts to create a bivalent DNA binding protein with minimal size, we created a Mini staple (BBa_K5237009) consisting of GCN4 fused with an GSG-linker to rGCN4 (BBa_K5237008). The structure and binding affinity of GCN4 were predicted and calculated. Furthermore different possible linkers were tested.
We assessed the flexibility and rigidity of our constructs through the pLDDT values assigned during the predictions. Figure 2 illustrates the variation in linkers using the ('GGGGS')n sequence for flexible linkers and the ('EAAAK')n sequence for rigid linkers (Arai et al., 2001). The predictions are colored by their pLDDT scores, which act as a surrogate measure of chain rigidity (Akdel et al., 2022; Guo et al., 2022). The construct C (Fig. 5) was tested as part BBa_K5237009 in the Wetlab - it did not bind DNA on any side as the structure is to rigid which hinders dimerisation of the two subunits

Figure 5: Variation of linkers connecting our mini staples. Panels A (BBa_K5237007) and B (BBa_K5237008) show orientations of the leucine zipper, each bound to DNA. Panels C to I display linker variations colored by their pLDDT confidence score, which serves as a surrogate for chain flexibility (Akdel et al., 2022). Note that panels H and I are not bound to the second DNA strand. All structures were predicted using the AlphaFold server (Google DeepMind, 2024).

5. References

Akdel, M., Pires, D. E. V., Pardo, E. P., Janes, J., Zalevsky, A. O., Meszaros, B., Bryant, P., Good, L. L., Laskowski, R. A., Pozzati, G., Shenoy, A., Zhu, W., Kundrotas, P., Serra, V. R., Rodrigues, C. H. M., Dunham, A. S., Burke, D., Borkakoti, N., Velankar, S., … Beltrao, P. (2022). A structural biology community assessment of AlphaFold2 applications. Nat Struct Mol Biol, 29(11), 1056–1067. https://doi.org/10.1038/s41594-022-00849-w

Arai, R., Ueda, H., Kitayama, A., Kamiya, N., & Nagamune, T. (2001). Design of the linkers which effectively separate domains of a bifunctional fusion protein. Protein Engineering, Design and Selection, 14(8), 529–532. https://doi.org/10.1093/protein/14.8.529

Chen, X., Zaro, J. L., & Shen, W.-C. (2013). Fusion protein linkers: Property, design and functionality. Advanced Drug Delivery Reviews, 65(10), 1357–1369. https://doi.org/10.1016/j.addr.2012.09.039

Google DeepMind. (2024). AlphaFold Server. https://alphafoldserver.com/terms

Guo, H.-B., Perminov, A., Bekele, S., Kedziora, G., Farajollahi, S., Varaljay, V., Hinkle, K., Molinero, V., Meister, K., Hung, C., Dennis, P., Kelley-Loughnane, N., & Berry, R. (2022). AlphaFold2 models indicate that protein sequence determines both structure and dynamics. Scientific Reports, 12(1), 10696. https://doi.org/10.1038/s41598-022-14382-9

Hollenbeck, J. J., Gurnon, D. G., Fazio, G. C., Carlson, J. J., & Oakley, M. G. (2001). A GCN4 Variant with a C-Terminal Basic Region Binds to DNA with Wild-Type Affinity. Biochemistry, 40(46), 13833–13839.

Hollenbeck, J. J., & Oakley, M. G. (2000). GCN4 Binds with High Affinity to DNA Sequences Containing a Single Consensus Half-Site. Biochemistry, 39(21), 6380–6389. https://doi.org/10.1021/bi992705n