Part:BBa_K2926001
Cas13a Lbu
This part codes for Cas13a derived from Leptotrichia buccalis. The natural EcoRI and PstI restriction sites were removed via mutagenesis PCR.
Sequence and Features
- 10COMPATIBLE WITH RFC[10]
- 12COMPATIBLE WITH RFC[12]
- 21INCOMPATIBLE WITH RFC[21]Illegal BglII site found at 344
Illegal BglII site found at 1280
Illegal BglII site found at 1544
Illegal BglII site found at 2048
Illegal BglII site found at 2534
Illegal BglII site found at 2642
Illegal BglII site found at 2972 - 23COMPATIBLE WITH RFC[23]
- 25COMPATIBLE WITH RFC[25]
- 1000INCOMPATIBLE WITH RFC[1000]Illegal SapI.rc site found at 2653
Proof of concept: Cas13a as a Cell Death inducing system (CeDIS)
The results of the cultivation with Cas13a Lbu however shows that the cells reach a premature stationary phase. This indicates, that the growth of the yeast is decreased and indicates collateral cleavage events of the Cas13a. Over the duration of 10 hours there is no significant increase in the OD600 of the induced cells carrying the CeDIS. However, the yeast containing the Cas13a show a significant decrease of growth even with the uninduced cells. The control WT S. cerevisiae also shows a difference in growth on raffinose and galactose. The variation of the OD600 for the Cas carrying yeast can be assigned to the variation in carbon source. While these results show that the CeDIS containing Cas13a is active and effective within yeast, it also indicates that there is a background activity when raffinose is used as an inhibitor of the GALL promoter. Even in the uninduced state, Cas is expressed a on a low level , which leads to a decrease in growth, even if it is less effective than in the induced state.
This can potentially be explained by the structure of raffinose. Raffinose is a trisaccharaide consisting of galactose, glucose and fructose.
The tests have been conducted for both Cas13a Lwa and Cas13a Lsh.
Contribution (Waterloo iGEM 2021)
Summary: CRISPR-Cas systems, such as CRIPSR-Cas13a from Leptotrichia buccalis, have a multitude of applications. iGEM Bielefeld-CeBiTec 2019's CeDIS, described above, is an excellent example of one of these applications: RNA targeting and subsequent nonspecific cleavage of RNA to induce cell death. In the discussion below, Waterloo iGEM 2021 hopes to add background information on the usage of Cas13a. As well, we hope to highlight another application of Cas13a, which has become the focus of recent SARS-CoV-2 diagnostics methods, and is a critical mechanism in the functionality of NeuroDetech, Waterloo iGEM's 2021 project.
Documentation: Generally, CRISPR-Cas systems involve a Cas protein and a CRISPR guide RNA (gRNA), where the CRISPR-Cas complex exhibits endonuclease activity. Cas13 is a Type VI Cas protein, which is unique among the Cas family, as it cleaves single-stranded RNA (ssRNA). This is in contrast to Cas9 and Cas12, which target double-stranded DNA (Koonin & Makarova, 2019). Even more notably, upon recognition of its ssRNA target, Cas13 cleaves the target, remains bound, and exhibits nonspecific nuclease activity, indiscriminately cleaving ssRNA other than the target (Koonin & Makarova, 2019).
There are several relatively well-characterized Cas13 proteins that originate from different species of the Leptotrichia genus, such as Cas13a from L. wadei, L. buccalis, and L. shahii. Of these Cas13a variants, L. shahii was the earliest discovered; however, diagnostics tools have favoured L. wadei and L. buccalis for their increased analyte sensitivity. In fact, Fozouni et al. (2020) utilized Cas13a from L. buccalis to detect SAR-CoV-2 RNA, favouring the L. buccalis variant as it had the highest sensitivity compared to the other Cas13a variants (Fozouni et al., 2020). In addition, Cas13a from L. buccalis has been shown to be accurate enough to distinguish even single nucleotide polymorphisms (SNPs) (Fozouni et al., 2020). This makes this part the optimal Leptotrichia-derived Cas13 part for use in diagnostics applications.
Generally, the successful design of CRISPR guide RNA sequences for use with Cas13a requires that the following guidelines be met:
- CRISPR guide RNA sequences should have a total length of ~64 bp. This is in contrast with Cas9, whose guide RNA sequences can span over 100 bp (Koonin & Makarova, 2019).
- Of the recommended 64 bp length, 30 bp should be reserved for a stem and hairpin region, which is bound by Cas13a. Fozouni et al. (2020) suggests that the following 30-nucleotide sequence is appropriate as a stem sequence at the 5' end of the CRISPR guide RNA (Fozouni et al., 2020): 5′-GACCACCCCAAAAAUGAAGGGGACUAAAAC-3′
- The remaining ~34 bp can consist of a complementary RNA sequence to the target ssRNA.
As mentioned, the magic of CRISPR-Cas13a often comes down to its nonspecific RNase activity upon detection of its target ssRNA. In diagnostics applications, a major limitation of utilizing CRISPR-Cas systems is that amplification of the target DNA/RNA must be performed before detection using CRISPR-Cas. A common detection method utilizing this workflow is called SHERLOCK. However, Cas13's nonspecific RNase activity upon target detection inherently allows for the detection of a single molecule of the target RNA to be 'amplified'. In turn, the amplification step before CRISPR-Cas13 detection can effectively be eliminated. This novel method of CRISPR-Cas13 detection, boasting higher sensitivity than the traditional SHERLOCK method, is called SATORI, and it is the crux of the detection of SARS-CoV-2 RNA by Fozouni et al. (2020). Once a sample containing SARS-CoV-2 RNA was added to a solution containing CRISPR-Cas13a as well as a fluorophore and quencher duo, linked by ssRNA, the CRISPR-Cas13a's nonspecific RNase activity was activated by detection of the SARS-CoV-2 RNA. This resulted in the collateral cleavage of the ssRNA linking fluorophores and quenchers, thereby producing fluorescence proportional to the amount of SARS-CoV-2 RNA added (Fozouni et al., 2020).
Waterloo iGEM's 2021 project, NeuroDetech, consisted of a set of microfluidic assay chips designed to quantify ADHD-associated biomarkers and gene markers. Inspired by the work of Fozouni et al. (2020), we adapted the SATORI detection method for the sensitive detection of mRNA containing ADHD-associated mutations in urine samples. A binding molecule, consisting of biotinylated CRIPSR-Cas13a, would bind to the ADHD-associated mRNA transcript in a urine sample. Then, at the test chamber of the microfluidic assay, any unbound CRISPR-Cas13 would bind to molecules of the target RNA conjugated to the test chamber. Subsequent release to the test chamber of fluorophores and quenchers linked by ssRNA would result in fluorescence, where a signal lower than a threshold level would indicate that the patient is likely to have the ADHD-associated mutation. More information can be found on Waterloo iGEM 2021's wiki page: https://2021.igem.org/Team:Waterloo
Contribution (iGEM22_Thessaloniki_Meta)
The existing part that we used for our experiments and we decided to improve is the Cas13a Lbu (BBa_K2926001) designed by iGEM19_Bielefeld-CeBiTec team. This part codes for the Cas13a derived from Leptotrichia buccalis. According to the part’s documentation in the registry the EcoRI and PstI site have been removed to succeed RFC [10] compatibility. This part was used by the Bielefeld-CeBiTec team for assays in S.cerevisiae.
Introduction
A fundamental prerequisite for the practical application of recombinant enzyme-based diagnostic devices is the production of the enzyme in large quantities and in a functional form. However, the protein expression is a complex process which involves hundreds of molecular components and is affected by many variables which need to be optimized for efficient protein production in large quantities (Lipońska et al., 2018). The Cas13a protein derived from Leptotrichia Buccalis has a considerable molecular weight of approximately 138kDA and the effective recombinant Cas13a protein production in Escherichia coli can often be challenging.
Optimization approaches for the protein production can be applied in different steps of the whole process, however we decided to focus our efforts on optimization strategies related with the translation step of protein synthesis. Specifically, two approaches were followed towards protein production optimization. The first strategy or else cis-optimization approach involves the usage of a DNA CDS sequence that is optimal for Cas13a production in E.coli based on codon metrics and codon frequency. The second approach or else trans-optimization involves the incorporation of the SUMO (small ubiquitin-related modifier) protein in fusion with Cas13a protein for efficient protein production in the soluble cytoplasmic function. The addition of the SUMO solubility tag in fusion with the Cas13a, could enhance protein expression and solubility, decrease protein degradation and simplify protein purification (Butt et al., 2005). As described above, two different optimization approaches were followed for the improvement of the part BBa_K2926001:
- Cis-optimization: codon optimization for efficient Cas13a production in ecoli.
- Trans-optimization: generate gene chimera for Cas13a production in fusion with SUMO solubility tag.
Cis-optimization: codon optimization for efficient Cas13a production in E.coli.
In the context of synthetic biology, codon optimization is widely used to ameliorate gene expression in heterologous expression systems. Codon optimization is based on the basic principle of the genetic code that the distribution of the 64 unique DNA codons is non-random. Within a genome exist both rare and abundant DNA codons and their distribution varies across all organisms. The host-specific codon usage bias (CUB) influences translation efficiency especially when there is a high dominance of rare codons in the genetic sequence that is intended for translation. A common strategy for common optimization is the replacement of the rare codons with more frequently occurring ones, in accordance with the CUB of the specific organism. Since we decided to express LbuCas13a protein in BL21 (DE3) E.coli strain, we followed a codon optimization approach to achieve maximum protein expression efficiency in the desired E.coli strain.
Trans-optimization: generate gene chimera for Cas13a production in fusion with SUMO solubility tag.
SUMO protein has been previously fused to the N-terminus of several proteins leading to increased expression and solubility of the protein of interest. But how does SUMO protein enhance protein solubility and proper folding? The answer lies in the structure of SUMO protein. Specifically, SUMO has an inner hydrophobic core and an external hydrophilic surface, thus exerts a detergent-like effect in proteins that cannot easily acquire their proper folding. Despite the advantages of SUMO addition, one drawback of this protein expression strategy is the necessary cleavage of the solubility tag. However, many SUMO proteases such as Ulp1, which are members of the cysteine protease superfamily can be used to cleave the SUMO tag without affecting the N-terminus of the desired protein (Panavas et al., 2009). Therefore, by expressing the LbuCas13a protein in fusion with the SUMO-chaperone protein we aim to enhance the quantity of Cas13a protein that accumulates to the soluble cytoplasmic fraction, succeeding proper protein folding and subsequent efficient SUMO tag removal.
Rare codon analysis for expression in E.coli
Utilizing the Rare Codon Analysis tool from Genscript we evaluated the codon usage frequency of the two coding sequences as described in detail below.
Codon optimized SUMO-LbuCas13a (BBa K4170014) for E.coli expression
According to the Rare Codon analysis tool, the codon optimized SUMO-LbuCas13a protein displayes a high CAI value (approximately 0.8). The CAI value is a codon adaptation index and the higher the CAI value the higher the change that the gene will expressed efficiently. The CAI value equal to 0.8 is acceptable ensuring a high change of efficient protein expression. In addition, the percentage of low frequency codons (CFD) based on the E.coli host organism is about 3%. This minimum percentage ensures that the percentage of rare codons in our sequence is extremely low and cannot affect the translational efficiency.
Cas13a Lbu part (BBa_K2926001) deposited from iGEM19_Bielefeld-CeBiTec team
On the other hand, the Cas13a Lbu part (BBa_K2926001) deposited from iGEM19_Bielefeld-CeBiTec team display a relatively low CAI value (approximately 0.65) which deviates from the acceptable limits for efficient protein expression in E.coli. In additon, the percentage of low frequency codons (CFD) is 7%, higher that the codon optimized SUMO-LbuCas13a sequence deposited by : iGEM22_Thessaloniki_Meta. These relativelly low values could reveal the reduced protein expression efficiency in E.coli. However, these values (CAI & CFD) are clearly improved when the S. cerevisiae Yeast Strain is selected as a host organism.
Comparative experiments
To demonstrate that our modifications had a positive effect on the efficiency of Cas13a protein production, we conducted comparative protein production experiments between the “improved” LbuCas13a (https://parts.igem.org/Part:BBa_K4170016) and the analogous protein from BBa_K2926001 part. For the implementation of the comparative experiments, we had to replace the “optimized” SUMO-LbuCas13a protein from the pSB1C3 plasmid with the Lbu Cas13a deposited from iGEM19_Bielefeld-CeBiTec team in the registry, keeping constant all the other regulatory elements of the genetic device. The detailed cloning strategy of the LbuCas13a coding device under T7 promoter (https://parts.igem.org/Part:BBa_K4170016) is described on the corresponding part registry page and on the results page of our wiki (https://2022.igem.wiki/thessaloniki-meta/results).
Cloning strategy of Cas13a Lbu (BBa_K2926001) for comparative experiments.
The CDS of Cas13a Lbu (BBa_K2926001) flanked by appropriate recognition sequences of BsaI restriction enzyme was ordered from IDT and cloned downstream of the T7 promoter in pSB1C3 plasmid with Golden Gate assembly. This part is the Cas13a Lbu part.
The cloning process is described in detail below:
Step 1. PCR amplification
- PCR amplification with Ev and Pv standard primers from Basic SevaBrick Assembly (Damalas et al., 2020) using pSB1C3 (Bba_J36400-2022 DNA distribution Kit) a template (Figure 1). This PCR produces the pSB1C3 backbone part ready for Golden Gate assembly.
- PCR amplification with cas13a T7 P0 FWD and SUMOLESS RVS primers using the LbuCas13a coding device under T7 promoter (BBa_K4170016-link) as a template (Figure 2). This PCR produces the LacI-promoter-RBS part ready for Golden Gate assembly.
Step 2. Golden Gate-based sevaBrick assembly
- Golden Gate assembly of the PCR amplified LacI-promoter-RBS and Cas13a Lbu parts (BBa_K2926001) along with the ‘linearized’ pSB1C3 backbone part for the efficient construction of the LbuCas13a coding sequence under the transcriptional control of the Lac Repressor.
- The procedure of the protein expression initiates with the preparation of the bacterial pre-culture (15ml) incubated overnight at 37 °C. The following day the pre-culture was diluted in 1L of nutrient media, followed by a further incubation at 37°C of the diluted pre-culture until Optical Density (OD600) reached 0.7 – 0.8 (continuous measurements at the photometer at specific time points).
- To induce LbuCas13a protein expression we added IPTG to the 1L bacterial culture to achieve a final concentration of 1mM. The bacterial culture was then incubated for 6 hours at 25°C and the bacterial culture was centrifuged at 10.000 g for 20 min at 4 °C and the supernatant was discarded. The bacterial pellets were resuspended in binding buffer (3oomM NaCl, 50mM NaH2PO4, 10mM imidazole, pH 8) followed by successive freeze-thaw cycles. After the freeze-thaw steps, Lysozyme (100mg/ml) and Triton x-100 were added and ultrasonication was performed to lyse the bacterial membrane. After ultrasonication, DNase (1mg/ml), MgCl2 (8mM) and Protease Inhibitor (PI) were added and the samples were incubated in a cold room for 2 hours on a rotator machine.
- To obtain the soluble fraction of the protein, refrigerated centrifugation was carried out and the supernatant was then filtered by using 0.22 μm filter units. The remaining pellet constitutes the Inclusion Bodies (IBs), which also contains the insoluble form of the protein of interest. To isolate the SUMO-LbuCas13a protein from the IBs, the bacterial pellet is subjected to successive washing steps using 3 different buffers (A, B, C) followed by ultracentrifugation at 30.000g for 20 min at 4 C. The process is completed by resuspending the protein in L-Arginine solution which promotes the proper folding of the protein restoring its functional quaternary structure.
The Golden Gate assembly products underwent transformation into E.coli DH5a competent cells and then colony PCR was performed, using the primers VR and VF2. Picking sample from different colonies and then evaluating the results on a 1 % agarose gel electrophoresis we concluded that the Golden Gate assembly was successful in the first colony. The PCR product in the 1st colony has the suitable number of basepairs (5392bp), while the 2nd colony showed no PCR amplification. For the colony PCR procedure, from the agar plate half amount of each colony was picked and diluted on 10 μl of dH20. The other half amount was picked for liquid overnight culture.
The final plasmid after the Golden Gate assembly contains the same DNA elements/features as the cloned final SUMO-LbuCas13a coding device under T7 promoter (BBa_K4170016), replacing the CDS of the codon-optimized Cas13a with the CDS of Cas13a Lbu (BBa_K2926001). Furthermore, the CDS of the molecular chaperone SUMO sequence has also been removed. In addition, utilizing this cloning strategy we incorporated the 6xHis affinity tag at the N-terminus of the Cas13a Lbu (BBa_K2926001) to facilitate the efficient purification of the protein utilizing a Ni-NTA affinity purification methodology. The genetic sequences of the Cas13a Lbu expression device are illustrated at the following map derived from snapgene. The Cas13a Lbu coding device inserted into the pSB1C3 backbone generated a new composite part which is deposited at the IGEM Registry (BBa_K4170056 https://parts.igem.org/Part:BBa_K4170056 )
Expression of recombinant LbuCas13a proteins.
General protein expression protocol
This protocol was followed for all the protein expression experiments with minor modifications at the IPTG concentration, induction temperature or time of induction .
SDS-PAGE gel electrophoresis and Western blotting of Cas13a Lbu (BBa_K2926001)
Equal volumes of protein samples corresponding to the filtered soluble and resuspended insoluble (IBs) solution respectively, were loaded into each well of the SDS-PAGE gel for separation based on molecular weight. After gel electrophoresis completion, suitable images of the gels were obtained. After the separation of the protein mixture through the SDS-PAGE gel electrophoresis, it was transferred to the PVDF membrane for western blotting. After the initial blocking step, the addition of the anti-His primary and the anti-mouse alkaline Phosphatase-Labeled secondary antibody, the protein bands were detected using the alkaline phosphatase detection method.
Analyzing the results from the SDS-PAGE gel electrophoresis and the Western Blotting (above figure ) we can conclude that the Cas13a Lbu (BBa_K2926001) protein is detected only in the insoluble fraction which corresponds to the inclusion bodies of the bacteria. The protein band which corresponds to the LbuCas13a protein is detected at the molecular weight of 130kDa. However, no LbuCas13a protein band is detected at the soluble cytoplasmic fraction. The process of obtaining bioactive functional protein from inclusion bodies is usually labor intensive and the yields of the recovered recombinant protein are often low.
SDS-PAGE gel electrophoresis and Western blotting of “improved” SUMO-LbuCas13a protein (BBa_K4170016)
Equal volumes of protein samples corresponding to the filtered soluble and resuspended insoluble (IBs) solution respectively, were loaded into each well of the SDS-PAGE gel for separation based on molecular weight. After gel electrophoresis completion, suitable images of the gels were obtained. After the separation of the protein mixture through the SDS–PAGE gel electrophoresis, it was transferred to the PVDF membrane for western blotting. After the initial blocking step, the addition of the anti-His primary and the anti-mouse alkaline Phosphatase-Labeled secondary antibody, the protein bands were detected using the alkaline phosphatase detection method.
Analyzing the results from the SDS-PAGE gel electrophoresis and the Western Blotting (above figure) we can conclude that the SUMO-LbuCas13a is detected in both the soluble and insoluble fraction (Inclusion Bodies). The protein band which corresponds to the SUMO-LbuCas13a fusion protein is detected at the molecular weight of 155kDa. Therefore, we can assume that the addition of the SUMO solubility tag enhanced the soluble fraction of the LbuCas13a protein. The purification of soluble proteins is less expensive and time consuming compared to the process needed for protein purification and recovery from inclusion bodies. Utilizing the chaperone-mediated LbuCas13a protein recovery from soluble fraction ensures the integrity of the refolded proteins. The resolubilization procedures required to recover the protein from inclusion bodies can disturb the integrity of the protein. In addition, if we compare the Figures which correspond to the Cas13a Lbu (BBa_K2926001) and SUMO-LbuCas13a (BBa_K4170016) respectively, we can understand that the codon optimization procedure enhanced the total protein yield in both the soluble and the insoluble fraction of the LbuCas13a protein. Further information about the downstream experiments regarding the SUMO-LbuCas13a purification with His SpinTrap purification columns and the enzymatic removal of the SUMO protein can be found on the results and on the experiments pages of iGEM22_Thessaloniki_Meta team Wiki (https://2022.igem.wiki/thessaloniki-meta/results).
References
Butt, T., Edavettal, S., Hall, J. and Mattern, M., 2005. SUMO fusion technology for difficult-to-express proteins. Protein Expression and Purification, 43(1), pp.1-9.
Damalas, S., Batianis, C., Martin‐Pascual, M., Lorenzo, V. and Martins dos Santos, V., 2020. SEVA 3.1: enabling interoperability of DNA assembly among the SEVA, BioBricks and Type IIS restriction enzyme standards. Microbial Biotechnology, 13(6), pp.1793-1806.
Fozouni, P., Son, S., Derby, M. D. de L., Knott, G. J., Gray, C. N., D’Ambrosio, M. V., Zhao, C., Switz, N. A., Kumar, G. R., Stephens, S. I., Boehm, D., Tsou, C.-L., Shu, J., Bhuiya, A., Armstrong, M., Harris, A. R., Chen, P.-Y., Osterloh, J. M., & Ott, M. (2020, December 4). Amplification-free detection of SARS-COV-2 with CRISPR-CAS13a and mobile phone microscopy. Cell. 184(2), 323-333.e9. https://www.sciencedirect.com/science/article/abs/pii/S0092867420316238.
Koonin, E. V., & Makarova, K. S. (2019). Origins and evolution of CRISPR-Cas systems. Philosophical Transactions of the Royal Society B: Biological Sciences, 374(1772). https://royalsocietypublishing.org/doi/10.1098/rstb.2018.0087
Lipońska, A., Ousalem, F., Aalberts, D., Hunt, J. and Boël, G., 2018. The new strategies to overcome challenges in protein production in bacteria. Microbial Biotechnology, 12(1), pp.44-47.
Panavas, T., Sanders, C. and Butt, T., 2009. SUMO Fusion Technology for Enhanced Protein Production in Prokaryotic and Eukaryotic Expression Systems. Methods in Molecular Biology, pp.303-317.None |