CAT-Seq Esterase (Ready for Expression - Toehold 3 and Trigger 2)

Figure 1. Abstract scheme of the Catalytic Activity Sequencing

This composite part contains a ready for expression [CAT-Seq Esterase], which is controlled by a Toehold Switch and a trigger RNA (BBa_K2621013 and BBa_K2621015 respectively). This composite part also incorporates a T7 Promoter ([BBa_I719005]) and two CAT-Seq Barcodes.

CAT-Seq Esterase is a hydrolase that was used in Catalytic Activity Sequencing system for its promiscuous capability to hydrolyse N4-acyl-2'-deoxycytidine triphosphates (Substrate Nucleotides). It has been found that the enzyme accepts various N4-acyl-2'-deoxycytidine triphosphates as substrates for hydrolysis, leaving a 2'-deoxycytidine triphosphate (Product Nucleotide).

Sequence-wise it is the most similar to tannases (EC 3.1.1.20) and feruloyl esterases (EC 3.1.1.73), however to date there has been no enzyme characterized hydrolysing N4-amidic bond in modified deoxycytidine triphosphates.

It is the main component of Catalytic Activity Sequencing (CAT-Seq) method. CAT-Seq is a method for high-throughput catalytic biomolecule and genetic regulatory part activity-sequence relationship assessment toolkit.

See how this part is used in the CAT-Seq by pressing here!

Sequence and Features

Assembly Compatibility:

10
COMPATIBLE WITH RFC[10]
12
INCOMPATIBLE WITH RFC[12]
Illegal NheI site found at 1773
Illegal NheI site found at 1933
21
INCOMPATIBLE WITH RFC[21]
Illegal BamHI site found at 819
23
COMPATIBLE WITH RFC[23]
25
INCOMPATIBLE WITH RFC[25]
Illegal NgoMIV site found at 635
1000
COMPATIBLE WITH RFC[1000]

Introduction

Biology

Description of the CAT-Seq esterase

CAT-Seq Esterase is a hydrolase that has a promiscuous capability to hydrolyse N4-acyl-2'-deoxycytidine triphosphates (Substrate Nucleotide). Sequence-wise it is the most similar to tannases (EC 3.1.1.20) and feruloyl esterases (EC 3.1.1.73), however to date there has been no enzyme characterized hydrolysing N4-amidic bond in modified deoxycytidine triphosphates.

To further elaborate on the fact, it has been found that the enzyme accepts various N4-acyl-2'-deoxycytidine triphosphates as substrates for hydrolysis, leaving a 2'-deoxycytidine triphosphate (Product Nucleotide).

To better understand the nature of this biological part, we have generated a structural homology model based on solved structures of homologous feruloyl esterases. Currently known feruloyl esterases contain a catalytic triad in their active sites consisting of Serine, Aspartate and Histidine. Aligning CAT-Seq Esterase to the structural 3WMT template, we could also identify the catalytic triad consisting of Ser195, Asp429, His467. The sequence similarities and the positions Catalytic Triad amino acids are highlighted in the illustration below (Fig. 1).

Figure 1. Superposition of the generated homology model (light blue) and the template used (yellow, PDB ID: 6G21). Catalytic triad residues are marked in red boxes with numbering. Calcium ion shown as grey sphere.

In-Silico design of the CAT-Seq esterase mutants

The reason of making esterase mutants was to benchmark the accuracy of CAT-Seq system. We have aimed to make Esterase mutants that would have slightly different activities. After deriving the mutations, we have measured their activities using standard methods, and then compared them with CAT-Seq measurements. Therefore, the esterase and its mutants can now be used for accuracy calibration when setting up the CAT-Seq for the first time.

The residues chosen for mutation were scattered around the catalytic triad of the esterase and involved mainly polar and aromatic side chains. In literature, some of these side chains are assumed to form bonds stabilizing the binding of the substrate (Fig. 2).

Figure 2. <b>View of the CAT-Seq Esterase Active Center. Residue coloring: catalytic triad - red, mutation positions - orange, calcium-coordinating residues - cyan. Catalytic residues as well as successful mutations are numbered in boxes. Hydrogen bonds in the catalytic triad represented as black dashed lines.

After carefully selecting the mutations in-silico, we have synthesized and measured the Activity of those esterase mutants in the laboratory. The mutations have successfully altered the activity of the original Esterase. Also, there were no cases of complete inactivation of enzyme catalytic activity from the mutants we managed test. Please head to the results section for further information.

The Overview of the Toehold Riboregulators

Figure 1. Toehold switches repress translation through base pairs programmed before and after the start codon (AUG), leaving the RBS and start codon regions completely unpaired. The toehold domain a binds to a complementary a* domain on the trigger RNA ant initiates strand displacement which activates the translation Green, Alexander A. et al.

A toehold is a short RNA sequence that contains a ribosome binding site and a start codon followed by 9 amino acid linker. Importantly, it forms a stable secondary RNA hairpin structure that, in addition to locking the start codon in the stem loop, sequesters the ribosome binding site in a bulge loop.

As a consequence of stable secondary structure, the ribosome cannot bind and initiate the translation of a downstream gene. The linker codes for low-molecular-weight amino acids added to the N terminus of the gene of interest. This sequence increases the orthogonality of the toehold switches as it is important in forming the base of the stem loop and locks the start codon in it. The trigger RNA binds the 5’ end of the toehold and initiates strand displacement by linear-linear interaction.

As a result of that, the ribosome binding site and the start codon are accessible for ribosome binding and translation initiation. Since the trigger RNA binds the 5’ end of the toehold sequence, the nucleotide composition of it is an important factor that adds to the degree of different toehold systems cross interaction. By employing a specific linker sequence, the number of unique triggers with minimal cross interaction increases.

Usage with CAT-Seq (Catalytic Activity Sequencing)

About CAT-Seq

Figure 1. Abstract scheme of the Catalytic Activity Sequencing method

CAT-Seq stands for Catalytic Activity Sequencing - a system designed and built for high-speed activity and interaction characterization of Catalytic and Regulatory biological parts. You can learn more about CAT-Seq [http://2018.igem.org/Team:Vilnius-Lithuania-OG by clicking this link]

Catalytic Activity Sequencing Overview

Figure 1. Abstract scheme of the Catalytic Activity Sequencing method

Library preparation - A library of catalytic biomolecules is prepared.
Library encapsulation into droplets - Every library fragment is physically separated by encapsulating them into picoliter water droplets. Also, substrate nucleotides, the targets for catalytic biomolecules, are encapsulated.
Catalytic biomolecule production - In each droplet catalytic biomolecules are produced.
Catalysis of the substrate conversion - Catalytic biomolecules may recognise the Substrate Nucleotides as a target for chemical reaction catalysis. Depending on biomolecule activity, a specific number of nucleotides with removed substrates (product nucleotides) is established in each droplet.
Activity Recording
1. Droplet Merging - each of prior droplet is merged with new droplet that contains DNA amplification mix and reference nucleotides. The reference nucleotides are helping to tracking the Product Nucleotide number.
2. DNA amplification - DNA is amplified using the different unique catalytic biomolecule DNA in each droplet. During the amplification, the Product Nucleotides and the Reference Nucleotides are incorporated into the DNA sequence.
Activity Reading by Nanopore Sequencing - All of the droplets are broken and the amplified DNA is sequenced. During the sequencing, biomolecule’s activity is retrieved by calculating reference and Product Nucleotides (substrate removed), together with the sequence of particular biomolecule variant.

Determining the accuracy of CAT-Seq

Trying to build a CAT-Seq pipeline in your own laboratory will require the CAT-Seq esterase in order to troubleshoot the system and assess the measurement accuracy and precision. In other words, the esterase can be used to calibrate the CAT-Seq.

Together with the Esterase, its substrate attached to a nucleotide is required (substrate nucleotide). In the standard case, the Substrate Nucleotide is N4-benzoyl-2'-deoxycytidine triphosphate. If the Esterase catalyzes the removal of the substrate from the nucleotide, it becomes the Product Nucleotide - 2'-deoxycytidine triphosphate.

Genetic Regulatory Part activity and cross-interaction assessment

While Catalytic Activity Sequencing began as a method for catalytic biomolecule activity recording, we have also create a way to adjust CAT-Seq to record activities of regulatory part. In addition to the activities, cross-interactions of different regulatory parts can also be measured.

When assessing the activities and sequences of libraries of catalytic biomolecules in CAT-Seq , the activity is measured and recorded as a function of Product Nucleotide that was produced in each droplet.

Yet, the activity of the catalytic biomolecule is not the only aspect that can influence the amount of Product Nucleotides that are produced. If all of the droplets would contain the same catalytic biomolecule , but each droplet would have a different concentration of that biomolecule, we would in result get different amounts of Product Nucleotides. For example, droplets with large amount of biomolecules may produce a large number of Product Nucleotides and vice-versa. The default and well-characterized Catalytic Biomolecule in CAT-Seq for regulatory part charectation would be the CAT-Seq Esterase.

Figure 1. A scheme which illustrates the Product Nucleotide concentration on the Catalytic Biomolecule amount in droplets.

Yet, the activity of the catalytic biomolecule is not the only aspect that can influence the amount of Product Nucleotides that are produced. If all of the droplets would contain the same catalytic biomolecule , but each droplet would have a different concentration of that biomolecule, we would in result get different amounts of Product Nucleotides. For example, droplets with large amount of biomolecules may produce a large number of Product Nucleotides and vice-versa. The default and well-characterized Catalytic Biomolecule in CAT-Seq for regulatory part charectation would be the CAT-Seq Esterase.

Example for Toehold cross-interaction determination

Next, we want to give an example on how to record regulatory part interactions using CAT-Seq. In this case, we will be using Toehold Switches.

The Toehold Switch systems are composed of two RNA strands referred to as the Switch and Trigger. The Switch RNA contains the coding sequence of the gene being regulated. The Switch RNA forms a hairpin structure that includes the RBS site. While the hairpin structure is formed, the translation is inhibited. The Trigger RNA is a molecule that can selectively bind to the Switch RNA region and expose the gene RBS site for ribosomes. Once that happens, the protein translation can be initiated.

Once again, only the first part of general CAT-Seq design needs to be changed - the library preparation. Instead of using a catalytic biomolecule library, a single enzyme is used. Then, libraries need to be prepared - one for Toehold Switches and another for Trigger RNA. The Trigger RNA libraries also require a separate T7 promoter for RNA expression. Then, both of those libraries must be ligated to the enzyme DNA fragment. Resulting library contains fragments which have the same catalytic biomolecule (In the general case - the CAT-Seq Esterase), yet each of them have a random combination of a specific Toehold Switch and RNA Trigger.

Figure 1. Comparison between Standard CAT-Seq library constructs and Regulatory part adjusted CAT-Seq library constructs (For cross-interaction recording).

After the encapsulation, the droplets which have the correct Trigger RNA and Toehold Switch combination (in which Trigger RNA binds to that specific Toehold Switch), can produce catalytic biomolecules . In turn, those catalytic biomolecules can produce the Product Nucleotides.

In other droplets, where the Trigger RNA binds to the Toehold Switch with weaker affinity, there are also less catalytic biomolecules and Product Nucleotides produced. Finally, the droplets in which the Trigger RNA does not bind to the Toehold Switch, there are no biomolecules or product nucleotides produced.

Part Characterization (Vilnius-Lithuania Overgraduate 2018)

Kinetic characterization of the CAT-Seq Esterase

Spectrofotometic kinetic data based on decay of absorbance at 310 nm due to substrate nucleotide catalytic conversion was gathered using a range of starting substrate nucleotide concentrations. A Michelis Menten curve for the CAT-Seq esterase was plotted using the aquired data. Data shown in 1 Fig., show perfect fit (R2 = 0.9449) to a standard Michaelis Menten curve.

Figure 1. Michaelis Menten Curve generated for CAT-Seq Esterase enzyme. The initial velocity of the enzyme was determined spectrophotometrically as a decay of absorbance at 310 nm at different starting substrate nucleotide concentrations. The graph shows the average values of three independent experiments.

Michaelis Menten plot transformations were generated for CAT-Seq esterase enzyme (BBa_K2621000). 2 Fig., display Lineweaver-Burk and Hanes-Woolf transformation plot. Based on the equation koeficients in the Hanes-Woolf transformation, the expermentally determined Vmax value is 17.2 µM/min, Km value is 86 µM.

Figure 1. Two Michaelis Menten Curve transformations for CAT-Seq Esterase enzyme. Lineweaver-Burk and Hanes-Woolf transformations for Michaelis Menten curve generated earlier. Both of the transformations were performed on data acquired from 3 independent experiments. Graphs show the fitted linear function and its correlation coefficient.

Esterase and its mutants catalytic activity determination

The 10 esterase mutants housing mutations at bioinformatically predicted sites were created. To see why and how mutations were designed in-silico, [http://2018.igem.org/Team:Vilnius-Lithuania-OG/Model please click this link]. Each of the mutant was constructed utilizing PCR and synthesized using In vitro transcription and translation kit and their catalytic activity towards N4-benzoyl-2'-deoxycytidine triphosphate were tested.

Figure 3. Relative activity of In silico generated CAT-Seq esterase mutants CAT-Seq esterase mutant sequences were generated In silico and synthesised using in vitro transcription and translation kit. The graph shows the relative catalytic activity of each generated mutant measured spectrophotometrically and the corresponding mutation site.The activity was normalized to wild type (WT) enzyme.

The reaction kinetics were measured using the spectrophotometer as a decrease of absorbance. Figure 3 displays the relative hydrolysis speed of each mutant generated. As seen from these results, a variety of mutants, showing different catalysis speeds were produced. Some of the amino acids changes affected the activity drastically, for example Trp224 to Tyr, Lys227 to Arg or Glu509 to Lys. Other mutations only modulated the activity Asn107 to Asp or Glu194 to Ala. Additionally, large 8 amino acid deletion at position Pro348-Hy356 caused only a moderate decrease in enzymes activity.

Esterase and mutants activity assessment using CAT-Seq

The constructed in silico designed mutant library was subjected to catalytic activity sequencing. By applying the data preparation and analysis pipeline, the mean methylation scores arising from different ratios of catalytically converted and reference nucleotides for each barcoded mutant DNA template were filtered and extracted from the DNA embedded with catalytic activity information (in a form of incorporated reference to catalytically converted nucleotide ratio). The collected data was normalized over Wild Type Esterase and K227R mutant (lowest activity).

Figure 1. Comparison of In bulk and CAT-Seq measured esterase mutant relative activity. In silico generated Esterase mutant library was subjected to catalytic activity sequencing. The mean methylation scores for each barcoded mutant DNA template were filtered and extracted. The collected data was normalized over Wild Type CAT seq Esterase and K227R mutant (lowest activity). The relative activity, extracted from the mean methylation score of each mutant read is compared to measurement data gathered in standard sized reactions (in bulk) .

The relative methylation score (reference nucleotide count) of each mutant read corresponds to the activity of the enzyme it encodes. The higher the activity of the expressed enzyme, the lower methylation score are assigned, due to catalytic conversion of substrate nucleotides. The comparison of the results, gathered with CAT-Seq catalytic activity sequencing method and in standard sized reactions (in bulk) spectrophotometric data (Fig 1.) conclude the viability of CAT-Seq approach. The activity reading, extracted from the DNA sequence correlates with the in bulk measurement data perfectly. The activity of the each Esterase mutant is measured accurately and is assigned to the corresponding DNA sequence.

Cross-interaction measurements of Toehold Switches

The 9 library members, composed of 3 unique Toehold Switch sequences and 3 unique activating RNA sequences, termed Trigger RNA, were designed to test the orthogonality and regulatory characteristics of each part.

Each of the regulatory part, consisting of one toehold sequence upstream of the CAT-Seq esterase gene (BBa_K2621000) and one trigger sequence were constructed. First of all, the orthogonality of each toehold:activating RNA pair and they regulatory characteristic have been tested in bulk.

The constructed library members were synthesized using In vitro transcription and translation kit and their catalytic activity towards N4-benzoyl-2'-deoxycytidine triphosphate were tested. The reaction kinetics were measured using the spectrophotometer as a decrease of absorbance due tue hydrolyzed substrate nucleotide.

Figure 5. The catalytic activity of each Toehold:Trigger RNA construct matrix.CAT-Seq esterase mutant sequences were placed downstream the riboregulatory sequences with corresponding trigger part and they catalytic activity was measured. The decrease of absorbance corresponds to catalytically active enzyme. The graph shown as a matrix concludes that only Toehold sequences expressed with their corresponding Trigger RNA produce an active enzyme molecule.

Figure 5 displays the relative hydrolysis speed of each regulatory part variant in a form of matrix. The decrease of absorbance shown in Y axis corresponds to the catalytic conversion of substrate nucleotides. As seen from control experiment, in which standard esterase was expressed, the decline of absorbance over time is seen.

Taking these results into consideration, the same decrease of absorbance is only seen in the diagonal of the matrix. This means, that active catalytic molecules is expressed only when both regulatory molecules of the same group are present - Toehold1 (BBa_K2621011) with Trigger1 (BBa_K2621014), Toehold2 (BBa_K2621012) with Trigger2 (BBa_K2621015) and Toehold3 (BBa_K2621013) with Trigger3 (BBa_K2621016). None of the regulatory sequences show any cross talk with the other group. These results conclude the generation of working toehold riboswitches control library for regulatory sequences parameter and orthogonality screening.

Cross-interaction measurements of Toehold Switches using CAT-Seq

In addition to ribosome binding sites, we have constructed Toehold regulatory sequence library constituted of different toehold and triggers pairs was constructed subjected to catalytic activity sequencing method:

BBa_K2621011 - Toehold Switch 1
BBa_K2621012 - Toehold Switch 2
BBa_K2621013 - Toehold Switch 3
BBa_K2621014 - Trigger RNA 1
BBa_K2621015 - Trigger RNA 2
BBa_K2621016 - Trigger RNA 3

The mean methylation scores (reference nucleotide count) for each barcoded DNA template, housing different regulatory sequence were filtered and extracted from the DNA embedded with catalytic activity information (in a form of incorporated reference to catalytically converted nucleotide ratio).

Figure 23. The evaluation of Toehold-Trigger riboregulatory sequence orthogonality using CAT-Seq.The catalytic activity of esterase genes, regulated by different Toehold switches were measured using CAT-Seq. The mean methylation scores for each barcoded regulatory construct DNA template was filtered and essigned. Low methylation scores correspond to actively expressed protein and are only assigned when both Toehold and trigger sequences from the same group are present verifying the already measured orthogonality of regulatory parts.

The graph displays the mean methylation (reference nucleotide) scores assigned to each barcoded toehold-trigger construct read. Based on the results, low methylation score are only assigned when both Toehold and trigger sequences from the same group are present, due tue esterase being expressed. These results correlate perfectly to the standard (not in droplet) measurement results, carried out earlier.

Based on this fact it can be concluded that CAT-Seq activity sequencing method can be utilized as a precise and accurate way to screen and assign the activity and orthogonality of regulatory sequences.

Part:BBa_K2621043

Contents