Assembly:RBS-CDS issues

Revision as of 02:44, 23 June 2006 by Ahessel (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

The transfer curve of a BioBrick device is measured from input TIPS to output TIPS. For a protein-DNA inhibition system, this transfer curve depends on the rates and efficiencies of many stages in a chain that includes transcriptional efficiency, mRNA lifetime, ribosome binding efficiency, translation initiation and efficiency, protein lifetime, protein concentration, protein multimerization, protein-DNA binding rates and efficiencies, cooperative effects with other molecules, RNA polymerase effects, system copy count, and other system specific effects. If BioBrick parts are to be interoperable, intentional engineering of this chain of events will be required to produce a desired transfer function.

Different ribosome binding sites (RBS) bind ribosomes with different efficiencies. While the overall efficiency may not be directly proportional to this efficiency, it is an important variable. In the future, we expect that this data book will have a table such as the following: Standard BioBrick Ribosome Binding Sites Part Number Binding Efficiency Sequence BBa_B0030

0.6 att aaa gag gag aaa BBa_B0031 0.07 tca cac agg aaa cc BBa_B0032 0.3 tca cac agg aaa g

A BioBrick engineer armed with this table could perform a first-order reengineering of the transfer function by measuring a component with BBa_B0031 and then selecting a more appropriate RBS.

In order to allow easy variation in the RBS, some BioBrick alpha parts have their RBS and their protein coding regions implemented in separate parts. The combined, normalized, part is also available for convenience. In order to divide the BioBrick part into its RBS and coding region, a set of BioBrick restriction sites must be designed. Standard Assembly and the RBS

The RBS is physically near the start codon (in BioBricks, this will always be ATG). The "sequence logo" for the RBS is shown below. The size of each letter is proportional to the frequency of that base in the RBS. From Tom Schneider, "A Gallery of Sequence Logos"

This figure shows the RBS as a hump of G's or A's about 10 bases before the start codon. This leaves little room for a BioBrick restriction enzyme site. The standard assembly method of BioBricks normally results in an 8-base region containing a 6-base mixed SpeI/XbaI restriction site surrounded by a base on each end as shown below. If a start codon followed this sequence, the RBS would be significantly disturbed.

--T ACTAGA G--

This problem is solved by changing the rightmost base from a G to a T. This results in the sequence shown below and limits the impact of the BioBrick overhead to six bases.

Summary

When variable RBS sequences are developed as BioBrick parts, they should have the standard BioBrick prefix and suffix as follows: Variable RBS Prefix

GAATTC GCGGCCGC T TCTAGA G[RBS]

EcoRI  NotI      XbaI

Variable RBS Suffix

[RBS]T ACTAGT A GCGGCCG CTGCAG

      SpeI     NotI    PstI

When a protein coding region is designed to be preceded by a BioBrick RBS part, it should have the following prefix and suffix sequences: Protein Coding Region Prefix*

GAATTC GCGGCCGC T TCTAG[ATG Remaining CDS]

EcoRI  NotI      XbaI

Protein Coding Region Suffix

[CDS]T ACTAGT A GCGGCCG CTGCAG

      SpeI     NotI    PstI

Notes

  • The only change is that the protein coding region prefix does NOT end with AGAG. instead, it ends AGATG with the ATG also being the start codon of the protein coding region.

In BioBrick parts, we prefer using TAA as the stop codon.