Help:Protein coding

Browse protein coding parts!

Protein-coding parts are parts that create functional proteins. Most of the parts presented on this page are protein-coding regions only -- a few of them, however, currently also contain RBS sites.

Regulations for Protein-Coding Parts

Every BioBrick coding region consists of the following structure:

It begins with a standard start codon: "ATG"
It ends with two "stop" codons: "TAA","TAA", i.e., "TAATAA".

The actual protein-coding sequence begins with the start codon and ends immediately before the two stop codons. Thus the structure of a BioBrick protein-coding region sequence looks like: "ATG-[your coding region]-TAATAA " (Note that ATG ---> AUG in the mRNA transcript, and AUG codes for methionine. Depending on which protein you have, the methionine may or may not remain as the first residue in your expressed amino acid sequence.)

Direction

A coding region can point RNA polymerase in either the forward or reverse direction depending on which strand of the double-stranded DNA molecule it binds to. Currently most BioBrick parts transcribe DNA in the forward direction. By convention, in the scientific literature (and in textbooks) sequences are always presented with the forward direction to the right. In the case of BioBricks, however, the location of the cloning sites determines the "forward" direction. The BioBrick prefix contains the EcoRI and XbaI restriction sites, and when a coding sequence starts (with its AUG initiation codon) at the prefix, it's said to run in the forward direction. However, nothing prevents us from inserting the coding sequence the other way around with the AUG adjacent to the BioBrick suffix and the coding sequence running from right to left. We define this as the "reverse" orientation.

References and Sources for Protein-Coding Information

There are a number of excellent sources for protein-coding sequence information. They include:

PubMed (Entrez Protein) [http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&itool=toolbar link]
UniProt [http://www.pir.uniprot.org/ link]
UniProt-SwissProt/TrEMBL [http://www.ebi.ac.uk/swissprot/ link]

3-Dimensional structural information resides at:

Protein Data Bank (PDB) [http://www.rcsb.org/pdb/Welcome.do link]

Protein Barcodes

a BioBrick Barcode, shown under Part Design:Features, ssDNA viewing mode

Many protein coding parts in the Registry are trackable through the use of pieces of DNA which are:

non-coding (no "start" codon), rare, and about 25 base pairs in length. These sequences are known as Barcodes.

Help:Protein coding

Contents

Regulations for Protein-Coding Parts

Direction

References and Sources for Protein-Coding Information

Protein Barcodes

Tags