Coding

Part:BBa_K1150000

Designed by: Freiburg 2013   Group: iGEM13_Freiburg   (2013-09-15)
Revision as of 00:40, 9 October 2013 by TU Munich (Talk | contribs)

dCas9

dCas9
Function Binding protein
Use in Mammalian cells
RFC standard RFC 25
Backbone pSB1C3
Organism Streptococcus pyogenes
Source Feng Zhang, Addgene
Submitted by [http://2013.igem.org/Team:Freiburg Freiburg 2013]

dCas9 is a codon optimized and standardized (RFC 25) protein for human cell lines. Interacting with a DNA-binding RNA and fused with different effector domains it can be used for specific gene regulation.

Cas9 is the main protein of the CRISPR/Cas system II of Streptococcus pyogenes. CRISPR systems protect bacteria and archaea from phages by recognizing and cleaving of invading phage DNA. This recognition is based on Watson Crick base pairing between a short RNA, called crRNA, and the complementary DNA strand. A second RNA, called tracrRNA, connects crRNA and Cas9. These three parts together form a protein-RNA-DNA complex with the targeted DNA strand [1].
Cas9 became of great interest for research concerning DNA targeting, because of its ability to recognize site specific DNA strands by a crRNA.
At first the functionality of Cas9 was modified by exchanging aminoacids. As a result, Cas9 was able to introduce mutations within the genome of several organisms by causing double strand breaks [2][3]. Then, it was converted from a nuclease to a nickase introducing single strand breaks [4] and lately it was converted to an enzymatically inactive form, called dCas9 [5].
This dCas9 is codon optimized for human cell lines and standardized (RFC 25). It can be used as a DNA binding protein, that can be fused with different effectors in order to regulate gene expression.




Sequence and Features


Assembly Compatibility:
  • 10
    COMPATIBLE WITH RFC[10]
  • 12
    COMPATIBLE WITH RFC[12]
  • 21
    INCOMPATIBLE WITH RFC[21]
    Illegal BglII site found at 248
  • 23
    COMPATIBLE WITH RFC[23]
  • 25
    COMPATIBLE WITH RFC[25]
  • 1000
    COMPATIBLE WITH RFC[1000]


Protein data table for BioBrick BBa_K1150000 automatically created by the BioBrick-AutoAnnotator version 1.0
Nucleotide sequence in RFC 25, so ATGGCCGGC and ACCGGT were added (in italics) to the 5' and 3' ends: (underlined part encodes the protein)
 ATGGCCGGCGACAAGAAG ... GGAGGCGACACCGGT
 ORF from nucleotide position -8 to 4107 (excluding stop-codon)
Amino acid sequence: (RFC 25 scars in shown in bold, other sequence features underlined; both given below)

101 
201 
301 
401 
501 
601 
701 
801 
901 
1001 
1101 
1201 
1301 
MAGDKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFF
HRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEE
NPINASGVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSD
AILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDL
LRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNF
DKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLL
KIIKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIH
DDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKE
HPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDAIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKFD
NLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALI
KKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKT
EVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKL
PKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAYNKHR
DKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGDTG*
Sequence features: (with their position in the amino acid sequence, see the list of supported features)
RFC25 scar (shown in bold): 659 to 660, 1355 to 1356
Amino acid composition:
Ala (A)75 (5.5%)
Arg (R)77 (5.6%)
Asn (N)70 (5.1%)
Asp (D)98 (7.1%)
Cys (C)2 (0.1%)
Gln (Q)52 (3.8%)
Glu (E)108 (7.9%)
Gly (G)71 (5.2%)
His (H)31 (2.3%)
Ile (I)93 (6.8%)
Leu (L)148 (10.8%)
Lys (K)150 (10.9%)
Met (M)22 (1.6%)
Phe (F)63 (4.6%)
Pro (P)35 (2.6%)
Ser (S)76 (5.5%)
Thr (T)66 (4.8%)
Trp (W)7 (0.5%)
Tyr (Y)55 (4.0%)
Val (V)73 (5.3%)
Amino acid counting
Total number:1372
Positively charged (Arg+Lys):227 (16.5%)
Negatively charged (Asp+Glu):206 (15.0%)
Aromatic (Phe+His+Try+Tyr):156 (11.4%)
Biochemical parameters
Atomic composition:C7120H11348N1944O2104S24
Molecular mass [Da]:158617.6
Theoretical pI:9.01
Extinction coefficient at 280 nm [M-1 cm-1]:120450 / 120575 (all Cys red/ox)
Plot for hydrophobicity, charge, predicted secondary structure, solvent accessability, transmembrane helices and disulfid bridges 
Codon usage
Organism:E. coliB. subtilisS. cerevisiaeA. thalianaP. patensMammals
Codon quality (CAI):good (0.71)good (0.70)very bad (0.00)good (0.65)excellent (0.89)excellent (0.94)
Alignments (obtained from PredictProtein.org)
   There were no alignments for this protein in the data base. The BLAST search was initialized and should be ready in a few hours.
Predictions (obtained from PredictProtein.org)
   There were no predictions for this protein in the data base. The prediction was initialized and should be ready in a few hours.
The BioBrick-AutoAnnotator was created by TU-Munich 2013 iGEM team. For more information please see the documentation.
If you have any questions, comments or suggestions, please leave us a comment.

References

[1] Westra E.R., Swarts D.C., Staals R.H., Jore M.M., Brouns S.J., van der Oost J. (2012). The CRISPRs, they are a-changin': how prokaryotes generate adaptive immunity. Annu Rev Genet. 46, 311-39
[2] Mali P., Yang L., Esvelt K.M., Aach J., Guell M., DiCarlo J.E., Norville J.E., Church G.M. (2013). RNA-guided human genome engineering via Cas9. Science 339(6121), 823-6
[3] Jiang W., Bikard D., Cox D., Zhang F., Marraffini L.A. (2013). RNA-guided editing of bacterial genomes using CRISPR-Cas systems. Nat Biotechnol. 31(3), 233-9
[4] Cong, L., Ran, F.A., Cox, D., Lin, S., Barretto, R., Habib, N., Hsu, P.D., Wu, X., Jiang, W., Marraffini, L.A., Zhang, F. (2013). Multiplex Genome Engineering Using CRISPR/Cas Systems. Science 339 (6121), 819-23
[5] Qi L.S., Larson M.H., Gilbert L.A., Doudna J.A., Weissman J.S., Arkin A.P., Lim W.A. (2013). Repurposing CRISPR as an RNA-guided platform for sequence-specific control of gene expression. Cell 152(5), 1173-83

[edit]
Categories
//function/crispr
//function/crispr/cas9
Parameters
None