Part:BBa_K5396004
Barbie1-Cys
This BARBIE1 protein is modified with an additional amino acid (cysteine). This enhancement allows it to be effectively utilized in our biosensor technology.
Part Generation
The BARBIE1-Cys fragment was generated from a PCR reaction using primers that specifically amplify the linker-BARBIE1-linker region of BBa_K5396001. The reverse primer used in this reaction adds a codon that encodes the amino acid cysteine at the end of the sequence.
Protein Design
Starting from the BaCBM2 structure model generated by the AlphaFold2 software, we performed docking assays with six types of plastic: polypropylene (PP), polyethylene (PE), polyethylene terephthalate (PET), nylon (NY), polyvinyl chloride (PVC) and polystirene (PS). We made the docking using Gnina software with relaxed parameters to screen many proteins and features for plastic affinity.
Thereafter, the produced overlaps were removed by the docking assays using the ChimeraX software, as well as used for visualization and sequence manipulation. A reverse folding was then performed with the protein output from the docking using the LigandMPNN tool. The original protein set generated from the docking was filtered to maintain just unique positions, considering the associated score , without overlap between them.
By doing that, 6.000 sequences were generated for each ligand, totalizing 6 plastics x 6.000 sequences = 36.000 sequences, as illustrated in Figure 1. The consensus sequence from the 36000 sequence originated our most optimized protein sequence sensitive for several plastics types was named as BARBIE1!
Figure 1. Protein-ligand docking representation of the BARBIE1 protein docked with PP, PE, PET, NY, PVC.
The BARBIE1 protein was redocked with the same plastics as before, once more using Gnina software. The result comparing its affinity with BaCBM2 in silico assays performed are described in the barplot of Figure 5. Comparing the predicted affinity between the original and the modified protein, it is notable a substantial increase for all plastics, in particular for PE, PP, and PS, highlighting the effectiveness of the processing pipeline.
Besides the monomers tests, we also wanted to test the affinity using different sizes of plastic in order to guarantee that this could be a valuable parameter to future analysis and experiments. Therefore, the tested ligands were PE and PET plastics with 50 and 25 repeating units, respectively. As a result, the previous behavior at maintaining a higher KD for BARBIE1 when compared to BaCBM2 was preserved.
Computational Modeling
Proteins with carbohydrate-binding modules (CBM) can not only be found as single units but also in more units or as part of larger multi-domain proteins. In light of the importance of understanding the thermodynamics basis for structural composition, the newest version of Alpha Fold 3 (AF3) was used [doi.org/10.1038/s41586-024-07487-w]. The model was used to optimize protein-protein interaction and test high-order oligomers. To effectively model the water filter system closer to reality, the first step was to predict the state of the proteins.
The proposed proteins to be evaluated in Alpha Pulldown are the B1-CBM (pink), BaCBM2 (blue), and 1A3N (green), which is a reference protein that forms the deoxy human hemoglobin.
Figure X. Protein comparative of the tested proteins. In (a), it is shown the BARBIE1 structure, in (b) the BaCBM2, and in (c) the 1A3N protein.
With 4 subunits as shown in Figure X+1, the 1A3N multicomplex protein forms a pocket in the middle region to store oxygen molecules. Since this reference protein was resolved with more subunits, it can serve as a baseline for the prediction of the other proteins.
Figure X+1. 1A3N protein representation with four repeating units.
About the predicted template modeling (pTM) values:
Concerning the interface predicted template modeling (iPTM) values:
Figure X+2. pTM and iPTM values calculated by AlphaFold3 for BARBIE1, BaCBM2, and 1A3N with different repeating units.
As it can be seen in Figure X+2, both BARBIE1, BaCBM2, and 1A3N achieved high scores as monomers. On the other hand, only the 1A3N as a multimer achieved higher values, such as its two and three subunits iPTM, as expected. Therefore, it is possible to affirm that it is very unlikely for both B1-CBM and BaCBM2 to form a multimer, which can be assured by the deoxy human hemoglobin results. This result allows the modeling of B1-CBM as a monomer, which simplifies the system.
Interaction Properties
In order to further our knowledge in the protein properties, we compared our resulting sequence BARBIE1 with the original BaCBM2 in different tests. Firstly, we compared each protein's tertiary structure, as shown in Figure X+3. As a result, it is notable the similarity between them, specially in the secondary structures, in which is notable the presence of the same amount of beta sheets. In the upper left part of the BARBIE1 structure, however, there is a notable difference between them.
Figure X+3. Tertiary structure of BARBIE1 on the left (pink) and BaCBM2 on the right (blue).
This subtable contrast between the original and modified protein is shown in Figure X+4, in which we aligned the structures using VMD and calculated the root mean square deviation (RMSD). As it is possible to analyze in the aligned structures, it is very similar visibly, with a resulting RMSD of 0.56 Å (RMSDs < 2 Å indicates high similarity).
Figure X+4. Alignment of BARBIE1 (magenta) and BaCBM2 (blue) tertiary structures resulting in a RMSD of 0.56.
After that, we assessed the electrostatic surface of each protein using the ChimeraX tool, which can be valuable for identifying binding sites for ligands, as well as stability and its behavior when solvated.
In Figure X+5, both proteins are represented with the electrostatic surface. While on one hand the red parts stand for the negative electrostatic, the blue parts stand for the positive electrostatic. Therefore, it is notable a major presence of negative values for BARBIE1 when compared to BaCBM2, which may indicate a higher affinity with positive ions or positive charged ligands.
Figure X+5. Electrostatic surface represented in the left for BARBIE1 and in the right for BaCBM2. The red regions indicate positive electrostatic and the negative are represented as blue.
Following on, we calculated the proteins hydrophobicities also using ChimeraX. In this way, we can at the same time understand the water interacting regions, corresponding to the more hydrophilic parts, and possible binding sites, generally indicated by hydrophobic parts. In Figure X+6, the hydrophilic regions are represented as blue and the hydrophobic regions as yellow.
Figure X+6. Hydrophobicity surface represented in the left for BARBIE1 and in the right for BaCBM2. The blue regions indicate hydrophilic and the hydrophobicity is represented as yellow.
In general, while on one hand hydrophilic regions are exposed to aqueous solvents in the exterior of the structure, hydrophobic regions are buried in its interior. When confronting both structures, it is notable an inverse behavior on the BARBIE1 structure compared to BaCBM2: the hydrophobic parts are not buried, but exposed to the surface. Since more hydrophobic sites are exposed to the surface, it may point to a better understanding of our results. The better scores were achieved by choosing more hydrophobic amino acids in the protein primary structure, which enabled the creation of more pockets and the subsequent increase of KD. Therefore, the choice is notable for a higher plastic affinity, but also a lesser water solubility.
Sequence and Features
- 10COMPATIBLE WITH RFC[10]
- 12COMPATIBLE WITH RFC[12]
- 21COMPATIBLE WITH RFC[21]
- 23COMPATIBLE WITH RFC[23]
- 25INCOMPATIBLE WITH RFC[25]Illegal AgeI site found at 91
- 1000COMPATIBLE WITH RFC[1000]
None |