<aside> <img src="/icons/push-pin_green.svg" alt="/icons/push-pin_green.svg" width="40px" /> Key Links: http://docs.google.com/spreadsheets/d/1AsYRLlrRLd6I8abxNHfuz1OtFTSqYZ87_kefBMsxhMo/edit?gid=0#gid=0 https://docs.google.com/spreadsheets/d/19_u8Sd8TdseHP6yAVrDSjIKf0vDU9RNH3SEACx8L22Y/edit?gid=0#gid=0
</aside>
How many molecules of amino acids do you take with a piece of 500 grams of meat? (on average an amino acid is ~100 Daltons)
$$ \frac{500 grams}{100 Daltons} = \frac{500 grams}{100 * 1.6610^{-24}grams} = \underline{3.01210^{24}\:Molecules} $$
Why humans eat beef but do not become a cow, eat fish but do not become fish?
Why there are only 20 natural amino acids?
Can you make other non-natural amino acids? Design some new amino acids.
Where did amino acids come from before enzymes that make them, and before life started?
If you make an alpha-helix using D-amino acids, what handedness (right or left) would you expect?
Can you discover additional helices in proteins?
Why most molecular helices are right-handed?
Why do beta-sheets tend to aggregate?
Why many amyloid diseases form b-sheet?
Design a b-sheet motif that forms a well-ordered structure.
In this part of the homework, you will be using online resources and 3D visualization software to answer questions about proteins.
Briefly describe the protein you selected and why you selected it.
Identify the amino acid sequence of your protein.
How long is it? What is the most frequent amino acid? You can use this notebook to count most frequent amino acid - https://colab.research.google.com/drive/1vlAU_Y84lb04e4Nnaf1axU8nQA6_QBP1?usp=sharing

How many protein sequence homologs are there for your protein?Hint: Use the pBLAST tool to search for homologs and ClustalOmega to align and visualize them. Tutorial Here
Ans: A 100 sequences come up when searched for:
clustalo-I20250528-222515-0706-49990485-p1m.aln-clustal_num

Does your protein belong to any protein family?
Identify the structure page of your protein in RCSB
Ans: https://www.rcsb.org/structure/1A3N

When was the structure solved? Is it a good quality structure? Good quality structure is the one with good resolution. Smaller the better (Resolution: 2.70 Å)
Are there any other molecules in the solved structure apart from protein?
Does your protein belong to any structure classification family?
Ans: Globins



Open the structure of your protein in any 3D molecule visualization software:
PyMol Tutorial Here (hint: ChatGPT is good at PyMol commands)

Visualize the protein as "cartoon", "ribbon" and "ball and stick".
Ans: Cartoon, ribbon, sticks and spheres in order



Color the protein by secondary structure. Does it have more helices or sheets?
We can see only helices and no sheets

Color the protein by residue type. What can you tell about the distribution of hydrophobic vs hydrophilic residues?
Ans: They are distributed alternatively in the helices of the protein

Visualize the surface of the protein. Does it have any "holes" (aka binding pockets)?
Ans: Yes a clear hole is present in the centre

<aside> <img src="/icons/snippet_lightgray.svg" alt="/icons/snippet_lightgray.svg" width="40px" /> Resources: https://colab.research.google.com/drive/1hXStRY9VCyw52n17uWdWQBj__IcR2ztK?usp=sharing#scrollTo=38gFJBazNdzJ
</aside>
Fold your protein with AlphaFold or ESMFold or Boltz and compare it to the real structure.

Predicted Structure

Real Structure from https://www.rcsb.org/structure/1A3N
Comment on:
Inverse-fold your structure with ProteinMPNN
<aside> <img src="/icons/slide_yellow.svg" alt="/icons/slide_yellow.svg" width="40px" /> MIT/Harvard Students can optionally present this on Wednesday, March 5. Due for all on March 11
</aside>