Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Creates hydrophobic surface patches wit many Ala side chains #95

Open
gha2012 opened this issue Feb 29, 2024 · 2 comments
Open

Creates hydrophobic surface patches wit many Ala side chains #95

gha2012 opened this issue Feb 29, 2024 · 2 comments

Comments

@gha2012
Copy link

gha2012 commented Feb 29, 2024

Hi,
thank you very much for making this available! I am using this together with RFDiffusion to create small protein complexes and the interfaces look very good in many cases. However, I found that ProteinMPNN often creates very hydrophobic polyAla surface patches. I am a bit worried that this will lead to solubility issues. Is there a parameter to control this?
Thanks for any suggestions!

image

@drewschaub
Copy link

drewschaub commented Mar 13, 2024

I'm not one of the developers.

Have you looked at https://github.com/nrbennet/dl_binder_design The binder design protocol might be what you're looking for. It couples proteinmpnn with AF2 and predicts solubility and binding affinity using AF2 scores.

If running proteinmpnn by itself, I'll run a few jobs using different models and different values for T to generate several sequences. I'll then filter out sequences with high counts of alanine. I'll also calculate pI values as the models have a tendency to generate a lot of charged residues (e.g. glutamic acid).

The issue of repeats isn't something unique to ProteinMPNN, I notice it when running ESM.

It's also not unique to this domain. If I use OpenAI's whisper to transcribe audio it's common for it to generate repeats there as well.

@gha2012
Copy link
Author

gha2012 commented Mar 14, 2024

Thank you for your comment! Yes, I am using the binder design protocol. I guess I should have posted the question there but I thought this is related to proteinmpnn.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants