ProteinMPNN: Robust deep learning based protein sequence design

J. Dauparas et al.

ProteinMPNN is a computational framework for designing protein sequences. It uses a neural network-based approach to predict protein sequences. The primary goal is to design new proteins with specific shapes and functions, which has numerous applications in biotechnology, medicine, and research. Furthermore, RFDiffusion outputs can be used in ProteinMPNN to refine these backbones and design sequences that fold correctly and exhibit the desired functionalities. This combined approach allows the creation of new proteins with specific shapes and functions tailored to various needs.

Example use case: Design protein sequences

Technology: Message Passing Neural Network (MPNN)

Limitation:

  • Some of the parameters were kept default. Please see this page for more details.
  • Some of the examples are currently not implemented. Please see this page for more details.

Parameters Guideline:

1) ProteinMPNN Task: Simple Monomer Task

  • Chain List: None
  • Fix or Design Specific Residue: None
  • Tie Specific Residues: None

2) ProteinMPNN Task: Simple Multi-Chain Task

  • Chain List: A C
  • Fix or Design Specific Residue: None
  • Tie Specific Residues: None

3) ProteinMPNN Task: Fixed Specific Residue Task or Design Specific Residue Task

  • Chain List: A C
  • Fix or Design Specific Residue: 1 2 3 4 5 6 7 8 23 25, 10 11 12 13 14 15 16 17 18 19 20 40
  • Tie Specific Residues: None

4) ProteinMPNN Task: Tie Some Position Together Task

  • Chain List: A C
  • Fix or Design Specific Residue: None
  • Tie Specific Residues: 1 2 3 4 5 6 7 8, 1 2 3 4 5 6 7 8

5) ProteinMPNN Task: Homooligomer Task

  • Chain List: None
  • Fix or Design Specific Residue: None
  • Tie Specific Residues: None
Citation:
J. Dauparas et al., Robust deep learning–based protein sequence design using ProteinMPNN.Science378,49-56(2022). DOI:10.1126/science.add2187
Released: Jun-25-2024
v0.1
Structural Bioinformatics
Protein Sequence Design
fa
4
5
share
Example Results
View Source
Previous Job Parameters
Your previous job parameters will show up here
so you can keep track of your jobs

Upload a PDB File

No files selected!
OR
Use Demo Data
Your file can be in the following formats:
pdb
• The Protein Data Bank (PDB) data format is a standard file format used to store information about the three-dimensional structures of biological macromolecules.

Set Parameters

Simple Monomer Task
1
10
100