But what does it really mean to "design" a protein model? It means you explore a protein structure and then simplify the way the protein is visually displayed to make the key features of the protein that help communicate your molecular story more obvious.
This can mean hiding some atoms that are not important for your protein story, changing the display format of certain parts of your protein structure or changing colors to best highlight the most important parts of the protein. Watch the video below for a more thorough description from Dr. Tim Herman.
The first step in designing a molecular model is accessing Jmol
The Jmol User Design Environment (JUDE) will give you access to Jmol, and let practice all of the commands, techniques and tools described throughout this Training Guide. A link to the JUDE webpage can also be found at the bottom left of your screen throughout Jmol Training Guide.
Click to Open theNote: Jmol was originally a Java-based program that ran locally on your computer, rather than through a web page. For those who prefer to continue using the Java-based version of Jmol rather than the JUDE web page, we have included expandable sections with instructions specifically for Java-based Jmol users, as shown below.
Java-Based Jmol: InstallingThe Jmol User Design Environment webpage is comprised of several main areas, described in more detail below.
In order to view a molecule in Jmol, you need a molecular structure file that contains the location in 3-dimensional space of each atom in the molecule. Every atom is assigned an Atom Number, along with an (X, Y, Z) Coordinate to mark its location.
The image below is an example of the inner contents of a molecular structure file. For more information on structure files and how they are determined, visit these RCSB Protein Databank resources:
There are two main types of molecular structure formats you can use with Jmol: Protein Databank (.PDB) files and SMILES Sequences.
SMILES (Simplified Molecular Input Line Entry Specification) Sequences are single lines of text that describe both the atoms and bonds in a small molecular structure (usually under ~50 atoms).
Atoms are represented by their element abbreviations (C, N, O, P, Cl, Br, etc.), an equals sign "=" represents double bonds and the pound sign "#" represents triple bonds. Branching is indicated by brackets "()" and rings are indicated by pairs of digits. A few examples are shown below.
SMILES Sequences can be found from a variety of online drug and small molecule databases, including the following websites.
To open a SMILES Sequence in the CBM's Jmol User Design Environment, click the red "Open a New Molecular Structure" button to expand the load menu. Then simply type or paste the SMILES Sequence you wish to load into the first text input area and hit "Load".
Protein Databank (.pdb) files are the most common molecular structure file format for macromolecules such as proteins.
The RCSB Protein Databank (https://www.rcsb.org/) is an international database that contains over 150,000 protein structures. Each structure hosted on the Protein Databank has a unique four character long alpha-numeric identifier, referred to as the structure's PDB ID.
Protein structures on the RCSB Protein Databank can be searched by molecule name, PDB ID, or researcher name. Often more than one .pdb file will exist for a specific type of protein. For example, you will see hundreds of .pdb file entries if you search for a relatively common protein like Insulin.
The Protein Databank's Molecule of the Month (http://pdb101.rcsb.org/motm) can help identify the best .pdb files for common proteins. Each Molecule of the Month article provides an introduction to the structure and function of a specific protein, a discussion of its relevance to health and disease, and links to related entries.
Each structure on the RCSB Protein Databank has a unique PDB ID and Structure Summary page, with a variety of useful information about the structure.
Just above the protein structure's PDB ID should be a series of tabs, the fourth of which is the Sequence Tab. This shows the exact amino acid sequence for the protein, as well as secondary structure information.
To open a PDB File in the CBM's Jmol User Design Environment, first click the red "Open a New Molecular Structure" button to expand the load menu. Then simply type or paste the four character alpha-numeric PDB-ID for the protein structure you wish to load into the second text input area and hit "Load".
Note: If you want to work with a local PDB file directly from your computer, that is not publicly available on the RCSB Protein Databank, you will need to use the Java-based version of Jmol.
Java-Based Jmol: Opening PDB FilesAny time you design something cool in Jmol, you should save it! Saving your work allows you to revisit the design later, share the design with collaborators, and revert back to your saved state if something goes wrong.
The simplest way to save your design progress is to create a Script (.spt) file. This single file will contain all of the information needed to reproduce the current molecule representation, allowing you to open it back up exactly as it was when you saved it.
To save a .spt file in the CBM's Jmol User Design Environment, click the orange "Save Your Design" button to expand the save menu and then click the "Save .SPT File" button.
Note: Your filename must include the extension ".spt" to save properly. Also note that depending on your browser, you may need to press the "OK" button twice to initiate the save/download.
You can also save several other file formats from the JUDE Web Page, each with a specific application:
Image (.jpg) Files are static 2D images of your molecule design, perfect for use in PowerPoint, Word, posters or embedding in an email. To save a .jpg file, click the orange "Save your Design" button to expand the save menu, and then click the "Save .JPG File" button.
3D Printing (.stl) Files are the standard input filetype for most low cost plastic 3D printers, and will allow you to 3D print your own molecule designs! To save a .stl file, click the orange "Save your Design" button to expand the save menu, and then click the "Save .STL File" button.
Note: .STL files exported for 3D printing can be quite large. If your browser freezes or you receive an error when attempting to export an STL file from the JUDE Web Page, click on the section below for a work-around.
Java-Based Jmol: Saving Large STL FilesCommand History (.txt) Files are a simple text list of the commands you have entered into the Jmol User Design Environment. These can be useful in tracking your design progress, or for use in the CBM's Jmol Exploration Web Page Creator. To save a History .txt file, click the orange "Save your Design" button to expand the save menu, and then click the "Save History .TXT File" button.
To reload a previously saved .spt file, simply drag the file from your local computer into the Jmol Display Window in the JUDE Web Page. This will bring up your design in the Jmol Display window and allow you to continue editing.
This same drag-and-drop technique will also work if you are using the Java-based version of Jmol.
At this point, you should be able to access Jmol through the Jmol User Design Environment (JUDE), load SMILES and Protein Data Bank (.pdb) Files, and save and reload your work.
But that is just the beginning. . . The real power of Jmol lies in manipulating the display of your molecule. In the next section, Simplifying Your Protein, you will learn how to explore a molecular structure and view the backbone of a protein.