
Scientists work with proteins just as they work with metals and other inorganic materials, designing new substances with enhanced properties, such as the ability to survive at high temperatures. But doing so involves sorting through the nearly endless possible ways to rearrange a protein’s components, called amino acids — an extremely time-consuming and computer-intensive task.
By applying a computational technique for alloy design called cluster expansion, the MIT researchers and their colleagues were able to search through potential amino acid configurations up to 100 million times faster than with conventional techniques. The work could prove useful in many fields, including medicine and biotechnology, which stand to benefit from the revamped proteins with superior properties.
“In an alloyed material, there are a number of different elements — such as nickel, chromium and iron in stainless steel — which are arranged on a lattice of sites,” [Assistant Professor Dane] Morgan said. “It’s the same thing with a protein: You have different amino acids occupying various sites in the protein backbone.” The trick is determining exactly how to reshuffle these components to enhance the properties of the larger structure, whether it’s a metal or a biological molecule.
Each protein’s function depends on its unique three-dimensional structure, which, in turn, rests on the molecule’s specific linear chain, or sequence, of 20 different amino acids. In protein design, scientists start with a protein of known sequence, structure and function — such as an industrial enzyme that breaks up grime. They then hunt for modified amino acid arrangements that augment the molecule’s natural function.
Beginning with an existing protein structure does reduce the number of possible ways to rearrange the amino acids, but the numbers are still mind-boggling.
“For a sequence of just 100 different amino acids, you have 20100 possible combinations,” Morgan said. “My guess is this is more than the number of atoms in the universe.”
“Cluster expansion breaks the design problem into meaningful pieces that you can get your brain and your computer around,” Morgan said.
To demonstrate the ability of cluster expansion to manage this complexity, the team focused on protein stability. Highly stable proteins fold into tight three-dimensional structures; less stable ones tend to fall apart. Because a protein’s stability relates to its energy in the folded, 3-D state, scientists can calculate an energy term to predict whether a particular amino acid sequence will adopt a robust structure.
Cluster expansion breaks an amino acid sequence into small subclusters, consisting of one, two, three or more amino acids. An effective energy term is then determined for every possible amino acid sequence within each subcluster. Once the energies of each subcluster are known, they can be quickly added to give the energy of the entire sequence.
When the team applied the method to two well-known proteins, they found it calculated amino acid sequence energies that matched well with those computed by an established technique. The difference? The traditional technique needed more than three minutes to make each energy calculation while cluster expansion took just a microsecond.
[Assistant Professor Amy] Keating is excited about the potential contribution of cluster expansion to biology. “It may make some genomic studies that we have been dreaming about feasible whereas they really weren’t before.”
I figured I’d drop in a little hard science for the weekend. If you wish to see something natural and beautiful, Google “metallography” then click on “Images”. Wander.














