Google just announced that its Artificial Intelligence algorithm, AlphaFold, made a major advance in protein foldinga. To help make sense of this, I am interviewingb Pierre Baldic (h-index: 106d), one of the leading AI scientists in the world, and also my PhD advisor back in the day. Pierre is a Distinguished Professor at UC Irvine, and has been using AI to solve scientific problems for decades. He literally wrote the textbooke I used in graduate school, and his Deep Learning in Sciencef book is due to be published March, 2021.

What is Deep Learning?

Figure from

this paperg at PNAS. Deep Learning networks are often depicted in this way, with information flowing through nodes in a network to make a computation..

AlphaFold is a type of Deep Learning algorithm, a neural network that learns from data. That “learning” ability is one reason it is called Artificial Intelligence. Deep Learning can be used in many ways to address many tasks. In science, one task we care about is protein folding.

What is Protein Folding?

So what is “protein folding”? We want to know the three-dimensional structure of proteins from the amino acid sequence. Proteins are defined by a sequence of letters, where each letter corresponds to a different building blockh, like this:

VLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSKYR

The task of protein folding is to translate this sequence of building blocks into a three dimensional structure, like this:

The

protein structure of hemoglobini, in the style of David Goodsell. This is the structure encoded in the the amino acid sequence above.

That structure becomes important. It can give important insight into how the protein functions. In this case, we are look at hemoglobin, the oxygen carrying protein in our blood. Shifts in the structure when it binds oxygen are important to its function.

The shift in structure is key to how hemoglobin carries oxygen in our blood.

There are several experimental techniques to determine protein structure: X-Ray crystallographyj, NMRk, and cryo-EMl. These techniques have been used for decades, and produced thousands of protein structures that are stored in the Protein Data Bankm. However, they are expensive and slow, and they often do not work.

So, could we build a computer program that could look at all the structures solved so far, and learn patterns from it, so that it could now predict the structure of a new protein? That is the basic challenge and opportunity of protein folding. As simple as that sounds, it turns out to be a very hard problem, for a whole host of reasons. But AI algorithms are proving to be particularly effective at this question

With AlphaFold’s announcement, is protein folding a solved problem now? This tweet from a scientists nails the answer. “Protein folding is not a solved problem.n” Still, this is exciting. What exactly does it mean?

How is Deep Learning Impacting Science?

This is just one more example, on a growing list, of where Deep Learning is making a large impact on how scientists conduct their work. Pierre Baldi has been applying deep learning to solve scientific problems for decades. We will be discussing how Deep Learning is coming of age, and how it is shaping science in many ways.

References

  1. https://deepmind.com/blog/article/alphafold-a-solution-to-a-50-year-old-grand-challenge-in-biology
  2. https://www.youtube.com/watch?v=0Y2kQf54tpM
  3. http://www.igb.uci.edu/~pfbaldi/
  4. https://scholar.google.com/citations?user=RhFhIIgAAAAJ&hl=en
  5. https://www.amazon.com/Bioinformatics-Learning-Approach-Adaptive-Computation/dp/026202506X?tag=swamidass-20
  6. https://amazon.com/dp/0618352104
  7. https://www.pnas.org/content/116/4/1074
  8. https://en.wikipedia.org/wiki/Amino_acid#Table_of_standard_amino_acid_abbreviations_and_properties
  9. https://pdb101.rcsb.org/motm/41
  10. https://en.wikipedia.org/wiki/X-ray_crystallography
  11. https://en.wikipedia.org/wiki/Nuclear_magnetic_resonance_spectroscopy_of_proteins
  12. https://en.wikipedia.org/wiki/Cryogenic_electron_microscopy
  13. https://www.rcsb.org/
  14. https://twitter.com/lpachter/status/1333702159850893312?s=20