5 Tweets 366 reads Jun 20, 2022
Protein structure can be predicted from a single sequence alone with high accuracy.
@HelixonBio team have developed OmegaFold, achieving performance similar to RF and AF2's MSA versions. Only a single sequence is given as input. 1/5
OmegaFold leverages a protein language model and a new Geoformer module to enforce geometric consistency among single/pair embeddings. 2/5
Protein language model (PLM) first intrinsically learns evolutionary context from large unaligned datasets. With new training techniques, we found improved long-range contact prediction. 3/5
However, learned embeddings (and predicted distogram) are usually not geometrically valid. Geoformer updates the pair/single embeddings to make them more consistent, geometrically. Works well on proteins such as antibodies, when no evolutionary info is available. 4/5
Without the need to search MSAs, OF is also super fast. Only a few seconds for predicting a normal-size protein.
Preprint & code will be released soon. Just the pilot episode. Stay tuned for more. 5/5

Loading suggestions...