Researchers from the University of Virginia have developed an AI-driven approach to explore structural similarities and relationships across the protein universe. Their study, published in Nature Communications, challenges the conventional notions about protein structure relationships and identifies many faint relationships missed by traditional methods. The team, led by Phil Bourne, dean of the School of Data Science, has created a computational framework called DeepUrfold that combines deep learning and a new conceptual model, the Urfold, to detect and quantify these protein relationships at scale.

Uncovering Faint Protein Relationships
The traditional approach to classifying proteins into separate, non-overlapping bins has been challenged by the researchers at the University of Virginia. Their AI-driven framework, DeepUrfold, views protein relationships in terms of “communities” and identifies faint structural relationships across the protein universe that were previously missed.
This new methodology allows researchers to move beyond thinking of protein similarities in static, geometric terms and toward a more integrated approach. The study’s findings could have far-reaching implications for our understanding of protein structure and evolution, as well as the development of new therapeutic strategies and biotechnological applications.
Pioneering AI Advances in Structural Bioinformatics
The researchers behind this groundbreaking work are no strangers to scientific breakthroughs. Phil Bourne, the founding dean of the School of Data Science, is renowned for his contributions to the field of structural bioinformatics and computational biology. Earlier in his career, Bourne co-led the development of the RCSB Protein Data Bank, a comprehensive repository of protein structure information that has been instrumental in driving the field forward and paving the way for advancements like AlphaFold.
The team also includes Cam Mura, a senior scientist with the School of Data Science and the Department of Biomedical Engineering, who has an extensive background in structural and computational biology, as well as Eli Draizen, a recent UVA alumnus and current postdoctoral scholar in computational biology at the University of California, San Francisco.
The Urfold Approach: Redefining Protein Structure Relationships
At the heart of this study is the Urfold, a new conceptual model that allows for two proteins to exhibit architectural similarity despite having differing topologies or “folds.” This novel approach challenges the traditional way of thinking about protein structure relationships and enables the DeepUrfold framework to detect and quantify these faint relationships at an unprecedented scale.
The authors’ computational framework combines deep learning-based techniques with the Urfold model, allowing them to capture and describe these distant relationships between proteins that were previously considered unrelated. By viewing protein relationships in terms of “communities” rather than separate, non-overlapping bins, the researchers have opened up new avenues for exploring the intricate web of protein structure and evolution.