When genetic testing reveals a rare DNA mutation, doctors and patients are frequently left in the dark about what it actually means. Now, researchers have developed a powerful new way to determine whether a patient with a mutation is likely to actually develop disease, a concept known in genetics as penetrance.
The team set out to solve this problem using artificial intelligence (AI) and routine lab tests like cholesterol, blood counts, and kidney function. Details of the findings were reported in the journal Science. Their new method combines machine learning with electronic health records to offer a more accurate, data-driven view of genetic risk.
Traditional genetic studies often rely on a simple yes/no diagnosis to classify patients. But many diseases, like high blood pressure, diabetes, or cancer, don’t fit neatly into binary categories. The researchers trained AI models to quantify disease on a spectrum, offering more nuanced insight into how disease risk plays out in real life.
Using more than 1 million electronic health records, the researchers built AI models for 10 common diseases. They then applied these models to people known to have rare genetic variants, generating a score between 0 and 1 that reflects the likelihood of developing the disease.
A higher score, closer to 1, suggests a variant may be more likely to contribute to disease, while a lower score indicates minimal or no risk. The team calculated “ML penetrance” scores for more than 1,600 genetic variants.
Some of the results were surprising, say the investigators. Variants previously labeled as “uncertain” showed clear disease signals, while others thought to cause disease had little effect in real-world data.