Sessions

21 – Machine Learning, Data Democratization and My Generation’s Future

Machine learning, for good or ill, has the potential to fundamentally alter nearly every aspect of my generation’s lives as we move forward. The question is, what can be done to influence the way machine learning enters our lives to make sure the effects are as positive as possible? For my generation, getting access to the resources necessary to experiment with machine learning is a challenging task. Much of the world’s data is locked up in data silos that only large organizations can use. However, when data is made available, new possibilities emerge.

I will discuss how NIH’s ClinVar repository has paved the way for Variantexplorer.org. While paying only hosting fees (and having a great deal of support and advice from many generous people), I was able to construct a site that summaries cross laboratory genetic variant classification conflicts. In addition to describing how the site works, I will share the challenges I faced constructing it and share my views on steps that can be taken to enhance the usability of data that has already been published. In particular, I will provide an update on my progress in repurposing the ClinVar data file in a manner intended to facilitate machine learning. I greatly appreciate this opportunity to address the HAS19 audience and share my views on how this audience could fundamentally empower my generation, as we come of age, to profoundly improve healthcare for all.