Skip to Main Content

The National Institutes of Health wants your DNA, and the DNA of 1 million other Americans, for an ambitious project called All of Us. Its goal — to “uncover paths toward delivering precision medicine” — is a good one. But until it can safeguard participants’ genetic privacy, you should decline the invitation to join unless you fully understand and accept the risks.

DNA databases like All of Us could provide valuable medical breakthroughs such as identifying new disease risk factors and potential drug targets. But these benefits could come with a high price: increased risk to individuals’ genetic data privacy, something that current U.S. laws do not adequately protect.

This month, the NIH announced it was throwing open the doors to enrollment in All of Us. This comes at a time when genetic and data privacy is in the public eye. In late April, California police caught the alleged “Golden State Killer” by using an online DNA database called GEDmatch. In mid-May, the same database was used to help solve a double homicide committed in Washington state in 1987. Earlier this year, the Cambridge Analytica scandal broke, showing that a data analytics firm collected private information on up to 87 million Facebook users. Notwithstanding the benefits to law enforcement, these and other revelations are eroding public trust in social media and genealogy websites like 23andMe and Without trust, it will be difficult for programs like the All of Us initiative to succeed.


Far more complex than fingerprints, a genetic profile is the single most identifiable characteristic an individual has. Such profiles contain a treasure trove of information about individuals and their health, such as predispositions for cancer, neurodegenerative disease, and mental illness. It’s not only genetic data that the All of Us Program will obtain. The project aims to collect biospecimens and data about donors’ medical histories, lifestyles, families, and psychological health. It will also solicit data from wearables like Fitbits and Apple Watches.

Our current health privacy laws were created before genetic privacy became an issue, and they don’t adequately protect it. For example, the Health Insurance Portability and Accountability Act (HIPAA), the primary U.S. health privacy law, does not apply to companies like GEDmatch, 23andMe, or Nor does it apply to the All of Us program, its corporate partners, or new forms of medical data gathered from sources like websites, apps, and wearables.


HIPAA applies only to what it calls “covered entities” — individuals and organizations traditionally associated with health care, such as doctors, hospitals, insurance companies, and their business associates. HIPAA holds covered entities to a high standard of care, requiring that they maintain the confidentiality of patient data and penalizing them in the event of a data breach. In many cases, the All of Us program may have more sensitive information about you than your doctor. But the program is not your physician and is not subject to the duties imposed on health care providers by HIPAA and other regulations such as state medical licensing laws.

Beyond HIPAA, few laws prohibit police from accessing genetic data stored in public or private databases. There are even fewer restrictions on government access to genetic data if national security is at risk. That means contributing DNA to genealogy services could expose users and their families to law enforcement scrutiny. For example, if your relative’s DNA is found at a crime scene, you could be dragged into an investigation due to your kinship. Even a distant relative’s data could provide probable cause for law enforcement to conduct a search or interrogation.

Theft is also a concern. Consider what could happen if hackers stole genetic data from the All of Us database or a consumer genealogy site. Once it has been disseminated, it would be impossible to retrieve and conceal again. Hackers could hold the data for ransom or sell it to third parties such as data brokers or unscrupulous employers.

DNA databases can also be sold. In 1997, Iceland and a company called DeCODE Genetics launched a national database, which now contains DNA from nearly half the Icelandic population. In 2012, the pharmaceutical company Amgen bought the database and now profits from the insights gleaned from it. All of Us will share data with corporate partners, including Verily, Google’s life sciences division.

We urge legislators to consider expanding HIPAA’s definition of covered entities to include app developers, websites, and other companies that collect and analyze health data, including genetic information. In 2014, California passed a law that treats all companies that handle medical information as health care providers under the state’s Confidentiality of Medical Information Act. To protect consumer genetic data, other states could follow California’s lead. Even better, Congress could amend HIPAA to bring all companies that handle health data into its definition of covered entities. In that case, genealogy sites would be required to use the same privacy standards as doctors and hospitals.

At the same time, organizations and companies that collect genetic information, from 23andMe to the NIH, must be clearer and more straightforward about conveying how they protect individuals’ genetic data. Current contract law governs the protection of genetic data through the agreements users sign when providing genetic information. These agreements are notoriously vague and difficult to understand, and they can give companies nearly limitless rights to use an individual’s genetic information — and to change their policies at any time without consent. To protect consumer privacy, genetic testing entities must create privacy policies that prioritize clarity. They should also allow users to opt-out of data sharing that does not directly benefit the public or contribute to user test results.

To create laws that better govern both public and private genetic data collection and DNA databases, courts and lawmakers should draw from the concept of information fiduciaries, coined by Jack Balkin, a professor at Yale Law School, to describe the special relationship of trust between consumers and entities that collect their personal or sensitive information. Treating DNA databases like information fiduciaries would impose on them legal duties of care, confidentiality, and loyalty. They would be obliged to act reasonably toward users, safeguard their data, and avoid conflicts of interest that could exploit them.

Public and private DNA databases have the potential to produce great social benefits, from improving cancer treatment to catching serial killers. But these benefits shouldn’t come at the risk of exposing individuals’ private genetic data. Before donating their DNA to private or public databases, individuals should ask hard questions — and read the fine print — to make sure their genetic information will remain private and protected.

Erosion of trust will discourage people from contributing their DNA and diminish the scientific benefits of programs like All of Us. To benefit fully from DNA databases, we must create and enforce fair industry standards, reform existing health laws to account for new sources of medical data, and create laws that protect genetic privacy rights.

Mason Marks, M.D., and Tiffany Li, J.D. are fellows at Yale Law School’s Information Society Project. Li also directs the Wikimedia/Yale Law School Initiative on Intermediaries and Information.

  • “In that case, genealogy sites would be required to use the same privacy standards as doctors and hospitals.”

    That’s a bold proposal but you might consider broadly consulting with the genetic genealogy community before pushing hard for this. In some cases, this could force some genealogy websites to shut down with little or no privacy benefit.

Comments are closed.