Cracking the Code of Health: How Primates Unlock the Secrets of Human Genes

The completion of the Human Genome Project transpired on April 14, 2003. During that epoch, besides humanity, researchers had solely decoded the complete genomes of three mammals: murine creatures, rodents, and our closest cousins, chimpanzees, thus hindering the formulation of a comprehensive lateral comparison. Scientists discerned that sans such a horizontal juxtaposition, deriving substantive deductions solely from the human genome sequence proved arduous. Consequently, over the subsequent two decades, scientists systematically unraveled the entire genomes of hundreds of mammals, eventually drawing certain statistically significant inferences.

To amalgamate research endeavors across diverse nations and augment operational efficacy, over 150 scholars hailing from 30 scientific bastions worldwide forged a multinational coalition, embarking collectively on a scholarly venture christened Zoonomia. This initiative converged the genomes of 241 mammals, encompassing Homo sapiens, to inaugurate a unified repository. Encompassing 80% of extant mammalian species, this repository essentially encapsulates the entire gamut of mammalian taxa.

The publication of “Science” magazine on April 27, 2023, disseminated 11 papers penned by Zoonomia project luminaries, elucidating the strides of this endeavor. Among them, one paper postulated that placental mammals commenced diversification during the epoch of dinosaurs. Another meticulously juxtaposed the gene sequences of the aforementioned 241 genomes, revealing complete identity in 10.2% of the gene sequences, indicative of the paramount significance of this genetic subset, wherein any mutation portended fatality. Subsequent inquiry unveiled the preponderance of these DNA sequences outside the protein coding realm, falling within the rubric of regulatory genes. The precise functionalities of approximately half of these regulatory genes remain enigmatic, positing them as a veritable “gold mine” for prospective genetic exploration.

Yet, the object of utmost intrigue pertains incontrovertibly to the human genome, particularly its nexus with human maladies. Historically, this conundrum primarily underwent scrutiny via genome-wide association analysis (GWAS). This methodology singularly fixated on the human genome itself, discerning disparities between healthy individuals and patients. However, this modus operandi merely established correlations between genes and afflictions, precluding the substantiation of causal relationships. Given the involvement of myriad genetic loci in numerous maladies, pinpointing the decisive disparity remained elusive.

Contrarily, the Zoonomia project adopts a divergent approach, endeavoring to ascertain the gene most predisposed to malady by scrutinizing the evolutionary genetic annals of mammals. Upon computational analysis, researchers deduced the comparative viability of this research paradigm vis-à-vis the GWAS methodology, affirming its diminished research exigency by at least one magnitude.

The GWAS methodology proves excessively rudimentary, precluding discernment of the repercussions ensuing from a particular gene mutation. Although the cost of DNA sequencing plummets, potentially enabling millions worldwide to undergo genome sequencing, the import of a genetic mutation remains inscrutable. The dearth of comprehension regarding the correlation between a protein’s amino acid sequence and its physiological function precludes prognostication of health ramifications attributable solely to genetic mutation.

In redress, Kyle Fahey, an artificial intelligence savant affiliated with Illumina, a renowned DNA sequencing entity, conceived a sterling stratagem. Citing the evolutionary proximity between humans and primates, Fahey postulated near homology between human and primate proteins. Ergo, the occurrence of a particular amino acid variance in humans, mirrored in other primates, augurs benignancy, having withstood evolutionary vicissitudes. By extrapolation, the comprehensive sequencing of extant primate genomes, juxtaposed with the human genome, affords a colossal repository delineating genetic congruences and disparities. Subsequent integration with three-dimensional protein structure data within an artificial intelligence neural network facilitates discernment of the benign or malignant nature of newfound genetic mutations.

Fahey proffered this concept in 2018, notwithstanding the scant sequencing of the primate genome. Undeterred, Fahey collaborated with Thomas Marquez Bonnet, a geneticist at the University of Pompeu Fabra in Spain. Leveraging Bonnet’s extensive primate genome repertoire, Fahey conducted preliminary investigations, culminating in a paper published in “Nature Genetics” on July 23, 2018.

This treatise corroborated the feasibility of the aforementioned notion, albeit bemoaning the inadequacy of the primate genome database. Bonnet, in turn, rallied colleagues worldwide, amassing the genome sequences of 233 primate specimens from 24 nations, encompassing all 16 families of the Primates order, thereby encapsulating the panoply of primate diversity.

Armed with this exhaustive primate genome repository, scientists could transcend erstwhile limitations, as evidenced by the publication of eight papers in “Science” magazine on June 2, 2023. These papers, crafted by geneticists worldwide, harnessed the primate genome database to interrogate diverse facets pertinent to humanity. Foremost among them, Bonnet and Fahey’s collaboration identified 4.3 million common missense mutations shared by humans and primates, whereupon scrutiny against extant genetic pathology databases revealed a benignity rate of 98.7%, evincing remarkable concordance.

Compared to relying solely on the mammalian genome sequence repository, wherein the benignity rate of genetic mutations dwindles to 87%, this disparity underscores the efficacy of the primate-centric approach. In essence, this research bequeaths a superlative gene annotation database to artificial intelligence systems, augmenting data volume by over 50-fold. Geneticists prognosticate that henceforth, this augmented database, in concert with three-dimensional protein structure data, will enable the precise delineation of the benign or malignant nature of any novel genetic mutation, thereby unveiling inscrutable facets of human health.

error: Content is protected !!