  {"id":32760,"date":"2018-11-13T16:26:54","date_gmt":"2018-11-13T21:26:54","guid":{"rendered":"https:\/\/digital.hbs.edu\/platform-rctom\/submission\/23-and-who-deep-research-into-our-genetic-code\/"},"modified":"2018-11-14T01:33:58","modified_gmt":"2018-11-14T06:33:58","slug":"23-who-deep-research-into-our-genetic-code","status":"publish","type":"hck-submission","link":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/23-who-deep-research-into-our-genetic-code\/","title":{"rendered":"23 &amp; Who?: Deep Research into our Genetic Code"},"content":{"rendered":"<p><span style=\"font-weight: 400\"><br \/>\nAdvances in genomics research have ushered in a new era of analysis into our physiological and behavioral traits, uncovering insights about our identity and its determinants. This research links our phenotype (observable characteristics) with our genotype (genetic composition) in hopes of identifying common attributes that allow us to better understand and predict differences in our personality and health. The human genome, however, is remarkably large &#8211; it contains three billion base pairs of genetic material, which if written out, would fill over 200 New York City telephone books (averaging 1000 pages each)\u00a0<\/span><span style=\"font-weight: 400\"><sup>1<\/sup><\/span><span style=\"font-weight: 400\">. It is also remarkably complex: \u00a0there are roughly ~20,000 genes and even more regions that control how these genes are expressed. Recently discovered techniques in machine learning offer a new opportunity to understand genetic differences at scale for the first time, providing companies like 23&amp;Me the chance to drive consumer applications in precision medicine and genetic lifestyle analysis.<\/span><\/p>\n<p><b><br \/>\nThe Big Problem:<\/b><\/p>\n<p><span style=\"font-weight: 400\">The Human Genome Project, completed in 2003, offered our first mapping of the human genome<\/span><span style=\"font-weight: 400\">, identifying all of the 20,500 genes in human DNA, and determining the sequences of its 3 billion chemical base pairs\u00a0<sup>2<\/sup>. However, decades after its completion, scientists are still looking to establish more causal relationships between genotype and phenotype traits. Why the lacking developments? Two primary factors hinder our progress: \u00a0scale and decoding complexity. To the former, small variations in genes and regulatory regions are ultimately what make each of us unique (and, unfortunately, result in disease). These small variations are very hard to detect within such a large data set. To the latter, manual genome sequencing is still very error prone and may lead to misread DNA elements.<\/span><\/p>\n<p><b><br \/>\nThe Cluster of Solutions:<\/b><\/p>\n<p><span style=\"font-weight: 400\">To tackle these issues, 23&amp;Me has used statistical techniques such as deep neural nets to understand correlations in genetics data, generate hypotheses for clinical validation, and standardize the genome reconstruction process. This is primarily done in two ways:<\/span><\/p>\n<ol>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Identify connections between genotypic and phenotypic patterns using unsupervised machine learning, a technique that clusters genes by their expression in cells and tissues.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Improve sequencing methodologies by transforming the genome reconstruction problem into an image classification problem, which results in greater speed and accuracy\u00a0<\/span><span style=\"font-weight: 400\"><sup>3<\/sup><\/span><span style=\"font-weight: 400\">.<\/span><\/li>\n<\/ol>\n<p>Unsupervised machine learning clusters individuals within an unlabeled data set, indicating key traits that are consistent and relevant to the genetic factors driving our susceptibility to disease. 23&amp;Me applies this method to ancestry data as well, mapping the locations in your genome that carry small amounts of information about your family\u2019s geographic history.<\/p>\n<figure id=\"attachment_32746\" aria-describedby=\"caption-attachment-32746\" style=\"width: 500px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/d3.harvard.edu\/platform-rctom\/wp-content\/uploads\/sites\/4\/2018\/11\/Screen-Shot-2018-11-13-at-12.37.49-PM.png\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-32746 size-large\" src=\"https:\/\/d3.harvard.edu\/platform-rctom\/wp-content\/uploads\/sites\/4\/2018\/11\/Screen-Shot-2018-11-13-at-12.37.49-PM-1024x592.png\" alt=\"23&amp;Me Ancestry History\" width=\"500\" height=\"290\" srcset=\"https:\/\/d3.harvard.edu\/platform-rctom\/wp-content\/uploads\/sites\/4\/2018\/11\/Screen-Shot-2018-11-13-at-12.37.49-PM-1024x592.png 1024w, https:\/\/d3.harvard.edu\/platform-rctom\/wp-content\/uploads\/sites\/4\/2018\/11\/Screen-Shot-2018-11-13-at-12.37.49-PM-300x173.png 300w, https:\/\/d3.harvard.edu\/platform-rctom\/wp-content\/uploads\/sites\/4\/2018\/11\/Screen-Shot-2018-11-13-at-12.37.49-PM-768x444.png 768w, https:\/\/d3.harvard.edu\/platform-rctom\/wp-content\/uploads\/sites\/4\/2018\/11\/Screen-Shot-2018-11-13-at-12.37.49-PM-600x347.png 600w\" sizes=\"auto, (max-width: 500px) 100vw, 500px\" \/><\/a><figcaption id=\"caption-attachment-32746\" class=\"wp-caption-text\"><em>23&amp;Me Ancestry History<\/em><\/figcaption><\/figure>\n<p><span style=\"font-weight: 400\">Accordingly, their use of machine learning to improve sequencing methodologies has enabled improved accuracy and innovation. The genome sequencing process begins with high-throughput sequencing (HTS), a method that produces ~1 billion short sequences of bases, but are unfortunately not organized into a human-recognizable genome sequence <\/span><span style=\"font-weight: 400\"><sup>4<\/sup><\/span><span style=\"font-weight: 400\">. Deep learning methods facilitate a true genome sequence (guanine, cytosine, adenine and thymine organized into 23 pairs of chromosomes) from HTS sequencer data with significantly greater accuracy than previous classical methods<\/span><span style=\"font-weight: 400\"> <sup>5<\/sup><\/span><span style=\"font-weight: 400\">. <\/span><\/p>\n<p><b><br \/>\nOur Genetic Future:<\/b><\/p>\n<p><span style=\"font-weight: 400\">Genomic applications in machine learning have enabled a wave of research and consumer innovation towards understanding how our DNA influences who we are. However, genetic makeup is only one piece of our map of human physiology. The next wave of innovation will arise from combining this data with other clinical research to build a true picture of health and disease. These partnerships can uncover additional findings, including how our phenotype expression changes within varying environmental conditions (e.g. lack of sleep, high levels of exercise) through comparison with other clinical measurements. <\/span><\/p>\n<p><span style=\"font-weight: 400\">Machine learning applications in genetics research can produce generative innovations as well. By learning the characteristics of genotypes that produce desired phenotype expressions, we may begin to edit and change our own genomes to reduce our susceptibility to disease.\u00a0<\/span><\/p>\n<p><b><br \/>\nGenetic Philosophy:<\/b><\/p>\n<p><span style=\"font-weight: 400\">While ripe with promise, machine learning applications in genetics carry deep, philosophical questions about the nature of privacy, data sharing, and the human condition. Critics push back on data sharing and commercial opportunities\u00a0 &#8211; should we allow 23&amp;Me to negotiate contractual agreements to share data with 3rd parties? And should they be able to profit from their genetic analysis\u00a0<\/span><span style=\"font-weight: 400\"><sup>6<\/sup><\/span><span style=\"font-weight: 400\">?\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400\">Perhaps more broadly, these applications provoke questions about what makes us human. If we are able to sequence and modify our own genome to optimize for desired traits, should we? Are we sacrificing our morality to improve our material wealth? Genomics research can answer many questions about our physiology, but these questions will require a deeper philosophical reflection into the essence of our humanity and values.<\/span><\/p>\n<p>&nbsp;<\/p>\n<p><em>(798 words)<\/em><\/p>\n<p>&nbsp;<\/p>\n<hr \/>\n<p><strong>References:<\/strong><\/p>\n<p><span style=\"font-weight: 400\">[1] \u00a0&#8220;NOVA Online | Cracking The Code Of Life | Genome Facts&#8221;. 2018. <\/span><i><span style=\"font-weight: 400\">Pbs.Org<\/span><\/i><span style=\"font-weight: 400\">. https:\/\/www.pbs.org\/wgbh\/nova\/genome\/facts.html.<\/span><\/p>\n<p><span style=\"font-weight: 400\">[2] \u00a0&#8220;Human Genome Project Information&#8221;. 2018. <\/span><i><span style=\"font-weight: 400\">Web.Ornl.Gov<\/span><\/i><span style=\"font-weight: 400\">. https:\/\/web.ornl.gov\/sci\/techresources\/Human_Genome\/index.shtml.<\/span><\/p>\n<p><span style=\"font-weight: 400\">[3] \u00a02018. <\/span><i><span style=\"font-weight: 400\">Biorxiv.Org<\/span><\/i><span style=\"font-weight: 400\">. https:\/\/www.biorxiv.org\/content\/biorxiv\/early\/2016\/12\/14\/092890.full.pdf.<\/span><\/p>\n<p><span style=\"font-weight: 400\">[4] \u00a0Thangaraj, Andrew. 2018. &#8220;Evaluating Deepvariant: A New Deep Learning Variant Caller From The Google Brain Team&#8221;. <\/span><i><span style=\"font-weight: 400\">Inside Dnanexus<\/span><\/i><span style=\"font-weight: 400\">. https:\/\/blog.dnanexus.com\/2017-12-05-evaluating-deepvariant-googles-machine-learning-variant-caller\/.<\/span><\/p>\n<p><span style=\"font-weight: 400\">[5] \u00a02018. <\/span><i><span style=\"font-weight: 400\">Permalinks.23Andme.Com<\/span><\/i><span style=\"font-weight: 400\">. https:\/\/permalinks.23andme.com\/pdf\/23_17-GeneticWeight_Feb2017.pdf.<\/span><\/p>\n<p><span style=\"font-weight: 400\">[6] \u00a0Brodwin, Erin. 2018. &#8220;DNA-Testing Company 23Andme Has Signed A $300 Million Deal With A Drug Giant. Here&#8217;s How To Delete Your Data If That Freaks You Out.&#8221;. <\/span><i><span style=\"font-weight: 400\">Business Insider<\/span><\/i><span style=\"font-weight: 400\">. https:\/\/www.businessinsider.com\/dna-testing-delete-your-data-23andme-ancestry-2018-7.<\/span><\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>How can machine learning answer the centuries-old question of who we really are? \u00a0<\/p>\n","protected":false},"author":11864,"featured_media":33121,"comment_status":"open","ping_status":"closed","template":"","categories":[2462,2215,208,346,4778],"class_list":["post-32760","hck-submission","type-hck-submission","status-publish","has-post-thumbnail","hentry","category-deep-learning","category-genetics","category-health","category-machine-learning","category-unsupervised-learning","hck-taxonomy-industry-biotechnology","hck-taxonomy-country-united-states"],"connected_submission_link":"https:\/\/d3.harvard.edu\/platform-rctom\/assignment\/rc-tom-challenge-2018\/","yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.3 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>23 &amp; Who?: Deep Research into our Genetic Code - Technology and Operations Management<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/23-who-deep-research-into-our-genetic-code\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"23 &amp; Who?: Deep Research into our Genetic Code - Technology and Operations Management\" \/>\n<meta property=\"og:description\" content=\"How can machine learning answer the centuries-old question of who we really are? \u00a0\" \/>\n<meta property=\"og:url\" content=\"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/23-who-deep-research-into-our-genetic-code\/\" \/>\n<meta property=\"og:site_name\" content=\"Technology and Operations Management\" \/>\n<meta property=\"article:modified_time\" content=\"2018-11-14T06:33:58+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/d3.harvard.edu\/platform-rctom\/wp-content\/uploads\/sites\/4\/2018\/11\/GenomeTesting-468428788-Converted-5.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"2548\" \/>\n\t<meta property=\"og:image:height\" content=\"898\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/submission\\\/23-who-deep-research-into-our-genetic-code\\\/\",\"url\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/submission\\\/23-who-deep-research-into-our-genetic-code\\\/\",\"name\":\"23 &amp; Who?: Deep Research into our Genetic Code - Technology and Operations Management\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/submission\\\/23-who-deep-research-into-our-genetic-code\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/submission\\\/23-who-deep-research-into-our-genetic-code\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/wp-content\\\/uploads\\\/sites\\\/4\\\/2018\\\/11\\\/GenomeTesting-468428788-Converted-5.jpg\",\"datePublished\":\"2018-11-13T21:26:54+00:00\",\"dateModified\":\"2018-11-14T06:33:58+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/submission\\\/23-who-deep-research-into-our-genetic-code\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/submission\\\/23-who-deep-research-into-our-genetic-code\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/submission\\\/23-who-deep-research-into-our-genetic-code\\\/#primaryimage\",\"url\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/wp-content\\\/uploads\\\/sites\\\/4\\\/2018\\\/11\\\/GenomeTesting-468428788-Converted-5.jpg\",\"contentUrl\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/wp-content\\\/uploads\\\/sites\\\/4\\\/2018\\\/11\\\/GenomeTesting-468428788-Converted-5.jpg\",\"width\":2548,\"height\":898},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/submission\\\/23-who-deep-research-into-our-genetic-code\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Submissions\",\"item\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/submission\\\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"23 &amp; Who?: Deep Research into our Genetic Code\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/#website\",\"url\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/\",\"name\":\"Technology and Operations Management\",\"description\":\"MBA Student Perspectives\",\"potentialAction\":[{\"@type\":\"性视界Action\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"23 &amp; Who?: Deep Research into our Genetic Code - Technology and Operations Management","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/23-who-deep-research-into-our-genetic-code\/","og_locale":"en_US","og_type":"article","og_title":"23 &amp; Who?: Deep Research into our Genetic Code - Technology and Operations Management","og_description":"How can machine learning answer the centuries-old question of who we really are? \u00a0","og_url":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/23-who-deep-research-into-our-genetic-code\/","og_site_name":"Technology and Operations Management","article_modified_time":"2018-11-14T06:33:58+00:00","og_image":[{"width":2548,"height":898,"url":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-content\/uploads\/sites\/4\/2018\/11\/GenomeTesting-468428788-Converted-5.jpg","type":"image\/jpeg"}],"twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/23-who-deep-research-into-our-genetic-code\/","url":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/23-who-deep-research-into-our-genetic-code\/","name":"23 &amp; Who?: Deep Research into our Genetic Code - Technology and Operations Management","isPartOf":{"@id":"https:\/\/d3.harvard.edu\/platform-rctom\/#website"},"primaryImageOfPage":{"@id":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/23-who-deep-research-into-our-genetic-code\/#primaryimage"},"image":{"@id":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/23-who-deep-research-into-our-genetic-code\/#primaryimage"},"thumbnailUrl":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-content\/uploads\/sites\/4\/2018\/11\/GenomeTesting-468428788-Converted-5.jpg","datePublished":"2018-11-13T21:26:54+00:00","dateModified":"2018-11-14T06:33:58+00:00","breadcrumb":{"@id":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/23-who-deep-research-into-our-genetic-code\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/d3.harvard.edu\/platform-rctom\/submission\/23-who-deep-research-into-our-genetic-code\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/23-who-deep-research-into-our-genetic-code\/#primaryimage","url":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-content\/uploads\/sites\/4\/2018\/11\/GenomeTesting-468428788-Converted-5.jpg","contentUrl":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-content\/uploads\/sites\/4\/2018\/11\/GenomeTesting-468428788-Converted-5.jpg","width":2548,"height":898},{"@type":"BreadcrumbList","@id":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/23-who-deep-research-into-our-genetic-code\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/d3.harvard.edu\/platform-rctom\/"},{"@type":"ListItem","position":2,"name":"Submissions","item":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/"},{"@type":"ListItem","position":3,"name":"23 &amp; Who?: Deep Research into our Genetic Code"}]},{"@type":"WebSite","@id":"https:\/\/d3.harvard.edu\/platform-rctom\/#website","url":"https:\/\/d3.harvard.edu\/platform-rctom\/","name":"Technology and Operations Management","description":"MBA Student Perspectives","potentialAction":[{"@type":"性视界Action","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/d3.harvard.edu\/platform-rctom\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"_links":{"self":[{"href":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-json\/wp\/v2\/hck-submission\/32760","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-json\/wp\/v2\/hck-submission"}],"about":[{"href":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-json\/wp\/v2\/types\/hck-submission"}],"author":[{"embeddable":true,"href":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-json\/wp\/v2\/users\/11864"}],"replies":[{"embeddable":true,"href":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-json\/wp\/v2\/comments?post=32760"}],"version-history":[{"count":0,"href":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-json\/wp\/v2\/hck-submission\/32760\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-json\/wp\/v2\/media\/33121"}],"wp:attachment":[{"href":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-json\/wp\/v2\/media?parent=32760"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-json\/wp\/v2\/categories?post=32760"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}