  {"id":32089,"date":"2018-11-13T14:20:17","date_gmt":"2018-11-13T19:20:17","guid":{"rendered":"https:\/\/digital.hbs.edu\/platform-rctom\/submission\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\/"},"modified":"2018-11-13T14:27:31","modified_gmt":"2018-11-13T19:27:31","slug":"developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs","status":"publish","type":"hck-submission","link":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\/","title":{"rendered":"Developing Machine Learning and Proprietary Data Sets at Goldman Sachs"},"content":{"rendered":"<p>Few industries are as ubiquitous and digitized as banking, which sits on massive amounts of clean, structured, financial data. Yet by and large, banks have maintained the same business model for centuries. The opportunity for disruption has enabled startups to flood the sector: Stripe, Apple and Square are changing how we pay for things, while digital currencies, peer-to-peer lending, \u2018alternative-data\u2019 providers and pay-as-you-go insurance platforms are threatening the universal bank model. In this essay, I investigate how an investment bank like <em>Goldman Sachs<\/em> can harness the power of its data through machine learning (ML). Due to its breadth of revenue streams, Goldman has the ability to address almost any area of \u2018B2B\u2019 finance, from investment banking, investment management, securities and lending. Nevertheless, monetizing B2B client data will be difficult given strong data protection and privacy agreements. As a result, I see Goldman Sachs making a strategic pivot towards consumer finance, where data constraints are less stringent.<\/p>\n<p>In investment banking, strict NDA agreements limit the amount of proprietary data collected in any M&amp;A transaction. As a result, the next frontier of process innovation will come from incorporating \u201calternative data\u201d (e.g. credit card transactions, satellite images or weather forecasts) to develop proprietary insights on M&amp;A opportunities. An investment bank armed with such insights would become a more strategic advisor to its clients and grow its market share as a result. A recent FinTech unicorn, Dataminr, is using big data from a variety of non-traditional sources to develop such insights, but it is struggling to incorporate these into the habits of professional advisors (i.e. 59% of Dataminr\u2019s customers claim that \u2018workflow integration\u2019 is their biggest challenge). Given Goldman\u2019s relationships with professional investors, where it is strongly embedded in their investment workflow, it has all the structural advantages to monetize this market.<\/p>\n<p>When it comes to customer service, self-conversing bots are a major opportunity for ML to increase convenience and decrease SG&amp;A cost. In private wealth management, Goldman\u2019s clients are some of the wealthiest individuals in the world, but that is not a reason to overlook automation and bots. Using customer service bots would allow private bankers to spend more time with customers by automating basic support, data collection and automatic portfolio rebalancing based on customer requirement. Finally in lending, Goldman has proprietary data on millions of clients and their lending results. The potential of ML is to automate the underwriting process through stronger prediction algorithms that reduce bad debt expense, extend credit to higher-risk clients, and decrease administration costs.<\/p>\n<p>In the short and medium term, Goldman will continue to expand its internal center of excellence of ML, set up in early 2018. So far, it hired the head of ML from Amazon, acquired dozens of FinTech companies with consumer data, and replaced its infamous equity trading team with computer engineers (i.e. The 600 equity traders from the early 2000s are all gone). In trading, ML algorithms predict which clients might be interested in specific investments and send quotes in real time. To succeed in the long term however, Goldman has recognized that it needs to create more proprietary data. Indeed, most machine learning algorithms are open source today, and anyone can use them free of charge against public data. Consequently, Goldman is incubating several initiatives aimed at collecting data from consumers, who are often less reticent than institutions to share data. It acquired three consumer startups in 2018 and developed an open trading platform called Marquee, which will be released to retail investors in Q4 2018. In addition, the firm recently released a new consumer lending platform, Marcus, to help consumers consolidate their credit card balances under one contract. Both platforms are run entirely by software with no human intervention.<\/p>\n<p>For sustainable success in the next century, Goldman must not only develop proprietary insights from its proprietary data sets, but also incorporate these insights into the workflow of decision makers. Whereas machines are better at predicting very short term opportunities in reaction to tweets, earnings statements, or news, humans will maintain an edge in making long term predictions. As a result, humans will not be totally replaced and will need the ability to \u2018overrule\u2019 whatever the computer is saying.<\/p>\n<p>To conclude, I would like to share some open-ended questions for the industry. We have seen how banks like Goldman are developing new consumer services in view of getting usage data, but how can they similarly leverage their B2B data? In M&amp;A advisory for instance, Goldman mapped 146 distinct steps taken in any IPO, \u201csome of which are begging to be automated\u201d.<\/p>\n<p>Bibliography:<\/p>\n<ul>\n<li><u><a href=\"http:\/\/ezp-prod1.hul.harvard.edu\/login?url=http:\/\/search.ebscohost.com\/login.aspx?direct=true&amp;db=bth&amp;AN=120606181&amp;site=ehost-live&amp;scope=site\">How to tell if machine learning can solve your business problem\u00a0(Links to an external site.)Links to an external site.<\/a><\/u>\u00a0<em>性视界 Business Review Digital Articles <\/em>(November 15, 2016).<\/li>\n<li>Wilson, S. Sachdev, and A. Alter.\u00a0<u><a href=\"http:\/\/ezp-prod1.hul.harvard.edu\/login?url=http:\/\/search.ebscohost.com\/login.aspx?direct=true&amp;db=bth&amp;AN=118686115&amp;site=ehost-live&amp;scope=site\">How companies are using machine learning to get faster and more efficient(Links to an external site.)Links to an external site.<\/a><\/u>\u00a0<em>性视界 Business Review Digital Articles<\/em>\u00a0(May 3, 2016).<\/li>\n<li>Daniel Faggella. <a href=\"https:\/\/www.techemergence.com\/machine-learning-in-finance\/\">Machine Learning in Finance: Current and Future Applications<\/a>. <em>com<\/em> (November 5, 2018).<\/li>\n<li>Ed Oliver. <a href=\"https:\/\/www.dataminr.com\/resources\/report-investors-embrace-alternative-data\">The New Data Paradigm for Traders: Alternative Data in the Mainsteam<\/a>. <em>Dataminr<\/em> (Q3 2018)<\/li>\n<li>Ayn de Jesus. <a href=\"https:\/\/www.techemergence.com\/robo-advisors-artificial-intelligence-comparing-5-current-apps\/\">Robo Advisors and Artificial Intelligence<\/a>. <em>com<\/em> (July 6, 2018)<\/li>\n<li>Paul Clarke. <a href=\"https:\/\/www.fnlondon.com\/articles\/goldman-sachs-hunts-ai-experts-for-all-important-quant-team-20180130\">Goldman Sachs Hunts AI Experts for Quant Team<\/a>. <em>Financial News London<\/em> (January 30, 2018).<\/li>\n<li>Rob Daly. <a href=\"https:\/\/www.marketsmedia.com\/goldman-preps-marquee-upgrade\/\">Goldman Preps Marquee Upgrade<\/a>. <em>com<\/em> (October 16 2018)<\/li>\n<li>Liz Hoffman and Telis Demos. <a href=\"https:\/\/www.wsj.com\/articles\/wall-street-erases-the-line-between-its-jocks-and-nerds-1534564810?mod=djemCIO_h\">Wall Street Erases the Line Between its Jocks and Nerds<\/a>. <em>The Wall Street Journal<\/em> (August 18, 2018)<\/li>\n<li>Nanette Byrnes. <a href=\"https:\/\/www.technologyreview.com\/s\/603431\/as-goldman-embraces-automation-even-the-masters-of-the-universe-are-threatened\/\">As Goldman Embraces Automation, Even the Masters of the Universe Are Threatened<\/a>. <em>MIT Technology Review<\/em> (February 7, 2017)<\/li>\n<li>Charles Elkan. <a href=\"https:\/\/www.goldmansachs.com\/careers\/blog\/posts\/present-and-future-of-ai-charles-elkan.html\">The Present and Future of AI for Financial Services Firm<\/a>. <em>Goldman Sachs website<\/em> (September 7, 2018)<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Goldman Sachs is a global investment bank, struggling to monetize customer data in a highly regulated environment. How are they getting around these restrictions?<\/p>\n","protected":false},"author":11298,"featured_media":32200,"comment_status":"open","ping_status":"closed","template":"","categories":[4365,1026,346],"class_list":["post-32089","hck-submission","type-hck-submission","status-publish","has-post-thumbnail","hentry","category-artifical-intelligence","category-investment-banking","category-machine-learning","hck-taxonomy-organization-goldman-sachs","hck-taxonomy-industry-banking","hck-taxonomy-country-united-states"],"connected_submission_link":"https:\/\/d3.harvard.edu\/platform-rctom\/assignment\/rc-tom-challenge-2018\/","yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.3 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Developing Machine Learning and Proprietary Data Sets at Goldman Sachs - Technology and Operations Management<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Developing Machine Learning and Proprietary Data Sets at Goldman Sachs - Technology and Operations Management\" \/>\n<meta property=\"og:description\" content=\"Goldman Sachs is a global investment bank, struggling to monetize customer data in a highly regulated environment. How are they getting around these restrictions?\" \/>\n<meta property=\"og:url\" content=\"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\/\" \/>\n<meta property=\"og:site_name\" content=\"Technology and Operations Management\" \/>\n<meta property=\"article:modified_time\" content=\"2018-11-13T19:27:31+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/d3.harvard.edu\/platform-rctom\/wp-content\/uploads\/sites\/4\/2018\/11\/IMG011320_full4x3-1.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"960\" \/>\n\t<meta property=\"og:image:height\" content=\"720\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/submission\\\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\\\/\",\"url\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/submission\\\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\\\/\",\"name\":\"Developing Machine Learning and Proprietary Data Sets at Goldman Sachs - Technology and Operations Management\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/submission\\\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/submission\\\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/wp-content\\\/uploads\\\/sites\\\/4\\\/2018\\\/11\\\/IMG011320_full4x3-1.jpg\",\"datePublished\":\"2018-11-13T19:20:17+00:00\",\"dateModified\":\"2018-11-13T19:27:31+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/submission\\\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/submission\\\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/submission\\\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\\\/#primaryimage\",\"url\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/wp-content\\\/uploads\\\/sites\\\/4\\\/2018\\\/11\\\/IMG011320_full4x3-1.jpg\",\"contentUrl\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/wp-content\\\/uploads\\\/sites\\\/4\\\/2018\\\/11\\\/IMG011320_full4x3-1.jpg\",\"width\":960,\"height\":720},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/submission\\\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Submissions\",\"item\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/submission\\\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Developing Machine Learning and Proprietary Data Sets at Goldman Sachs\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/#website\",\"url\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/\",\"name\":\"Technology and Operations Management\",\"description\":\"MBA Student Perspectives\",\"potentialAction\":[{\"@type\":\"性视界Action\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/d3.harvard.edu\\\/platform-rctom\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Developing Machine Learning and Proprietary Data Sets at Goldman Sachs - Technology and Operations Management","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\/","og_locale":"en_US","og_type":"article","og_title":"Developing Machine Learning and Proprietary Data Sets at Goldman Sachs - Technology and Operations Management","og_description":"Goldman Sachs is a global investment bank, struggling to monetize customer data in a highly regulated environment. How are they getting around these restrictions?","og_url":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\/","og_site_name":"Technology and Operations Management","article_modified_time":"2018-11-13T19:27:31+00:00","og_image":[{"width":960,"height":720,"url":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-content\/uploads\/sites\/4\/2018\/11\/IMG011320_full4x3-1.jpg","type":"image\/jpeg"}],"twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\/","url":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\/","name":"Developing Machine Learning and Proprietary Data Sets at Goldman Sachs - Technology and Operations Management","isPartOf":{"@id":"https:\/\/d3.harvard.edu\/platform-rctom\/#website"},"primaryImageOfPage":{"@id":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\/#primaryimage"},"image":{"@id":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\/#primaryimage"},"thumbnailUrl":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-content\/uploads\/sites\/4\/2018\/11\/IMG011320_full4x3-1.jpg","datePublished":"2018-11-13T19:20:17+00:00","dateModified":"2018-11-13T19:27:31+00:00","breadcrumb":{"@id":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/d3.harvard.edu\/platform-rctom\/submission\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\/#primaryimage","url":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-content\/uploads\/sites\/4\/2018\/11\/IMG011320_full4x3-1.jpg","contentUrl":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-content\/uploads\/sites\/4\/2018\/11\/IMG011320_full4x3-1.jpg","width":960,"height":720},{"@type":"BreadcrumbList","@id":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/developing-machine-learning-and-proprietary-data-sets-at-goldman-sachs\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/d3.harvard.edu\/platform-rctom\/"},{"@type":"ListItem","position":2,"name":"Submissions","item":"https:\/\/d3.harvard.edu\/platform-rctom\/submission\/"},{"@type":"ListItem","position":3,"name":"Developing Machine Learning and Proprietary Data Sets at Goldman Sachs"}]},{"@type":"WebSite","@id":"https:\/\/d3.harvard.edu\/platform-rctom\/#website","url":"https:\/\/d3.harvard.edu\/platform-rctom\/","name":"Technology and Operations Management","description":"MBA Student Perspectives","potentialAction":[{"@type":"性视界Action","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/d3.harvard.edu\/platform-rctom\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"_links":{"self":[{"href":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-json\/wp\/v2\/hck-submission\/32089","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-json\/wp\/v2\/hck-submission"}],"about":[{"href":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-json\/wp\/v2\/types\/hck-submission"}],"author":[{"embeddable":true,"href":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-json\/wp\/v2\/users\/11298"}],"replies":[{"embeddable":true,"href":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-json\/wp\/v2\/comments?post=32089"}],"version-history":[{"count":0,"href":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-json\/wp\/v2\/hck-submission\/32089\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-json\/wp\/v2\/media\/32200"}],"wp:attachment":[{"href":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-json\/wp\/v2\/media?parent=32089"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/d3.harvard.edu\/platform-rctom\/wp-json\/wp\/v2\/categories?post=32089"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}