{"componentChunkName":"component---src-templates-blog-post-tsx","path":"/february-20-dvc-heartbeat","result":{"data":{"markdownRemark":{"id":"d8f97f40-5cba-5ee7-b900-e58cf7a57c76","excerpt":"<p>Welcome to the February Heartbeat! This month’s featured image is a DVC pipeline\n<a href=\"https://medium.com/nlp-trend-and-review-en/use-dvc-to-version-control-ml-dl-models-bef61dbfe477\">created by one of our users</a>,\nwhich <em>we</em> think resembles a…</p>","html":"<p>Welcome to the February Heartbeat! This month’s featured image is a DVC pipeline\n<a href=\"https://medium.com/nlp-trend-and-review-en/use-dvc-to-version-control-ml-dl-models-bef61dbfe477\">created by one of our users</a>,\nwhich <em>we</em> think resembles a valentine. Here are some more highlights from our\nteam and our community:</p>\n<h2>News</h2>\n<p><strong>Our team is growing!</strong> In early January, DVC gained two new folks: engineer\n<a href=\"https://github.com/skshetry\">Saugat Pachhai</a> and data scientist\n<a href=\"https://twitter.com/andronovhopf\">Elle O’Brien</a>. Saugat, based in Nepal, will\nbe contributing to core DVC. Elle (that’s me!), currently in San Francisco, will\nbe leading data science projects and outreach with DVC.</p>\n<p>We’re <strong>gearing up for a spring full of talks</strong> about DVC projects, including\nnew up-and-coming features for data cataloging and continuous integration. Here\nare just a few events that have been added to our schedule:</p>\n<p><html><head></head><body><html><head></head><body><section class=\"elp-content-holder\">\n      <a href=\"https://www.mlprague.com/#schedule-saturday\" class=\"external-link-preview\">\n          <div class=\"elp-description-holder\">\n            <h4 class=\"elp-title\">Machine Learning Prague - March 19</h4>\n            <div class=\"elp-description\">DVC engineer Pawel Redzynski will talk about open source tools for versioning machine learning projects.</div>\n            <div class=\"elp-link\">mlprague.com</div>\n          </div>\n           <div class=\"elp-image-holder\">\n                <img src=\"/uploads/images/2020-02-10/mlprague.jpg\" alt=\"Machine Learning Prague - March 19\">\n            </div>\n      </a>\n    </section>\n    </body></html></body></html></p>\n<p><html><head></head><body><html><head></head><body><section class=\"elp-content-holder\">\n      <a href=\"https://www.mlprague.com/#schedule-saturday\" class=\"external-link-preview\">\n          <div class=\"elp-description-holder\">\n            <h4 class=\"elp-title\">DivOps 2020 - March 24</h4>\n            <div class=\"elp-description\">Elle O'Brien is talking about open source software in the growing field of MLOps at this international, remote conference.</div>\n            <div class=\"elp-link\">https://divops.org/</div>\n          </div>\n           <div class=\"elp-image-holder\">\n                <img src=\"/uploads/images/2020-02-10/divops_logo.png\" alt=\"DivOps 2020 - March 24\">\n            </div>\n      </a>\n    </section>\n    </body></html></body></html></p>\n<p><html><head></head><body><html><head></head><body><section class=\"elp-content-holder\">\n      <a href=\"https://www.mlprague.com/#schedule-saturday\" class=\"external-link-preview\">\n          <div class=\"elp-description-holder\">\n            <h4 class=\"elp-title\">Women in Data Science San Diego - May 9</h4>\n            <div class=\"elp-description\">Elle O'Brien will be delivering a keynote talk about data catalogs and feature stores.</div>\n            <div class=\"elp-link\">https://www.widsconference.org/</div>\n          </div>\n           <div class=\"elp-image-holder\">\n                <img src=\"/uploads/images/2020-02-10/wids.jpeg\" alt=\"Women in Data Science San Diego - May 9\">\n            </div>\n      </a>\n    </section>\n    </body></html></body></html></p>\n<p>-Elle O’Brien was recently accepted to give a keynote at\n<a href=\"https://www.widsconference.org/\">Women in Data Science</a> San Diego on May 9. The\ntalk is called “Packaging data and machine learning models for sharing.”</p>\n<p>-Elle will also be speaking at <a href=\"https://divops.org/\">Div Ops</a>, a new online\nconference about (you guessed it) DevOps, on March 27.</p>\n<p>Look out for more conference announcements soon- in our <strong>brand new community\npage!</strong> We’ve <a href=\"https://dvc.org/community\">just launched a new hub</a> for sharing\nevents, goings-ons, and ways to contribute to DVC.</p>\n<h2>From the community</h2>\n<p>Our users continue to put awesome things on the internet. Like this AI blogger\nwho isn’t afraid to wear his heart on his sleeve.</p>\n<p><html><head></head><body><html><head></head><body><section class=\"elp-content-holder\">\n      <a href=\"https://medium.com/@matlihan/my-favorite-data-science-tool-is-dvc-data-version-control-e6ab8aed24d2\" class=\"external-link-preview\">\n          <div class=\"elp-description-holder\">\n            <h4 class=\"elp-title\">My favorite data science tool is DVC - Data Version Control</h4>\n            <div class=\"elp-description\">by Musa Atlıhan</div>\n            <div class=\"elp-link\">medium.com</div>\n          </div>\n           <div class=\"elp-image-holder\">\n                <img src=\"/uploads/images/2020-02-10/musa_atlihan.jpeg\" alt=\"My favorite data science tool is DVC - Data Version Control\">\n            </div>\n      </a>\n    </section>\n    </body></html></body></html></p>\n<p>Musa Atlihan writes:</p>\n<blockquote>\n<p>From my experience, whether it is a real-world data science project or it is a\ndata science competition, there are two major key components for success.\nThose components are API simplicity and reproducible pipelines. Since data\nscience means experimenting a lot in a limited time frame, first, we need\nmachine learning tools with simplicity and second, we need\nreliable/reproducible machine learning pipelines. Thanks to tools like Keras,\nLightGBM, and fastai we already have simple yet powerful tools for rapid model\ndevelopment. And thanks to DVC, we are building large projects with\nreproducible pipelines very easily.</p>\n</blockquote>\n<p>It’s cool how Musa puts DVC in context with libraries for model building. In a\nway, the libraries that have made it easier than ever to iterate through\ndifferent model architectures have increased the need for reproducibility in\nproportion.</p>\n<p>Meanwhile in Germany, superusers Marcel Mikl and Bert Besser wrote\n<a href=\"https://blog.codecentric.de/en/2019/03/walkthrough-dvc/\">another</a> seriously\ncomprehensive article about DVC for Codecentric. Marcel and Bert walk readers\nthrough the steps to <strong>build a custom machine learning training pipeline with\nremote computing resources</strong> like GCP and AWS. It’s an excellent guide to\nconfiguring model training with attention to <em>automation</em> and <em>collaboration</em>.\nWe give them 🦉🦉🦉🦉🦉 out of 5.</p>\n<p><html><head></head><body><html><head></head><body><section class=\"elp-content-holder\">\n      <a href=\"https://blog.codecentric.de/en/2020/01/remote-training-gitlab-ci-dvc/\" class=\"external-link-preview\">\n          <div class=\"elp-description-holder\">\n            <h4 class=\"elp-title\">Remote training with GitLab-CI and DVC</h4>\n            <div class=\"elp-description\">by Marcel Mikl and Bert Besser</div>\n            <div class=\"elp-link\">blog.codecentric.de</div>\n          </div>\n           <div class=\"elp-image-holder\">\n                <img src=\"/uploads/images/2020-02-10/marcel.png\" alt=\"Remote training with GitLab-CI and DVC\">\n            </div>\n      </a>\n    </section>\n    </body></html></body></html></p>\n<p>Here are a few more stories on our radar:</p>\n<ul>\n<li><strong>AI Singapore shares their method for AI development and deployment.</strong> This\n..\n<a href=\"https://makerspace.aisingapore.org/2020/01/agile-ai-engineering-in-aisg/\">blog about how Agile informs their processes</a>\nfor continuous integration and delivery includes data versioning.</li>\n<li><strong>Toucan AI dispenses advice for ML engineers.</strong> This ..\n<a href=\"https://toucanai.com/blog/post/building-production-ml/\">blog for practitioners</a>\ndiscusses questions like, “When to work on ML vs. the processes that surround\nML”. It covers how DVC is used for model versioning in the exploration stage\nof ML.</li>\n<li>\n<p><strong>DVC at the University.</strong> A recent ..\n<a href=\"https://arxiv.org/pdf/1912.01706.pdf\">pre-print from natural language processing researchers at Université Laval</a>\nexplains how DVC facilitated dataset access for collaborators.</p>\n<blockquote>\n<p>“In our case, the original dataset takes up to 6 Gigabytes. The previous way\nof retrieving the dataset over the network with a standard 20 Mbits/sec\ninternet connexion took up to an hour to complete (including uncompressing\nthe data). Using DVC reduced the retrieval time of the dataset to 3 minutes\nover the network with the same internet connexion.”</p>\n</blockquote>\n<p>Thanks for sharing- this is a lovely result. Oh, and last…</p>\n</li>\n<li><strong>DVC is a job requirement</strong>! We celebrated a small milestone when we stumbled\n.. across a listing for a data engineer to support R&#x26;D at\n<a href=\"https://www.elvie.com/en-us/\">Elvie</a>, a maker of tech for women’s health\n(pretty neat mission). The decorations on the job posting are ours 😎</li>\n</ul>\n<p><html><head></head><body><span class=\"gatsby-resp-image-wrapper\" style=\"position: relative; display: block; margin-left: auto; margin-right: auto;  max-width: 470px;\">\n      <a class=\"gatsby-resp-image-link\" href=\"/static/f0e8a9d4e7525ba2c56504833e14c3cd/4362d/elvie.png\" style=\"display: block\" target=\"_blank\" rel=\"noopener\">\n    <span class=\"gatsby-resp-image-background-image\" style=\"padding-bottom: 83.82978723404257%; position: relative; bottom: 0; left: 0; background-image: url(&#x27;data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAABQAAAARCAYAAADdRIy+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAACVUlEQVQ4y6VUy47TQBDMN/EXHPgNfgQJCXFHnOECh9VeuLMckIggWtiEtXASO++337HHzxRdkzibXQSLYKRS2z097aqebjdKb4180kTSOgegsNtBsPsncDWCs4dwXzxA/P4p/ncxacM/fwT/7WOU8pKmObI0hVLqt0gP+1mWIUkSDb5XVbVnWAbfUcY2kqzCYjHHZrMRu8B0OsVkMsF8PsdyudSWWK1W2r9er3XMbDbT+3me7xNqqoI0VfB9H67rwnEcnZiWB5mEflr6GVfbXyRjV6EqS02dQfwaD9KSKUFG3KOfrOg7ZXi6GvXtkDIPjkYjfYB2OBwIhhjYNkZix+MxLMvSYExRFLp+k7mNzo8vSDN1kCxJWeQgCDQo1fM8+GGk4foBvCBEFCdwxM+9KIqw3W51qUy7jYtP7xAn0U1CfomsbGEzGAzQ65qwjW+wOpcwO19hXrVgXV+hd92WGAvdXg+GYWjJSRKLwnRfw1pyKXVkMsoaSuIo8FFMDWT9z8jHHWl+A7nVQjG4lK7woURRLqDsm1vBbclhGGopvkiinCQrNGKVIowVwm2MUPZZlloywec8zw5tw7GRpqRkMuv3+1oyC981TfRFmiU+R27Zk9Zh/dhCTEoCdd1v9aG+5aKUYPcYTLiHBAQngYfqKbk7QcdJ2SkHxcZAHLoYT+fHVmE9T7Fvo+GxB3kZ9eQQR4b56APS5jOo9iv4C0u3hus6unU4CWR6KvPen0NmvIFqPkd6+RLYTv/qj/LH31cxuoD6+ARZ5zUq5em55iXdd/Duc71+Asu4ECrn2prNAAAAAElFTkSuQmCC&#x27;); background-size: cover; display: block;\"></span>\n  <picture>\n        <source srcset=\"/static/f0e8a9d4e7525ba2c56504833e14c3cd/c54d4/elvie.webp 175w, /static/f0e8a9d4e7525ba2c56504833e14c3cd/a3432/elvie.webp 350w, /static/f0e8a9d4e7525ba2c56504833e14c3cd/426ac/elvie.webp 700w, /static/f0e8a9d4e7525ba2c56504833e14c3cd/e8e7c/elvie.webp 940w\" sizes=\"(max-width: 700px) 100vw, 700px\" type=\"image/webp\">\n        <source srcset=\"/static/f0e8a9d4e7525ba2c56504833e14c3cd/17006/elvie.png 175w, /static/f0e8a9d4e7525ba2c56504833e14c3cd/d6f3f/elvie.png 350w, /static/f0e8a9d4e7525ba2c56504833e14c3cd/69344/elvie.png 700w, /static/f0e8a9d4e7525ba2c56504833e14c3cd/4362d/elvie.png 940w\" sizes=\"(max-width: 700px) 100vw, 700px\" type=\"image/png\">\n        <img class=\"gatsby-resp-image-image\" src=\"/static/f0e8a9d4e7525ba2c56504833e14c3cd/69344/elvie.png\" alt=\"elvie\" title=\"elvie\" loading=\"lazy\" style=\"width:100%;height:100%;margin:0;vertical-align:middle;position:absolute;top:0;left:0;\">\n      </picture>\n  </a>\n    </span></body></html><em>A\n<a href=\"https://www.jobstoday.co.uk/job/40530810/data-engineer/?TrackID=8\">job advertisement</a>\nfeaturing DVC.</em></p>","timeToRead":4,"fields":{"slug":"/february-20-dvc-heartbeat"},"frontmatter":{"title":"February '20 DVC❤️Heartbeat","date":"February 10, 2020","description":"DVC talks around the world,\nnew team members, and full-stack machine learning.\n","descriptionLong":"Every month we share news, findings, interesting reads,\ncommunity takeaways, and everything else along the way.\nLook here for updates about DVC, our journey as a startup, \nprojects by our users and big ideas about best practices in ML and data science.\n","tags":["Heartbeat","Continuous Integration","DVC"],"commentsUrl":"https://discuss.dvc.org/t/dvc-heartbeat-feburary-20/318","author":{"childMarkdownRemark":{"frontmatter":{"name":"Elle O'Brien","avatar":{"childImageSharp":{"fixed":{"base64":"data:image/jpeg;base64,/9j/2wBDABALDA4MChAODQ4SERATGCgaGBYWGDEjJR0oOjM9PDkzODdASFxOQERXRTc4UG1RV19iZ2hnPk1xeXBkeFxlZ2P/2wBDARESEhgVGC8aGi9jQjhCY2NjY2NjY2NjY2NjY2NjY2NjY2NjY2NjY2NjY2NjY2NjY2NjY2NjY2NjY2NjY2NjY2P/wgARCAAUABQDASIAAhEBAxEB/8QAGQABAAIDAAAAAAAAAAAAAAAAAAMFAgQG/8QAFQEBAQAAAAAAAAAAAAAAAAAAAgP/2gAMAwEAAhADEAAAAZmtNOlyjIcrZgpiEP/EABsQAQACAgMAAAAAAAAAAAAAAAIBAxIhABEz/9oACAEBAAEFArV0dVzy942N41GdSpSt8A0D/8QAFxEAAwEAAAAAAAAAAAAAAAAAAQIgIf/aAAgBAwEBPwFRkf/EABURAQEAAAAAAAAAAAAAAAAAAAEg/9oACAECAQE/AWP/xAAfEAACAQIHAAAAAAAAAAAAAAAAARACEQMSUVJhcYH/2gAIAQEABj8CS3MzUexhvQ7h3FwWTP/EAB0QAAMBAAEFAAAAAAAAAAAAAAABESFRMUFxsdH/2gAIAQEAAT8hsbi0nApKKKoujQjiDTN+15X0ovgcWhPSElmTuf/aAAwDAQACAAMAAAAQHOi9/8QAGBEAAwEBAAAAAAAAAAAAAAAAAAExEBH/2gAIAQMBAT8QRRwUz//EABcRAAMBAAAAAAAAAAAAAAAAAAEQITH/2gAIAQIBAT8QOo6v/8QAHRABAAMBAAIDAAAAAAAAAAAAAQARITFxkUFhsf/aAAgBAQABPxB89x9U6Sn6EA69v7iEs5LB0aDWr38jGAHlgPfSAs0pXqIzhQ8QFPkh4ORpypLXVhnif//Z","width":40,"height":40,"src":"/static/1614906361c7d460137741db062e0c7e/d83e5/elle_obrien.jpg","srcSet":"/static/1614906361c7d460137741db062e0c7e/d83e5/elle_obrien.jpg 1x,\n/static/1614906361c7d460137741db062e0c7e/58860/elle_obrien.jpg 1.5x,\n/static/1614906361c7d460137741db062e0c7e/90ac5/elle_obrien.jpg 2x","srcWebp":"/static/1614906361c7d460137741db062e0c7e/e145b/elle_obrien.webp","srcSetWebp":"/static/1614906361c7d460137741db062e0c7e/e145b/elle_obrien.webp 1x,\n/static/1614906361c7d460137741db062e0c7e/0d42c/elle_obrien.webp 1.5x,\n/static/1614906361c7d460137741db062e0c7e/f46db/elle_obrien.webp 2x"}}}}}},"picture":{"childImageSharp":{"fluid":{"base64":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAABQAAAAOCAYAAAAvxDzwAAAACXBIWXMAAAsSAAALEgHS3X78AAAB/ElEQVQ4y21T247TMBDNv2xjO3ESJ45zbbd0kVAlBLwBbzzwA3wBEl/uYc4kLlm0D1PXnts5ZyZZWZaxdXV0zWata6JLdz6NMVErFU1Vxe7952iKIn6Z5zi1Ibo6xLbuxSRO65jxDymlxPJcUVNXxE75r5SW/yY/kek85b/+0LevP+j3zVHXrnSd7rSGF/JuJNRBLArGgqsX3BldgKwsC7njXbMZjjHWRvXhY/x0/x5/3t/FpvZxDbe4Di+xbTyjU5IvCKvKku8c9b6l0Ldy4l5ZK521NsTFKTSWnLPUhZ7jOolLZm0psVKw5oLLFOh2Xej5PIlNgyfWd6etiRHTuo50vcz0zCdiLnyuy0Dz2D+av0IIHZOmPBAaQie6bCg169aIxsc4vCFftN41JO+dPCZhkyF4a8QsuNDANIE0xSUwPeenphmoNk1FT6cTqf0xBSMZWvGgBC10SpMXhGzIQ/489eLLoNPIeiGhru2jE4z3U7Q9s07nZRS6G9XND+2Rt+ld/NOwZCTQDA7QBBLsIegk2jJ1vp/yXPz8AdDEw0BekkEKPuDvyw09EQgKaPR0yoUSGiB2mYP4sSpp+Y8yZcchpC8EDqAAYtAENRSdRv/Yt9TgaK8K6sNAYLI6+zBgoAyN8X7M+b9o9laXraAWnS77os88nONqvVUM9hfwOqnOAgeLkwAAAABJRU5ErkJggg==","aspectRatio":1.4450867052023122,"src":"/static/5a98f92cf0ff6089ab3b066ce727cd94/286b3/heartbeat_black.png","srcSet":"/static/5a98f92cf0ff6089ab3b066ce727cd94/1f44b/heartbeat_black.png 213w,\n/static/5a98f92cf0ff6089ab3b066ce727cd94/3e433/heartbeat_black.png 425w,\n/static/5a98f92cf0ff6089ab3b066ce727cd94/286b3/heartbeat_black.png 850w,\n/static/5a98f92cf0ff6089ab3b066ce727cd94/9a739/heartbeat_black.png 1275w,\n/static/5a98f92cf0ff6089ab3b066ce727cd94/c47cc/heartbeat_black.png 1700w,\n/static/5a98f92cf0ff6089ab3b066ce727cd94/c5a1b/heartbeat_black.png 2000w","srcWebp":"/static/5a98f92cf0ff6089ab3b066ce727cd94/5c1d9/heartbeat_black.webp","srcSetWebp":"/static/5a98f92cf0ff6089ab3b066ce727cd94/99b2d/heartbeat_black.webp 213w,\n/static/5a98f92cf0ff6089ab3b066ce727cd94/23220/heartbeat_black.webp 425w,\n/static/5a98f92cf0ff6089ab3b066ce727cd94/5c1d9/heartbeat_black.webp 850w,\n/static/5a98f92cf0ff6089ab3b066ce727cd94/5e720/heartbeat_black.webp 1275w,\n/static/5a98f92cf0ff6089ab3b066ce727cd94/35cfd/heartbeat_black.webp 1700w,\n/static/5a98f92cf0ff6089ab3b066ce727cd94/37117/heartbeat_black.webp 2000w","sizes":"(max-width: 850px) 100vw, 850px","presentationWidth":850}}},"pictureComment":"Just in time for Valentine's day, here's a seasonally-relevant DVC pipeline."}}},"pageContext":{"next":null,"previous":{"fields":{"slug":"/gsoc-ideas-2020"},"frontmatter":{"title":"Join DVC for Google Summer of Code 2020"}},"currentPage":1,"slug":"/february-20-dvc-heartbeat"}}}