{"componentChunkName":"component---src-templates-blog-post-tsx","path":"/dvc-project-ideas-for-google-summer-of-docs-2019","result":{"data":{"markdownRemark":{"id":"bd3dea9e-6220-5967-b66c-af24479d3cb8","excerpt":"<p>We strongly believe that well-shaped documentation is key for making the product\ntruly open. We have been investing lots of time and energy…</p>","html":"<p>We strongly believe that well-shaped documentation is key for making the product\ntruly open. We have been investing lots of time and energy in improving our docs\nlately. Being a team of 90% engineers we are eager to welcome the writers into\nour team and our community. We are happy to share our experience, introduce them\nto the world of open source and machine learning best practices, guide through\nthe OS contribution process and work together on improving our documentation.</p>\n<p>DVC was started in late 2017 by a data scientist and an engineer. It is now\ngrowing pretty fast and though our in-house team is quite small, we have to\nthank our contributors (more than 80 in both code and docs) for developing DVC\nwith us. When working with DVC the technical writer will not only get lots of\nhands-on experience in writing technical docs, but will also immerse into DVC\ncommunity — a warm and welcoming gathering of ML and DS enthusiasts and an\ninvaluable source of inspiration and expertise in ML engineering.</p>\n<h3>About DVC</h3>\n<p>DVC is a brainchild of a data scientist and an engineer, that was created to\nfill in the gaps in the ML processes tooling and evolved into a successful open\nsource project.</p>\n<p>ML brings changes in development and research processes. These ML processes\nrequire new tools for data versioning, ML pipeline versioning, resource\nmanagement for model training and others that haven’t been formalized. The\ntraditional software development tools do not fully cover ML team’s needs but\nthere are no good alternatives. It makes engineers to custom develop a new\ntoolset to manage data files, keep track of ML experiments and connect data and\nsource code together. The ML process becomes very fragile and requires tons of\ntribal knowledge.</p>\n<p>We have been working on <a href=\"http://DVC.org\">DVC</a> by adopting best ML practices and\nturning them into Git-like command line tool. DVC versions multi-gigabyte\ndatasets and ML models, make them shareable and reproducible. The tool helps to\norganize a more rigorous process around datasets and the data derivatives. Your\nfavorite cloud storage (S3, GCS, or bare metal SSH server) could be used with\nDVC as a data file backend.</p>\n<p>If you are interested in learning a little bit more about DVC and its journey,\nhere is a great interview with DVC creator in the Episode 206 of\nPodcast.<strong>init</strong>. Listen to it\n<a href=\"https://www.pythonpodcast.com/data-version-control-episode-206/\">HERE </a>or read\nthe transcript\n<a href=\"https://towardsdatascience.com/data-version-control-with-dvc-what-do-the-authors-have-to-say-3c3b10f27ee\">HERE.</a></p>\n<h3>The state of DVC documentation</h3>\n<p>DVC is a pretty young project, developed and maintained solely by engineers. As\nmany OS projects we started from the bottom and for a long time our\n<a href=\"https://dvc.org/doc\">documentation</a> was a bunch of bits and pieces. Nowadays\nimproving documentation is one of our top priorities. We moved to the new\nin-house built documentation engine and started working with several technical\nwriters. Certain parts have been tremendously improved recently, e.g.\n<a href=\"https://dvc.org/doc/get-started\">Get Started</a> and\n<a href=\"https://dvc.org/doc/commands-reference/fetch\">certain parts of Commands Reference</a>\n. So far most of our documentation has been written majorly by the engineering\nteam and there is need for improving the overall structure and making some parts\nmore friendly from a new user perspective. We have mostly complete\n<a href=\"https://dvc.org/doc/commands-reference\">reference documentation</a> for each\ncommand, although some functions are missing good actionable examples. We also\nhave a <a href=\"https://dvc.org/doc/user-guide/dvc-files-and-directories\">User Guide</a>,\nhowever it is not in very good shape. We strive for making our documentation\nclear and comprehensive for users of various backgrounds and proficiency levels\nand this is where we do need some fresh perspective.</p>\n<h3>How DVC documentation is built</h3>\n<p>We have an open Github Apache-2 licensed repository for the\n<a href=\"https://github.com/iterative/dvc.org\">DVC website</a>, the documentation engine\nand the <a href=\"https://github.com/iterative/dvc.org\">documentation files</a>. The website\nis built with Node.js + React, including the documentation engine (built\nin-house).</p>\n<p>Each documentation page is a static Markdown file in the repository, e.g.\n<a href=\"https://github.com/iterative/dvc.org/blob/master/static/docs/get-started/example-versioning.md.\">example here</a>.\nIt is rendered dynamically in the browser, no preprocessing is required. It\nmeans that tech writers or contributors need to write/edit a Markdown file,\ncreate a pull request and merge it into the master branch of the\n<a href=\"https://github.com/iterative/dvc.org\">repository.</a> The complete\n<a href=\"https://github.com/iterative/dvc.org/blob/master/README.md#contributing\">documentation contributing guide</a>\ndescribes the directory structure and locations for the different documentation\nparts.</p>\n<h3>DVC’s approach to documentation work</h3>\n<p>Documentation tasks and issues are maintained on our doc’s GitHub\n<a href=\"https://github.com/iterative/dvc.org/issues\">issue tracker</a>. Changes to the\ndocumentation are made via pull requests on GitHub, and go through our standard\nreview process which is the same for documentation and code. A technical writer\nwould be trained in working with our current development process. It generally\nmeans that tech writers or contributors need to write/edit a Markdown file, use\ngit and Github to create a pull request and publish it. The documentation\n<a href=\"https://github.com/iterative/dvc.org/blob/master/README.md#contributing\">contributing guide</a>\nincludes style conventions and other details. Documentation is considered of the\nsame importance as code. Engineering team has a policy to write or update the\nrelevant sections if something new is released. If it’s something too involved\nengineers may create a ticket and ask for help. There is one maintainer who is\nresponsible for doing final reviews and merging the changes. In this sense, our\ndocumentation is very similar to any other open source project.</p>\n<h2>Project ideas for GSoD’19</h2>\n<p>We identified a number of ideas to work on and there are two major topics these\nideas fall into. Both topics are pretty broad and we don’t expect we can\ncompletely cover them during this GSoD but hopefully we can make certain\nprogress.</p>\n<p>First of all, we want to bring more structure and logic to our documentation to\nimprove user onboarding experience. The goal is for a new user to have a clear\npath they can follow and understand what takeaways each part of the\ndocumentation provides. In particular, improving how\n<a href=\"https://dvc.org/doc/get-started\">Get Started</a>,\n<a href=\"https://dvc.org/doc/tutorial\">Tutorials</a> and\n<a href=\"https://dvc.org/doc/get-started/example-versioning\">Examples</a> relate to each\nother, restructuring the existing <a href=\"https://dvc.org/doc/user-guide\">User Guide</a>\nto explain basic concepts, and writing more use cases that resonate with ML\nengineers and data scientists.</p>\n<p>The other issue we would like to tackle is improving and expanding the existing\nreference docs — commands descriptions, examples, etc. It involves filling in\nthe gaps and developing new sections, similar to\n<a href=\"https://dvc.org/doc/commands-reference/fetch\">this one</a>. We would also love to\nsee more illustrative materials.</p>\n<h3>Project 1: Improving and expanding User Guide</h3>\n<p><strong>Description and details:</strong> Reviewing, restructuring and filling major gaps in\nthe User Guide (introductory parts of the basic concepts of DVC), e.g. have a\nlook at <a href=\"https://github.com/iterative/dvc.org/issues/144\">this ticket</a> or\n<a href=\"https://github.com/iterative/dvc.org/issues/53\">this one</a>.</p>\n<p><strong>Mentors</strong>: <a href=\"https://github.com/shcheklein\">@shcheklein</a> and\n<a href=\"https://github.com/dmpetrov\">@dmpetrov</a></p>\n<h3>Project 2: Expanding and developing new tutorials and use cases.</h3>\n<p><strong>Description and details:</strong> We already have some requests for more tutorials,\ne.g. <a href=\"https://github.com/iterative/dvc.org/issues/96\">this ticket</a>. Here is\nanother good <a href=\"https://github.com/iterative/dvc.org/issues/194\">use case request</a>\n. If you are going to work on this project you would need some domain knowledge,\npreferably some basic ML or data science experience.</p>\n<p><strong>Mentors</strong>: <a href=\"https://github.com/shcheklein\">@shcheklein</a> and\n<a href=\"https://github.com/dmpetrov\">@dmpetrov</a></p>\n<h3>Project 3: Improving new user onboarding</h3>\n<p><strong>Description and details:</strong> Analyze and restructure user walkthrough across\n<a href=\"https://dvc.org/doc/get-started\">Get started</a>,\n<a href=\"https://dvc.org/doc/tutorial\">Tutorials</a> and\n<a href=\"https://dvc.org/doc/get-started/example-versioning\">Examples</a>. These three have\none thing in common — hands-on experience with DVC. If you choose this project,\nwe will work together to come up with a better location for the Examples (to\nmove them out of the Get Started shadow), and a better location for the\nTutorials (to reference external tutorials that were developed by our community\nmembers and published on different platforms).</p>\n<p><strong>Mentors</strong>: <a href=\"https://github.com/shcheklein\">@shcheklein</a> and\n<a href=\"https://github.com/dmpetrov\">@dmpetrov</a></p>\n<h3>Project 4: Improving commands reference</h3>\n<p><strong>Description and details:</strong> We will work on improving our\n<a href=\"https://dvc.org/doc/commands-reference\">Commands reference</a> section. This\nincludes expanding and filling in the gaps. One of the biggest pain points right\nnow are Examples. Users want them to be\n<a href=\"https://github.com/iterative/dvc.org/issues/198\">easy to run and try</a> and here\nis a lot to be done in terms of improvement. We have a good example of how is\nshould be done <a href=\"https://dvc.org/doc/commands-reference/fetch\">here</a>.</p>\n<p><strong>Mentors</strong>: <a href=\"https://github.com/shcheklein\">@shcheklein</a> and\n<a href=\"https://github.com/dmpetrov\">@dmpetrov</a></p>\n<h3>Project 5: Describe and integrate “DVC packages”</h3>\n<p><strong>Description and details:</strong> Describe the brand new feature “DVC packages” and\nintegrate it with the rest of the documentation. We have been working hard to\nrelease a few new commands to help with datasets management (have a look at\n<a href=\"https://github.com/iterative/dvc/issues/1487\">this ticket</a>). It’s a major\nfeature that deserves its place in the Get Started, Use cases, Commands\nReference, etc.</p>\n<p><strong>Mentors</strong>: <a href=\"https://github.com/shcheklein\">@shcheklein</a> and\n<a href=\"https://github.com/dmpetrov\">@dmpetrov</a></p>\n<p>The ideas we outline above are just an example of what we can work on. We are\nopen for any other suggestions and would like to work together with the\ntechnical writer to make the contribution experience both useful and enjoyable\nfor all parties involved. If you have any suggestions or questions we would love\nto hear from you => DVC.org/support and our DMs on\n<a href=\"https://twitter.com/DVCorg\">Twitter</a> are always open!</p>\n<html><head></head><body><hr></body></html>\n<p>Special thanks to the <a href=\"https://numfocus.org/\">NumFOCUS</a> for the ideas list\ninspiration.</p>\n<p>If you are a tech writer — check the\n<a href=\"https://developers.google.com/season-of-docs/docs/tech-writer-guide\">Technical writer guide</a>.\nFrom April 30, 2019 you can see the list of participating open source\norganizations on the <a href=\"https://g.co/seasonofdocs\">Season of Docs website</a>. The\napplication period for technical writers opens on <strong>May 29, 2019</strong> and ends on\nJune 28, 2019.</p>","timeToRead":8,"fields":{"slug":"/dvc-project-ideas-for-google-summer-of-docs-2019"},"frontmatter":{"title":"DVC project ideas for Google Season of Docs 2019","date":"April 23, 2019","description":"DVC.org is applying for Google Season of Docs — a new and unique program\nsponsored by Google that pairs technical writers with open source projects to\ncollaborate on the open source project documentation.\n","descriptionLong":"DVC.org is applying for Google Season of Docs — a new and unique program\nsponsored by Google that pairs technical writers with open source projects to\ncollaborate on the open source project documentation. It’s happening for the\nfirst time in 2019 and we are excited about the opportunity to be a part of\nit!\n","tags":["Google Season of Docs","Python","Documentation","DVC"],"commentsUrl":"https://discuss.dvc.org/t/dvc-project-ideas-for-google-season-of-docs-2019/291","author":{"childMarkdownRemark":{"frontmatter":{"name":"Svetlana Grinchenko","avatar":{"childImageSharp":{"fixed":{"base64":"data:image/jpeg;base64,/9j/2wBDABALDA4MChAODQ4SERATGCgaGBYWGDEjJR0oOjM9PDkzODdASFxOQERXRTc4UG1RV19iZ2hnPk1xeXBkeFxlZ2P/2wBDARESEhgVGC8aGi9jQjhCY2NjY2NjY2NjY2NjY2NjY2NjY2NjY2NjY2NjY2NjY2NjY2NjY2NjY2NjY2NjY2NjY2P/wgARCAAUABQDASIAAhEBAxEB/8QAGAABAAMBAAAAAAAAAAAAAAAAAAIDBQT/xAAUAQEAAAAAAAAAAAAAAAAAAAAA/9oADAMBAAIQAxAAAAHRz7x3qhnQCoH/xAAaEAACAwEBAAAAAAAAAAAAAAABAgMEMxIU/9oACAEBAAEFArBJZ0aqyN0s+trCvhYchTKzj0SRD//EABQRAQAAAAAAAAAAAAAAAAAAACD/2gAIAQMBAT8BH//EABQRAQAAAAAAAAAAAAAAAAAAACD/2gAIAQIBAT8BH//EAB0QAAICAgMBAAAAAAAAAAAAAAECABExQQMSIVH/2gAIAQEABj8CXiVuvbcDK9jYMDDcRyLBFTHpIixRflQjAHyBVwJ//8QAGxABAAMAAwEAAAAAAAAAAAAAAQARQSExUXH/2gAIAQEAAT8ht1LLoIkF4B6wLgOVH8uRVjiTT5AoqBr2L1odagwijiyf/9oADAMBAAIAAwAAABBjDwD/xAAUEQEAAAAAAAAAAAAAAAAAAAAg/9oACAEDAQE/EB//xAAUEQEAAAAAAAAAAAAAAAAAAAAg/9oACAECAQE/EB//xAAeEAACAwADAAMAAAAAAAAAAAABEQAhMUFRYXGBkf/aAAgBAQABPxCmAzsDr5hIjqwzzzqjPahi5r8IQVrj2MCG47GtJ+o5KCJ7t+zbocIBE8rYP/mqRYTv5EF8QZIHU//Z","width":40,"height":40,"src":"/static/fcc8502faa36f9a989fa0651c3c21653/d83e5/svetlana_grinchenko.jpg","srcSet":"/static/fcc8502faa36f9a989fa0651c3c21653/d83e5/svetlana_grinchenko.jpg 1x,\n/static/fcc8502faa36f9a989fa0651c3c21653/58860/svetlana_grinchenko.jpg 1.5x,\n/static/fcc8502faa36f9a989fa0651c3c21653/90ac5/svetlana_grinchenko.jpg 2x","srcWebp":"/static/fcc8502faa36f9a989fa0651c3c21653/e145b/svetlana_grinchenko.webp","srcSetWebp":"/static/fcc8502faa36f9a989fa0651c3c21653/e145b/svetlana_grinchenko.webp 1x,\n/static/fcc8502faa36f9a989fa0651c3c21653/0d42c/svetlana_grinchenko.webp 1.5x,\n/static/fcc8502faa36f9a989fa0651c3c21653/f46db/svetlana_grinchenko.webp 2x"}}}}}},"picture":{"childImageSharp":{"fluid":{"base64":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAABQAAAAPCAIAAABr+ngCAAAACXBIWXMAAAsSAAALEgHS3X78AAADUUlEQVQoz3WRy28TVxjFbwhINIRXFVO6Adp/gQWIBbugdtlm0VaqSIOSCLAQCEGkRASCSsOCpi2Ulqig4jgQGzuQMH6NnThjJ3ZIkxm/Z2x3HL8ytuOMJ6SyPQ+PH50J6qILjn469yGd+937XcAfVch8fFCEtZWSi43+WgrdLoVuvaMsc3Pb5WUxeFNIqevCekNWDfBH2rhjh0TFnvVvPl+2arDpMcw4gRo0XoPGY9RgRg1qfIFCExKYtGnRLuqfUu4fGyxZFxjAH94n5SXYI4dI9ZR/ORGcC4Tf4G/cfq/Th8/7ozPOOBZM+ongfMhjC9q0Ae/0vWrqt4r/AhCHh/gDu6TilcOtW6dP4sFV/2oug0f+sLtH4OWE7tna5Fg6EEygPsIbDSey89aAz/Sk6uvgIQCku4v3hoX9O/ljioqi5Z+vv4iu57JbxSdL+I3pxZX7D2ibNhqKedBY0EcmqZzLQeG6noa9STA1gUatJuX5h/fLilbho71C2+6tM2e2fvoeefUQXfk7shQmnY44kUC9zAq2KT0kbenlDDsq5iYJUK/XpXBug/3rzu9C+wnuw4PCqQ+qQ4B/CSJwdwxWbUAjOWySIsZp9HLZ/knVBETL/8PZRAExY/kYuTnYI/Tu5I4rKl/uq9oANwUEQ5Ng3lExgioEJOfNuwRTs2gG0in/VU6/XXjpKCTThYiqNgsqn7WKu/eKP7TVXC0NC5CBQR2W/d2ENzeXZj4F29/diOhn4D9nIy6Pz+ql59rFwT1UewehNYWRKdJyN264knjdlXh9NmnsJXSX5nXDVpvesEDI4YgGutX9aFCp7leq+pVjQ8pf7py93df7uK9ntK9z9HqX6vo5dd85lcQ1ia6x/u7xx+NenTMOisgc+d15590u5NFX0z1Kfedl04Nxuw11WBcd8CJilXAjsMsKITCEOK3uV3r7z6Mm7Wz4xRwJMjcGkhevrqu7uUAnrR5gUH+MxKlMKr2WzOWzVHatWCpmstRGIU8zG/l8ZjlAqqAl7Qw+6VwFKeXF1IVLqwPfxp91FhlCqDfogiyalsfNzbccyzEMw/MCWy6nqbza6HkKYc9txMQsCdhQiMVxzh8QU2ty67ab/z6xfDVTYLMMlymUM3TpXwMGgA/YGuToAAAAAElFTkSuQmCC","aspectRatio":1.34715821812596,"src":"/static/3f83b2f133ac48de03c7aa1b31e399d4/286b3/post-image.png","srcSet":"/static/3f83b2f133ac48de03c7aa1b31e399d4/1f44b/post-image.png 213w,\n/static/3f83b2f133ac48de03c7aa1b31e399d4/3e433/post-image.png 425w,\n/static/3f83b2f133ac48de03c7aa1b31e399d4/286b3/post-image.png 850w,\n/static/3f83b2f133ac48de03c7aa1b31e399d4/9a739/post-image.png 1275w,\n/static/3f83b2f133ac48de03c7aa1b31e399d4/c47cc/post-image.png 1700w,\n/static/3f83b2f133ac48de03c7aa1b31e399d4/fee50/post-image.png 1754w","srcWebp":"/static/3f83b2f133ac48de03c7aa1b31e399d4/5c1d9/post-image.webp","srcSetWebp":"/static/3f83b2f133ac48de03c7aa1b31e399d4/99b2d/post-image.webp 213w,\n/static/3f83b2f133ac48de03c7aa1b31e399d4/23220/post-image.webp 425w,\n/static/3f83b2f133ac48de03c7aa1b31e399d4/5c1d9/post-image.webp 850w,\n/static/3f83b2f133ac48de03c7aa1b31e399d4/5e720/post-image.webp 1275w,\n/static/3f83b2f133ac48de03c7aa1b31e399d4/35cfd/post-image.webp 1700w,\n/static/3f83b2f133ac48de03c7aa1b31e399d4/cf3ed/post-image.webp 1754w","sizes":"(max-width: 850px) 100vw, 850px","presentationWidth":850}}},"pictureComment":null}}},"pageContext":{"next":{"fields":{"slug":"/may-19-dvc-heartbeat"},"frontmatter":{"title":"May ’19 DVC❤️Heartbeat"}},"previous":{"fields":{"slug":"/april-19-dvc-heartbeat"},"frontmatter":{"title":"April ’19 DVC❤️Heartbeat"}},"currentPage":12,"slug":"/dvc-project-ideas-for-google-summer-of-docs-2019"}}}