• [Bite] Version Control for Data Scientists

  • 2022/05/05
  • 再生時間: 16 分
  • ポッドキャスト

[Bite] Version Control for Data Scientists

  • サマリー

  • Data scientists usually have to write code to prototype software, be it to preprocess and clean data, engineer features, build a model, or deploy a codebase into a production environment or other use case. The evolution of a codebase is important for a number of reasons which is where version control can help, such as:

    • collaborating with other code developers (due diligence in coordination and delegation)
    • generating backups
    • recording versions
    • tracking changes
    • experimenting and testing
    • and working with agility.

    In this bite episode of the DataCafé we talk about these motivators for version control and how it can strengthen your code development and teamwork in building a data science model, pipeline or product.

    Further reading:

    • Version control via Wikipedia https://en.wikipedia.org/wiki/Version_control
    • git-scm via https://git-scm.com/
    • "Version Control & Git" by Jason Byrne via Slideshare https://www.slideshare.net/JasonByrne6/version-control-git-86928367
    • "Learn git" via codecademy https://www.codecademy.com/learn/learn-git
    • "Become a git guru" via Atlassian https://www.atlassian.com/git/tutorials
    • Gitflow workflow via Atlassian https://www.atlassian.com/git/tutorials/comparing-workflows/gitflow-workflow
    • "A successful git branching model" by Vincent Dressian https://nvie.com/posts/a-successful-git-branching-model/
    • Branching strategies via GitVersion https://gitversion.net/docs/learn/branching-strategies/

    Recording date: 21 April 2022

    Thanks for joining us in the DataCafé. You can follow us on twitter @DataCafePodcast and feel free to contact us about anything you've heard here or think would be an interesting topic in the future.

    続きを読む 一部表示

あらすじ・解説

Data scientists usually have to write code to prototype software, be it to preprocess and clean data, engineer features, build a model, or deploy a codebase into a production environment or other use case. The evolution of a codebase is important for a number of reasons which is where version control can help, such as:

  • collaborating with other code developers (due diligence in coordination and delegation)
  • generating backups
  • recording versions
  • tracking changes
  • experimenting and testing
  • and working with agility.

In this bite episode of the DataCafé we talk about these motivators for version control and how it can strengthen your code development and teamwork in building a data science model, pipeline or product.

Further reading:

  • Version control via Wikipedia https://en.wikipedia.org/wiki/Version_control
  • git-scm via https://git-scm.com/
  • "Version Control & Git" by Jason Byrne via Slideshare https://www.slideshare.net/JasonByrne6/version-control-git-86928367
  • "Learn git" via codecademy https://www.codecademy.com/learn/learn-git
  • "Become a git guru" via Atlassian https://www.atlassian.com/git/tutorials
  • Gitflow workflow via Atlassian https://www.atlassian.com/git/tutorials/comparing-workflows/gitflow-workflow
  • "A successful git branching model" by Vincent Dressian https://nvie.com/posts/a-successful-git-branching-model/
  • Branching strategies via GitVersion https://gitversion.net/docs/learn/branching-strategies/

Recording date: 21 April 2022

Thanks for joining us in the DataCafé. You can follow us on twitter @DataCafePodcast and feel free to contact us about anything you've heard here or think would be an interesting topic in the future.

[Bite] Version Control for Data Scientistsに寄せられたリスナーの声

カスタマーレビュー:以下のタブを選択することで、他のサイトのレビューをご覧になれます。