• Building the Backend: Data Solutions that Power Leading Organizations

  • 著者: Travis Lawrence
  • ポッドキャスト

Building the Backend: Data Solutions that Power Leading Organizations

著者: Travis Lawrence
  • サマリー

  • Welcome to the Building the Backend Podcast! We’re a data podcast focused on uncovering the data technologies, processes, and patterns that are driving today’s most successful companies. You will hear from data leaders sharing their knowledge and insights with what’s working and what’s not working for them. Our goal is to bring you valuable insights that will save you and your team time when building a modern data architecture in the cloud. Topics will span from big data, AI, ML, governance, visualizations, and best practices for enabling your organization to be data-driven. If you are a chief data officer, data architect, data engineer, data analyst, and those building the backend data solutions then HIT SUBSCRIBE!
    © 2023 Building the Backend: Data Solutions that Power Leading Organizations
    続きを読む 一部表示

あらすじ・解説

Welcome to the Building the Backend Podcast! We’re a data podcast focused on uncovering the data technologies, processes, and patterns that are driving today’s most successful companies. You will hear from data leaders sharing their knowledge and insights with what’s working and what’s not working for them. Our goal is to bring you valuable insights that will save you and your team time when building a modern data architecture in the cloud. Topics will span from big data, AI, ML, governance, visualizations, and best practices for enabling your organization to be data-driven. If you are a chief data officer, data architect, data engineer, data analyst, and those building the backend data solutions then HIT SUBSCRIBE!
© 2023 Building the Backend: Data Solutions that Power Leading Organizations
エピソード
  • The Analytics Engine for All Your Data with Justin Borgman @ Starburst
    2022/03/15

    In this episode we speak with Justin Borgman, Chairman & CEO at Starburst, which is based on open source Trino (formerly PrestoSQL) and was recently valued at $3.35 billion after securing their series D funding.  In this episode we discuss convergence of DW’s / DL's, why data lakes fail and much much more. 

    Top 3 takeaways

    • The data mesh architecture is gaining adoption more quickly in Europe due to GDPR.
    • There were two main limitations of data lakes when comparing to DW’s, performance and CRUD operations. Performance has been resolved with query engines like Starburst and tools like Apache Iceberg, Apache Hudi and Delta Lake are starting to close the gap with CRUD operations. 
    • The principle of a single source of truth / storing everything in a single DL or DW is not always feasible or possible depending on regulations. Starburst is bridging that gap and enabling data mesh and data fabric architectures. 
    続きを読む 一部表示
    36 分
  • Transform Your Object Storage Into a Git-like Repository With Paul Singman @ LakeFS
    2022/03/01

    In this episode we speak with Paul Singman Developer Advocate at Treeverse / LakeFS. LakeFS is an open source project  that allows you to transform your object storage into a Git-like repository. 

    Top 3 takeaways

    • LakeFS enables use cases like debugging to quickly view historical versions of your data at a specific point in time and running ML experiments over the same set of data with branching..
    • The current data landscape is very fragmented with many tools available.. Over the coming years there will most likely be consolidation of tools that are more open and integrated. 
    • Data quality and observability continue to be key components of successful data lakes and having visibility into job runs. 
    続きを読む 一部表示
    27 分
  • Enable Faster Data Processing and Access with Apache Arrow with Matt Topol @ Factset
    2022/02/01

    In this episode we speak with Matt Topol, Vice President, Principal Software Architect @ FactSet and dive deep into how they are taking advantage of Apache Arrow for faster processing and data access. 

    Below are the top 3 value bombs:

    • Apache Arrow is an open-source in-memory columnar format that creates a standard way to share and process data structures.
    • Apache Arrow Flight eliminates serialization and deserialization which enables faster access to query results compared to traditional JDBC and ODBC interfaces.
    • Don’t put all your eggs in one basket, whether you're using commercial products or open source, make sure you design a modular architecture that does not tie you down to any one piece of technology.
    続きを読む 一部表示
    49 分

Building the Backend: Data Solutions that Power Leading Organizationsに寄せられたリスナーの声

カスタマーレビュー:以下のタブを選択することで、他のサイトのレビューをご覧になれます。