エピソード

  • Enterprise Data Observability and the Future of Agentic AI with Ramon Chen, Chief Product Officer at Acceldata
    2025/04/07

    In this thought-provoking episode of Data Hurdles, hosts Chris Detzel and Michael Burke welcome back Ramon Chen, Chief Product Officer at Acceldata, for an insightful discussion on the rapidly evolving world of enterprise data observability and agentic AI.

    Ramon shares how data observability has evolved from an emerging concept to a "full-blown tidal wave" in the industry, now widely recognized as a crucial component of data management that ensures proactive data quality and trustworthiness throughout the data supply chain. The conversation explores how data observability functions as a set of policies and rules that monitor data quality from inception, providing data engineers with timely alerts to resolve issues before they affect business users' reports or downstream AI applications.

    The episode dives deep into Acceldata's recent announcement of "Agentic AI data management" - a paradigm shift that applies AI agents to data management in a way similar to their application in customer support and sales. Ramon explains how this approach offers a chat-like interface that adapts to the user's role and intent, providing personalized insights and recommendations about data quality and reliability.

    The hosts and Ramon also discuss broader implications of AI advancement, including the changing nature of technical roles, the balance between automation and human oversight, and the emergence of AI observability as a natural extension of data observability. Ramon highlights the upcoming "Autonomous 25" conference on May 20th in San Francisco, where industry leaders will explore agentic AI and its impact on data management.

    This episode offers valuable insights for data professionals navigating the intersection of AI and data management in an era of unprecedented technological change.

    続きを読む 一部表示
    30 分
  • The Shield, Not the Weapon: Ethical AI Surveillance with Ram Bulusu of Warp9Ai
    2025/03/31

    In this thought-provoking episode of Data Hurdles, hosts Chris Detzel and Michael Burke speak with Ram Bulusu, Head of Applied Artificial Intelligence of Warp9Ai about his work developing advanced surveillance technologies for public safety applications. The conversation primarily explores Ram's development of an AI-enabled camera system designed for airports and border crossings that uses multimodal data inputs to identify potential security threats in real-time.

    Ram explains his concept of "benevolent monitoring" - using AI surveillance as a protective shield rather than a controlling weapon - and details how his proposed system could help prevent security breaches, traffic accidents, and crimes by detecting behavioral patterns before incidents occur. He discusses the technical challenges of creating real-time monitoring systems, including energy requirements and data management issues, while addressing concerns about privacy and government oversight.

    The discussion also touches on Ram's other AI projects, including an interactive AI psychotherapist designed to provide immediate mental health support for those in crisis. Throughout the episode, hosts Chris and Mike raise thoughtful questions about the ethical implications, privacy concerns, and potential benefits of these emerging surveillance technologies, creating a balanced exploration of how AI might transform public safety and security in the coming years.

    続きを読む 一部表示
    40 分
  • Breaking Data Silos: AI-Ready Data Strategies with Nishith Trivedi, Enterprise Data Governance and Global MDM Lead at Pfizer
    2025/03/17

    In this insightful episode of Data Hurdles, hosts Chris Detzel and Michael Burke sit down with Nishith Trivedi, Enterprise Data Governance and Global MDM Lead at Pfizer. Nishith shares his journey from chemical engineering to becoming a data expert, and details how his team is transforming Pfizer's data landscape to support AI initiatives.

    Nishith provides a fascinating look at how a pharmaceutical giant manages data across multiple verticals—from supply chain to R&D—while explaining the challenges of making data "AI-ready." He discusses the evolution from vector-based RAG to graph-based approaches, the importance of ontologies in preventing AI hallucinations, and how knowledge graphs help connect unstructured data.

    The conversation explores how Pfizer is navigating complex regulatory requirements across 150+ countries, the shift toward patient-centric approaches, and the vision for creating FAIR data (Findable, Accessible, Interoperable, and Reusable). Listeners will gain valuable insights into enterprise data governance, the future of agentic AI, and practical strategies for breaking down data silos in large organizations.


    続きを読む 一部表示
    44 分
  • DeepSeek's Cost-Efficient Model Training ($5M vs hundreds of millions for competitors)
    2025/02/22

    The episode features hosts Chris Detzel and Michael Burke discussing DeepSeek, a Chinese AI company making waves in the large language model (LLM) space. Here are the key discussion points:

    Major Breakthrough in Cost Efficiency:
    - DeepSeek claimed they trained their latest model for only $5 million, compared to hundreds of millions or billions spent by competitors like OpenAI
    - This cost efficiency created market disruption, particularly affecting NVIDIA's stock as it challenged assumptions about necessary GPU resources

    Mixture of Experts (MoE) Innovation:
    - Instead of using one large model, DeepSeek uses multiple specialized "expert" models
    - Each expert model focuses on specific areas/topics
    - Uses reinforcement learning to route queries to the appropriate expert model
    - This approach reduces both training and inference costs
    - DeepSeek notably open-sourced their MoE architecture, unlike other major companies

    Technical Infrastructure:
    - Discussion of how DeepSeek achieved results without access to NVIDIA's latest GPUs
    - Highlighted the dramatic price increase in NVIDIA GPUs (from $3,000 to $30,000-$50,000) due to AI demand
    - Explained how inference costs (serving the model) often exceed training costs

    Chain of Thought Reasoning:
    - DeepSeek open-sourced their chain of thought reasoning system
    - This allows models to break down complex questions into steps before answering
    - Improves accuracy on complicated queries, especially math problems
    - Comparable to Meta's LLAMA in terms of open-source contributions to the field

    Broader Industry Impact:
    - Discussion of how businesses are integrating AI into their products
    - Example of ZoomInfo using AI to aggregate business intelligence and automate sales communications
    - Noted how technical barriers to AI implementation are lowering through platforms like Databricks

    The hosts also touched on data privacy concerns regarding Chinese tech companies entering the US market, drawing parallels to TikTok discussions. They concluded by discussing how AI tools are making technical development more accessible to non-experts and mentioned the importance of being aware of how much personal information these models collect about users.

    続きを読む 一部表示
    25 分
  • Clean Data, Business Context, and the Future of Analytics - Featuring Noy Twerski, Sherloq Co-founder & CEO
    2025/02/17

    This episode of Data Hurdles features an in-depth conversation with Noy Twerski, CEO and Co-founder of Sherloq, a collaborative SQL repository platform. The discussion, hosted by Chris Detzel and Michael Burke, explores several key themes in data analytics and management.

    Key Topics Covered:

    1. Introduction to Sherloq
    - Sherloq is introduced as a plugin that integrates with various SQL editors including Databricks, Snowflake, and JetBrains editors
    - The platform serves as a centralized repository for SQL queries, addressing the common problem of scattered SQL code across organizations

    2. Origin Story
    - Twerski shares her background as a product manager who experienced firsthand the challenges of managing SQL queries
    - The company was founded about 2.5 years ago with her co-founder Nadav, whom she knew from computer science undergrad
    - They identified the problem through extensive user research, finding that 80% of data analysts struggled with locating their tables, fields, and SQL

    3. Business Context and AI Discussion
    - A significant portion of the conversation focuses on the relationship between SQL, business context, and AI
    - The hosts and guest discuss the challenges of automating SQL generation through AI, emphasizing the importance of business context
    - They explore why text-to-SQL solutions are more complex than they appear, particularly in enterprise settings

    4. Future Outlook
    - Discussion of Sherloq's future plans, focusing on deepening their collaborative SQL repository capabilities
    - Exploration of how the platform could serve as infrastructure for future AI capabilities
    - Consideration of data quality as an ongoing challenge in the enterprise data space

    5. Industry Insights
    - The conversation includes broader discussions about data quality, governance, and the evolution of data teams
    - Twerski shares insights about different user personas and how they approach the product differently

    Notable Aspects:
    - The podcast includes interesting perspectives on the future of data analytics and AI
    - There's a strong emphasis on practical business applications and real-world challenges
    - The hosts and guest share thoughtful insights about data quality as a persistent challenge in the industry

    The episode provides valuable insights for data professionals, particularly those interested in data management, SQL development, and the evolution of data tools in an AI-driven landscape.

    続きを読む 一部表示
    34 分
  • Top 10 MDM 2025 Platforms - Who's Rising, Who's Falling & Why It Matters
    2024/12/01

    The Data Hurdles Impact Index (DHII) provides a comprehensive analysis of the top Master Data Management platforms, evaluating vendors based on multi-domain capabilities, core features, AI enablement, data governance integration, architecture flexibility, total cost of ownership, market reach, and vendor stability. This inaugural DHII analysis covers ten leading MDM platforms that are shaping enterprise data management in 2025.

    The assessment, led by 20-year MDM veteran Rohit Singh Verma, Director - Data practice, Nvizion Solutions, examines market leaders and emerging players including Informatica, Stibo Systems, Profisee, Reltio, Ataccama, TIBCO EBX, IBM Infosphere MDM, SAP MDM, Syndigo, and Viamedic. Each vendor is evaluated through the lens of practical implementation experience, market presence, and technological innovation.

    Key findings reveal Informatica's continued dominance with their IDMC cloud offering, though facing increasing pressure in specific domains from specialists like Stibo Systems in product data management. The analysis highlights a significant market opportunity in the Middle East, where only select vendors have established strong presences. The DHII also identifies critical factors beyond technical capabilities, including the importance of system integrator networks, implementation speed, and regional market penetration.

    The evaluation exposes interesting market dynamics, such as the challenges faced by legacy vendors like IBM and SAP in keeping pace with cloud-native solutions, and the emergence of AI-enabled capabilities as a key differentiator. The analysis also addresses the persistent challenge of high implementation failure rates (estimated at 75%) and how vendors are evolving to address this through improved user interfaces, AI-assisted implementations, and stronger partner ecosystems.

    This groundbreaking DHII assessment serves as an essential guide for organizations navigating the complex MDM vendor landscape, offering insights that go beyond traditional analyst evaluations to provide a practical, implementation-focused perspective on the market's leading solutions.

    続きを読む 一部表示
    1 時間 7 分
  • The Future of Data Teams in the AI Era: Insights from Alex Welch, dbt Labs' Head of Data and Analytics
    2024/11/01

    In this insightful episode of Data Hurdles, hosts Chris Detzel and Michael Burke sit down with Alex Welch, Head of Data at dbt Labs, to explore the transformative impact of AI on data organizations and the future of analytics.

    With over a decade of experience in FinTech and now leading data initiatives at dbt Labs, Alex shares valuable perspectives on:

    • Data Quality & Governance:
    - The critical importance of establishing data quality frameworks
    - How to approach data governance without creating unnecessary friction
    - The balance between control and accessibility in data management

    • AI Implementation & Challenges:
    - Two major hurdles in AI adoption: data/tech debt and the skills/culture gap
    - Practical approaches to introducing AI into existing workflows
    - The importance of starting small rather than trying to "boil the ocean"

    • Future of Data Teams:
    - Emerging roles like prompt engineering specialists and AI ethics officers
    - The shift from hierarchical structures to dynamic pod-based teams
    - How human-AI collaboration will reshape organizational structures

    • Skills & Development:
    - Why traditional analytical skills remain crucial in the AI era
    - The importance of maintaining human judgment and expertise
    - How to prepare for an AI-augmented workplace

    The conversation takes an especially interesting turn when discussing practical applications of AI, including Alex's personal example of using AI for meal planning and grocery shopping automation. The hosts and guest also explore thought-provoking perspectives on maintaining human expertise while leveraging AI capabilities, emphasizing the importance of using AI to augment rather than replace human decision-making.

    The episode concludes with valuable insights about preparing organizations for emerging AI trends and the importance of considering security implications in an AI-enabled future.

    This episode is particularly relevant for:
    - Data leaders planning AI initiatives
    - Organizations navigating data quality challenges
    - Professionals interested in the future of data careers
    - Anyone looking to understand the practical implications of AI in business

    続きを読む 一部表示
    51 分
  • Data Mesh in Action: Challenges, Opportunities, and Real-World Examples with Willem Koenders
    2024/09/29

    In this comprehensive episode of Data Hurdles, hosts Chris Detzel and Michael Burke engage in a deep and insightful conversation with Willem Koenders, a global data strategy leader at ZS Associates, about the increasingly popular concept of data mesh.

    The episode begins with Willem providing his background and expertise in the data field, setting the stage for a rich discussion. He explains the core concept of data mesh, describing it as a domain-driven approach to data architecture that emphasizes decentralized ownership and governance of data across an organization.

    Throughout the conversation, Willem uses various analogies to make the concept more accessible, likening data mesh to a net with strategic data nodes, and comparing data assets to real estate properties that need proper management and care. These analogies help illustrate the shift from centralized data warehouses or lakes to a more distributed, domain-oriented approach.

    The hosts and guest delve into the challenges of implementing data mesh, including cultural shifts required within organizations. Willem emphasizes the importance of clear ownership, quality control, and the need for a product-oriented mindset when it comes to data assets. He discusses how data mesh can help solve long-standing issues of data quality and accessibility that many organizations face.

    Real-world examples and case studies are shared, providing listeners with practical insights into how data mesh principles are being applied across various industries. Willem talks about the financial sector's early adoption of similar concepts and how medical technology companies are now embracing data mesh to deal with evolving market demands and data-generating products.

    The conversation also covers the critical aspect of data governance in a mesh environment. Willem explains how governance needs to be balanced between centralized standards (especially for security) and domain-specific controls. He stresses the importance of enablement and providing the right tools for domain teams to manage their data effectively.

    Chris and Michael bring up the challenges of cross-functional collaboration and the often siloed nature of data work in organizations. Willem acknowledges these difficulties and discusses strategies for improving communication and alignment between different teams and roles.

    The episode explores how to measure the business impact of data mesh implementations. Willem advocates for a portfolio approach, where organizations track the value generated by specific data assets and their associated use cases, rather than focusing solely on technology investments.

    Looking to the future, the discussion touches on the potential for data mesh to become a dominant data architecture approach, especially for larger and more complex organizations. Willem expresses hope that evolving tools and technologies, including AI, will make data mesh implementation more accessible to a broader range of companies.

    Throughout the episode, the hosts and guest maintain a balanced view, acknowledging both the potential benefits and the significant challenges of adopting a data mesh approach. They emphasize that success depends not just on technology, but on organizational culture, trust, and effective communication.

    The conversation concludes with reflections on the importance of building trust between different parts of an organization and how frameworks like data mesh can facilitate better collaboration and data utilization when implemented thoughtfully.

    This episode provides listeners with a comprehensive overview of data mesh, blending theoretical concepts with practical insights and real-world examples. It offers valuable perspectives for data professionals, business leaders, and anyone interested in modern data architecture and management strategies.

    続きを読む 一部表示
    42 分