エピソード

  • OpenAI Tasks, Perplexity Data Integrations, Anyone Can Build, and More
    2024/12/30

    In this episode of "AI Want to Talk," host Jacob Norgord explores various new developments in AI, focusing on product updates and innovative applications from different companies. This episode emphasizes the evolving capabilities of AI and its potential impact on various industries.

    AI Product and Feature Updates:

    • OpenAI's Task Feature: Discussion of OpenAI's task automation feature, which allows users to create automations that enable AI to perform specific tasks at specific times, such as weekly weather forecasts. This is designed to enhance the value of AI for users with frequent, recurring requests.
    • TL Draw Computer: Introduction to TL Draw Computer, an online collaborative platform for brainstorming and creating AI and natural language processing workflows. Users can draw out workflows with components like text boxes, images, and audio clips to create complex automations.
    • Google's NotebookLM UI: Overview of the new UI for Google's NotebookLM, featuring a layout with sources on the left and document generation tools on the right, including options for creating podcasts, study guides, FAQs, and timelines. The new interface also provides a central pane for source summaries and question prompts.

    AI Model and Integration Developments:

    • Perplexity's Acquisition of Carbon: Details on Perplexity's acquisition of Carbon, a retrieval engine that connects external data sources to large language models. This will allow Perplexity to integrate data from apps like Notion and Google Docs, creating a centralized hub for user information.
    • Eleven Labs Flash Model: Announcement of Eleven Labs' new Flash model, which generates speech in 75 milliseconds, enabling more human-like interactions with AI voice models. This model aims for low latency and is being targeted for integration into various products, such as video games.
    • ChatGPT Integration: Discussion of ChatGPT's new feature allowing it to work directly with apps like Apple Notes and Notion, accessing all data within these applications rather than just screen displays. This feature positions ChatGPT as a central interface for interacting with data across various applications.

    AI Agents and Data:

    • Firecrawl and AI Agents: Introduction to Firecrawl, a company focused on providing AI models with high-quality data by scraping the web for specific data sets. They are also hiring AI agents (not humans) to work within their system and are paying $10,000 to $15,000 for the use of these agents.
    • Vertical AI Agents: Explanation of vertical AI agents, which are specialized AI systems designed for industry-specific tasks. Examples include agents for finance or law that can take action within their respective fields.
    • AI Software Creation: Discussion of platforms like Windsurf, Bolt, and Cursor, which allow users without coding experience to create software using AI.
    • Tempo Labs: Introduction to Tempo Labs, a code-first alternative to Figma, powered by AI. This platform generates functional code by prototyping user interfaces and allowing users to focus on core ideas rather than code.

    AI and Human Cognition:

    • Human Brain Processing Speed: Exploration of an article highlighting that the human brain processes information at a rate of only 10 bits per second. The podcast discusses how, despite this slow rate, humans are able to distill vast amounts of data efficiently.
    • AI Constraints: Speculation on the idea that mimicking human constraints in AI data processing may be key to achieving more nuanced and contextual understanding in AI.

    Other Interesting Points:

    • Fake AI Band: A story about a person who created a fake band with AI and made $10 million by also creating fake AI fans. This is framed as a humorous example of how AI can be used in unexpected and potentially fraudulent ways.
    • The focus on AI Agents: Going into 2025, there
    続きを読む 一部表示
    19 分
  • Multiple New Google Models, OpenAI Projects, Grok is Free, Particle News, and More
    2024/12/18

    In this episode of "AI Wanna Talk," host Jacob Norgord dives into the latest advancements in AI, exploring practical applications and significant announcements from major tech companies. This episode covers a range of new AI products, features, and research.

    OpenAI's Projects Feature:

    • Discussion of OpenAI's "Projects" feature, which allows users to upload files and provide explicit instructions to maintain context throughout a conversation with AI models, addressing the common issue of AI forgetting earlier parts of a conversation. This feature is available for ChatGPT Plus or Pro members, or those with a team account.

    Google's Gemini 2.0 Model and AI Agents:

    • An overview of Google's new Gemini 2.0 family of models, focusing on the "agentic era" where AI acts proactively on the user's behalf.
    • Details on the experimental Gemini 2.0 Flash model, a smaller model designed for low latency and integration into various Google experiences, such as Google Sheets, Docs, and Search.
    • Explanation of the "whisk" experiment, allowing users to combine objects from multiple images.
    • Information on Google's new state-of-the-art video model, Veo 2, a competitor to OpenAI’s Sora.

    XAI's Grok and Mainframe's AI Agents:

    • Announcement that XAI's Grok AI assistant is now free for all X users, highlighting its ability to access real-time data from the web and the X platform.
    • An introduction to Mainframe, a company developing AI agents that work without human intervention, focusing on their first stage rollout called "Cobbot," which will consist of a suite of AI agents to accelerate teams.

    AI and Employment:

    • Discussion of how the company CLA is using AI to boost productivity and potentially replace roles, and the potential implications for the job market and quality of output.

    Particle News App:

    • Highlight of the Particle News app, which uses a TikTok-like algorithm for personalized news feeds and includes features like article narration and AI-powered Q&A.

    Social Media and Teen Depression:

    • Exploration of data from Jonathan Height's "The Anxious Generation", revealing a correlation between the rise of smartphones and social media with the increase in teen depression.

    Links:

    • Google Labs
    • Veo 2 Waitlist
    • Grok
    • Particle News App
    • Twitter thread on "The Anxious Generation"
    続きを読む 一部表示
    19 分
  • Eleven Labs, Amazon’s “Nova” Model, ChatGPT Pro, Microsoft Copilot Vision, Llama 3.3 70B, OpenAI’s Sora, and More
    2024/12/10

    (Definitely meant 25th power not 25th degree toward the end of the episode)

    In this episode of "AI Wanta Talk," host Jacob Norgord explores recent AI advancements, focusing on their practical applications. This episode covers several major developments in the AI landscape:

    Eleven Labs' Innovation in Audio AI

    • Eleven Labs has launched an AI Podcast Generator through their 11 Reader iOS app, enabling podcast creation from various text sources in 32 languages.
    • The company has also introduced a platform for building custom AI agents with configurable voices and response styles.

    Amazon and OpenAI's New Models

    • Amazon has introduced Nova, their foundational model focused on math, science, coding, and reasoning tasks.
    • OpenAI has launched a $200/month ChatGPT Pro tier, providing advanced access to GPT-4's capabilities.

    Microsoft and Meta's Developments

    • Microsoft's Copilot Vision enables screen-aware AI assistance within the Edge browser.
    • Meta's Llama 3 demonstrates improved efficiency through quality training data.

    Productivity and On-Device AI

    • The Twos app introduces PAL (Personal Active List) for AI-powered task management.
    • Apollo AI brings on-device AI capabilities to iOS devices.

    Google's Advances

    • Gemini exp-1206 features a 2 million token context window, surpassing ChatGPT 4.0 on LM Arena.
    • The Illuminate experiment enables podcast creation with customizable styles.

    Breakthrough Technologies

    • OpenAI's Sora introduces advanced text-to-video generation capabilities.
    • Google's Willow quantum computing chip achieves significant computational breakthroughs.
    • Sundar Pichai proposes space-based quantum computing collaboration with Elon Musk.

    Links:

    • Apollo AI
    • Copilot Vision
    • ElevenLabs
    • Google Illuminate
    • ElevenReader (iOS and Android)
    • OpenAI Sora
    • Twos
    続きを読む 一部表示
    21 分
  • OpenAI vs. Anthropic, Claude's "Styles" and "MCP", Microsoft AI's "Long-Term Memory", and Bronze's Chroma Acquisition
    2024/11/27

    In this episode of "AI Wanta Talk," host Jacob Norgord explores recent AI advancements, focusing on their practical applications. This episode focuses on four major developments in the AI landscape:

    Anthropic AI's Claude New Features

    • Anthropic AI has introduced two new features for its Claude AI chatbot: Styles and Model Context Protocol (MCP).
    • Styles enables users to customize how Claude responds using presets like "concise," "explanatory," or "formal."
    • Model Context Protocol (MCP) acts as a "universal translator" for AI and data sources, allowing Claude to connect with external sources like files or websites and interact with them.
    • MCP enables Claude to perform complex tasks such as generating images based on user requests, writing code, and integrating images into websites.

    Microsoft AI's Apparent Long-Term Memory Breakthrough

    • Microsoft AI CEO Mustafa Suleyman believes long-term memory is the crucial missing element in current AI chatbots.
    • Microsoft AI is working on incorporating long-term memory into its Copilot chatbot, enabling it to retain information from previous conversations and use it to provide more personalized and accurate responses.
    • The goal is to eliminate the need for users to constantly re-explain context and make interactions with AI more natural and efficient.

    Bronze AI Acquires Chroma

    • Bronze AI, known for its innovative Bronze file format that creates dynamic music experiences, has acquired Chroma, a company specializing in audiovisual entertainment for mobile devices.
    • The acquisition suggests potential for combining Bronze's evolving music with Chroma's visual expertise, creating a multisensory experience for listeners.
    • The collaboration may push the boundaries of art by blending music and visuals in innovative ways.

    Links:

    • More info on Claude Styles
    • More info on Model Context Protocal (MCP)
    • Pi
    • Mustafa Suleyman Interview Excerpt
    • Dot by New Computer
    • Bronze
      • “Jasmine” by Jai Paul (Bronze Version)
    続きを読む 一部表示
    22 分
  • ChatGPT Now Works With Apps, Google's New Gemini App, Perplexity Shopping, and Hume AI's Storyteller
    2024/11/18

    In this episode of "AI I Want To Talk," host Jacob Norgord explores recent AI advancements, focusing on their practical applications. The episode covers four major developments in the AI landscape:

    ChatGPT App Integrations:

    • OpenAI has introduced a new feature allowing ChatGPT to interact with external applications.
    • Currently limited to coding apps like Xcode, TextEdit, iTerm2 Terminal, and VS Code.
    • Provides ChatGPT with context from the user's active code, improving code-related responses.
    • Although currently focused on coding, it has potential for wider application in the future.
    • Future possibilities include integration with design tools like Figma and music software like Ableton.

    The Google Gemini App:

    • Google has released a new app called Gemini, featuring their AI model of the same name.
    • The app allows users to message Gemini and request image generation.
    • Includes Gemini Live, enabling real-time conversational interaction with the AI.
    • Notable for high-quality voices, rapid response times, and internet search capability.
    • However, the live chat feature might provide inaccurate information for niche queries without citing sources.

    Perplexity Shopping:

    • Perplexity introduces Perplexity Shopping, a streamlined platform for product research and purchase.
    • Aggregates relevant product information for easy comparison and purchase without navigating multiple websites.
    • Requires a Perplexity Pro membership for direct purchases through the platform.

    Hume AI's Storyteller Feature:

    • Hume AI is an AI company specializing in voice AI with an emphasis on emotional understanding.
    • Their iPhone app features a storytelling AI that generates images to accompany its narratives.
    • Highlights the potential of AI for innovative storytelling through the combination of voice and image generation.

    Links:

    • ChatGPT Work with Apps
    • Google Gemini App (Apple)
    • Google Gemini App (Android)
    • Perplexity Shopping
    • Hume AI
    続きを読む 一部表示
    14 分
  • ChatGPT Search, Claude Visual PDFs, Ideogram Canvas, Runway Act-One, xAI's API, and Google Learn About
    2024/11/04

    In this episode of "AI I Want To Talk," host Jacob Norgord explores recent AI advancements, focusing on their practical applications. The episode covers six major developments in the AI landscape:

    ChatGPT Search:

    • Enables ChatGPT to search the web and inform answers, compensating for its limited knowledge cutoff date.
    • Offers faster search speeds compared to ChatGPT's previous web search feature.
    • Comparable to Perplexity in speed but potentially faster in the future.
    • Provides a Chrome extension to make it the default search engine.

    Claude's PDF Capabilities:

    • Introduces improved PDF handling, enabling Claude to understand PDFs with non-typed text, including handwritten notes.
    • Overcomes limitations of previous text-extraction methods.
    • Expands possibilities for using Claude with handwritten notes and other non-typed PDFs.

    Ideogram Canvas:

    • Launches a new feature called Canvas, described as an infinite creative board for organizing, generating, editing, and combining images.
    • Offers a gridless board for uploading and manipulating multiple images to inform AI image creation.
    • Potentially includes text functionality for creating custom fonts.

    Runway Act One:

    • Unveils Act One, a tool that allows users to record themselves and use their facial expressions to animate characters.
    • Eliminates the need for CGI or motion capture for creating animated characters.
    • Offers a quick and innovative way to create animated content for various media.

    xAI's API for Grok 2:

    • Releases an API for Grok 2, enabling developers to integrate xAI's Grok AI into their platforms.
    • Offers access to Grok, the chatbot accessible through Twitter or X, for building unique applications.
    • Provides developers with a more distinctive chatbot option for specific use cases.

    Google Learn About:

    • Introduces a new Google experiment called Learn About, designed for creating personalized learning curriculums.
    • Allows users to input text and images to generate a customized learning path.
    • Provides an AI-powered resource for effective learning, tailored to individual needs and preferences.

    Links mentioned in the episode:

    • Ideogram Canvas Video
      • Ideogram
    • Runway Act One Video
      • Runway
    • Google Learn About
    続きを読む 一部表示
    11 分
  • Anthropic's New Models, "Computer Use", Perplexity App, ProSearch, and ElevenLabs Voice Design
    2024/10/25

    In this episode of "AI I Want To Talk," host Jacob Norgord explores recent AI advancements, focusing on their practical applications and potential impact on productivity. The episode covers three major developments in the AI landscape:

    Anthropic's Claude Update:

    • A new version of Claude with improved reasoning and coding capabilities.
    • Introduction of the "computer use" feature, allowing Claude to control a user's computer.
    • Discussion of potential applications in various industries and companies.
    • Acknowledgment of both the exciting possibilities and potential risks associated with this technology.

    Perplexity Pro Search Improvement:

    • Enhanced generative and multi-step reasoning capabilities.
    • Demonstration of its power through an example from Perplexity's CEO, creating a comprehensive table of key takeaways from Jeff Bezos' shareholder letters.

    ElevenLabs' Voice Design Feature:

    • Introduction of a new tool allowing users to create custom AI voices using prompts.
    • Brief overview of its potential applications and implications for voice technology.

    Links mentioned in the episode:

    • agent.exe
    • Perplexity Pro Search Example
    続きを読む 一部表示
    10 分
  • NotebookLM, Microsoft's New Copilot, and Prompt Engineering
    2024/10/25

    In the debut episode of “AI I Want To Talk,” host Jacob Norgord delves into the realms of AI and productivity, exploring their fascinating intersection. His goal: To simplify AI advancements and make them accessible for everyday use.

    • Jacob introduces NotebookLM with its latest updates, Google’s AI-powered note-taking tool. Unlike ChatGPT or Claude, it integrates various information sources, enabling AI interaction with personalized context.
    • NotebookLM’s “audio overview” feature creates insightful podcasts from user-provided information. Recent upgrades allow source-specific focus, enhancing learning experiences.
    • The episode covers Microsoft’s improved Copilot, developed by former Inflection co-founder Mustafa Suleyman. It shares similarities with Pi, known for concise responses and superior voice quality.
    • Jacob discusses prompt engineering’s evolution. He advocates for natural language communication with AI, moving away from rigid syntax. Generative AI’s probabilistic nature enables this shift.
    • He emphasizes explaining tasks to AI as you would to a human, providing context for better results. Jacob cites Ideogram as an example of effective AI interaction, stressing the importance of holistic task delegation.

    Links mentioned in the episode:

    • NotebookLM

    • Microsoft Copilot

    • Pi

    • Ideogram

    続きを読む 一部表示
    14 分