AI Wanna Talk

エピソード

Deepseek R1, ChatGPT Operator, Project Stargate, and More

2025/01/31
In this episode of AI Wanna Talk, host Jacob Norgord dives into a whirlwind of recent AI developments, covering everything from new models and massive investments to cutting-edge tools and applications. This episode is packed with insights about the ever-changing landscape of artificial intelligence.
Key topics covered include:
Project Stargate, a $500 billion US initiative to build AI infrastructure across the country, involving companies like OpenAI, SoftBank, Oracle, and Nvidia.
The groundbreaking Deepseek R1 model, which achieves performance on par with OpenAI's premier models at a fraction of the cost.
A discussion on how Deepseek R1 is open source and what this means for how it can be used, including through Perplexity.
How the release of Deepseek R1 has caused shifts in the perceived value of GPUs and other AI infrastructure.
Eleven Labs raising $180 million to enhance their AI audio technology with an aim to make speech the standard of interacting with devices.
OpenAI's "Operator" agent, which can control your browser to accomplish tasks and the early stages of its development.
Krea’s custom AI models and their real-time 3D capabilities.
Pika Labs' new 2.1 model, with a link to apply for early access.
Adam, a new product that is deemed to be the future of CAD with its ability to generate CAD models from simple prompts.
The release of another Chinese model, Qwen2.5-Max, which is said to perform better than GPT4o and Claude 3.5 Sonnet.
An update on Grok, which is reportedly three times faster and has the ability to search the web and X (formerly Twitter).
A tutorial on how to accept payments from AI agents.
Deepseek's Janus-Pro 7B image model.
Kanye West hiring an AI team.
This episode emphasizes how quickly AI is evolving, with innovative models and tools being released at an impressive rate, potentially leading to significant shifts in how we interact with technology in the future.
Links & Resources:
Deepseek
Perplexity
Windsurf
Eleven Labs Reader App
Pika Labs 2.1 early access
Adam CAD
Qwen2.5-Max
Grok
Accepting Payments from AI Agents tutorial
Deepseek Janus-Pro 7B
Kanye West's Yeezy AI Team email
続きを読む一部表示
21 分

カートのアイテムが多すぎます

ご購入は五十タイトルがカートに入っている場合のみです。

カートに追加できませんでした。

しばらく経ってから再度お試しください。

ウィッシュリストに追加できませんでした。

しばらく経ってから再度お試しください。

ほしい物リストの削除に失敗しました。

しばらく経ってから再度お試しください。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

無料で聴く
Grok Site, Perplexity Acquisitions, AI Tutor, and More

2025/01/22
In this episode of "AI Wanna Talk," host Jacob Norgord explores a wide range of AI developments, focusing on new tools, platforms, and applications. The episode includes discussion of Perplexity's recent activities including their bid for TikTok, their acquisition of read.cv and their new sports feature. The podcast also examines new AI models, such as OpenAI’s o3-mini, and Apple's AI-generated news summaries. The episode also features a number of AI-powered tools including Windsor, Synthesis Tutor 2.0, FASHN AI, Krea AI, LumaLabs' Ray2, and Grok.
Key topics covered include:
Perplexity's potential acquisition of TikTok and their purchase of read.cv, and how these might integrate with their search platform.
A look at Perplexity's new real-time sports update feature, which can be displayed on a lock screen and provide in-depth game statistics, potentially replacing apps like ESPN.
OpenAI’s O3 mini model and its advantages in speed and efficiency.
A critique of Apple's AI-generated news summaries and the features of the new iPhone, such as emoji creation and image playground.
An overview of Windsurf, an AI development platform that now has web searching capabilities, which allows it to research and implement APIs, and auto-generate memories to improve its programming efficiency.
An introduction to Synthesis Tutor 2.0, an AI-driven tutor for kids that provides visual content and adapts to individual learning styles.
A demonstration of FASHN AI’s technology, which allows users to see how clothing designs would look on a model.
A look at Krea AI’s 3D layering tool that modifies a static image in real-time based on the orientation of a 3D object.
An overview of LumaLabs' Ray2 video model.
An update on Grok, which now has its own standalone website, as well as an iOS app, and can search both the web and X (formerly Twitter).
The podcast emphasizes how AI is evolving, with Perplexity leading in innovation and companies constantly pushing the boundaries of AI tools and applications.
Links & Resources:
• Potential Perplexity Acquisition
• Read.cv
• OpenAI's o3-mini
• Apple Pulls Back Generative AI
• Synthesis Tutor 2.0 Demo
• FASHN AI
• Krea 3D
• LumaLabs Ray2
• Grok
続きを読む一部表示
17 分

カートのアイテムが多すぎます

ご購入は五十タイトルがカートに入っている場合のみです。

カートに追加できませんでした。

しばらく経ってから再度お試しください。

ウィッシュリストに追加できませんでした。

しばらく経ってから再度お試しください。

ほしい物リストの削除に失敗しました。

しばらく経ってから再度お試しください。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

無料で聴く
xAI's New Grok App, Smaller Models, Hardware, and Time Allocation

2025/01/13
Correction: The Bee device records all participants in a conversation while being able to identify and distinguish your voice from other speakers.

In this episode of "AI Wanna Talk," host Jacob Norgord dives into the latest developments in AI, covering new apps, models, hardware, and even some thought-provoking ideas about the future of work and human interaction.
AI App and Model Updates:
xAI's Grok App: A detailed look at xAI's Grok app, including its web-searching capabilities and integration with Twitter, offering a real-time information experience. The app allows image uploads and generation and is currently available on iPhone, with a web app coming soon.
Bytedance AI Video Upscaler: Discussion of Bytedance’s new AI video upscaler, which uses stable diffusion techniques to enhance video quality, potentially impacting platforms like TikTok.
Cohere’s North Platform: An overview of Cohere’s North platform, an all-in-one AI workspace designed for enterprises. It integrates LLMs, search, and AI agents into workplace tools like Google Drive, Gmail, and GitHub.
Meta's Byte-Latent Transformer: Explanation of Meta's new byte-latent transformer, which processes data at the byte level, improving performance, reducing computational resources and handling multiple languages better. This approach bypasses traditional tokenization.
Microsoft's Phi Model: Introduction of Microsoft's smaller 14 billion parameter Phi model, noting its ability to run locally and achieve high performance on benchmarks like the MMLU.
Kokoro 82M Model: Overview of the Kokoro 82M text-to-speech model, which can run locally with only 350MB of RAM, making it ideal for device integration.
AI Hardware and Wearables:
OMI Device: An exploration of the OMI wearable, a necklace-like device that provides contextually relevant information based on conversations and aims to connect directly to the brain in the future.
B Device: Discussion of the B wrist-worn device, which summarizes conversations, suggests to-dos, and creates daily memories while respecting privacy, priced at $50.
Future of AI and Society:
AI-Driven Advertising: Insight into how advertising might shift to target AI agents rather than humans, as suggested by Perplexity's CEO.
Changing Time Allocation: A discussion of Paul Graham's tweet about increased time spent at home since 2003, raising questions about the impact of AI on our lives and how we'll spend our time. The podcast also touches on the idea of intentional inconvenience in the future.
Links & Resources:
Adobe TransPixar AI
Aravind's Advertising Prediction
Bytedance's STAR AI
Cohere’s North
Meta's Byte-Latent Transformer
Time Spent at Home
xAI
Small Models
Microsoft Phi-4
Kokoro-82M
Wearables
omi
Bee
続きを読む一部表示
31 分

カートのアイテムが多すぎます

ご購入は五十タイトルがカートに入っている場合のみです。

カートに追加できませんでした。

しばらく経ってから再度お試しください。

ウィッシュリストに追加できませんでした。

しばらく経ってから再度お試しください。

ほしい物リストの削除に失敗しました。

しばらく経ってから再度お試しください。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

無料で聴く
OpenAI Tasks, Perplexity Data Integrations, Anyone Can Build, and More

2024/12/30
In this episode of "AI Want to Talk," host Jacob Norgord explores various new developments in AI, focusing on product updates and innovative applications from different companies. This episode emphasizes the evolving capabilities of AI and its potential impact on various industries.
AI Product and Feature Updates:
OpenAI's Task Feature: Discussion of OpenAI's task automation feature, which allows users to create automations that enable AI to perform specific tasks at specific times, such as weekly weather forecasts. This is designed to enhance the value of AI for users with frequent, recurring requests.
TL Draw Computer: Introduction to TL Draw Computer, an online collaborative platform for brainstorming and creating AI and natural language processing workflows. Users can draw out workflows with components like text boxes, images, and audio clips to create complex automations.
Google's NotebookLM UI: Overview of the new UI for Google's NotebookLM, featuring a layout with sources on the left and document generation tools on the right, including options for creating podcasts, study guides, FAQs, and timelines. The new interface also provides a central pane for source summaries and question prompts.
AI Model and Integration Developments:
Perplexity's Acquisition of Carbon: Details on Perplexity's acquisition of Carbon, a retrieval engine that connects external data sources to large language models. This will allow Perplexity to integrate data from apps like Notion and Google Docs, creating a centralized hub for user information.
Eleven Labs Flash Model: Announcement of Eleven Labs' new Flash model, which generates speech in 75 milliseconds, enabling more human-like interactions with AI voice models. This model aims for low latency and is being targeted for integration into various products, such as video games.
ChatGPT Integration: Discussion of ChatGPT's new feature allowing it to work directly with apps like Apple Notes and Notion, accessing all data within these applications rather than just screen displays. This feature positions ChatGPT as a central interface for interacting with data across various applications.
AI Agents and Data:
Firecrawl and AI Agents: Introduction to Firecrawl, a company focused on providing AI models with high-quality data by scraping the web for specific data sets. They are also hiring AI agents (not humans) to work within their system and are paying $10,000 to $15,000 for the use of these agents.
Vertical AI Agents: Explanation of vertical AI agents, which are specialized AI systems designed for industry-specific tasks. Examples include agents for finance or law that can take action within their respective fields.
AI Software Creation: Discussion of platforms like Windsurf, Bolt, and Cursor, which allow users without coding experience to create software using AI.
Tempo Labs: Introduction to Tempo Labs, a code-first alternative to Figma, powered by AI. This platform generates functional code by prototyping user interfaces and allowing users to focus on core ideas rather than code.
AI and Human Cognition:
Human Brain Processing Speed: Exploration of an article highlighting that the human brain processes information at a rate of only 10 bits per second. The podcast discusses how, despite this slow rate, humans are able to distill vast amounts of data efficiently.
AI Constraints: Speculation on the idea that mimicking human constraints in AI data processing may be key to achieving more nuanced and contextual understanding in AI.
Other Interesting Points:
Fake AI Band: A story about a person who created a fake band with AI and made $10 million by also creating fake AI fans. This is framed as a humorous example of how AI can be used in unexpected and potentially fraudulent ways.
The focus on AI Agents: Going into 2025, there
続きを読む一部表示
19 分

カートのアイテムが多すぎます

ご購入は五十タイトルがカートに入っている場合のみです。

カートに追加できませんでした。

しばらく経ってから再度お試しください。

ウィッシュリストに追加できませんでした。

しばらく経ってから再度お試しください。

ほしい物リストの削除に失敗しました。

しばらく経ってから再度お試しください。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

無料で聴く
Multiple New Google Models, OpenAI Projects, Grok is Free, Particle News, and More

2024/12/18
In this episode of "AI Wanna Talk," host Jacob Norgord dives into the latest advancements in AI, exploring practical applications and significant announcements from major tech companies. This episode covers a range of new AI products, features, and research.
OpenAI's Projects Feature:
Discussion of OpenAI's "Projects" feature, which allows users to upload files and provide explicit instructions to maintain context throughout a conversation with AI models, addressing the common issue of AI forgetting earlier parts of a conversation. This feature is available for ChatGPT Plus or Pro members, or those with a team account.
Google's Gemini 2.0 Model and AI Agents:
An overview of Google's new Gemini 2.0 family of models, focusing on the "agentic era" where AI acts proactively on the user's behalf.
Details on the experimental Gemini 2.0 Flash model, a smaller model designed for low latency and integration into various Google experiences, such as Google Sheets, Docs, and Search.
Explanation of the "whisk" experiment, allowing users to combine objects from multiple images.
Information on Google's new state-of-the-art video model, Veo 2, a competitor to OpenAI’s Sora.
XAI's Grok and Mainframe's AI Agents:
Announcement that XAI's Grok AI assistant is now free for all X users, highlighting its ability to access real-time data from the web and the X platform.
An introduction to Mainframe, a company developing AI agents that work without human intervention, focusing on their first stage rollout called "Cobbot," which will consist of a suite of AI agents to accelerate teams.
AI and Employment:
Discussion of how the company CLA is using AI to boost productivity and potentially replace roles, and the potential implications for the job market and quality of output.
Particle News App:
Highlight of the Particle News app, which uses a TikTok-like algorithm for personalized news feeds and includes features like article narration and AI-powered Q&A.
Social Media and Teen Depression:
Exploration of data from Jonathan Height's "The Anxious Generation", revealing a correlation between the rise of smartphones and social media with the increase in teen depression.
Links:
Google Labs
Veo 2 Waitlist
Grok
Particle News App
Twitter thread on "The Anxious Generation"
続きを読む一部表示
19 分

カートのアイテムが多すぎます

ご購入は五十タイトルがカートに入っている場合のみです。

カートに追加できませんでした。

しばらく経ってから再度お試しください。

ウィッシュリストに追加できませんでした。

しばらく経ってから再度お試しください。

ほしい物リストの削除に失敗しました。

しばらく経ってから再度お試しください。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

無料で聴く
Eleven Labs, Amazon’s “Nova” Model, ChatGPT Pro, Microsoft Copilot Vision, Llama 3.3 70B, OpenAI’s Sora, and More

2024/12/10
(Definitely meant 25th power not 25th degree toward the end of the episode)
In this episode of "AI Wanta Talk," host Jacob Norgord explores recent AI advancements, focusing on their practical applications. This episode covers several major developments in the AI landscape:
Eleven Labs' Innovation in Audio AI
Eleven Labs has launched an AI Podcast Generator through their 11 Reader iOS app, enabling podcast creation from various text sources in 32 languages.
The company has also introduced a platform for building custom AI agents with configurable voices and response styles.
Amazon and OpenAI's New Models
Amazon has introduced Nova, their foundational model focused on math, science, coding, and reasoning tasks.
OpenAI has launched a $200/month ChatGPT Pro tier, providing advanced access to GPT-4's capabilities.
Microsoft and Meta's Developments
Microsoft's Copilot Vision enables screen-aware AI assistance within the Edge browser.
Meta's Llama 3 demonstrates improved efficiency through quality training data.
Productivity and On-Device AI
The Twos app introduces PAL (Personal Active List) for AI-powered task management.
Apollo AI brings on-device AI capabilities to iOS devices.
Google's Advances
Gemini exp-1206 features a 2 million token context window, surpassing ChatGPT 4.0 on LM Arena.
The Illuminate experiment enables podcast creation with customizable styles.
Breakthrough Technologies
OpenAI's Sora introduces advanced text-to-video generation capabilities.
Google's Willow quantum computing chip achieves significant computational breakthroughs.
Sundar Pichai proposes space-based quantum computing collaboration with Elon Musk.
Links:
Apollo AI
Copilot Vision
ElevenLabs
Google Illuminate
ElevenReader (iOS and Android)
OpenAI Sora
Twos
続きを読む一部表示
21 分

カートのアイテムが多すぎます

ご購入は五十タイトルがカートに入っている場合のみです。

カートに追加できませんでした。

しばらく経ってから再度お試しください。

ウィッシュリストに追加できませんでした。

しばらく経ってから再度お試しください。

ほしい物リストの削除に失敗しました。

しばらく経ってから再度お試しください。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

無料で聴く
OpenAI vs. Anthropic, Claude's "Styles" and "MCP", Microsoft AI's "Long-Term Memory", and Bronze's Chroma Acquisition

2024/11/27
In this episode of "AI Wanta Talk," host Jacob Norgord explores recent AI advancements, focusing on their practical applications. This episode focuses on four major developments in the AI landscape:
Anthropic AI's Claude New Features
Anthropic AI has introduced two new features for its Claude AI chatbot: Styles and Model Context Protocol (MCP).
Styles enables users to customize how Claude responds using presets like "concise," "explanatory," or "formal."
Model Context Protocol (MCP) acts as a "universal translator" for AI and data sources, allowing Claude to connect with external sources like files or websites and interact with them.
MCP enables Claude to perform complex tasks such as generating images based on user requests, writing code, and integrating images into websites.
Microsoft AI's Apparent Long-Term Memory Breakthrough
Microsoft AI CEO Mustafa Suleyman believes long-term memory is the crucial missing element in current AI chatbots.
Microsoft AI is working on incorporating long-term memory into its Copilot chatbot, enabling it to retain information from previous conversations and use it to provide more personalized and accurate responses.
The goal is to eliminate the need for users to constantly re-explain context and make interactions with AI more natural and efficient.
Bronze AI Acquires Chroma
Bronze AI, known for its innovative Bronze file format that creates dynamic music experiences, has acquired Chroma, a company specializing in audiovisual entertainment for mobile devices.
The acquisition suggests potential for combining Bronze's evolving music with Chroma's visual expertise, creating a multisensory experience for listeners.
The collaboration may push the boundaries of art by blending music and visuals in innovative ways.
Links:
More info on Claude Styles
More info on Model Context Protocal (MCP)
Pi
Mustafa Suleyman Interview Excerpt
Dot by New Computer
Bronze
“Jasmine” by Jai Paul (Bronze Version)
続きを読む一部表示
22 分

カートのアイテムが多すぎます

ご購入は五十タイトルがカートに入っている場合のみです。

カートに追加できませんでした。

しばらく経ってから再度お試しください。

ウィッシュリストに追加できませんでした。

しばらく経ってから再度お試しください。

ほしい物リストの削除に失敗しました。

しばらく経ってから再度お試しください。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

無料で聴く
ChatGPT Now Works With Apps, Google's New Gemini App, Perplexity Shopping, and Hume AI's Storyteller

2024/11/18
In this episode of "AI I Want To Talk," host Jacob Norgord explores recent AI advancements, focusing on their practical applications. The episode covers four major developments in the AI landscape:
ChatGPT App Integrations:
OpenAI has introduced a new feature allowing ChatGPT to interact with external applications.
Currently limited to coding apps like Xcode, TextEdit, iTerm2 Terminal, and VS Code.
Provides ChatGPT with context from the user's active code, improving code-related responses.
Although currently focused on coding, it has potential for wider application in the future.
Future possibilities include integration with design tools like Figma and music software like Ableton.
The Google Gemini App:
Google has released a new app called Gemini, featuring their AI model of the same name.
The app allows users to message Gemini and request image generation.
Includes Gemini Live, enabling real-time conversational interaction with the AI.
Notable for high-quality voices, rapid response times, and internet search capability.
However, the live chat feature might provide inaccurate information for niche queries without citing sources.
Perplexity Shopping:
Perplexity introduces Perplexity Shopping, a streamlined platform for product research and purchase.
Aggregates relevant product information for easy comparison and purchase without navigating multiple websites.
Requires a Perplexity Pro membership for direct purchases through the platform.
Hume AI's Storyteller Feature:
Hume AI is an AI company specializing in voice AI with an emphasis on emotional understanding.
Their iPhone app features a storytelling AI that generates images to accompany its narratives.
Highlights the potential of AI for innovative storytelling through the combination of voice and image generation.
Links:
ChatGPT Work with Apps
Google Gemini App (Apple)
Google Gemini App (Android)
Perplexity Shopping
Hume AI
続きを読む一部表示
14 分

カートのアイテムが多すぎます

ご購入は五十タイトルがカートに入っている場合のみです。

カートに追加できませんでした。

しばらく経ってから再度お試しください。

ウィッシュリストに追加できませんでした。

しばらく経ってから再度お試しください。

ほしい物リストの削除に失敗しました。

しばらく経ってから再度お試しください。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

無料で聴く

特集

カテゴリー別

エピソード

Deepseek R1, ChatGPT Operator, Project Stargate, and More

カートのアイテムが多すぎます

カートに追加できませんでした。

ウィッシュリストに追加できませんでした。

ほしい物リストの削除に失敗しました。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

Grok Site, Perplexity Acquisitions, AI Tutor, and More

カートのアイテムが多すぎます

カートに追加できませんでした。

ウィッシュリストに追加できませんでした。

ほしい物リストの削除に失敗しました。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

xAI's New Grok App, Smaller Models, Hardware, and Time Allocation

カートのアイテムが多すぎます

カートに追加できませんでした。

ウィッシュリストに追加できませんでした。

ほしい物リストの削除に失敗しました。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

OpenAI Tasks, Perplexity Data Integrations, Anyone Can Build, and More

カートのアイテムが多すぎます

カートに追加できませんでした。

ウィッシュリストに追加できませんでした。

ほしい物リストの削除に失敗しました。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

Multiple New Google Models, OpenAI Projects, Grok is Free, Particle News, and More

カートのアイテムが多すぎます

カートに追加できませんでした。

ウィッシュリストに追加できませんでした。

ほしい物リストの削除に失敗しました。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

Eleven Labs, Amazon’s “Nova” Model, ChatGPT Pro, Microsoft Copilot Vision, Llama 3.3 70B, OpenAI’s Sora, and More

カートのアイテムが多すぎます

カートに追加できませんでした。

ウィッシュリストに追加できませんでした。

ほしい物リストの削除に失敗しました。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

OpenAI vs. Anthropic, Claude's "Styles" and "MCP", Microsoft AI's "Long-Term Memory", and Bronze's Chroma Acquisition

カートのアイテムが多すぎます

カートに追加できませんでした。

ウィッシュリストに追加できませんでした。

ほしい物リストの削除に失敗しました。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

ChatGPT Now Works With Apps, Google's New Gemini App, Perplexity Shopping, and Hume AI's Storyteller

カートのアイテムが多すぎます

カートに追加できませんでした。

ウィッシュリストに追加できませんでした。

ほしい物リストの削除に失敗しました。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました