Big News of the Week
Google’s Gemini AI Upgrades Phone Calls with AI Mode
Google’s Gemini AI now enhances your phone calls by listening in real time and offering helpful suggestions like quick replies, summaries, and context-aware prompts. It can detect when you’re on a call, automatically pull up relevant info like contacts or calendar events, and even help manage your conversation flow. This means your calls become more efficient and less stressful, whether you’re scheduling, negotiating, or following up.
Google Adds AI Still-Image Animation with Veo
You can now turn your photos into 8-second videos with AI-generated sounds like speech and background noise using Google’s Veo 3 model. This lets you create dynamic video content easily from still images.
NotebookLM Adds Featured Notebooks from Major Publications
NotebookLM is featuring public notebooks from The Economist, The Atlantic, Oxford researchers, and more, and it’s so cool!
You can now:
- Ask questions about each notebook (with citations)
- Browse key topics using Mind Maps
- Listen to Audio Overviews for a quick summary
I’ve been enjoying this one:
Life guidance via Arthur C. Brooks’ The Atlantic columns
Over 140,000 public notebooks have already been shared since the feature launched last month.
Rollout starts now on desktop with more collections coming soon!
OpenAI Introduces ChatGPT Agent for Smarter Task Automation
OpenAI just launched ChatGPT Agent, a new feature that lets you create AI assistants to handle multi-step tasks automatically. You can customize these agents to manage workflows like scheduling, data retrieval, or customer support, freeing you from repetitive work. The agents learn from your instructions and adapt to complex requests, making your daily routines smoother and more efficient without needing constant input.
OpenAI Launches AI Web Browser
Aaaand… OpenAI just released a web browser that integrates ChatGPT directly into your browsing experience. This means as you surf the web, you get AI-generated summaries, real-time answers, and interactive help without switching apps.
Microsoft Copilot Vision AI Brings Screen and Desktop Scanning
Microsoft’s Copilot Vision AI now lets you scan your entire screen and desktop to quickly find information or complete tasks. Instead of switching between apps or manually searching, you can ask Copilot to analyze what’s on your screen—whether it’s documents, images, or windows—and get instant help. This saves you time and streamlines your workflow by making it easier to locate and act on what you need without breaking your focus.
Canva Integrates Anthropic Claude AI with MCP Support
Canva now lets you use Anthropic’s Claude AI directly within its platform.
Genspark Launches AI Pods to Create Professional Podcasts from One Prompt
You’ve seen this on NotebookLM, and now you can do it with Genspark!
With a single prompt, Genspark AI Pods turn any topic, webpage, video, or document into a polished podcast. It analyzes content, fact-checks, produces broadcast-quality audio, and creates dynamic hosts that sound natural. The voices are different from NotebookLM. Watch the demo here.
AWS to Launch AI Agent Marketplace with Anthropic as Partner
AWS is launching a new AI agent marketplace next week, partnering with Anthropic to offer you a wide range of AI agents for business tasks. This marketplace will let you discover, customize, and integrate AI-powered agents that automate workflows, handle customer service, analyze data, and more. This reminds me of when the Apple App Store opened up its shelves to developers.