Apple Plans to Release Gemini-Powered Siri in 2 Phases

Advertise with us

Welcome back! Agents, supply chains and storyboards all meet in the same control room here. Voice helpers lean on outside brains, coding assistants reach into your own machine, regulators turn chip orders into a guessing game, and a video tool quietly spins product notes into global clips. The shared tension is simple: the more intelligence moves into the background, the more every hidden decision about where it runs and who owns it starts to matter.

In today’s Generative AI Newsletter:

Apple rebuilds Siri with Gemini-powered upgrade.
OpenAI Codex agent edits local code safely.
Alibaba plans NVIDIA H200 orders amid customs uncertainty.
FalcoCut turns product info into localized video ads.

Latest Developments

Apple is getting ready to bring Siri back with a catch related to its rebuilding process. Reports say Apple will rebuild Siri in two phases, starting with iOS 26.4 later this spring and a bigger chatbot-style Siri in iOS 27. The reports point to Google Gemini models powering the new Siri, even while Apple frames it internally as its own system. That matters because if Apple cannot make Siri feel modern, Apple Intelligence becomes a marketing layer on top of a voice assistant people stopped trusting.

Here’s what the details show:

Timing: Apple plans to demo a major Siri update in the second half of February.
Phase 1: iOS 26.4 Siri aims to use personal data and on-screen context, using “Apple Foundation Models version 10.”
Phase 2: iOS 27 and macOS 27 bring “Campos,” a conversational Siri powered by “version 11,” pitched as competitive with Gemini 3.
Risk: Apple may run early requests on Private Cloud Compute, but heavier queries could move to Google’s cloud later.

Apple is making the same bet as the rest of the industry. Pick an AI supplier to ship the product and negotiate about control later. If Gemini-backed Siri finally works, Apple will have a very polished front end for Google’s brain. Apple's decision to potentially use Google's cloud for heavier queries shows a willingness to prioritize user experience over internal competition. However, if it fails, users will blame Apple instead of Gemini.

_{Special highlight from our network}

Learn anything with AI that adapts to how you learn

StudyFetch is an AI-powered learning platform that adapts to your individual learning style and pace, helping you master complex subjects more effectively.

It is trusted by law and medical exam takers, as well as professionals who need to quickly learn new domains and retain critical information.

Unlike one-size-fits-all explanations, StudyFetch personalizes how content is presented, so learning is more engaging, efficient, and actually sticks.

Try StudyFetch for free

OpenAI showed how Codex CLI, a coding helper agent, thinks while it works on your machine and explained the agent loop behind it. It's a simple cycle where the model answers or calls a tool and Codex runs it, often through your local shell. A normal chatbot can hallucinate. However, a terminal bot has the ability to change or corrupt your repo. Hence, Codex strives to produce dependable and top-notch software modifications.

Here’s what the breakdown reveals:

Loop: Codex repeats tool calls until it returns a final assistant message that ends the turn.
Endpoints: It can operate using ChatGPT's backend, the public Responses API, or a local server with specific software versions.
Speed: It creates a precise initial prompt to improve cache efficiency, preventing data processing from becoming overly complex.
Limits: As the context expands, Codex compresses data using /responses/compact to retain 'encrypted_content' instead of carrying the entire history forward.

Codex is a good snapshot of where agents are heading. The advantages encompass portability of endpoints, quicker operations with caching, and streamlining to maintain the continuity of lengthy tasks. The drawback is that tool surfaces change, caches fail and the agent is still one permission slip away from touching the wrong thing. This signifies a notable shift in agents' development from showcasing models to focusing on engineering for improved performance.

China’s regulators are sending a mixed signal on NVIDIA’s H200 chips. Chinese officials reportedly told Alibaba, Tencent, and ByteDance to prepare orders, hinting that Beijing may be close to letting imports move again. However, another report says customs guidance told agents that H200 is not permitted to enter China, describing it as “basically a ban for now.” That gap is significant if you are leading an AI team because planning a cluster is simple, but crossing poses greater challenges.

The evidence indicates the following:

Approval: Regulators tentatively approved discussions on quantities and initial order preparations.
Condition: Beijing may mandate firms to also purchase domestic chips, potentially affecting their procurement strategies.
Enforcement: Customs messaging could still obstruct shipments even in the presence of approvals from other agencies.
Demand: Chinese firms purportedly ordered over two million H200 chips priced at approximately $27,000 each, contrasting sharply with NVIDIA's inventory of 700,000 chips.

This situation benefits NVIDIA if orders translate into actual deliveries, yet it poses risks for buyers who base their plans on tentative permissions. Access to chips is resembling the dynamics of cloud credits in 2019, with the added complexity of potential customs hindrances; NVIDIA's success in China may only be on paper, as it could falter at the border. Moving forward, pay attention to the recipients of exemptions and the true performance implications of 'buy local too.'

FalcoCut is a browser-based video tool built for turning product info into short ads and then translating those ads into new languages with subtitles, dubbing, and lip sync. It’s designed for simple workflows like “make a 20 second product clip,” “add a talking presenter,” and “localize it for another market” without opening a full video editor.

Core functions (and how to use them):

Text to video: Paste a product description and generate a 15–30s ad draft you can iterate on quickly.
Image to video: Upload a product photo and turn it into a moving sequence with transitions for landing pages and socials.
Video editing (simple swaps): Replace a person, product, or background to refresh an old creative without re-shooting.
Avatars for demos: Create a “talking presenter” clip for explainer-style ads and product walkthroughs.
Translation + lip sync: Localize an existing video into 30+ languages with auto-subtitles and synced mouth movement.

Try this yourself:
Take one existing product video (10-30 seconds). Export it once with on-screen captions in your original language, then run the Video Translator into one target language you care about. Watch both versions muted first. If the message still lands, you have a usable ad. If it doesn’t, rewrite the first 2 seconds as a simpler hook, regenerate, and compare the retention feel.

Apple Plans to Release Gemini-Powered Siri in 2 Phases

Advertise with us

Latest Developments

Learn anything with AI that adapts to how you learn

Reply

Keep Reading

GenAI.community