👨‍🏭 This AI startup wants to replace all human labor, starting now

Plus: Google’s Split AI Strategy, AI Agent Fails, and Experience Based AI

Welcome, AI Insiders!

Google unveiled Gemini Flash for lightning-fast reasoning in the cloud and Gemma 3 QAT for running large models on consumer GPUs. Mechanize, a provocative new startup, is openly chasing full labor automation, training AI agents to do every job humans can. Meanwhile, code editor Cursor faced backlash after its AI agent confidently lied to users, exposing the trust risks in automated support. And in a bold shift, DeepMind researchers propose moving beyond static training and toward real-world experience as the future of learning.

In today’s Generative AI Newsletter:

• Google splits its AI muscle: Flash for speed, Gemma for accessibility
• Mechanize wants to replace all human labor, starting now
• Cursor’s AI agent lied—and users canceled in protest
• DeepMind proposes experience-first AI to replace static data

Special highlight from out Network

Compliance isn’t a checkbox. It’s your growth strategy.

If you’re moving fast, you can’t afford to slow down for audits, checklists, or regulatory headaches. That’s where Vanta steps in.

Vanta helps you lock in trust early—automating 35+ frameworks like SOC 2, ISO 27001, and the EU AI Act so your team can stay focused on building, shipping, and scaling.

Why it matters:

Startups that bake in security and compliance from day one build faster, win more deals, and stay ahead of shifting regulations without burning out the team.

What you’ll get with Vanta’s Compliance for Startups Bundle:

  • Plug-and-play checklists to know what matters

  • Case studies from fast-moving teams like yours

  • On-demand videos from startup security leaders who’ve been there

The best part? You don’t have to figure it all out yourself.

⚙️ Google’s AI Split: Speed in Cloud, Power on Device

Image Credit: Google Blog

Google just dropped two major upgrades across its AI portfolio: Gemini 2.5 Flash, a fast hybrid reasoner with built-in cost controls, and Gemma 3 QAT, a set of quantization-aware open models that bring 27B-scale performance to consumer GPUs.

What’s new:

  • Gemini 2.5 Flash
    • A lightweight, high-speed hybrid reasoner
    • Excels at STEM, logic, and visual tasks
    • Features a new “thinking budget” to balance latency, quality, and cost
    • Available now via AI Studio, Vertex AI, and the Gemini app

  • Gemma 3 QAT (Quantization-Aware Training)
    • Makes Gemma 3 27B run locally on consumer GPUs like RTX 3090
    • Uses int4 quantization to cut VRAM from 54GB ➝ 14.1GB with minimal quality loss
    • Trained to withstand quantization noise, preserving accuracy
    • Integrated with tools like Ollama, MLX, llama.cpp, and available on Hugging Face

Google is sharpening its dual strategy: Gemini Flash pushes smarter AI into real-time, cost-sensitive applications, while Gemma 3 QAT makes open, powerful models more accessible than ever. Whether you're building in the cloud or on a laptop, Google's betting on adaptive AI that scales both ways.

🤖 Startup Aims to Replace All Human Workers

Image Credit: Ideogram/GenAI

Tamay Besiroglu, co-founder of AI research institute Epoch, just unveiled Mechanize, a new startup building AI agents trained in simulated workplaces to take over every job done by humans. The stated goal? Full automation of all work. Everywhere.

Key developments:

  • Mechanize is creating digital environments to train agents in complex, long-horizon tasks—things like project management, collaboration, and adapting to interruptions

  • White-collar work is the first target, especially roles involving computer-based workflows

  • Backers include major names like Jeff Dean, Nat Friedman, and Patrick Collison

  • The market size is pegged at $60T, based on global wages across all industries

  • The announcement sparked criticism over the ethics of total automation and its conflict with Besiroglu’s leadership at Epoch, which positions itself as an impartial research group

Mechanize doesn’t hint at the future but states it outright. Mechanize is betting that whoever builds the first general labor agent will control the levers of the future economy. Whether society wants that future is a different question.

📞AI Agent Lies to Users: Mass Cancellations at Cursor

Agentic code editor Cursor is facing fallout after its AI support agent “Sam” hallucinated a fake security policy, convincing users they were restricted to a single device. The result? A wave of user anger, Reddit drama, and canceled subscriptions.

Key developments:

  • A user was repeatedly logged out when switching between devices and reached out to support

  • The AI agent falsely claimed that single-device access was a new security feature

  • The response, delivered confidently and without disclosure it came from a bot, triggered backlash on Reddit and Hacker News

  • Cursor’s co-founder confirmed the policy never existed and blamed a recent login security update

  • The company is now labeling AI responses more clearly and offering refunds to affected users

Hallucinations are still the Achilles' heel of AI support agents. While companies rush to automate customer service, this incident shows that even confident-sounding bots can quietly erode user trust. Cursor’s case wasn’t just a technical glitch but a breakdown in communication, accountability, and transparency. Until AI systems can say “I don’t know,” full automation might be a risk, not a feature.

đź§  DeepMind Proposes Shift to Experience-Based AI Learning

Image Credit: Google Deepmind

Two of reinforcement learning’s most influential minds, David Silver and Richard Sutton, have published a new paper calling for a major shift in AI development. They propose moving away from static, human-curated training data and toward systems that learn through real-world interaction.

The details:

  • The proposal introduces “streams” which are ongoing data feeds from the real world that enable continuous, open-ended learning instead of static, short-form training.

  • Instead of training on human outputs, agents would learn by interacting with environments and using feedback signals like health outcomes, academic scores, or environmental performance metrics.

  • This experiential model mirrors how AlphaZero mastered games, but aims to scale that approach to broader domains with unpredictable dynamics.

  • The authors argue that human supervision limits discovery, and that autonomous, feedback-driven learning can push AI beyond current human knowledge while maintaining safety and adaptability.

As today's most powerful AI models hit the ceiling of what human-curated data can offer, this proposal signals a pivotal shift. It moves from mimicking human intelligence to building agents that surpass it. DeepMind is no longer just training AI to reason. It is preparing AI to experience the world.

🚀 Boost your business with us—advertise where 10M+ AI leaders engage

🌟 Sign up for the first AI Hub in the world.

📲 Our Socials

Reply

or to participate.