Skip to content
Mobrief
Mobrief
Product 1h ago

Quantization-Aware Training in TorchAO (II)Quantization-Aware Training in TorchAO (II)

In our previous Quantization-Aware Training (QAT) blog , we introduced the initial QAT flow in TorchAO for large language models targeting edge devices with ExecuTorch . Since then, we extended…

Since then, we extended this flow to also target fast CUDA kernels like the ones in MSLK for fast inference in vLLM , and incorporated this flow into popular fine-tuning frameworks like Unsloth and…

PyTorch Blog
Tech 2h ago

Google faces wrongful death lawsuit after Gemini allegedly ‘coached’ man to die by suicideGoogle faces wrongful death lawsuit after Gemini allegedly ‘coached’ man to die by suicide

A lawsuit filed on Wednesday accuses Google's Gemini AI chatbot of trapping 36-year-old Jonathan Gavalas in a "collapsing reality" that involved a series of violent missions, ultimately ending with...

In the days leading up to his death, Gemini allegedly convinced Gavalas that he was "executing a covert plan to liberate his […]

The Verge Tech
Business 17h ago

Amazon Spends Another $21B to Beef up Spain's AI InfrastructureAmazon Spends Another $21B to Beef up Spain's AI Infrastructure

The latest round of funding signifies another escalation in Amazon's commitment to the country.

AI Business
Business 23h ago

Google releases Gemini 3.1 Flash Lite at 1/8th the cost of ProGoogle releases Gemini 3.1 Flash Lite at 1/8th the cost of Pro

Google's newest AI model is here: Gemini 3.1 Flash-Lite , and the biggest improvements this time around come in cost…

search and cloud giant.

VentureBeat
Research 1h ago

Google faces wrongful death suit after Gemini allegedly convinced a man to die and…Google faces wrongful death suit after Gemini allegedly convinced a man to die and become digital

According to a lawsuit filed in a US federal court in Northern California on Wednesday, Google's chatbot Gemini…

The article Google faces wrongful death suit after Gemini allegedly convinced a man to die and become digital…

The Decoder
Research 2h ago

US military uses Anthropic's Claude for AI-driven strike planning in Iran warUS military uses Anthropic's Claude for AI-driven strike planning in Iran war

In the war against Iran, the US military is using generative AI at scale for target selection and strike planning for…

Of all models, it's the one from the company Washington just banned.

The Decoder
THE WIRE
Business 3h ago

OpenAI Says ChatGPT Instant 5.3 is Less Cringe, More AccurateOpenAI Says ChatGPT Instant 5.3 is Less Cringe, More Accurate

The AI model maker said it is responding to user criticisms.

AI Business
Product 6h ago

MCP Apps support on VercelMCP Apps support on Vercel

Teams can now build and deploy MCP Apps on Vercel with full support for Next.js.MCP…

They run inside iframes and communicate with any compatible host, such as ChatGPT,…

Vercel Blog
Business 5h ago

Pentagon vendor cutoff exposes the AI dependency map most enterprises never builtPentagon vendor cutoff exposes the AI dependency map most enterprises never built

The federal directive ordering all U.S. government agencies to cease using Anthropic…

government agencies to cease using Anthropic technology comes with a six-month phaseout…

VentureBeat
Labs 1h ago

Phi-4-reasoning-vision and the lessons of training a multimodal reasoning modelPhi-4-reasoning-vision and the lessons of training a multimodal reasoning model

At a glance Phi-4-reasoning-vision-15B is a compact and smart open‑weight multimodal…

It is a broadly capable model that allows for natural interaction for a wide array of…

Microsoft Research
Business 8h ago

Anthropic’s Skyrocketing Revenue, A Contract Compromise?, Nvidia EarningsAnthropic’s Skyrocketing Revenue, A Contract Compromise?, Nvidia Earnings

Anthropic's enterprise business is reaching escape velocity, which increases the…

Then, agents dramatically increase demand for Nvidia chips, even if they threaten…

Stratechery
Labs 19h ago

Understanding AI and learning outcomesUnderstanding AI and learning outcomes

OpenAI introduces the Learning Outcomes Measurement Suite to assess AI’s impact on…

OpenAI News
Tech just now

Google’s AI-powered workspace is now available to more users in SearchGoogle’s AI-powered workspace is now available to more users in Search

Google is bringing Canvas to everyone in the US using AI Mode in Search. The feature…

The feature opens up a dedicated workspace within its AI-powered search tool, allowing…

The Verge Tech
Product 2h ago

Inside BMW Group’s experiments evaluating domain-specific language modelsSmall models, high quality: Inside BMW Group’s experiments evaluating domain-specific language models

A car you can talk to has been a longstanding dream, whether as the basis for…

One way of achieving better, more natural voice commands is by incorporating AI…

Google Cloud AI Blog
Labs 23h ago

cuTile.jl Brings NVIDIA CUDA Tile-Based Programming to JuliacuTile.jl Brings NVIDIA CUDA Tile-Based Programming to Julia

NVIDIA CUDA Tile is one of the most significant additions to NVIDIA CUDA programming…

NVIDIA Developer
Labs 9h ago

Extending single-minus amplitudes to gravitonsExtending single-minus amplitudes to gravitons

A new preprint extends single-minus amplitudes to gravitons, with GPT-5.2 Pro helping…

OpenAI News
Research 23h ago

Google's fastest and cheapest model Gemini 3.1 Flash-Lite got smarter but also tripled…Google's fastest and cheapest model Gemini 3.1 Flash-Lite got smarter but also tripled the price

Google Deepmind has released a preview of Gemini 3.1 Flash-Lite, the fastest and…

It's significantly more capable than its predecessor, but output costs have more than…

The Decoder
Research 22h ago

4 Proven AI Agent Use Cases for NonprofitsScaling Impact: 4 Proven AI Agent Use Cases for Nonprofits

When we partner with nonprofits, we ask ourselves a critical question: How can we give…

The root of this question comes from a long-standing reality: passion alone cannot…

Salesforce AI Research
Browse all stories
/ Search M Mode T Theme