Skip to content
PROVENANCE BRIEF
PROVENANCE BRIEF
Research 3h ago

Even with newer models like GPT-5.2 and Claude 4.6, AI chatbots still…Even frontier LLMs from GPT-5 onward lose up to 33% accuracy when you chat too long

Even with newer models like GPT-5.2 and Claude 4.6, AI chatbots still give worse answers the longer a conversation goes on.

Even with newer models like GPT-5.2 and Claude 4.6, AI chatbots still give worse answers the longer a conversation goes on.

Why it matters

Affects widely-used AI models.

The Decoder
Research just now

Industry expectations in Machine Learning Engineers in 2026Industry expectations in Machine Learning Engineers in 2026

Reddit MachineLearning: Industry expectations in Machine Learning Engineers in 2026

Find the core claim, method, and released artifacts.

Why it matters

Part of the evolving AI landscape.

Reddit MachineLearning
Research 5h ago

A Dream of Spring for Open-Weight LLMs: 10 Architectures from…A Dream of Spring for Open-Weight LLMs: 10 Architectures from Jan-Feb 2026

Reddit MachineLearning: A Dream of Spring for Open-Weight LLMs: 10 Architectures from Jan-Feb 2026

Find the core claim, method, and released artifacts.

Why it matters

Part of the evolving AI landscape.

Reddit MachineLearning
Community 2h ago

so we needed to fine tune on client datafine tuning on proprietary data is way harder to deploy than anyone tells you and most of it has nothing to do with the model

so we needed to fine tune on client data.

so we needed to fine tune on client data.

Reddit LocalLLaMA
Community 3h ago

There's been a lot of buzz about Qwen3.5 models being smarter than…Qwen3.5 35B-A3B replaced my 2-model agentic setup on M1 64GB

There's been a lot of buzz about Qwen3.5 models being smarter than all previous open-source models in the same size…

There's been a lot of buzz about Qwen3.5 models being smarter than all previous open-source models in the same size…

Reddit LocalLLaMA
Community 4h ago

If you've used multi-agent setups with LangChain, CrewAI, AutoGen,…What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek

If you've used multi-agent setups with LangChain, CrewAI, AutoGen, or Swarm, you've probably noticed: every agent…

If you've used multi-agent setups with LangChain, CrewAI, AutoGen, or Swarm, you've probably noticed: every agent…

Reddit LocalLLaMA
THE WIRE
Product 6h ago

In preparation for an XPU-specific backend for scaledmmv2 , move…: Factor out scaled_mm algo checks to non-CUDA ()

Summary: In preparation for an XPU-specific backend for scaledmmv2 , move some helpful…

Summary: In preparation for an XPU-specific backend for scaledmmv2 , move some helpful…

PyTorch Releases
Labs 2h ago

Feb 28 , 18:34 UTC Resolved - Between 9:50 PT / 17:50…Elevated errors on Claude Opus 4.6

Feb 28 , 18:34 UTC Resolved - Between 9:50 PT / 17:50 UTC and 10:12 PT / 18:12 UTC we…

Feb 28 , 18:34 UTC Resolved - Between 9:50 PT / 17:50 UTC and 10:12 PT / 18:12 UTC we…

Anthropic Status
Product 2h ago

Support for dict attribute is a little inconsistent in Dynamo: Support dict in NestedUserFunctionVariable ()

Support for dict attribute is a little inconsistent in Dynamo.

Support for dict attribute is a little inconsistent in Dynamo.

PyTorch Releases
Labs 5h ago

Feb 28 , 15:50 UTC Resolved - This incident has been resolvedElevated errors on claude.ai

Feb 28 , 15:50 UTC Resolved - This incident has been resolved.

Feb 28 , 15:50 UTC Resolved - This incident has been resolved.

Anthropic Status
Product 7h ago

#174500 Approved by: https://github.com/Lucaskabela ,…: [dynamo, bdb] test for empty command ()

Pull Request resolved: #174500 Approved by: https://github.com/Lucaskabela ,…

Pull Request resolved: #174500 Approved by: https://github.com/Lucaskabela ,…

PyTorch Releases
Browse all stories
/ Search M Mode T Theme