Frontier AI Research Digest
Weekly curated intelligence from the cutting edge of AI research. LLMs, reasoning, multimodal, agents, alignment — explained and connected.
-

Week 22, 2026 — Robotics & Embodied AI
Embodied AI had a defining week with the release of a unified foundation model spanning manipulation, navigation, and trajectory prediction — alongside critical benchmarks exposing brittleness in creative reasoning. Qwen-VLA: The First Embodied Foundation Model Qwen-VLA from Alibaba extends Qwen’s vision-language modeling stack to continuous action and trajectory generation via a DiT-based action decoder. Trained…
-

Week 22, 2026 — Vision & Multimodal Systems
Vision-language models made strides in high-resolution perception, 3D reasoning, video efficiency, and unified digital human generation. CVSearch: Cognitive Visual Search for High-Resolution MLLMs CVSearch by Liupeng Li et al. addresses the coverage-efficiency dilemma in high-resolution image perception for MLLMs. It dynamically schedules search strategies: first trying expert-assisted search, and only triggering a novel Semantic Guided…
-

Week 22, 2026 — Reasoning & Reinforcement Learning for LLMs
Test-time compute and reasoning methods dominated this week’s research, with breakthroughs in self-verification, efficient sampling, and working memory mechanisms. Self-Trained Verification Unlocks Both Test-Time and Training-Time Gains Self-Trained Verification (STV) by Chen Henry Wu and Aditi Raghunathan addresses the central bottleneck in LLM self-improvement: the verifier. The key insight is that while a model cannot…
-

Week 22, 2026 — LLM Training & Scaling Laws
This week brought transformative advances in understanding how large language models scale and train — from a unified theory of scaling failures to practical recipes for MoE hyperparameter transfer and data mixture auditing. Shannon Scaling Law Unifies Training Phenomena Xu Ouyang and colleagues proposed the Shannon Scaling Law, treating LLM training as information transmission over…
-

Scientific AI: When Models Become Scientists
42 papers surveyed, 12 core papers retained, 6 miscategorized filtered out | May 2025 – May 2026 — The story of Scientific AI in 2025–2026 isn’t just about applying machine learning to science problems — that’s been happening for years. What changed this year is that AI models began to look less like tools and…
-

Open Source Models: Four Breakthroughs That Changed What “Open” Means
60 papers surveyed, 36 core papers retained, 13 miscategorized filtered out | May 2025 – May 2026 — The “can open models match closed models?” debate is over. For many tasks, the answer is a clear yes. But the 2025–2026 research cycle asked a more interesting question: Now that open models work, what can we…
-

Code & Math AI: When Proving Programs Correct Became Practical
The year AI stopped guessing and started proving — how agentic theorem proving, I/O-optimal attention, and domain specialization converged (56 papers surveyed, 33 miscategorized filtered out) — In May 2025, if you asked an LLM to write a program and formally verify it, you’d get back plausible-looking code that probably didn’t compile and definitely hadn’t…
-

The Year Robots Learned to Learn — With 180 Demonstrations Instead of 10,000
How instrumentation, point tracking, and particle dynamics are rewriting the economics of robotic learning (53 papers surveyed, May 2025–May 2026) — Everyone knows the problem with robotics: robots can’t generalize. A factory arm that picks one part flawlessly for five years breaks when you move the bin six inches. A strawberry-picking robot that works in…
-

The Year AI Learned to Plan Before It Rendered: Video & Image Generation’s Architecture Shift
37 papers surveyed | May 2025 – May 2026 — Video and image generation research hit a turning point this year. The field moved past the era of “more compute, better pixels” and discovered something more fundamental: the hardest problems in visual generation aren’t about rendering — they’re about planning what to render. The central…