Scientific AI: When Models Become Scientists

Written by

in

42 papers surveyed, 12 core papers retained, 6 miscategorized filtered out | May 2025 – May 2026

The story of Scientific AI in 2025–2026 isn’t just about applying machine learning to science problems — that’s been happening for years. What changed this year is that AI models began to look less like tools and more like scientists. The most provocative question: can an AI discover a physical law that wasn’t already known to science? The evidence suggests we’re about to find out.

The Big Shift: Self-Supervised Physics Learning

The year’s most important development was the maturation of self-supervised physics learning: AI models learning physical laws from unlabeled observation, without being told what those laws are.

“LaMo: Self-Supervised Latent Motion Priors for Physical Realism in Video Generation” (Jiang et al., May 2026) extracts motion dynamics from unlabeled video without any physics supervision. The model learned a latent motion prior — a distribution over frame-to-frame changes — that encodes physical plausibility: object permanence, momentum, gravity. It learned these concepts without ever being told what physics is.

Think about what this means for science. Apply LaMo to fluid dynamics footage, biological motion, or planetary dynamics, and you have a physical simulator learned from raw observation — no governing equations needed. For domains where the physics is unknown or too complex to derive (turbulence, protein folding, ecosystem dynamics), this could revolutionize how we build scientific models.

“Learning a Particle Dynamics Model from Real-world Videos” (Kim, Sumukh & Fuxin, May 2026) pushes further: learning physics simulation models from real-world video without simulator access. The particle dynamics approach models scenes as interacting particles and learns interaction dynamics from observation alone.

Two theoretical frameworks complemented this data-driven work:

“Atmosphere as a steam engine” (Makarieva & Nefiodov, May 2026) provides a thermodynamic analysis of Earth’s atmosphere as a steam engine — theory-driven science that could either constrain or validate data-driven weather models.

“Soft Mobility Theory” (Eloy, May 2026) proposes a mathematical framework for deformable body motion in viscous flows, replacing expensive simulation with a configuration-dependent ODE for fluid-structure interaction problems.

The tension between theory-driven and data-driven approaches — atmosphere as steam engine vs. AI weather models learning physics from data — is one of the year’s most productive scientific debates.

Quantum Physics: From Practical Sensors to Black Holes

The quantum papers span the spectrum from practical to profound:

Rydberg atom sensing (Stumpf et al., May 2026) — analyzing electromagnetic scattering effects on quantum sensors, with computational frameworks suitable for AI-driven experimental design.

Analogies across physics (Cao, May 2026) — identifying formal connections between growth quenches (false vacuum destabilization) and operator growth dynamics. This is exactly the kind of cross-domain mathematical connection that AI systems could be trained to discover.

Interference-protected quantum dynamics (Kerschbaumer, Desaules & Serbyn, May 2026) — spin chains with nonthermal phenomena beyond many-body scars, advancing the theory of weak ergodicity breaking.

Better quantum error correction (Okada & Kasai, May 2026) — a mathematical construction for quantum LDPC codes, critical infrastructure for scalable quantum computing.

Biomedical and Demographic Science

“MuellerPT” (Tlemsani et al., May 2026) applies physics-guided pretraining to biomedical tissue analysis — learning transferable representations across specimens. The dataset is itself a significant contribution to biomedical polarimetric imaging.

“NeuralTFR” (Ciganda et al., May 2026) is fascinating for a different reason. It’s a neural network for global fertility forecasting that explicitly avoids embedding the recovery assumptions that differentiate existing demographic models. Instead, it provides an unbiased comparator — a machine learning reference point that doesn’t encode any particular theoretical tradition. For social science, this is a new epistemological role: the model that doesn’t take sides.

Optics: Order from Disorder

“Breaking order: Talbot effect with spinodal architectures” (Krüger et al., May 2026) demonstrates that stochastic, non-periodic structures can produce the same periodic pattern formation traditionally requiring diffraction gratings. This is a prime target for AI-driven inverse design — finding optical structures with desired properties that human intuition wouldn’t suggest.

Cross-Cutting Themes

Several threads connect Scientific AI to other topics:
SPACENUM (from Multimodal) revealed that VLM numerical outputs aren’t spatially grounded — critical for any scientific AI outputting quantitative predictions
Training-Free Looped Transformers offer extra inference-time compute for scientific reasoning without additional training
FM-CGM (Leveraging Foundation Models for Causal Generative Modeling) used pretrained models for visual causal reasoning

What’s Next

The maturation of self-supervised physics learning is the story of the year. AI models learning physical laws from unlabeled data, without being told what those laws are. If this scales, science transforms — not by replacing the scientific method, but by accelerating the discovery of patterns that human scientists then explain.

The open question: can AI move from assisting science to conducting science — forming hypotheses, designing experiments, interpreting results? The LaMo and particle dynamics papers suggest this transition is underway. The first AI-discovered physical law that wasn’t already known to science may come within the next year.

That will be the moment when “AI for science” becomes something much more interesting.

Part of the Frontier AI Research Digest backfill series (May 2025 – May 2026). 42 papers surveyed, 6 filtered as miscategorized. Core focus: self-supervised physics learning, quantum science, biomedical AI, and simulation infrastructure.

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *