Stay ahead of the generative AI revolution!Join the M5B Newsletter →

Welcome to M5BMachine 5-Minute Briefing

Your centralized dashboard for the generative AI revolution. Track the latest models, secure exclusive offers, and master the prompt.

Research• Feb 20, 2026

MMCAformer: Macro-Micro Cross-Attention Transformer for Traffic Speed Prediction with Microscopic Connected Vehicle Driving Behavior

arXiv:2602.16730v1 Announce Type: new Abstract: Accurate speed prediction is crucial for proactive traffic management to enhance traffic efficiency and safety. Existing studies have primarily relied on aggregated, macroscopic traffic flow data to predict future traffic trends, whereas road traffic ...

#ArXiv#Machine Learning#Academic

Research• Feb 19, 2026

Towards Efficient Constraint Handling in Neural Solvers for Routing Problems

arXiv:2602.16012v1 Announce Type: new Abstract: Neural solvers have achieved impressive progress in addressing simple routing problems, particularly excelling in computational efficiency. However, their advantages under complex constraints remain nascent, for which current constraint-handling schem...

#ArXiv#Machine Learning#Academic

Research• Feb 18, 2026

Learning Representations from Incomplete EHR Data with Dual-Masked Autoencoding

arXiv:2602.15159v1 Announce Type: new Abstract: Learning from electronic health records (EHRs) time series is challenging due to irregular sam- pling, heterogeneous missingness, and the resulting sparsity of observations. Prior self-supervised meth- ods either impute before learning, represent miss...

#ArXiv#Machine Learning#Academic

Research• Feb 16, 2026

Wireless TokenCom: RL-Based Tokenizer Agreement for Multi-User Wireless Token Communications

arXiv:2602.12338v1 Announce Type: new Abstract: Token Communications (TokenCom) has recently emerged as an effective new paradigm, where tokens are the unified units of multimodal communications and computations, enabling efficient digital semantic- and goal-oriented communications in future wirele...

#ArXiv#Machine Learning#Academic

Research• Feb 13, 2026

Explaining AI Without Code: A User Study on Explainable AI

arXiv:2602.11159v1 Announce Type: new Abstract: The increasing use of Machine Learning (ML) in sensitive domains such as healthcare, finance, and public policy has raised concerns about the transparency of automated decisions. Explainable AI (XAI) addresses this by clarifying how models generate pr...

#ArXiv#Machine Learning#Academic

Advertisement

Research• Feb 13, 2026

On Decision-Valued Maps and Representational Dependence

arXiv:2602.11295v1 Announce Type: new Abstract: A computational engine applied to different representations of the same data can produce different discrete outcomes, with some representations preserving the result and others changing it entirely. A decision-valued map records which representations ...

#ArXiv#Machine Learning#Academic

Research• Feb 13, 2026

KBVQ-MoE: KLT-guided SVD with Bias-Corrected Vector Quantization for MoE Large Language Models

arXiv:2602.11184v1 Announce Type: new Abstract: Mixture of Experts (MoE) models have achieved great success by significantly improving performance while maintaining computational efficiency through sparse expert activation. However, their enormous parameter sizes and memory demands pose major chall...

#ArXiv#Machine Learning#Academic

Research• Feb 12, 2026

Adaptive Optimization via Momentum on Variance-Normalized Gradients

arXiv:2602.10204v1 Announce Type: new Abstract: We introduce MVN-Grad (Momentum on Variance-Normalized Gradients), an Adam-style optimizer that improves stability and performance by combining two complementary ideas: variance-based normalization and momentum applied after normalization. MVN-Grad sc...

#ArXiv#Machine Learning#Academic

Research• Feb 12, 2026

Versor: A Geometric Sequence Architecture

arXiv:2602.10195v1 Announce Type: new Abstract: A novel sequence architecture design is introduced, Versor, which uses Conformal Geometric Algebra (CGA) in place of the traditional fundamental non-linear operations to achieve structural generalization and significant performance improvements on a v...

#ArXiv#Machine Learning#Academic

Research• Feb 10, 2026

Lagged backward-compatible physics-informed neural networks for unsaturated soil consolidation analysis

arXiv:2602.07031v1 Announce Type: new Abstract: This study develops a Lagged Backward-Compatible Physics-Informed Neural Network (LBC-PINN) for simulating and inverting one-dimensional unsaturated soil consolidation under long-term loading. To address the challenges of coupled air and water pressur...

#ArXiv#Machine Learning#Academic

Advertisement

Research• Feb 6, 2026

MINT: Minimal Information Neuro-Symbolic Tree for Objective-Driven Knowledge-Gap Reasoning and Active Elicitation

arXiv:2602.05048v1 Announce Type: new Abstract: Joint planning through language-based interactions is a key area of human-AI teaming. Planning problems in the open world often involve various aspects of incomplete information and unknowns, e.g., objects involved, human goals/intents -- thus leading...

#ArXiv#Machine Learning#Academic

Research• Feb 5, 2026

Active Epistemic Control for Query-Efficient Verified Planning

arXiv:2602.03974v1 Announce Type: new Abstract: Planning in interactive environments is challenging under partial observability: task-critical preconditions (e.g., object locations or container states) may be unknown at decision time, yet grounding them through interaction is costly. Learned world ...

#ArXiv#Machine Learning#Academic

Research• Feb 4, 2026

Learning ORDER-Aware Multimodal Representations for Composite Materials Design

arXiv:2602.02513v1 Announce Type: new Abstract: Artificial intelligence (AI) has shown remarkable success in materials discovery and property prediction, particularly for crystalline and polymer systems where material properties and structures are dominated by discrete graph representations. Such g...

#ArXiv#Machine Learning#Academic

Research• Feb 4, 2026

Sparse Adapter Fusion for Continual Learning in NLP

arXiv:2602.02502v1 Announce Type: new Abstract: Continual learning in natural language processing plays a crucial role in adapting to evolving data and preventing catastrophic forgetting. Despite significant progress, existing methods still face challenges, such as inefficient parameter reuse acros...

#ArXiv#Machine Learning#Academic

Research• Feb 4, 2026

UNSO: Unified Newton Schulz Orthogonalization

arXiv:2602.02500v1 Announce Type: new Abstract: The Newton-Schulz (NS) iteration has gained increasing interest for its role in the Muon optimizer and the Stiefel manifold. However, the conventional NS iteration suffers from inefficiency and instability. Although various improvements have been intr...

#ArXiv#Machine Learning#Academic

Advertisement

Research• Feb 4, 2026

PeerRank: Autonomous LLM Evaluation Through Web-Grounded, Bias-Controlled Peer Review

arXiv:2602.02589v1 Announce Type: new Abstract: Evaluating large language models typically relies on human-authored benchmarks, reference answers, and human or single-model judgments, approaches that scale poorly, become quickly outdated, and mismatch open-world deployments that depend on web retri...

#ArXiv#Machine Learning#Academic

Research• Feb 3, 2026

Complete Identification of Deep ReLU Neural Networks by Many-Valued Logic

arXiv:2602.00266v1 Announce Type: new Abstract: Deep ReLU neural networks admit nontrivial functional symmetries: vastly different architectures and parameters (weights and biases) can realize the same function. We address the complete identification problem -- given a function f, deriving the arch...

#ArXiv#Machine Learning#Academic

Research• Feb 3, 2026

Representation Learning Enhanced Deep Reinforcement Learning for Optimal Operation of Hydrogen-based Multi-Energy Systems

arXiv:2602.00027v1 Announce Type: new Abstract: Hydrogen-based multi-energy systems (HMES) have emerged as a promising low-carbon and energy-efficient solution, as it can enable the coordinated operation of electricity, heating and cooling supply and demand to enhance operational flexibility, impro...

#ArXiv#Machine Learning#Academic

Research• Feb 2, 2026

Why Reasoning Fails to Plan: A Planning-Centric Analysis of Long-Horizon Decision Making in LLM Agents

arXiv:2601.22311v1 Announce Type: new Abstract: Large language model (LLM)-based agents exhibit strong step-by-step reasoning capabilities over short horizons, yet often fail to sustain coherent behavior over long planning horizons. We argue that this failure reflects a fundamental mismatch: step-w...

#ArXiv#Machine Learning#Academic

Research• Feb 2, 2026

Causal Imitation Learning Under Measurement Error and Distribution Shift

arXiv:2601.22206v1 Announce Type: new Abstract: We study offline imitation learning (IL) when part of the decision-relevant state is observed only through noisy measurements and the distribution may change between training and deployment. Such settings induce spurious state-action correlations, so ...

#ArXiv#Machine Learning#Academic

Advertisement

Research• Jan 30, 2026

Is Parameter Isolation Better for Prompt-Based Continual Learning?

arXiv:2601.20894v1 Announce Type: new Abstract: Prompt-based continual learning methods effectively mitigate catastrophic forgetting. However, most existing methods assign a fixed set of prompts to each task, completely isolating knowledge across tasks and resulting in suboptimal parameter utilizat...

#ArXiv#Machine Learning#Academic

Research• Jan 30, 2026

QUARK: Robust Retrieval under Non-Faithful Queries via Query-Anchored Aggregation

arXiv:2601.21049v1 Announce Type: new Abstract: User queries in real-world retrieval are often non-faithful (noisy, incomplete, or distorted), causing retrievers to fail when key semantics are missing. We formalize this as retrieval under recall noise, where the observed query is drawn from a noisy...

#ArXiv#Machine Learning#Academic

Research• Jan 30, 2026

Do LLMs Favor LLMs? Quantifying Interaction Effects in Peer Review

arXiv:2601.20920v1 Announce Type: new Abstract: There are increasing indications that LLMs are not only used for producing scientific papers, but also as part of the peer review process. In this work, we provide the first comprehensive analysis of LLM use across the peer review pipeline, with parti...

#ArXiv#Machine Learning#Academic

Research• Jan 30, 2026

Faster Predictive Coding Networks via Better Initialization

arXiv:2601.20895v1 Announce Type: new Abstract: Research aimed at scaling up neuroscience inspired learning algorithms for neural networks is accelerating. Recently, a key research area has been the study of energy-based learning algorithms such as predictive coding, due to their versatility and ma...

#ArXiv#Machine Learning#Academic