Vision-Based Runtime Monitoring under Varying Specifications using Semantic Latent Representations
arXiv:2605.13923v1 Announce Type: new
Abstract: We study certified runtime monitoring of past-time signal temporal logic (ptSTL) from visual observations under partial observability. The monitor must infer safety-relevant quantities from images and provide finite-sample guarantees, while being \emp...
CAWI: Copula-Aligned Weight Initialization for Randomized Neural Networks
arXiv:2605.12580v1 Announce Type: new
Abstract: Randomized neural networks (RdNNs) enable efficient, backpropagation-free training by freezing randomly initialized input-to-hidden weights, which permits a closed-form solution for the output layer. However, conventional random initialization is blin...
Towards Robust Federated Multimodal Graph Learning under Modality Heterogeneity
arXiv:2605.12584v1 Announce Type: new
Abstract: Recently, multimodal graph learning (MGL) has garnered significant attention for integrating diverse modality information and structured context to support various network applications. However, real-world graphs are often isolated due to data-sharing...
Think Twice, Act Once: Verifier-Guided Action Selection For Embodied Agents
arXiv:2605.12620v1 Announce Type: new
Abstract: Building generalist embodied agents capable of solving complex real-world tasks remains a fundamental challenge in AI. Multimodal Large Language Models (MLLMs) have significantly advanced the reasoning capabilities of such agents through strong vision...
RankQ: Offline-to-Online Reinforcement Learning via Self-Supervised Action Ranking
arXiv:2605.11151v1 Announce Type: new
Abstract: Offline-to-online reinforcement learning (RL) improves sample efficiency by leveraging pre-collected datasets prior to online interaction. A key challenge, however, is learning an accurate critic in large state--action spaces with limited dataset cove...
A Cascaded Generative Approach for e-Commerce Recommendations
arXiv:2605.11118v1 Announce Type: new
Abstract: Personalized storefronts in large e-commerce marketplaces are often assembled from many independent components: static themes per page section ("placement"), retrieval systems to fetch eligible products per placement, and pointwise rankers to order co...
Interpretable EEG Microstate Discovery via Variational Deep Embedding: A Systematic Architecture Search with Multi-Quadrant Evaluation
arXiv:2605.10947v1 Announce Type: new
Abstract: EEG microstate analysis segments continuous brain electrical activity into brief, quasi-stable topographic configurations that reflect discrete functional brain states. Conventional approaches such as Modified K-Means operate directly in electrode spa...
arXiv:2605.10973v1 Announce Type: new
Abstract: Supervised fine-tuning (SFT) improves in-domain performance but can degrade out-of-domain (OOD) generalization. Prior work suggests that this degradation is related to changes in dominant singular subspaces of pretrained weight matrices. However, dire...
arXiv:2605.08360v1 Announce Type: new
Abstract: Modern AI is opening the door to collective decision-making in which participants express their views as free-form text rather than voting on a fixed set of candidates. A natural idea is to embed these opinions in a vector space so that the substantia...
Where Reliability Lives in Vision-Language Models: A Mechanistic Study of Attention, Hidden States, and Causal Circuits
arXiv:2605.08200v1 Announce Type: new
Abstract: A pervasive intuition holds that vision-language models (VLMs) are most trustworthy when their attention maps look sharp: concentrated attention on the queried region should imply a confident, calibrated answer. We test this Attention-Confidence Assum...
Distributional Reinforcement Learning via the Cram\'er Distance
arXiv:2605.08104v1 Announce Type: new
Abstract: This paper explores the application of the Soft Actor-Critic (SAC) algorithm within a Distributional Reinforcement Learning setting and introduces an implementation of such algorithm named Cram\'er-based Distributional Soft Actor-Critic (C-DSAC). The ...
On the Role of Strain and Vorticity in Numerical Integration Error for Flow Matching
arXiv:2605.06680v1 Announce Type: new
Abstract: Flow matching generates data by integrating a learned velocity field, where the number of integration steps (NFE) directly determines inference cost. We analyze which properties of the velocity field govern integration error by decomposing the velocit...
arXiv:2605.05365v1 Announce Type: new
Abstract: We present ZAYA1-8B, a reasoning-focused mixture-of-experts (MoE) model with 700M active and 8B total parameters, built on Zyphra's MoE++ architecture. ZAYA1-8B's core pretraining, midtraining, and supervised fine-tuning (SFT) were performed on a full...
arXiv:2605.05209v1 Announce Type: new
Abstract: Neural networks that land in flat regions of the loss landscape tend to generalise better than those in sharp regions. Sharpness-Aware Minimisation exploits this to improve generalisation. But function-preserving reparameterisation can inflate the Hes...
Horizon-Constrained Rashomon Sets for Chaotic Forecasting
arXiv:2605.05218v1 Announce Type: new
Abstract: Predictive multiplicity and chaotic dynamics represent two fundamental challenges in machine learning that have evolved independently despite their conceptual connections. We bridge this gap by introducing horizon-constrained Rashomon sets, a theoreti...
arXiv:2605.04100v1 Announce Type: new
Abstract: Off-policy temporal-difference (TD) learning with function approximation faces a structural tradeoff among stability, projection geometry, and variance control. Emphatic TD (ETD) improves the off-policy projection geometry through follow-on emphasis, ...
ANDRE: An Attention-based Neuro-symbolic Differentiable Rule Extractor
arXiv:2605.04193v1 Announce Type: new
Abstract: Inductive Logic Programming (ILP) aims to learn interpretable first-order rules from data, but existing symbolic and neuro-symbolic approaches struggle to scale to noisy and probabilistic settings. Classical ILP relies on discrete combinatorial rule s...
A Self-Attentive Meta-Optimizer with Group-Adaptive Learning Rates and Weight Decay
arXiv:2605.04055v1 Announce Type: new
Abstract: Adaptive optimizers like AdamW apply uniform hyperparameters across all parameter groups, ignoring heterogeneous optimization dynamics across layers and modules. We address this limitation by proposing MetaAdamW - a new optimizer that integrates a sel...
arXiv:2605.02907v1 Announce Type: new
Abstract: Softmax attention maps every query--key interaction into a probability distribution, but the underlying structure remains largely unexplored. We define the \emph{energy field}, the row-centered attention logit, and show that it exhibits invariant prop...
Making the Invisible Visible: Understanding the Mismatch Between Organizational Goals and Worker Experiences in AI Adoption
arXiv:2605.03078v1 Announce Type: new
Abstract: While AI is often introduced into organizations to drive innovation and efficiency, many adoption efforts fail as workers resist and struggle to integrate these systems. These failures point to a deeper issue: workers, the very people expected to coll...
Computing Thiele Rules on Interval Elections and their Generalizations
arXiv:2605.03067v1 Announce Type: new
Abstract: Approval-based committee voting has received significant attention in the social choice community. Among the studied rules, Thiele rules, and especially Proportional Approval Voting (PAV), stand out for desirable properties such as proportional repres...
Accelerating battery research with an AI interface between FINALES and Kadi4Mat
arXiv:2605.00909v1 Announce Type: new
Abstract: The time-consuming formation process critically impacts the longevity of sodium-ion coin cells and End Of Life (EOL) performance. This study aims to optimize formation protocols for duration efficiency, targeting high-performance outcomes while minimi...
Sparse Regression under Correlation and Weak Signals: A Reproducible Benchmark of Classical and Bayesian Methods
arXiv:2605.00835v1 Announce Type: new
Abstract: Choosing between classical and Bayesian sparse regression methods involves a real trade-off: penalized estimators like Lasso run in milliseconds but give no uncertainty estimates,while Horseshoe and Spike-and-Slab priors produce full posteriors but ne...
ClinicBot: A Guideline-Grounded Clinical Chatbot with Prioritized Evidence RAG and Verifiable Citations
arXiv:2605.00846v1 Announce Type: new
Abstract: Clinical diagnosis requires answers that are accurate, verifiable, and explicitly grounded in official guidelines. While large language models excel at natural language processing, their tendency to hallucinate undermines their utility in high-stakes ...