AI Agents for Sustainable SMEs: A Green ESG Assessment Framework
arXiv:2605.00841v1 Announce Type: new
Abstract: This study presents a novel, AI-driven framework for assessing Environmental, Social, and Governance (ESG) performance in European small and medium-sized enterprises (SMEs). An initial phase established expert-validated ESG baseline scores from a subs...
2026 Roadmap on Artificial Intelligence and Machine Learning for Smart Manufacturing
arXiv:2605.00839v1 Announce Type: new
Abstract: The evolution of artificial intelligence (AI) and machine learning (ML) is reshaping smart manufacturing by providing new capabilities for efficiency, adaptability, and autonomy across industrial value chains. However, the deployment of AI and ML in i...
Understanding Emergent Misalignment via Feature Superposition Geometry
arXiv:2605.00842v1 Announce Type: new
Abstract: Emergent misalignment, where fine-tuning on narrow, non-harmful tasks induces harmful behaviors, poses a key challenge for AI safety in LLMs. Despite growing empirical evidence, its underlying mechanism remains unclear. To uncover the reason behind th...
AirFM-DDA: Air-Interface Foundation Model in the Delay-Doppler-Angle Domain for AI-Native 6G
arXiv:2605.00020v1 Announce Type: new
Abstract: The success of large foundation models is catalyzing a new paradigm for AI-native 6G network design: wireless foundation models for physical layer design. However, existing models often operate on channel state information (CSI) in the space-time-freq...
Cloud Is Closer Than It Appears: Revisiting the Tradeoffs of Distributed Real-Time Inference
arXiv:2605.00005v1 Announce Type: new
Abstract: The increasing deployment of deep neural networks (DNNs) in cyber-physical systems (CPS) enhances perception fidelity, but imposes substantial computational demands on execution platforms, posing challenges to real-time control deadlines. Traditional ...
What Physics do Data-Driven MoCap-to-Radar Models Learn?
arXiv:2605.00018v1 Announce Type: new
Abstract: Data-driven MoCap-to-radar models generate plausible micro-Doppler spectrograms, but do they actually learn the underlying physics? We introduce a physics-based interpretability framework to answer this question via two proposed complementary metrics:...
AgentReputation: A Decentralized Agentic AI Reputation Framework
arXiv:2605.00073v1 Announce Type: new
Abstract: Decentralized, agentic AI marketplaces are rapidly emerging to support software engineering tasks such as debugging, patch generation, and security auditing, often operating without centralized oversight. However, existing reputation mechanisms fail i...
Minimal, Local, Causal Explanations for Jailbreak Success in Large Language Models
arXiv:2605.00123v1 Announce Type: new
Abstract: Safety trained large language models (LLMs) can often be induced to answer harmful requests through jailbreak prompts. Because we lack a robust understanding of why LLMs are susceptible to jailbreaks, future frontier models operating more autonomously...
Are Tools All We Need? Unveiling the Tool-Use Tax in LLM Agents
arXiv:2605.00136v1 Announce Type: new
Abstract: Tool-augmented reasoning has become a popular direction for LLM-based agents, and it is widely assumed to improve reasoning and reliability. However, we demonstrate that this consensus does not always hold: in the presence of semantic distractors, too...
TUR-DPO: Topology- and Uncertainty-Aware Direct Preference Optimization
arXiv:2605.00224v1 Announce Type: new
Abstract: Aligning large language models (LLMs) with human preferences is commonly done via reinforcement learning from human feedback (RLHF) with Proximal Policy Optimization (PPO) or, more simply, via Direct Preference Optimization (DPO). While DPO is stable ...
TADI: Tool-Augmented Drilling Intelligence via Agentic LLM Orchestration over Heterogeneous Wellsite Data
arXiv:2605.00060v1 Announce Type: new
Abstract: We present TADI (Tool-Augmented Drilling Intelligence), an agentic AI system that transforms drilling operational data into evidence-based analytical intelligence. Applied to the Equinor Volve Field dataset, TADI integrates 1,759 daily drilling report...
arXiv:2604.26991v1 Announce Type: new
Abstract: Recent advances in data-centric medical AI have produced highly accurate diagnostic systems, but the emphasis on data curation and performance metrics has not translated into widespread clinical adoption. We conjecture that this limited uptake stems f...
Simple Self-Conditioning Adaptation for Masked Diffusion Models
arXiv:2604.26985v1 Announce Type: new
Abstract: Masked diffusion models (MDMs) generate discrete sequences by iterative denoising under an absorbing masking process. In standard masked diffusion, if a token remains masked after a reverse update, the model discards its clean-state prediction for tha...
End-to-end autonomous scientific discovery on a real optical platform
arXiv:2604.27092v1 Announce Type: new
Abstract: Scientific research has long been human-led, driving new knowledge and transformative technologies through the continual revision of questions, methods and claims as evidence accumulates. Although large language model (LLM)-based agents are beginning ...
arXiv:2604.27007v1 Announce Type: new
Abstract: We provide a causal analysis of Binary Spiking Neural Networks (BSNNs) to explain their behavior. We formally define a BSNN and represent its spiking activity as a binary causal model. Thanks to this causal representation, we are able to explain the o...
When Your LLM Reaches End-of-Life: A Framework for Confident Model Migration in Production Systems
arXiv:2604.27082v1 Announce Type: new
Abstract: We present a framework for migrating production Large Language Model (LLM) based systems when the underlying model reaches end-of-life or requires replacement. The key contribution is a Bayesian statistical approach that calibrates automated evaluatio...
Think it, Run it: Autonomous ML pipeline generation via self-healing multi-agent AI
arXiv:2604.27096v1 Announce Type: new
Abstract: The purpose of our paper is to develop a unified multi-agent architecture that automates end-to-end machine learning (ML) pipeline generation from datasets and natural-language (NL) goals, improving efficiency, robustness and explainability. A five-ag...
A Survey of Multi-Agent Deep Reinforcement Learning with Graph Neural Network-Based Communication
arXiv:2604.25972v1 Announce Type: new
Abstract: In multi-agent reinforcement learning (MARL), the integration of a communication mechanism, allowing agents to better learn to coordinate their actions and converge on their objectives by sharing information. Based on an interaction graph, a subclass ...
Rethinking KV Cache Eviction via a Unified Information-Theoretic Objective
arXiv:2604.25975v1 Announce Type: new
Abstract: Key-value (KV) caching is essential for large language model inference, yet its memory overhead poses a critical bottleneck for long-context generation. Existing eviction policies predominantly rely on empirical heuristics, lacking a rigorous theoreti...
A Multimodal and Explainable Machine Learning Approach to Diagnosing Multi-Class Ejection Fraction from Electrocardiograms
arXiv:2604.25942v1 Announce Type: new
Abstract: Left ventricular ejection fraction (LVEF) assessment depends on echocardiography, limiting access in primary care and resource-constrained settings. We developed a multimodal machine-learning framework that combines engineered 12-lead ECG timeseries f...
A Randomized PDE Energy driven Iterative Framework for Efficient and Stable PDE Solutions
arXiv:2604.25943v1 Announce Type: new
Abstract: Efficient and stable solution of partial differential equations (PDEs) is central to scientific and engineering applications, yet existing numerical solvers rely heavily on matrix based discretizations, while learning based methods require costly trai...
Evaluating Strategic Reasoning in Forecasting Agents
arXiv:2604.26106v1 Announce Type: new
Abstract: Forecasting benchmarks produce accuracy leaderboards but little insight into why some forecasters are more accurate than others. We introduce Bench to the Future 2 (BTF-2), 1,417 pastcasting questions with a frozen 15M-document research corpus in whic...
OMEGA: Optimizing Machine Learning by Evaluating Generated Algorithms
arXiv:2604.26211v1 Announce Type: new
Abstract: In order to automate AI research we introduce a full, end-to-end framework, OMEGA: Optimizing Machine learning by Evaluating Generated Algorithms, that starts at idea generation and ends with executable code. Our system combines structured meta-prompt...
Distill-Belief: Closed-Loop Inverse Source Localization and Characterization in Physical Fields
arXiv:2604.26095v1 Announce Type: new
Abstract: {Closed-loop inverse source localization and characterization (ISLC) requires a mobile agent to select measurements that localize sources and infer latent field parameters under strict time constraints.} {The core challenge lies in the belief-space ob...