Neural Signals Generate Clinical Notes in the Wild
arXiv:2601.22197v1 Announce Type: new
Abstract: Generating clinical reports that summarize abnormal patterns, diagnostic findings, and clinical interpretations from long-term EEG recordings remains labor-intensive. We curate a large-scale clinical EEG dataset with $9{,}922$ reports paired with appr...
Multitask Learning for Earth Observation Data Classification with Hybrid Quantum Network
arXiv:2601.22195v1 Announce Type: new
Abstract: Quantum machine learning (QML) has gained increasing attention as a potential solution to address the challenges of computation requirements in the future. Earth observation (EO) has entered the era of Big Data, and the computational demands for effec...
arXiv:2601.22269v1 Announce Type: new
Abstract: Judge agents are fundamental to agentic AI frameworks: they provide automated evaluation, and enable iterative self-refinement of reasoning processes. We introduce JAF: Judge Agent Forest, a framework in which the judge agent conducts joint inference ...
Sparks of Rationality: Do Reasoning LLMs Align with Human Judgment and Choice?
arXiv:2601.22329v1 Announce Type: new
Abstract: Large Language Models (LLMs) are increasingly positioned as decision engines for hiring, healthcare, and economic judgment, yet real-world human judgment reflects a balance between rational deliberation and emotion-driven bias. If LLMs are to particip...
The Six Sigma Agent: Achieving Enterprise-Grade Reliability in LLM Systems Through Consensus-Driven Decomposed Execution
arXiv:2601.22290v1 Announce Type: new
Abstract: Large Language Models demonstrate remarkable capabilities yet remain fundamentally probabilistic, presenting critical reliability challenges for enterprise deployment. We introduce the Six Sigma Agent, a novel architecture that achieves enterprise-gra...
Learning Provably Correct Distributed Protocols Without Human Knowledge
arXiv:2601.22369v1 Announce Type: new
Abstract: Provably correct distributed protocols, which are a critical component of modern distributed systems, are highly challenging to design and have often required decades of human effort. These protocols allow multiple agents to coordinate to come to a co...
A generative machine learning model for designing metal hydrides applied to hydrogen storage
arXiv:2601.20892v1 Announce Type: new
Abstract: Developing new metal hydrides is a critical step toward efficient hydrogen storage in carbon-neutral energy systems. However, existing materials databases, such as the Materials Project, contain a limited number of well-characterized hydrides, which c...
arXiv:2601.20884v1 Announce Type: new
Abstract: Multimodal pretraining is effective for building general-purpose representations, but in many practical deployments, only one modality is heavily used during downstream fine-tuning. Standard pretraining strategies treat all modalities uniformly, which...
Rethinking LLM-Driven Heuristic Design: Generating Efficient and Specialized Solvers via Dynamics-Aware Optimization
arXiv:2601.20868v1 Announce Type: new
Abstract: Large Language Models (LLMs) have advanced the field of Combinatorial Optimization through automated heuristic generation. Instead of relying on manual design, this LLM-Driven Heuristic Design (LHD) process leverages LLMs to iteratively generate and r...
Bayesian-LoRA: Probabilistic Low-Rank Adaptation of Large Language Models
arXiv:2601.21003v1 Announce Type: new
Abstract: Large Language Models usually put more emphasis on accuracy and therefore, will guess even when not certain about the prediction, which is especially severe when fine-tuned on small datasets due to the inherent tendency toward miscalibration. In this ...
Latent Object Permanence: Topological Phase Transitions, Free-Energy Principles, and Renormalization Group Flows in Deep Transformer Manifolds
arXiv:2601.19942v1 Announce Type: new
Abstract: We study the emergence of multi-step reasoning in deep Transformer language models through a geometric and statistical-physics lens. Treating the hidden-state trajectory as a flow on an implicit Riemannian manifold, we analyze the layerwise covariance...
oculomix: Hierarchical Sampling for Retinal-Based Systemic Disease Prediction
arXiv:2601.19939v1 Announce Type: new
Abstract: Oculomics - the concept of predicting systemic diseases, such as cardiovascular disease and dementia, through retinal imaging - has advanced rapidly due to the data efficiency of transformer-based foundation models like RETFound. Image-level mixed sam...
Gap-K%: Measuring Top-1 Prediction Gap for Detecting Pretraining Data
arXiv:2601.19936v1 Announce Type: new
Abstract: The opacity of massive pretraining corpora in Large Language Models (LLMs) raises significant privacy and copyright concerns, making pretraining data detection a critical challenge. Existing state-of-the-art methods typically rely on token likelihoods...
Continuous-Flow Data-Rate-Aware CNN Inference on FPGA
arXiv:2601.19940v1 Announce Type: new
Abstract: Among hardware accelerators for deep-learning inference, data flow implementations offer low latency and high throughput capabilities. In these architectures, each neuron is mapped to a dedicated hardware unit, making them well-suited for field-progra...
DecHW: Heterogeneous Decentralized Federated Learning Exploiting Second-Order Information
arXiv:2601.19938v1 Announce Type: new
Abstract: Decentralized Federated Learning (DFL) is a serverless collaborative machine learning paradigm where devices collaborate directly with neighbouring devices to exchange model information for learning a generalized model. However, variations in individu...
Insight Agents: An LLM-Based Multi-Agent System for Data Insights
arXiv:2601.20048v1 Announce Type: new
Abstract: Today, E-commerce sellers face several key challenges, including difficulties in discovering and effectively utilizing available programs and tools, and struggling to understand and utilize rich data from various tools. We therefore aim to develop Ins...
Teaching LLMs to Ask: Self-Querying Category-Theoretic Planning for Under-Specified Reasoning
arXiv:2601.20014v1 Announce Type: new
Abstract: Inference-time planning with large language models frequently breaks under partial observability: when task-critical preconditions are not specified at query time, models tend to hallucinate missing facts or produce plans that violate hard constraints...
Should I Have Expressed a Different Intent? Counterfactual Generation for LLM-Based Autonomous Control
arXiv:2601.20090v1 Announce Type: new
Abstract: Large language model (LLM)-powered agents can translate high-level user intents into plans and actions in an environment. Yet after observing an outcome, users may wonder: What if I had phrased my intent differently? We introduce a framework that enab...
IPBC: An Interactive Projection-Based Framework for Human-in-the-Loop Semi-Supervised Clustering of High-Dimensional Data
arXiv:2601.18828v1 Announce Type: new
Abstract: High-dimensional datasets are increasingly common across scientific and industrial domains, yet they remain difficult to cluster effectively due to the diminishing usefulness of distance metrics and the tendency of clusters to collapse or overlap when...
Variational Quantum Circuit-Based Reinforcement Learning for Dynamic Portfolio Optimization
arXiv:2601.18811v1 Announce Type: new
Abstract: This paper presents a Quantum Reinforcement Learning (QRL) solution to the dynamic portfolio optimization problem based on Variational Quantum Circuits. The implemented QRL approaches are quantum analogues of the classical neural-network-based Deep De...
arXiv:2601.18833v1 Announce Type: new
Abstract: Since the early 90s, the evolution of the Business Process Management (BPM) discipline has been punctuated by successive waves of automation technologies. Some of these technologies enable the automation of individual tasks, while others focus on orch...
LLM Driven Design of Continuous Optimization Problems with Controllable High-level Properties
arXiv:2601.18846v1 Announce Type: new
Abstract: Benchmarking in continuous black-box optimisation is hindered by the limited structural diversity of existing test suites such as BBOB. We explore whether large language models embedded in an evolutionary loop can be used to design optimisation proble...
RIFT: Reordered Instruction Following Testbed To Evaluate Instruction Following in Singular Multistep Prompt Structures
arXiv:2601.18924v1 Announce Type: new
Abstract: Large Language Models (LLMs) are increasingly relied upon for complex workflows, yet their ability to maintain flow of instructions remains underexplored. Existing benchmarks conflate task complexity with structural ordering, making it difficult to is...