Stay ahead of the generative AI revolution!Join the M5B Newsletter →

AI Tools & Frameworks Directory

Discover essential tools, libraries, and frameworks to power your AI workflows.

All Engineering Hardware Jobs News Research Tools Tutorials

News AI TechCrunch Analytics Vidhya Data Science Towards Data Science Medium GenAI Textual OpenAI Google MIT Microsoft HuggingFace OpenSource Models NVIDIA GPU Enterprise ArXiv

Tool• May 21, 2026

Provably Learning Diffusion Models under the Manifold Hypothesis: Collapse and Refine

arXiv:2605.20235v1 Announce Type: new Abstract: Diffusion models generate high-dimensional data with remarkable quality, yet how their training efficiently learns the score function, bypassing the curse of dimensionality when data is supported on low-dimensional manifolds, remains theoretically une...

#ArXiv#Machine Learning#Academic

Tool• May 21, 2026

GraphDiffMed: Knowledge-Constrained Differential Attention with Pharmacological Graph Priors for Medication Recommendation

arXiv:2605.20188v1 Announce Type: new Abstract: Recommending safe and effective medication combinations from electronic health records (EHRs) is a core clinical AI problem, yet it remains difficult because patient trajectories are long, noisy, and clinically heterogeneous. Existing methods typicall...

#ArXiv#Machine Learning#Academic

Tool• May 21, 2026

Neural Estimation of Pairwise Mutual Information in Masked Discrete Sequence Models

arXiv:2605.20187v1 Announce Type: new Abstract: Understanding dependencies between variables is critical for interpretability and efficient generation in masked diffusion models (MDMs), yet these models primarily expose marginal conditional distributions and do not explicitly represent inter-variab...

#ArXiv#Machine Learning#Academic

Tool• May 21, 2026

SOLAR: A Self-Optimizing Open-Ended Autonomous Agent for Lifelong Learning and Continual Adaptation

arXiv:2605.20189v1 Announce Type: new Abstract: Despite the remarkable success of large language models (LLMs), they still face bottlenecks while deploying in dynamic, real-world settings with primary challenges being concept drift and the high cost of gradient-based adaptation. Traditional fine-tu...

#ArXiv#Machine Learning#Academic

Tool• May 21, 2026

Tool-Augmented Agent for Closed-loop Optimization,Simulation,and Modeling Orchestration

arXiv:2605.20190v1 Announce Type: new Abstract: Iterative industrial design-simulation optimization is bottlenecked by the CAD-CAE semantic gap: translating simulation feedback into valid geometric edits under diverse, coupled constraints. To fill this gap, we propose COSMO-Agent (Closed-loop Optim...

#ArXiv#Machine Learning#Academic

Tool• May 21, 2026

AgentCo-op: Retrieval-Based Synthesis of Interoperable Multi-Agent Workflows

arXiv:2605.20425v1 Announce Type: new Abstract: Designing multi-agent workflows is especially difficult in open-ended scientific settings where tasks lack curated training sets, reliable scalar evaluation metrics, and standardized interfaces between existing tools and agents. We propose AgentCo-op,...

#ArXiv#Machine Learning#Academic

Tool• May 20, 2026

HELLoRA: Hot Experts Layer-Level Low-Rank Adaptation for Mixture-of-Experts Models

arXiv:2605.18795v1 Announce Type: new Abstract: Low-Rank Adaptation (LoRA) dominates parameter-efficient fine-tuning of large language models, yet most variants target dense architectures. Mixture-of-Experts (MoE) models scale parameters at near-constant per-token compute, and their sparse activati...

#ArXiv#Machine Learning#Academic

Tool• May 20, 2026

Dimensional Balance Improves Large Scale Spatiotemporal Prediction Performance

arXiv:2605.18793v1 Announce Type: new Abstract: Accurate spatiotemporal pattern analysis is critical in fields such as urban traffic, meteorology, and public health monitoring. However, existing methods face performance bottlenecks, typically yielding only incremental gains and often exhibiting lim...

#ArXiv#Machine Learning#Academic

Tool• May 20, 2026

Robust Basis Spline Decoupling for the Compression of Transformer Models

arXiv:2605.18794v1 Announce Type: new Abstract: Decoupling is a powerful modeling paradigm for representing multivariate functions as compositions of linear transformations and univariate nonlinear functions. A single-layer decoupling can be viewed as a fully connected neural network with a single ...

#ArXiv#Machine Learning#Academic

Tool• May 20, 2026

UCCI: Calibrated Uncertainty for Cost-Optimal LLM Cascade Routing

arXiv:2605.18796v1 Announce Type: new Abstract: LLM cascades and model routing promise lower inference cost by sending easy queries to a small model and escalating hard ones to a large model, but most deployed routers use uncalibrated confidence scores and require per-workload threshold tuning. We ...

#ArXiv#Machine Learning#Academic

Tool• May 20, 2026

Simply Stabilizing the Loop via Fully Looped Transformer

arXiv:2605.18797v1 Announce Type: new Abstract: Scaling model performance typically requires increasing model size. Looped Transformer offers a compelling alternative by iteratively reusing the same Transformer blocks, trading additional computation for improved performance without increasing param...

#ArXiv#Machine Learning#Academic

Tool• May 20, 2026

AgentNLQ: A General-Purpose Agent for Natural Language to SQL

arXiv:2605.19010v1 Announce Type: new Abstract: Natural language to SQL (NL2SQL) conversion is an important problem for researchers and enterprises due to the ubiquitous importance of relational databases in broad-ranging practical problems. Despite the rapid advancements in the capabilities of LLM...

#ArXiv#Machine Learning#Academic

Tool• May 20, 2026

Position: Let's Develop Data Probes to Fundamentally Understand How Data Affects LLM Performance

arXiv:2605.18801v1 Announce Type: new Abstract: Data is fundamental to large language models (LLMs). However, understanding of what makes certain data useful for different stages of an LLM workflow, including training, tuning, alignment, in-context learning, etc., and why, remains an open question....

#ArXiv#Machine Learning#Academic

Tool• May 20, 2026

Learn-by-Wire Training Control Governance: Bounded Autonomous Training Under Stress for Stability and Efficiency

arXiv:2605.19008v1 Announce Type: new Abstract: Modern language-model training is increasingly exposed to instability, degraded runs, and wasted compute, especially under aggressive learning-rate, scale, and runtime-stress conditions. This paper introduces Learn-by-Wire Guard (LBW-Guard), a bounded...

#ArXiv#Machine Learning#Academic

Tool• May 20, 2026

Operationalizing Document AI: A Microservice Architecture for OCR and LLM Pipelines in Production

arXiv:2605.18818v1 Announce Type: new Abstract: Academic research tends to focus on new models for document understanding creating a wide gap in the literature between model definition and running models at production scale. To close that gap, we present a microservice architecture that encapsulate...

#ArXiv#Machine Learning#Academic

Tool• May 20, 2026

Evaluating the Utility of Personal Health Records in Personalized Health AI

arXiv:2605.18937v1 Announce Type: new Abstract: Patient-managed Personal Health Records (PHRs) promises to empower patients to better understand their health; but information in the record is complex, potentially hindering insights. In this study, we assess the potential of large language models (L...

#ArXiv#Machine Learning#Academic

Tool• May 19, 2026

Systematic Optimization of Real-Time Diffusion Model Inference on Apple M3 Ultra

arXiv:2605.16259v1 Announce Type: new Abstract: While real-time image generation using diffusion models has advanced rapidly on NVIDIA GPUs, systematic optimization research on non-CUDA platforms such as Apple Silicon remains extremely limited. In this study, we conducted comprehensive optimization...

#ArXiv#Machine Learning#Academic

Tool• May 19, 2026

Reducing Credit Assignment Variance via Counterfactual Reasoning Paths

arXiv:2605.16302v1 Announce Type: new Abstract: Reinforcement learning for multi-step reasoning with large language models (LLMs) often relies on sparse terminal rewards, leading to poor credit assignment conditions where the final feedback is evenly propagated across all intermediate decisions. Th...

#ArXiv#Machine Learning#Academic

Tool• May 19, 2026

When Actions Disappear: Adversarial Action Removal in Self-Play Reinforcement Learning

arXiv:2605.16312v1 Announce Type: new Abstract: We study adversarial action masking in self-play reinforcement learning: an attacker selectively removes legal actions from a victim's action set. Unlike observation or action perturbations, removal eliminates decision options before the agent acts. A...

#ArXiv#Machine Learning#Academic

Tool• May 19, 2026

SignMuon: Communication-Efficient Distributed Muon Optimization

arXiv:2605.16311v1 Announce Type: new Abstract: Distributed training of large neural networks is bottlenecked by full-precision gradient communication and by coordinatewise optimizers that ignore the matrix structure of weight tensors. We propose Sign-Muon, a 1-bit, matrix-aware optimizer that comb...

#ArXiv#Machine Learning#Academic

Tool• May 19, 2026

Mirror Descent-Type Algorithms for the Variational Inequality Problem with Functional Constraints

arXiv:2605.16262v1 Announce Type: new Abstract: Variational inequalities play a key role in machine learning research, such as generative adversarial networks, reinforcement learning, adversarial training, and generative models. This paper is devoted to the constrained variational inequality proble...

#ArXiv#Machine Learning#Academic

Tool• May 19, 2026

AgentWall: A Runtime Safety Layer for Local AI Agents

arXiv:2605.16265v1 Announce Type: new Abstract: The safety of autonomous AI agents is increasingly recognized as a critical open problem. As agents transition from passive text generators to active actors capable of executing shell commands, modifying files, calling APIs, and browsing the web, the ...

#ArXiv#Machine Learning#Academic

Tool• May 19, 2026

From Prompts to Protocols: An AI Agent for Laboratory Automation

arXiv:2605.16552v1 Announce Type: new Abstract: Automating science laboratories enables faster, safer, more accurate, and more reproducible execution of protocols, accelerating the discovery and testing of new materials, drugs, and more. However, setting up and running autonomous labs requires coor...

#ArXiv#Machine Learning#Academic

Tool• May 19, 2026

Skim: Speculative Execution for Fast and Efficient Web Agents

arXiv:2605.16565v2 Announce Type: new Abstract: Skim is a speculative execution framework for web agents that exploits the predictable structure of purpose-built websites. Today's web-agent expense is not intrinsic to the tasks but a property of how agents are composed: frontier-model inference, br...

#ArXiv#Machine Learning#Academic

1...13 14 15 16 17...48