AI Tools & Frameworks Directory

Discover essential tools, libraries, and frameworks to power your AI workflows.

Tool• Mar 6, 2026

SkillNet: Create, Evaluate, and Connect AI Skills

arXiv:2603.04448v1 Announce Type: new Abstract: Current AI agents can flexibly invoke tools and execute complex tasks, yet their long-term advancement is hindered by the lack of systematic accumulation and transfer of skills. Without a unified mechanism for skill consolidation, agents frequently ``...

#ArXiv#Machine Learning#Academic

Tool• Mar 6, 2026

Thin Keys, Full Values: Reducing KV Cache via Low-Dimensional Attention Selection

arXiv:2603.04427v1 Announce Type: new Abstract: Standard transformer attention uses identical dimensionality for queries, keys, and values ($d_q = d_k = d_v = \dmodel$). Our insight is that these components serve fundamentally different roles, and this symmetry is unnecessary. Queries and keys prod...

#ArXiv#Machine Learning#Academic

Tool• Mar 6, 2026

Delta-Crosscoder: Robust Crosscoder Model Diffing in Narrow Fine-Tuning Regimes

arXiv:2603.04426v1 Announce Type: new Abstract: Model diffing methods aim to identify how fine-tuning changes a model's internal representations. Crosscoders approach this by learning shared dictionaries of interpretable latent directions between base and fine-tuned models. However, existing formul...

#ArXiv#Machine Learning#Academic

Tool• Mar 6, 2026

Machine Learning for Complex Systems Dynamics: Detecting Bifurcations in Dynamical Systems with Deep Neural Networks

arXiv:2603.04420v1 Announce Type: new Abstract: Critical transitions are the abrupt shifts between qualitatively different states of a system, and they are crucial to understanding tipping points in complex dynamical systems across ecology, climate science, and biology. Detecting these shifts typic...

#ArXiv#Machine Learning#Academic

Tool• Mar 6, 2026

Decorrelating the Future: Joint Frequency Domain Learning for Spatio-temporal Forecasting

arXiv:2603.04418v1 Announce Type: new Abstract: Standard direct forecasting models typically rely on point-wise objectives such as Mean Squared Error, which fail to capture the complex spatio-temporal dependencies inherent in graph-structured signals. While recent frequency-domain approaches such a...

#ArXiv#Machine Learning#Academic

Tool• Mar 6, 2026

Multi-Frequency Fusion for Robust Video Face Forgery Detection

Current face video forgery detectors use wide or dual-stream backbones. We show that a single, lightweight fusion of two handcrafted cues can achieve higher accuracy with a much smaller model. Based on the Xception baseline model (21.9 million parameters), we build two detectors: LFWS, which adds a ...

#Apple#On-device AI

Tool• Mar 6, 2026

Flow Matching with Semidiscrete Couplings

Flow models parameterized as time-dependent velocity fields can generate data from noise by integrating an ODE. These models are often trained using flow matching, i.e. by sampling random pairs of noise and target points (x0,x1)(\mathbf{x}_0, \mathbf{x}_1)(x0,x1) and ensuring that the velocity fie...

#Apple#On-device AI

Tool• Mar 6, 2026

GenCtrl -- A Formal Controllability Toolkit for Generative Models

As generative models become ubiquitous, there is a critical need for fine-grained control over the generation process. Yet, while controlled generation methods from prompting to fine-tuning proliferate, a fundamental question remains unanswered: are these models truly controllable in the first place...

#Apple#On-device AI

Tool• Mar 5, 2026

RAG that remembers: How AI is learning from every query

What if search systems didn’t just retrieve information, but remembered what worked? Expanded Relevance Memory (ERM) proves that query expansion and document expansion are mathematically equivalent, unlocking a powerful shift...

#AI Accelerator Institute#AI#Research

Tool• Mar 5, 2026

AI4S-SDS: A Neuro-Symbolic Solvent Design System via Sparse MCTS and Differentiable Physics Alignment

arXiv:2603.03686v1 Announce Type: new Abstract: Automated design of chemical formulations is a cornerstone of materials science, yet it requires navigating a high-dimensional combinatorial space involving discrete compositional choices and continuous geometric constraints. Existing Large Language M...

#ArXiv#Machine Learning#Academic

Tool• Mar 5, 2026

MAGE: Meta-Reinforcement Learning for Language Agents toward Strategic Exploration and Exploitation

arXiv:2603.03680v1 Announce Type: new Abstract: Large Language Model (LLM) agents have demonstrated remarkable proficiency in learned tasks, yet they often struggle to adapt to non-stationary environments with feedback. While In-Context Learning and external memory offer some flexibility, they fail...

#ArXiv#Machine Learning#Academic

Tool• Mar 5, 2026

Mozi: Governed Autonomy for Drug Discovery LLM Agents

arXiv:2603.03655v1 Announce Type: new Abstract: Tool-augmented large language model (LLM) agents promise to unify scientific reasoning with computation, yet their deployment in high-stakes domains like drug discovery is bottlenecked by two critical barriers: unconstrained tool-use governance and po...

#ArXiv#Machine Learning#Academic

Tool• Mar 5, 2026

Build, Judge, Optimize: A Blueprint for Continuous Improvement of Multi-Agent Consumer Assistants

arXiv:2603.03565v1 Announce Type: new Abstract: Conversational shopping assistants (CSAs) represent a compelling application of agentic AI, but moving from prototype to production reveals two underexplored challenges: how to evaluate multi-turn interactions and how to optimize tightly coupled multi...

#ArXiv#Machine Learning#Academic

Tool• Mar 5, 2026

Asymmetric Goal Drift in Coding Agents Under Value Conflict

arXiv:2603.03456v1 Announce Type: new Abstract: Agentic coding agents are increasingly deployed autonomously, at scale, and over long-context horizons. Throughout an agent's lifetime, it must navigate tensions between explicit instructions, learned values, and environmental pressures, often in cont...

#ArXiv#Machine Learning#Academic

Tool• Mar 5, 2026

Heterogeneous Time Constants Improve Stability in Equilibrium Propagation

arXiv:2603.03402v1 Announce Type: new Abstract: Equilibrium propagation (EP) is a biologically plausible alternative to backpropagation for training neural networks. However, existing EP models use a uniform scalar time step dt, which corresponds biologically to a membrane time constant that is het...

#ArXiv#Machine Learning#Academic

Tool• Mar 5, 2026

Towards Improved Sentence Representations using Token Graphs

arXiv:2603.03389v1 Announce Type: new Abstract: Obtaining a single-vector representation from a Large Language Model's (LLM) token-level outputs is a critical step for nearly all sentence-level tasks. However, standard pooling methods like mean or max aggregation treat tokens as an independent set,...

#ArXiv#Machine Learning#Academic

Tool• Mar 5, 2026

RADAR: Learning to Route with Asymmetry-aware DistAnce Representations

arXiv:2603.03388v1 Announce Type: new Abstract: Recent neural solvers have achieved strong performance on vehicle routing problems (VRPs), yet they mainly assume symmetric Euclidean distances, restricting applicability to real-world scenarios. A core challenge is encoding the relational features in...

#ArXiv#Machine Learning#Academic

Tool• Mar 5, 2026

AOI: Turning Failed Trajectories into Training Signals for Autonomous Cloud Diagnosis

arXiv:2603.03378v1 Announce Type: new Abstract: Large language model (LLM) agents offer a promising data-driven approach to automating Site Reliability Engineering (SRE), yet their enterprise deployment is constrained by three challenges: restricted access to proprietary data, unsafe action executi...

#ArXiv#Machine Learning#Academic

Tool• Mar 5, 2026

Knowledge Graph and Hypergraph Transformers with Repository-Attention and Journey-Based Role Transport

arXiv:2603.03304v1 Announce Type: new Abstract: We present a concise architecture for joint training on sentences and structured data while keeping knowledge and language representations separable. The model treats knowledge graphs and hypergraphs as structured instances with role slots and encodes...

#ArXiv#Machine Learning#Academic

Tool• Mar 4, 2026

Phi-4-reasoning-vision and the lessons of training a multimodal reasoning model

We are pleased to announce Phi-4-reasoning-vision-15B, a 15 billion parameter open‑weight multimodal reasoning model, available through Microsoft Foundry (opens in new tab), HuggingFace (opens in new tab) and GitHub (opens in new tab). Phi-4-reasoning-vision-15B is a broadly capable model that can b...

#Microsoft#Research

Tool• Mar 4, 2026

Meta: From social platforms to systems architecture heavyweight

As Meta rebuilds its technical foundations to support multi-year model lifecycles, modular architectures, and reliability-first design, it is quietly reshaping how Silicon Valley thinks about production-grade AI.

#AI Accelerator Institute#AI#Research

Tool• Mar 4, 2026

Defusing the MCP ticking time bomb

The risk of Model Context Protocol in AI agents

#AI Accelerator Institute#AI#Research

Tool• Mar 4, 2026

SuperLocalMemory: Privacy-Preserving Multi-Agent Memory with Bayesian Trust Defense Against Memory Poisoning

arXiv:2603.02240v1 Announce Type: new Abstract: We present SuperLocalMemory, a local-first memory system for multi-agent AI that defends against OWASP ASI06 memory poisoning through architectural isolation and Bayesian trust scoring, while personalizing retrieval through adaptive learning-to-rank -...

#ArXiv#Machine Learning#Academic

Tool• Mar 4, 2026

Engineering Reasoning and Instruction (ERI) Benchmark: A Large Taxonomy-driven Dataset for Foundation Models and Agents

arXiv:2603.02239v1 Announce Type: new Abstract: The Engineering Reasoning and Instruction (ERI) benchmark is a taxonomy-driven instruction dataset designed to train and evaluate engineering-capable large language models (LLMs) and agents. This dataset spans nine engineering fields (namely: civil, m...

#ArXiv#Machine Learning#Academic

← Prev

1 2 3 4 5...25