AI Tools & Frameworks Directory

Discover essential tools, libraries, and frameworks to power your AI workflows.

Tool• Apr 6, 2026

LiME: Lightweight Mixture of Experts for Efficient Multimodal Multi-task Learning

arXiv:2604.02338v1 Announce Type: new Abstract: MoE-PEFT methods combine Mixture of Experts with parameter-efficient fine-tuning for multi-task adaptation, but require separate adapters per expert causing trainable parameters to scale linearly with expert count and limiting applicability to adapter...

#ArXiv#Machine Learning#Academic

Tool• Apr 6, 2026

SIEVE: Sample-Efficient Parametric Learning from Natural Language

arXiv:2604.02339v1 Announce Type: new Abstract: Natural language context-such as instructions, knowledge, or feedback-contains rich signal for adapting language models. While in-context learning provides adaptation via the prompt, parametric learning persists into model weights and can improve perf...

#ArXiv#Machine Learning#Academic

Tool• Apr 6, 2026

Not All Denoising Steps Are Equal: Model Scheduling for Faster Masked Diffusion Language Models

arXiv:2604.02340v1 Announce Type: new Abstract: Recent advances in masked diffusion language models (MDLMs) narrow the quality gap to autoregressive LMs, but their sampling remains expensive because generation requires many full-sequence denoising passes with a large Transformer and, unlike autoreg...

#ArXiv#Machine Learning#Academic

Tool• Apr 6, 2026

Holos: A Web-Scale LLM-Based Multi-Agent System for the Agentic Web

arXiv:2604.02334v1 Announce Type: new Abstract: As large language models (LLM)-driven agents transition from isolated task solvers to persistent digital entities, the emergence of the Agentic Web, an ecosystem where heterogeneous agents autonomously interact and co-evolve, marks a pivotal shift tow...

#ArXiv#Machine Learning#Academic

Tool• Apr 6, 2026

Xpertbench: Expert Level Tasks with Rubrics-Based Evaluation

arXiv:2604.02368v1 Announce Type: new Abstract: As Large Language Models (LLMs) exhibit plateauing performance on conventional benchmarks, a pivotal challenge persists: evaluating their proficiency in complex, open-ended tasks characterizing genuine expert-level cognition. Existing frameworks suffe...

#ArXiv#Machine Learning#Academic

Tool• Apr 6, 2026

Understanding the Nature of Generative AI as Threshold Logic in High-Dimensional Space

arXiv:2604.02476v1 Announce Type: new Abstract: This paper examines the role of threshold logic in understanding generative artificial intelligence. Threshold functions, originally studied in the 1960s in digital circuit synthesis, provide a structurally transparent model of neural computation: a w...

#ArXiv#Machine Learning#Academic

Tool• Apr 6, 2026

AIVV: Neuro-Symbolic LLM Agent-Integrated Verification and Validation for Trustworthy Autonomous Systems

arXiv:2604.02478v1 Announce Type: new Abstract: Deep learning models excel at detecting anomaly patterns in normal data. However, they do not provide a direct solution for anomaly classification and scalability across diverse control systems, frequently failing to distinguish genuine faults from nu...

#ArXiv#Machine Learning#Academic

Tool• Apr 6, 2026

SQUIRE: Interactive UI Authoring via Slot QUery Intermediate REpresentations

Frontend developers create UI prototypes to evaluate alternatives, which is a time-consuming process of repeated iteration and refinement. Generative AI code assistants enable rapid prototyping simply by prompting through a chat interface rather than writing code. However, while this interaction giv...

#Apple#On-device AI

Tool• Apr 3, 2026

An Online Machine Learning Multi-resolution Optimization Framework for Energy System Design Limit of Performance Analysis

arXiv:2604.01308v1 Announce Type: new Abstract: Designing reliable integrated energy systems for industrial processes requires optimization and verification models across multiple fidelities, from architecture-level sizing to high-fidelity dynamic operation. However, model mismatch across fidelitie...

#ArXiv#Machine Learning#Academic

Tool• Apr 2, 2026

Gemma 4: Byte for byte, the most capable open models

Gemma 4: Our most intelligent open models to date, purpose-built for advanced reasoning and agentic workflows.

#DeepMind#Google#AGI

Tool• Apr 2, 2026

Two-Stage Optimizer-Aware Online Data Selection for Large Language Models

arXiv:2604.00001v1 Announce Type: new Abstract: Gradient-based data selection offers a principled framework for estimating sample utility in large language model (LLM) fine-tuning, but existing methods are mostly designed for offline settings. They are therefore less suited to online fine-tuning, w...

#ArXiv#Machine Learning#Academic

Tool• Apr 2, 2026

Task-Centric Personalized Federated Fine-Tuning of Language Models

arXiv:2604.00050v1 Announce Type: new Abstract: Federated Learning (FL) has emerged as a promising technique for training language models on distributed and private datasets of diverse tasks. However, aggregating models trained on heterogeneous tasks often degrades the overall performance of indivi...

#ArXiv#Machine Learning#Academic

Tool• Apr 2, 2026

Temporal Memory for Resource-Constrained Agents: Continual Learning via Stochastic Compress-Add-Smooth

arXiv:2604.00067v1 Announce Type: new Abstract: An agent that operates sequentially must incorporate new experience without forgetting old experience, under a fixed memory budget. We propose a framework in which memory is not a parameter vector but a stochastic process: a Bridge Diffusion on a repl...

#ArXiv#Machine Learning#Academic

Tool• Apr 2, 2026

One Panel Does Not Fit All: Case-Adaptive Multi-Agent Deliberation for Clinical Prediction

arXiv:2604.00085v1 Announce Type: new Abstract: Large language models applied to clinical prediction exhibit case-level heterogeneity: simple cases yield consistent outputs, while complex cases produce divergent predictions under minor prompt changes. Existing single-agent strategies sample from on...

#ArXiv#Machine Learning#Academic

Tool• Apr 2, 2026

Open, Reliable, and Collective: A Community-Driven Framework for Tool-Using AI Agents

arXiv:2604.00137v1 Announce Type: new Abstract: Tool-integrated LLMs can retrieve, compute, and take real-world actions via external tools, but reliability remains a key bottleneck. We argue that failures stem from both tool-use accuracy (how well an agent invokes a tool) and intrinsic tool accurac...

#ArXiv#Machine Learning#Academic

Tool• Apr 2, 2026

A Safety-Aware Role-Orchestrated Multi-Agent LLM Framework for Behavioral Health Communication Simulation

arXiv:2604.00249v1 Announce Type: new Abstract: Single-agent large language model (LLM) systems struggle to simultaneously support diverse conversational functions and maintain safety in behavioral health communication. We propose a safety-aware, role-orchestrated multi-agent LLM framework designed...

#ArXiv#Machine Learning#Academic

Tool• Apr 2, 2026

Human-in-the-Loop Control of Objective Drift in LLM-Assisted Computer Science Education

arXiv:2604.00281v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly embedded in computer science education through AI-assisted programming tools, yet such workflows often exhibit objective drift, in which locally plausible outputs diverge from stated task specifications. E...

#ArXiv#Machine Learning#Academic

Tool• Apr 2, 2026

Apr 2, 2026InterpretabilityEmotion concepts and their function in a large language model

#Anthropic#Claude#LLM

Tool• Apr 1, 2026

ADeLe: Predicting and explaining AI performance across tasks

AI benchmarks report how large language models (LLMs) perform on specific tasks but provide little insight into their underlying capabilities that drive their performance. They do not explain failures or reliably predict outcomes on new tasks. To address this, Microsoft researchers in collaboration ...

#Microsoft#Research

Tool• Apr 1, 2026

How AIRA2 breaks AI research bottlenecks

While we've seen remarkable progress in AI for coding and mathematics, creating agents that can navigate the messy, open-ended nature of real research (where things break for no obvious reason) has proven far more challenging.

#AI Accelerator Institute#AI#Research

Tool• Apr 1, 2026

OneComp: One-Line Revolution for Generative AI Model Compression

arXiv:2603.28845v1 Announce Type: new Abstract: Deploying foundation models is increasingly constrained by memory footprint, latency, and hardware costs. Post-training compression can mitigate these bottlenecks by reducing the precision of model parameters without significantly degrading performanc...

#ArXiv#Machine Learning#Academic

Tool• Apr 1, 2026

Beta-Scheduling: Momentum from Critical Damping as a Diagnostic and Correction Tool for Neural Network Training

arXiv:2603.28921v1 Announce Type: new Abstract: Standard neural network training uses constant momentum (typically 0.9), a convention dating to 1964 with limited theoretical justification for its optimality. We derive a time-varying momentum schedule from the critically damped harmonic oscillator...

#ArXiv#Machine Learning#Academic

Tool• Apr 1, 2026

A Neural Tension Operator for Curve Subdivision across Constant Curvature Geometries

arXiv:2603.28937v1 Announce Type: new Abstract: Interpolatory subdivision schemes generate smooth curves from piecewise-linear control polygons by repeatedly inserting new vertices. Classical schemes rely on a single global tension parameter and typically require separate formulations in Euclidean,...

#ArXiv#Machine Learning#Academic

Tool• Apr 1, 2026

Foundations of Polar Linear Algebra

arXiv:2603.28939v1 Announce Type: new Abstract: This work revisits operator learning from a spectral perspective by introducing Polar Linear Algebra, a structured framework based on polar geometry that combines a linear radial component with a periodic angular component. Starting from this formulat...

#ArXiv#Machine Learning#Academic

← Prev

1...5 6 7 8 9...37