AI Tools & Frameworks Directory

Discover essential tools, libraries, and frameworks to power your AI workflows.

Tool• Mar 24, 2026

Scaling Synthetic Task Generation for Agents via Exploration

Post-Training Multimodal Large Language Models (MLLMs) to build interactive agents holds promise across domains such as computer-use, web navigation, and robotics. A key challenge in scaling such post-training is lack of high-quality downstream agentic task datasets with tasks that are diverse, feas...

#Apple#On-device AI

Tool• Mar 23, 2026

Will machines ever be intelligent?

Are machines truly intelligent? AI researchers Subutai Ahmad and Nicolò Fusi join Doug Burger to compare transformer-based AI with the human brain, exploring continual learning, efficiency, and whether today’s models are on a path toward human intelligence. The post Will machines ever be intelligent...

#Microsoft#Research

Tool• Mar 23, 2026

Speculating Experts Accelerates Inference for Mixture-of-Experts

arXiv:2603.19289v1 Announce Type: new Abstract: Mixture-of-Experts (MoE) models have gained popularity as a means of scaling the capacity of large language models (LLMs) while maintaining sparse activations and reduced per-token compute. However, in memory-constrained inference settings, expert wei...

#ArXiv#Machine Learning#Academic

Tool• Mar 23, 2026

A Visualization for Comparative Analysis of Regression Models

arXiv:2603.19291v1 Announce Type: new Abstract: As regression is a widely studied problem, many methods have been proposed to solve it, each of them often requiring setting different hyper-parameters. Therefore, selecting the proper method for a given application may be very difficult and relies on...

#ArXiv#Machine Learning#Academic

Tool• Mar 23, 2026

Maximizing mutual information between user-contexts and responses improve LLM personalization with no additional data

arXiv:2603.19294v1 Announce Type: new Abstract: While post-training has successfully improved large language models (LLMs) across a variety of domains, these gains heavily rely on human-labeled data or external verifiers. Existing data has already been exploited, and new high-quality data is expens...

#ArXiv#Machine Learning#Academic

Tool• Mar 23, 2026

TTQ: Activation-Aware Test-Time Quantization to Accelerate LLM Inference On The Fly

arXiv:2603.19296v1 Announce Type: new Abstract: To tackle the huge computational demand of large foundation models, activation-aware compression techniques without retraining have been introduced. However, since these methods highly rely on calibration data, domain shift issues may arise for unseen...

#ArXiv#Machine Learning#Academic

Tool• Mar 23, 2026

When both Grounding and not Grounding are Bad -- A Partially Grounded Encoding of Planning into SAT (Extended Version)

arXiv:2603.19429v1 Announce Type: new Abstract: Classical planning problems are typically defined using lifted first-order representations, which offer compactness and generality. While most planners ground these representations to simplify reasoning, this can cause an exponential blowup in size. R...

#ArXiv#Machine Learning#Academic

Tool• Mar 23, 2026

Hyperagents

arXiv:2603.19461v1 Announce Type: new Abstract: Self-improving AI systems aim to reduce reliance on human engineering by learning to improve their own learning and problem-solving processes. Existing approaches to self-improvement rely on fixed, handcrafted meta-level mechanisms, fundamentally limi...

#ArXiv#Machine Learning#Academic

Tool• Mar 23, 2026

Teaching an Agent to Sketch One Part at a Time

arXiv:2603.19500v1 Announce Type: new Abstract: We develop a method for producing vector sketches one part at a time. To do this, we train a multi-modal language model-based agent using a novel multi-turn process-reward reinforcement learning following supervised fine-tuning. Our approach is enable...

#ArXiv#Machine Learning#Academic

Tool• Mar 23, 2026

Learning to Disprove: Formal Counterexample Generation with Large Language Models

arXiv:2603.19514v1 Announce Type: new Abstract: Mathematical reasoning demands two critical, complementary skills: constructing rigorous proofs for true statements and discovering counterexamples that disprove false ones. However, current AI efforts in mathematics focus almost exclusively on proof ...

#ArXiv#Machine Learning#Academic

Tool• Mar 23, 2026

ItinBench: Benchmarking Planning Across Multiple Cognitive Dimensions with Large Language Models

arXiv:2603.19515v1 Announce Type: new Abstract: Large language models (LLMs) with advanced cognitive capabilities are emerging as agents for various reasoning and planning tasks. Traditional evaluations often focus on specific reasoning or planning questions within controlled environments. Recent s...

#ArXiv#Machine Learning#Academic

Tool• Mar 23, 2026

Optimal Splitting of Language Models from Mixtures to Specialized Domains

This paper was accepted at the Workshop on Navigating and Addressing Data Problems for Foundation Models at ICLR 2026. Language models achieve impressive performance on a variety of knowledge, language, and reasoning tasks due to the scale and diversity of pretraining data available. The standard tr...

#Apple#On-device AI

Tool• Mar 20, 2026

Frayed RoPE and Long Inputs: A Geometric Perspective

arXiv:2603.18017v1 Announce Type: new Abstract: Rotary Positional Embedding (RoPE) is a widely adopted technique for encoding position in language models, which, while effective, causes performance breakdown when input length exceeds training length. Prior analyses assert (rightly) that long inputs...

#ArXiv#Machine Learning#Academic

Tool• Mar 20, 2026

Engineering Verifiable Modularity in Transformers via Per-Layer Supervision

arXiv:2603.18029v1 Announce Type: new Abstract: Transformers resist surgical control. Ablating an attention head identified as critical for capitalization produces minimal behavioral change because distributed redundancy compensates for damage. This Hydra effect renders interpretability illusory: w...

#ArXiv#Machine Learning#Academic

Tool• Mar 20, 2026

InfoMamba: An Attention-Free Hybrid Mamba-Transformer Model

arXiv:2603.18031v1 Announce Type: new Abstract: Balancing fine-grained local modeling with long-range dependency capture under computational constraints remains a central challenge in sequence modeling. While Transformers provide strong token mixing, they suffer from quadratic complexity, whereas M...

#ArXiv#Machine Learning#Academic

Tool• Mar 20, 2026

Taming Epilepsy: Mean Field Control of Whole-Brain Dynamics

arXiv:2603.18035v1 Announce Type: new Abstract: Controlling the high-dimensional neural dynamics during epileptic seizures remains a significant challenge due to the nonlinear characteristics and complex connectivity of the brain. In this paper, we propose a novel framework, namely Graph-Regularize...

#ArXiv#Machine Learning#Academic

Tool• Mar 20, 2026

DEAF: A Benchmark for Diagnostic Evaluation of Acoustic Faithfulness in Audio Language Models

arXiv:2603.18048v1 Announce Type: new Abstract: Recent Audio Multimodal Large Language Models (Audio MLLMs) demonstrate impressive performance on speech benchmarks, yet it remains unclear whether these models genuinely process acoustic signals or rely on text-based semantic inference. To systematic...

#ArXiv#Machine Learning#Academic

Tool• Mar 20, 2026

Continually self-improving AI

arXiv:2603.18073v1 Announce Type: new Abstract: Modern language model-based AI systems are remarkably powerful, yet their capabilities remain fundamentally capped by their human creators in three key ways. First, although a model's weights can be updated via fine-tuning, acquiring new knowledge fro...

#ArXiv#Machine Learning#Academic

Tool• Mar 20, 2026

Multi-Trait Subspace Steering to Reveal the Dark Side of Human-AI Interaction

arXiv:2603.18085v1 Announce Type: new Abstract: Recent incidents have highlighted alarming cases where human-AI interactions led to negative psychological outcomes, including mental health crises and even user harm. As LLMs serve as sources of guidance, emotional support, and even informal therapy,...

#ArXiv#Machine Learning#Academic

Tool• Mar 20, 2026

Adaptive Domain Models: Bayesian Evolution, Warm Rotation, and Principled Training for Geometric and Neuromorphic AI

arXiv:2603.18104v1 Announce Type: new Abstract: Prevailing AI training infrastructure assumes reverse-mode automatic differentiation over IEEE-754 arithmetic. The memory overhead of training relative to inference, optimizer complexity, and structural degradation of geometric properties through trai...

#ArXiv#Machine Learning#Academic

Tool• Mar 20, 2026

Don't Vibe Code, Do Skele-Code: Interactive No-Code Notebooks for Subject Matter Experts to Build Lower-Cost Agentic Workflows

arXiv:2603.18122v1 Announce Type: new Abstract: Skele-Code is a natural-language and graph-based interface for building workflows with AI agents, designed especially for less or non-technical users. It supports incremental, interactive notebook-style development, and each step is converted to code ...

#ArXiv#Machine Learning#Academic

Tool• Mar 19, 2026

How AI in life sciences is reshaping healthcare

How can life sciences organizations overcome rising costs, regulatory complexity, and data silos to bring therapies to market faster, without compromising patient safety, and could AI be the key to doing so responsibly?

#AI Accelerator Institute#AI#Research

Tool• Mar 19, 2026

NVIDIA GTC 2026: The AI stack gets real

At NVIDIA’s GTC 2026, CEO Jensen Huang laid out a sweeping vision for AI’s next era. From chips and agent frameworks to robotics and real-time graphics, Huang’s keynote made one thing clear: The future of AI will be built on infrastructure, and NVIDIA intends to own it.

#AI Accelerator Institute#AI#Research

Tool• Mar 19, 2026

A foundation model for electrodermal activity data

arXiv:2603.16878v1 Announce Type: new Abstract: Foundation models have recently extended beyond natural language and vision to timeseries domains, including physiological signals. However, progress in electrodermal activity (EDA) modeling is hindered by the absence of large-scale, curated, and open...

#ArXiv#Machine Learning#Academic

← Prev

1...34 35 36 37 38...63