AI Tools & Frameworks Directory

Discover essential tools, libraries, and frameworks to power your AI workflows.

Tool• Apr 29, 2026

Assessing Y-Axis Influence: Bias in Multimodal Language Models on Chart-to-Table Translation

arXiv:2604.24987v1 Announce Type: new Abstract: Chart-to-table translation converts chart images into structured tabular data. Accurate translation is crucial for Multimodal Language Model (MLM) to answer complex queries. We observe imbalances in the number of images across different aspects of the...

#ArXiv#Machine Learning#Academic

Tool• Apr 29, 2026

Collaborative Coding, Better Scaling, Health Tracking: HAI Awards $2.17M to Innovative Research

Seed grants will fund 29 research teams pursuing novel research ideas across disciplines.

#Stanford#HAI#Ethics

Tool• Apr 29, 2026

Adaptive Thinking: Large Language Models Know When to Think in Latent Space

Recent advances in large language models (LLMs) test-time computing have introduced the capability to perform intermediate chain-of-thought (CoT) reasoning (thinking) before generating answers. While increasing the thinking budget yields smooth performance improvements at inference time, the relatio...

#Apple#On-device AI

Tool• Apr 29, 2026

DSO: Direct Steering Optimization for Bias Mitigation

Generative models are often deployed to make decisions on behalf of users, such as vision-language models (VLMs) identifying which person in a room is a doctor to help visually impaired individuals. Yet, VLM decisions are influenced by the perceived demographic attributes of people in the input, whi...

#Apple#On-device AI

Tool• Apr 28, 2026

Are your agents quietly draining your budget?

Revenium'a new research reveals the financial risks of autonomous AI agents, and a practical framework for governing costs before they spiral.

#AI Accelerator Institute#AI#Research

Tool• Apr 28, 2026

AI Builders Summit: Healthcare Boston 2026

Build and deploy secure, clinical-grade AI in one of the world’s most complex domains; healthcare.

#AI Accelerator Institute#AI#Research

Tool• Apr 28, 2026

The Spectral Lifecycle of Transformer Training: Transient Compression Waves, Persistent Spectral Gradients, and the Q/K--V Asymmetry

arXiv:2604.22778v1 Announce Type: new Abstract: We present the first systematic study of weight matrix singular value spectra \emph{during} transformer pretraining, tracking full SVD decompositions of every weight matrix at 25-step intervals across three model scales (30M--285M parameters). We disc...

#ArXiv#Machine Learning#Academic

Tool• Apr 28, 2026

KARL: Mitigating Hallucinations in LLMs via Knowledge-Boundary-Aware Reinforcement Learning

arXiv:2604.22779v1 Announce Type: new Abstract: Enabling large language models (LLMs) to appropriately abstain from answering questions beyond their knowledge is crucial for mitigating hallucinations. While existing reinforcement learning methods foster autonomous abstention, they often compromise ...

#ArXiv#Machine Learning#Academic

Tool• Apr 28, 2026

BiTA: Bidirectional Gated Recurrent Unit-Transformer Aggregator in a Temporal Graph Network Framework for Alert Prediction in Computer Networks

arXiv:2604.22781v1 Announce Type: new Abstract: Proactive alert prediction in computer networks is critical for mitigating evolving cyber threats and enabling timely defensive actions. Temporal Graph Neural Networks (TGNs) provide a principled framework for modeling time-evolving interactions; howe...

#ArXiv#Machine Learning#Academic

Tool• Apr 28, 2026

Stochastic KV Routing: Enabling Adaptive Depth-Wise Cache Sharing

arXiv:2604.22782v1 Announce Type: new Abstract: Serving transformer language models with high throughput requires caching Key-Values (KVs) to avoid redundant computation during autoregressive generation. The memory footprint of KV caching is significant and heavily impacts serving costs. This work ...

#ArXiv#Machine Learning#Academic

Tool• Apr 28, 2026

Parameter Efficiency Is Not Memory Efficiency: Rethinking Fine-Tuning for On-Device LLM Adaptation

arXiv:2604.22783v1 Announce Type: new Abstract: Parameter-Efficient Fine-Tuning (PEFT) has become the standard for adapting large language models (LLMs). In this work we challenge the wide-spread assumption that parameter efficiency equates memory efficiency and on-device adaptability. We show that...

#ArXiv#Machine Learning#Academic

Tool• Apr 28, 2026

An Intelligent Fault Diagnosis Method for General Aviation Aircraft Based on Multi-Fidelity Digital Twin and FMEA Knowledge Enhancement

arXiv:2604.22777v1 Announce Type: new Abstract: Fault diagnosis of general aviation aircraft faces challenges including scarce real fault data, diverse fault types, and weak fault signatures. This paper proposes an intelligent fault diagnosis framework based on multi-fidelity digital twin, integrat...

#ArXiv#Machine Learning#Academic

Tool• Apr 28, 2026

PExA: Parallel Exploration Agent for Complex Text-to-SQL

arXiv:2604.22934v1 Announce Type: new Abstract: LLM-based agents for text-to-SQL often struggle with latency-performance trade-off, where performance improvements come at the cost of latency or vice versa. We reformulate text-to-SQL generation within the lens of software test coverage where the ori...

#ArXiv#Machine Learning#Academic

Tool• Apr 28, 2026

The Power of Power Law: Asymmetry Enables Compositional Reasoning

arXiv:2604.22951v1 Announce Type: new Abstract: Natural language data follows a power-law distribution, with most knowledge and skills appearing at very low frequency. While a common intuition suggests that reweighting or curating data towards a uniform distribution may help models better learn the...

#ArXiv#Machine Learning#Academic

Tool• Apr 28, 2026

On the Existence of an Inverse Solution for Preference-Based Reductions in Argumentation

arXiv:2604.22958v1 Announce Type: new Abstract: Preference-based argumentation frameworks (PAFs) extend Dung's approach to abstract argumentation (AAFs) by encoding preferences over arguments. Such preferences control the transformation of attacks into defeats, and different approaches to doing so ...

#ArXiv#Machine Learning#Academic

Tool• Apr 28, 2026

Towards Causally Interpretable Wi-Fi CSI-Based Human Activity Recognition with Discrete Latent Compression and LTL Rule Extraction

arXiv:2604.22979v1 Announce Type: new Abstract: We address Human Activity Recognition (HAR) utilizing Wi-Fi Channel State Information (CSI) under the joint requirements of causal interpretability, symbolic controllability, and direct operation on high-dimensional raw signals. Deep neural models ach...

#ArXiv#Machine Learning#Academic

Tool• Apr 28, 2026

StereoFoley: Object-Aware Stereo Audio Generation from Video

We present StereoFoley, a video-to-audio generation framework that produces semantically aligned, temporally synchronized, and spatially accurate stereo sound at 48 kHz. While recent generative video-to-audio models achieve strong semantic and temporal fidelity, they largely remain limited to mono o...

#Apple#On-device AI

Tool• Apr 28, 2026

Local Mechanisms of Compositional Generalization in Conditional Diffusion

Conditional diffusion models appear capable of compositional generalization, i.e., generating convincing samples for out-of-distribution combinations of conditioners, but the mechanisms underlying this ability remain unclear. To make this concrete, we study length generalization, the ability to gene...

#Apple#On-device AI

Tool• Apr 28, 2026

LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning

Large Language Models (LLMs) demonstrate their reasoning ability through chain-of-thought (CoT) generation. However, LLM’s autoregressive decoding may limit the ability to revisit and refine earlier tokens in a holistic manner, which can also lead to inefficient exploration for diverse solutions. In...

#Apple#On-device AI

Tool• Apr 27, 2026

Announcing our partnership with the Republic of Korea

Google DeepMind and Korea partner to accelerate scientific breakthroughs using frontier AI models

#DeepMind#Google#AGI

Tool• Apr 27, 2026

Focus Session: Hardware and Software Techniques for Accelerating Multimodal Foundation Models

arXiv:2604.21952v1 Announce Type: new Abstract: This work presents a multi-layered methodology for efficiently accelerating multimodal foundation models (MFMs). It combines hardware and software co-design of transformer blocks with an optimization pipeline that reduces computational and memory requ...

#ArXiv#Machine Learning#Academic

Tool• Apr 27, 2026

Conditional anomaly detection using soft harmonic functions: An application to clinical alerting

arXiv:2604.21956v1 Announce Type: new Abstract: Timely detection of concerning events is an important problem in clinical practice. In this paper, we consider the problem of conditional anomaly detection that aims to identify data instances with an unusual response, such as the omission of an impor...

#ArXiv#Machine Learning#Academic

Tool• Apr 27, 2026

When Quotes Crumble: Detecting Transient Mechanical Liquidity Erosion in Limit Order Books

arXiv:2604.21993v1 Announce Type: new Abstract: We study the detection of transient liquidity erosion ("crumbling quotes") in electronic limit order books, where observable quote deterioration may reflect either mechanical liquidity withdrawal or informational repricing. Using the ABIDES agent-base...

#ArXiv#Machine Learning#Academic

Tool• Apr 27, 2026

Math Takes Two: A test for emergent mathematical reasoning in communication

arXiv:2604.21935v1 Announce Type: new Abstract: Although language models demonstrate remarkable proficiency on mathematical benchmarks, it remains unclear whether this reflects true mathematical reasoning or statistical pattern matching over learning formal syntax. Most existing evaluations rely on...

#ArXiv#Machine Learning#Academic

← Prev

1...24 25 26 27 28...63