Stay ahead of the generative AI revolution!Join the M5B Newsletter →

AI Tools & Frameworks Directory

Discover essential tools, libraries, and frameworks to power your AI workflows.

All Engineering Hardware Jobs News Research Tools Tutorials

News AI TechCrunch Analytics Vidhya Data Science Towards Data Science Medium GenAI Textual OpenAI Google MIT Microsoft HuggingFace OpenSource Models NVIDIA GPU Enterprise ArXiv

Tool• Apr 30, 2026

OMEGA: Optimizing Machine Learning by Evaluating Generated Algorithms

arXiv:2604.26211v1 Announce Type: new Abstract: In order to automate AI research we introduce a full, end-to-end framework, OMEGA: Optimizing Machine learning by Evaluating Generated Algorithms, that starts at idea generation and ends with executable code. Our system combines structured meta-prompt...

#ArXiv#Machine Learning#Academic

Tool• Apr 30, 2026

Distill-Belief: Closed-Loop Inverse Source Localization and Characterization in Physical Fields

arXiv:2604.26095v1 Announce Type: new Abstract: {Closed-loop inverse source localization and characterization (ISLC) requires a mobile agent to select measurements that localize sources and infer latent field parameters under strict time constraints.} {The core challenge lies in the belief-space ob...

#ArXiv#Machine Learning#Academic

Tool• Apr 30, 2026

Operating-Layer Controls for Onchain Language-Model Agents Under Real Capital

arXiv:2604.26091v1 Announce Type: new Abstract: We study reliability in autonomous language-model agents that translate user mandates into validated tool actions under real capital. The setting is DX Terminal Pro, a 21-day deployment in which 3,505 user-funded agents traded real ETH in a bounded on...

#ArXiv#Machine Learning#Academic

Tool• Apr 30, 2026

Hierarchical Multi-Persona Induction from User Behavioral Logs: Learning Evidence-Grounded and Truthful Personas

arXiv:2604.26120v1 Announce Type: new Abstract: Behavioral logs provide rich signals for user modeling, but are noisy and interleaved across diverse intents. Recent work uses LLMs to generate interpretable natural-language personas from user logs, yet evaluation often emphasizes downstream utility,...

#ArXiv#Machine Learning#Academic

Tool• Apr 29, 2026

Liquid Neural Network Models for Natural Gas Spot Price Time-Series Forecasting

arXiv:2604.24788v1 Announce Type: new Abstract: Natural gas is undoubtedly an essential component of the global energy system. Accurate short-term forecasting of natural gas price is challenging due to pronounced volatility driven by seasonal demand patterns, geopolitical developments, and shifting...

#ArXiv#Machine Learning#Academic

Tool• Apr 29, 2026

Architecture Determines Observability in Transformers

arXiv:2604.24801v1 Announce Type: new Abstract: Autoregressive transformers make confident errors, but activation monitoring can catch them only if the model preserves an internal signal that output confidence does not expose. This preservation is determined by architecture and training recipe. We ...

#ArXiv#Machine Learning#Academic

Tool• Apr 29, 2026

GCA-BULF: A Bottom-Up Framework for Short-Term Load Forecasting Using Grouped Critical Appliances

arXiv:2604.24766v1 Announce Type: new Abstract: With the rise of time-of-use and tiered electricity pricing, energy consumers are encouraged to adopt peak-shifting strategies by automatically controlling high-power appliances. These help lower energy costs while enhancing the power grid's stability...

#ArXiv#Machine Learning#Academic

Tool• Apr 29, 2026

S-SONDO: Self-Supervised Knowledge Distillation for General Audio Foundation Models

arXiv:2604.24933v1 Announce Type: new Abstract: General audio foundation models have recently achieved remarkable progress, enabling strong performance across diverse tasks. However, state-of-the-art models remain extremely large, often with hundreds of millions of parameters, leading to high infer...

#ArXiv#Machine Learning#Academic

Tool• Apr 29, 2026

Adaptive Prompt Embedding Optimization for LLM Jailbreaking

arXiv:2604.24983v1 Announce Type: new Abstract: Existing white-box jailbreak attacks against aligned LLMs typically append discrete adversarial suffixes to the user prompt, which visibly alters the prompt and operates in a combinatorial token space. Prior work has avoided directly optimizing the em...

#ArXiv#Machine Learning#Academic

Tool• Apr 29, 2026

Latent Agents: A Post-Training Procedure for Internalized Multi-Agent Debate

arXiv:2604.24881v1 Announce Type: new Abstract: Multi-agent debate has been shown to improve reasoning in large language models (LLMs). However, it is compute-intensive, requiring generation of long transcripts before answering questions. To address this inefficiency, we develop a framework that di...

#ArXiv#Machine Learning#Academic

Tool• Apr 29, 2026

Co-Director: Agentic Generative Video Storytelling

arXiv:2604.24842v1 Announce Type: new Abstract: While diffusion models generate high-fidelity video clips, transforming them into coherent storytelling engines remains challenging. Current agentic pipelines automate this via chained modules but suffer from semantic drift and cascading failures due ...

#ArXiv#Machine Learning#Academic

Tool• Apr 29, 2026

Assessing Y-Axis Influence: Bias in Multimodal Language Models on Chart-to-Table Translation

arXiv:2604.24987v1 Announce Type: new Abstract: Chart-to-table translation converts chart images into structured tabular data. Accurate translation is crucial for Multimodal Language Model (MLM) to answer complex queries. We observe imbalances in the number of images across different aspects of the...

#ArXiv#Machine Learning#Academic

Tool• Apr 28, 2026

KARL: Mitigating Hallucinations in LLMs via Knowledge-Boundary-Aware Reinforcement Learning

arXiv:2604.22779v1 Announce Type: new Abstract: Enabling large language models (LLMs) to appropriately abstain from answering questions beyond their knowledge is crucial for mitigating hallucinations. While existing reinforcement learning methods foster autonomous abstention, they often compromise ...

#ArXiv#Machine Learning#Academic

Tool• Apr 28, 2026

Parameter Efficiency Is Not Memory Efficiency: Rethinking Fine-Tuning for On-Device LLM Adaptation

arXiv:2604.22783v1 Announce Type: new Abstract: Parameter-Efficient Fine-Tuning (PEFT) has become the standard for adapting large language models (LLMs). In this work we challenge the wide-spread assumption that parameter efficiency equates memory efficiency and on-device adaptability. We show that...

#ArXiv#Machine Learning#Academic

Tool• Apr 28, 2026

BiTA: Bidirectional Gated Recurrent Unit-Transformer Aggregator in a Temporal Graph Network Framework for Alert Prediction in Computer Networks

arXiv:2604.22781v1 Announce Type: new Abstract: Proactive alert prediction in computer networks is critical for mitigating evolving cyber threats and enabling timely defensive actions. Temporal Graph Neural Networks (TGNs) provide a principled framework for modeling time-evolving interactions; howe...

#ArXiv#Machine Learning#Academic

Tool• Apr 28, 2026

Stochastic KV Routing: Enabling Adaptive Depth-Wise Cache Sharing

arXiv:2604.22782v1 Announce Type: new Abstract: Serving transformer language models with high throughput requires caching Key-Values (KVs) to avoid redundant computation during autoregressive generation. The memory footprint of KV caching is significant and heavily impacts serving costs. This work ...

#ArXiv#Machine Learning#Academic

Tool• Apr 28, 2026

The Spectral Lifecycle of Transformer Training: Transient Compression Waves, Persistent Spectral Gradients, and the Q/K--V Asymmetry

arXiv:2604.22778v1 Announce Type: new Abstract: We present the first systematic study of weight matrix singular value spectra \emph{during} transformer pretraining, tracking full SVD decompositions of every weight matrix at 25-step intervals across three model scales (30M--285M parameters). We disc...

#ArXiv#Machine Learning#Academic

Tool• Apr 28, 2026

PExA: Parallel Exploration Agent for Complex Text-to-SQL

arXiv:2604.22934v1 Announce Type: new Abstract: LLM-based agents for text-to-SQL often struggle with latency-performance trade-off, where performance improvements come at the cost of latency or vice versa. We reformulate text-to-SQL generation within the lens of software test coverage where the ori...

#ArXiv#Machine Learning#Academic

Tool• Apr 28, 2026

An Intelligent Fault Diagnosis Method for General Aviation Aircraft Based on Multi-Fidelity Digital Twin and FMEA Knowledge Enhancement

arXiv:2604.22777v1 Announce Type: new Abstract: Fault diagnosis of general aviation aircraft faces challenges including scarce real fault data, diverse fault types, and weak fault signatures. This paper proposes an intelligent fault diagnosis framework based on multi-fidelity digital twin, integrat...

#ArXiv#Machine Learning#Academic

Tool• Apr 28, 2026

Towards Causally Interpretable Wi-Fi CSI-Based Human Activity Recognition with Discrete Latent Compression and LTL Rule Extraction

arXiv:2604.22979v1 Announce Type: new Abstract: We address Human Activity Recognition (HAR) utilizing Wi-Fi Channel State Information (CSI) under the joint requirements of causal interpretability, symbolic controllability, and direct operation on high-dimensional raw signals. Deep neural models ach...

#ArXiv#Machine Learning#Academic

Tool• Apr 28, 2026

The Power of Power Law: Asymmetry Enables Compositional Reasoning

arXiv:2604.22951v1 Announce Type: new Abstract: Natural language data follows a power-law distribution, with most knowledge and skills appearing at very low frequency. While a common intuition suggests that reweighting or curating data towards a uniform distribution may help models better learn the...

#ArXiv#Machine Learning#Academic

Tool• Apr 28, 2026

On the Existence of an Inverse Solution for Preference-Based Reductions in Argumentation

arXiv:2604.22958v1 Announce Type: new Abstract: Preference-based argumentation frameworks (PAFs) extend Dung's approach to abstract argumentation (AAFs) by encoding preferences over arguments. Such preferences control the transformation of attacks into defeats, and different approaches to doing so ...

#ArXiv#Machine Learning#Academic

Tool• Apr 27, 2026

Conditional anomaly detection using soft harmonic functions: An application to clinical alerting

arXiv:2604.21956v1 Announce Type: new Abstract: Timely detection of concerning events is an important problem in clinical practice. In this paper, we consider the problem of conditional anomaly detection that aims to identify data instances with an unusual response, such as the omission of an impor...

#ArXiv#Machine Learning#Academic

Tool• Apr 27, 2026

Focus Session: Hardware and Software Techniques for Accelerating Multimodal Foundation Models

arXiv:2604.21952v1 Announce Type: new Abstract: This work presents a multi-layered methodology for efficiently accelerating multimodal foundation models (MFMs). It combines hardware and software co-design of transformer blocks with an optimization pipeline that reduces computational and memory requ...

#ArXiv#Machine Learning#Academic

1...18 19 20 21 22...48