Stay ahead of the generative AI revolution!Join the M5B Newsletter →

AI Tools & Frameworks Directory

Discover essential tools, libraries, and frameworks to power your AI workflows.

All Engineering Hardware Jobs News Research Tools Tutorials

News AI TechCrunch Analytics Vidhya Data Science Towards Data Science Medium GenAI Textual OpenAI Google MIT Microsoft HuggingFace OpenSource Models NVIDIA GPU Enterprise ArXiv

Tool• Feb 25, 2026

DMCD: Semantic-Statistical Framework for Causal Discovery

arXiv:2602.20333v1 Announce Type: new Abstract: We present DMCD (DataMap Causal Discovery), a two-phase causal discovery framework that integrates LLM-based semantic drafting from variable metadata with statistical validation on observational data. In Phase I, a large language model proposes a spar...

#ArXiv#Machine Learning#Academic

Tool• Feb 25, 2026

Implicit Intelligence -- Evaluating Agents on What Users Don't Say

arXiv:2602.20424v1 Announce Type: new Abstract: Real-world requests to AI agents are fundamentally underspecified. Natural human communication relies on shared context and unstated constraints that speakers expect listeners to infer. Current agentic benchmarks test explicit instruction-following bu...

#ArXiv#Machine Learning#Academic

Tool• Feb 25, 2026

Diffusion Modulation via Environment Mechanism Modeling for Planning

arXiv:2602.20422v1 Announce Type: new Abstract: Diffusion models have shown promising capabilities in trajectory generation for planning in offline reinforcement learning (RL). However, conventional diffusion-based planning methods often fail to account for the fact that generating trajectories in ...

#ArXiv#Machine Learning#Academic

Tool• Feb 25, 2026

Controllable Exploration in Hybrid-Policy RLVR for Multi-Modal Reasoning

arXiv:2602.20197v1 Announce Type: new Abstract: Reinforcement Learning with verifiable rewards (RLVR) has emerged as a primary learning paradigm for enhancing the reasoning capabilities of multi-modal large language models (MLLMs). However, during RL training, the enormous state space of MLLM and s...

#ArXiv#Machine Learning#Academic

Tool• Feb 25, 2026

Tensor Network Generator-Enhanced Optimization for Traveling Salesman Problem

arXiv:2602.20175v1 Announce Type: new Abstract: We present an application of the tensor network generator-enhanced optimization (TN-GEO) framework to address the traveling salesman problem (TSP), a fundamental combinatorial optimization challenge. Our approach employs a tensor network Born machine ...

#ArXiv#Machine Learning#Academic

Tool• Feb 25, 2026

MoBiQuant: Mixture-of-Bits Quantization for Token-Adaptive Elastic LLMs

arXiv:2602.20191v1 Announce Type: new Abstract: Changing runtime complexity on cloud and edge devices necessitates elastic large language model (LLM) deployment, where an LLM can be inferred with various quantization precisions based on available computational resources. However, it has been observ...

#ArXiv#Machine Learning#Academic

Tool• Feb 25, 2026

FedAvg-Based CTMC Hazard Model for Federated Bridge Deterioration Assessment

arXiv:2602.20194v1 Announce Type: new Abstract: Bridge periodic inspection records contain sensitive information about public infrastructure, making cross-organizational data sharing impractical under existing data governance constraints. We propose a federated framework for estimating a Continuous...

#ArXiv#Machine Learning#Academic

Tool• Feb 25, 2026

IMOVNO+: A Regional Partitioning and Meta-Heuristic Ensemble Framework for Imbalanced Multi-Class Learning

arXiv:2602.20199v1 Announce Type: new Abstract: Class imbalance, overlap, and noise degrade data quality, reduce model reliability, and limit generalization. Although widely studied in binary classification, these issues remain underexplored in multi-class settings, where complex inter-class relati...

#ArXiv#Machine Learning#Academic

Tool• Feb 24, 2026

Learning to Remember: End-to-End Training of Memory Agents for Long-Context Reasoning

arXiv:2602.18493v1 Announce Type: new Abstract: Long-context LLMs and Retrieval-Augmented Generation (RAG) systems process information passively, deferring state tracking, contradiction resolution, and evidence aggregation to query time, which becomes brittle under ultra long streams with frequent ...

#ArXiv#Machine Learning#Academic

Tool• Feb 24, 2026

Physiologically Informed Deep Learning: A Multi-Scale Framework for Next-Generation PBPK Modeling

arXiv:2602.18472v1 Announce Type: new Abstract: Physiologically Based Pharmacokinetic (PBPK) modeling is a cornerstone of model-informed drug development (MIDD), providing a mechanistic framework to predict drug absorption, distribution, metabolism, and excretion (ADME). Despite its utility, adopti...

#ArXiv#Machine Learning#Academic

Tool• Feb 24, 2026

Hierarchical Reward Design from Language: Enhancing Alignment of Agent Behavior with Human Specifications

arXiv:2602.18582v1 Announce Type: new Abstract: When training artificial intelligence (AI) to perform tasks, humans often care not only about whether a task is completed but also how it is performed. As AI agents tackle increasingly complex tasks, aligning their behavior with human-provided specifi...

#ArXiv#Machine Learning#Academic

Tool• Feb 24, 2026

Feedback-based Automated Verification in Vibe Coding of CAS Adaptation Built on Constraint Logic

arXiv:2602.18607v1 Announce Type: new Abstract: In CAS adaptation, a challenge is to define the dynamic architecture of the system and changes in its behavior. Implementation-wise, this is projected into an adaptation mechanism, typically realized as an Adaptation Manager (AM). With the advances of...

#ArXiv#Machine Learning#Academic

Tool• Feb 24, 2026

Decoding ML Decision: An Agentic Reasoning Framework for Large-Scale Ranking System

arXiv:2602.18640v1 Announce Type: new Abstract: Modern large-scale ranking systems operate within a sophisticated landscape of competing objectives, operational constraints, and evolving product requirements. Progress in this domain is increasingly bottlenecked by the engineering context constraint...

#ArXiv#Machine Learning#Academic

Tool• Feb 24, 2026

Spilled Energy in Large Language Models

arXiv:2602.18671v1 Announce Type: new Abstract: We reinterpret the final Large Language Model (LLM) softmax classifier as an Energy-Based Model (EBM), decomposing the sequence-to-sequence probability chain into multiple interacting EBMs at inference. This principled approach allows us to track "ene...

#ArXiv#Machine Learning#Academic

Tool• Feb 23, 2026

The Token Games: Evaluating Language Model Reasoning with Puzzle Duels

arXiv:2602.17831v1 Announce Type: new Abstract: Evaluating the reasoning capabilities of Large Language Models is increasingly challenging as models improve. Human curation of hard questions is highly expensive, especially in recent benchmarks using PhD-level domain knowledge to challenge the most ...

#ArXiv#Machine Learning#Academic

Tool• Feb 23, 2026

Duality Models: An Embarrassingly Simple One-step Generation Paradigm

arXiv:2602.17682v1 Announce Type: new Abstract: Consistency-based generative models like Shortcut and MeanFlow achieve impressive results via a target-aware design for solving the Probability Flow ODE (PF-ODE). Typically, such methods introduce a target time $r$ alongside the current time $t$ to mo...

#ArXiv#Machine Learning#Academic

Tool• Feb 23, 2026

Reducing Text Bias in Synthetically Generated MCQAs for VLMs in Autonomous Driving

arXiv:2602.17677v1 Announce Type: new Abstract: Multiple Choice Question Answering (MCQA) benchmarks are an established standard for measuring Vision Language Model (VLM) performance in driving tasks. However, we observe the known phenomenon that synthetically generated MCQAs are highly susceptible...

#ArXiv#Machine Learning#Academic

Tool• Feb 23, 2026

LATMiX: Learnable Affine Transformations for Microscaling Quantization of LLMs

arXiv:2602.17681v1 Announce Type: new Abstract: Post-training quantization (PTQ) is a widely used approach for reducing the memory and compute costs of large language models (LLMs). Recent studies have shown that applying invertible transformations to activations can significantly improve quantizat...

#ArXiv#Machine Learning#Academic

Tool• Feb 23, 2026

Joint Parameter and State-Space Bayesian Optimization: Using Process Expertise to Accelerate Manufacturing Optimization

arXiv:2602.17679v1 Announce Type: new Abstract: Bayesian optimization (BO) is a powerful method for optimizing black-box manufacturing processes, but its performance is often limited when dealing with high-dimensional multi-stage systems, where we can observe intermediate outputs. Standard BO model...

#ArXiv#Machine Learning#Academic

Tool• Feb 23, 2026

El Agente Gr\'afico: Structured Execution Graphs for Scientific Agents

arXiv:2602.17902v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used to automate scientific workflows, yet their integration with heterogeneous computational tools remains ad hoc and fragile. Current agentic approaches often rely on unstructured text to manage context ...

#ArXiv#Machine Learning#Academic

Tool• Feb 23, 2026

Alignment in Time: Peak-Aware Orchestration for Long-Horizon Agentic Systems

arXiv:2602.17910v1 Announce Type: new Abstract: Traditional AI alignment primarily focuses on individual model outputs; however, autonomous agents in long-horizon workflows require sustained reliability across entire interaction trajectories. We introduce APEMO (Affect-aware Peak-End Modulation for...

#ArXiv#Machine Learning#Academic

Tool• Feb 23, 2026

Epistemic Traps: Rational Misalignment Driven by Model Misspecification

arXiv:2602.17676v1 Announce Type: new Abstract: The rapid deployment of Large Language Models and AI agents across critical societal and technical domains is hindered by persistent behavioral pathologies including sycophancy, hallucination, and strategic deception that resist mitigation via reinfor...

#ArXiv#Machine Learning#Academic

Tool• Feb 23, 2026

BioBridge: Bridging Proteins and Language for Enhanced Biological Reasoning with LLMs

arXiv:2602.17680v1 Announce Type: new Abstract: Existing Protein Language Models (PLMs) often suffer from limited adaptability to multiple tasks and exhibit poor generalization across diverse biological contexts. In contrast, general-purpose Large Language Models (LLMs) lack the capability to inter...

#ArXiv#Machine Learning#Academic

Tool• Feb 20, 2026

Quantifying LLM Attention-Head Stability: Implications for Circuit Universality

arXiv:2602.16740v1 Announce Type: new Abstract: In mechanistic interpretability, recent work scrutinizes transformer "circuits" - sparse, mono or multi layer sub computations, that may reflect human understandable functions. Yet, these network circuits are rarely acid-tested for their stability acr...

#ArXiv#Machine Learning#Academic

1...3 4 5 6 7...19