Stay ahead of the generative AI revolution!Join the M5B Newsletter →

AI Tools & Frameworks Directory

Discover essential tools, libraries, and frameworks to power your AI workflows.

All Engineering Hardware Jobs News Research Tools Tutorials

News AI TechCrunch Analytics Vidhya Data Science Towards Data Science Medium GenAI Textual OpenAI Google MIT Microsoft HuggingFace OpenSource Models NVIDIA GPU Enterprise ArXiv

Tool• Mar 2, 2026

Detoxifying LLMs via Representation Erasure-Based Preference Optimization

arXiv:2602.23391v1 Announce Type: new Abstract: Large language models (LLMs) trained on webscale data can produce toxic outputs, raising concerns for safe deployment. Prior defenses, based on applications of DPO, NPO, and similar algorithms, reduce the likelihood of harmful continuations, but not r...

#ArXiv#Machine Learning#Academic

Tool• Mar 2, 2026

U-CAN: Utility-Aware Contrastive Attenuation for Efficient Unlearning in Generative Recommendation

arXiv:2602.23400v1 Announce Type: new Abstract: Generative Recommendation (GenRec) typically leverages Large Language Models (LLMs) to redefine personalization as an instruction-driven sequence generation task. However, fine-tuning on user logs inadvertently encodes sensitive attributes into model ...

#ArXiv#Machine Learning#Academic

Tool• Mar 2, 2026

HumanMCP: A Human-Like Query Dataset for Evaluating MCP Tool Retrieval Performance

arXiv:2602.23367v1 Announce Type: new Abstract: Model Context Protocol (MCP) servers contain a collection of thousands of open-source standardized tools, linking LLMs to external systems; however, existing datasets and benchmarks lack realistic, human-like user queries, remaining a critical gap in ...

#ArXiv#Machine Learning#Academic

Tool• Mar 2, 2026

An Agentic LLM Framework for Adverse Media Screening in AML Compliance

arXiv:2602.23373v1 Announce Type: new Abstract: Adverse media screening is a critical component of anti-money laundering (AML) and know-your-customer (KYC) compliance processes in financial institutions. Traditional approaches rely on keyword-based searches that generate high false-positive rates o...

#ArXiv#Machine Learning#Academic

Tool• Mar 2, 2026

Planning under Distribution Shifts with Causal POMDPs

arXiv:2602.23545v1 Announce Type: new Abstract: In the real world, planning is often challenged by distribution shifts. As such, a model of the environment obtained under one set of conditions may no longer remain valid as the distribution of states or the environment dynamics change, which in turn...

#ArXiv#Machine Learning#Academic

Tool• Mar 2, 2026

Brain-OF: An Omnifunctional Foundation Model for fMRI, EEG and MEG

arXiv:2602.23410v1 Announce Type: new Abstract: Brain foundation models have achieved remarkable advances across a wide range of neuroscience tasks. However, most existing models are limited to a single functional modality, restricting their ability to exploit complementary spatiotemporal dynamics ...

#ArXiv#Machine Learning#Academic

Tool• Feb 27, 2026

Multi-Level Causal Embeddings

arXiv:2602.22287v1 Announce Type: new Abstract: Abstractions of causal models allow for the coarsening of models such that relations of cause and effect are preserved. Whereas abstractions focus on the relation between two models, in this paper we study a framework for causal embeddings which enabl...

#ArXiv#Machine Learning#Academic

Tool• Feb 27, 2026

FIRE: A Comprehensive Benchmark for Financial Intelligence and Reasoning Evaluation

arXiv:2602.22273v1 Announce Type: new Abstract: We introduce FIRE, a comprehensive benchmark designed to evaluate both the theoretical financial knowledge of LLMs and their ability to handle practical business scenarios. For theoretical assessment, we curate a diverse set of examination questions d...

#ArXiv#Machine Learning#Academic

Tool• Feb 27, 2026

Agent Behavioral Contracts: Formal Specification and Runtime Enforcement for Reliable Autonomous AI Agents

arXiv:2602.22302v1 Announce Type: new Abstract: Traditional software relies on contracts -- APIs, type systems, assertions -- to specify and enforce correct behavior. AI agents, by contrast, operate on prompts and natural language instructions with no formal behavioral specification. This gap is th...

#ArXiv#Machine Learning#Academic

Tool• Feb 27, 2026

Graph Your Way to Inspiration: Integrating Co-Author Graphs with Retrieval-Augmented Generation for Large Language Model Based Scientific Idea Generation

arXiv:2602.22215v1 Announce Type: new Abstract: Large Language Models (LLMs) demonstrate potential in the field of scientific idea generation. However, the generated results often lack controllable academic context and traceable inspiration pathways. To bridge this gap, this paper proposes a scient...

#ArXiv#Machine Learning#Academic

Tool• Feb 27, 2026

Vibe Researching as Wolf Coming: Can AI Agents with Skills Replace or Augment Social Scientists?

arXiv:2602.22401v1 Announce Type: new Abstract: AI agents -- systems that execute multi-step reasoning workflows with persistent state, tool access, and specialist skills -- represent a qualitative shift from prior automation technologies in social science. Unlike chatbots that respond to isolated ...

#ArXiv#Machine Learning#Academic

Tool• Feb 27, 2026

Improving Spatial Allocation for Energy System Coupling with Graph Neural Networks

arXiv:2602.22249v1 Announce Type: new Abstract: In energy system analysis, coupling models with mismatched spatial resolutions is a significant challenge. A common solution is assigning weights to high-resolution geographic units for aggregation, but traditional models are limited by using only a s...

#ArXiv#Machine Learning#Academic

Tool• Feb 27, 2026

Zatom-1: A Multimodal Flow Foundation Model for 3D Molecules and Materials

arXiv:2602.22251v1 Announce Type: new Abstract: General-purpose 3D chemical modeling encompasses molecules and materials, requiring both generative and predictive capabilities. However, most existing AI approaches are optimized for a single domain (molecules or materials) and a single task (generat...

#ArXiv#Machine Learning#Academic

Tool• Feb 27, 2026

To Deceive is to Teach? Forging Perceptual Robustness via Adversarial Reinforcement Learning

arXiv:2602.22227v1 Announce Type: new Abstract: Despite their impressive capabilities, Multimodal Large Language Models (MLLMs) exhibit perceptual fragility when confronted with visually complex scenes. This weakness stems from a reliance on finite training datasets, which are prohibitively expensi...

#ArXiv#Machine Learning#Academic

Tool• Feb 26, 2026

A Hierarchical Multi-Agent System for Autonomous Discovery in Geoscientific Data Archives

arXiv:2602.21351v1 Announce Type: new Abstract: The rapid accumulation of Earth science data has created a significant scalability challenge; while repositories like PANGAEA host vast collections of datasets, citation metrics indicate that a substantial portion remains underutilized, limiting data ...

#ArXiv#Machine Learning#Academic

Tool• Feb 26, 2026

ACAR: Adaptive Complexity Routing for Multi-Model Ensembles with Auditable Decision Traces

arXiv:2602.21231v1 Announce Type: new Abstract: We present ACAR (Adaptive Complexity and Attribution Routing), a measurement framework for studying multi-model orchestration under auditable conditions. ACAR uses self-consistency variance (sigma) computed from N=3 probe samples to route tasks across...

#ArXiv#Machine Learning#Academic

Tool• Feb 26, 2026

AngelSlim: A more accessible, comprehensive, and efficient toolkit for large model compression

arXiv:2602.21233v1 Announce Type: new Abstract: This technical report introduces AngelSlim, a comprehensive and versatile toolkit for large model compression developed by the Tencent Hunyuan team. By consolidating cutting-edge algorithms, including quantization, speculative decoding, token pruning,...

#ArXiv#Machine Learning#Academic

Tool• Feb 26, 2026

Group Orthogonalized Policy Optimization:Group Policy Optimization as Orthogonal Projection in Hilbert Space

arXiv:2602.21269v1 Announce Type: new Abstract: We present Group Orthogonalized Policy Optimization (GOPO), a new alignment algorithm for large language models derived from the geometry of Hilbert function spaces. Instead of optimizing on the probability simplex and inheriting the exponential curva...

#ArXiv#Machine Learning#Academic

Tool• Feb 26, 2026

Latent Context Compilation: Distilling Long Context into Compact Portable Memory

arXiv:2602.21221v1 Announce Type: new Abstract: Efficient long-context LLM deployment is stalled by a dichotomy between amortized compression, which struggles with out-of-distribution generalization, and Test-Time Training, which incurs prohibitive synthetic data costs and requires modifying model ...

#ArXiv#Machine Learning#Academic

Tool• Feb 26, 2026

A Dynamic Survey of Soft Set Theory and Its Extensions

arXiv:2602.21268v1 Announce Type: new Abstract: Soft set theory provides a direct framework for parameterized decision modeling by assigning to each attribute (parameter) a subset of a given universe, thereby representing uncertainty in a structured way [1, 2]. Over the past decades, the theory has...

#ArXiv#Machine Learning#Academic

Tool• Feb 26, 2026

ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

arXiv:2602.21534v1 Announce Type: new Abstract: Agentic reinforcement learning (ARL) has rapidly gained attention as a promising paradigm for training agents to solve complex, multi-step interactive tasks. Despite encouraging early results, ARL remains highly unstable, often leading to training col...

#ArXiv#Machine Learning#Academic

Tool• Feb 26, 2026

Beyond Refusal: Probing the Limits of Agentic Self-Correction for Semantic Sensitive Information

arXiv:2602.21496v1 Announce Type: new Abstract: While defenses for structured PII are mature, Large Language Models (LLMs) pose a new threat: Semantic Sensitive Information (SemSI), where models infer sensitive identity attributes, generate reputation-harmful content, or hallucinate potentially wro...

#ArXiv#Machine Learning#Academic

Tool• Feb 26, 2026

Power and Limitations of Aggregation in Compound AI Systems

arXiv:2602.21556v1 Announce Type: new Abstract: When designing compound AI systems, a common approach is to query multiple copies of the same model and aggregate the responses to produce a synthesized output. Given the homogeneity of these models, this raises the question of whether aggregation unl...

#ArXiv#Machine Learning#Academic

Tool• Feb 25, 2026

Tensor Network Generator-Enhanced Optimization for Traveling Salesman Problem

arXiv:2602.20175v1 Announce Type: new Abstract: We present an application of the tensor network generator-enhanced optimization (TN-GEO) framework to address the traveling salesman problem (TSP), a fundamental combinatorial optimization challenge. Our approach employs a tensor network Born machine ...

#ArXiv#Machine Learning#Academic

1 2 3 4 5 6...19