Stay ahead of the generative AI revolution!Join the M5B Newsletter →

AI Tools & Frameworks Directory

Discover essential tools, libraries, and frameworks to power your AI workflows.

All Engineering Hardware Jobs News Research Tools Tutorials

News AI TechCrunch Analytics Vidhya Data Science Towards Data Science Medium GenAI Textual OpenAI Google MIT Microsoft HuggingFace OpenSource Models NVIDIA GPU Enterprise ArXiv

Tool• Feb 17, 2026

BotzoneBench: Scalable LLM Evaluation via Graded AI Anchors

arXiv:2602.13214v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly deployed in interactive environments requiring strategic decision-making, yet systematic evaluation of these capabilities remains challenging. Existing benchmarks for LLMs primarily assess static reasoning...

#ArXiv#Machine Learning#Academic

Tool• Feb 17, 2026

When to Think Fast and Slow? AMOR: Entropy-Based Metacognitive Gate for Dynamic SSM-Attention Switching

arXiv:2602.13215v1 Announce Type: new Abstract: Transformers allocate uniform computation to every position, regardless of difficulty. State Space Models (SSMs) offer efficient alternatives but struggle with precise information retrieval over a long horizon. Inspired by dual-process theories of cog...

#ArXiv#Machine Learning#Academic

Tool• Feb 17, 2026

Scaling the Scaling Logic: Agentic Meta-Synthesis of Logic Reasoning

arXiv:2602.13218v1 Announce Type: new Abstract: Scaling verifiable training signals remains a key bottleneck for Reinforcement Learning from Verifiable Rewards (RLVR). Logical reasoning is a natural substrate: constraints are formal and answers are programmatically checkable. However, prior synthes...

#ArXiv#Machine Learning#Academic

Tool• Feb 17, 2026

VeRA: Verified Reasoning Data Augmentation at Scale

arXiv:2602.13217v1 Announce Type: new Abstract: The main issue with most evaluation schemes today is their "static" nature: the same problems are reused repeatedly, allowing for memorization, format exploitation, and eventual saturation. To measure genuine AI progress, we need evaluation that is ro...

#ArXiv#Machine Learning#Academic

Tool• Feb 17, 2026

Exploring the Performance of ML/DL Architectures on the MNIST-1D Dataset

arXiv:2602.13348v1 Announce Type: new Abstract: Small datasets like MNIST have historically been instrumental in advancing machine learning research by providing a controlled environment for rapid experimentation and model evaluation. However, their simplicity often limits their utility for disting...

#ArXiv#Machine Learning#Academic

Tool• Feb 17, 2026

The Speed-up Factor: A Quantitative Multi-Iteration Active Learning Performance Metric

arXiv:2602.13359v1 Announce Type: new Abstract: Machine learning models excel with abundant annotated data, but annotation is often costly and time-intensive. Active learning (AL) aims to improve the performance-to-annotation ratio by using query methods (QMs) to iteratively select the most informa...

#ArXiv#Machine Learning#Academic

Tool• Feb 17, 2026

BLUEPRINT Rebuilding a Legacy: Multimodal Retrieval for Complex Engineering Drawings and Documents

arXiv:2602.13345v1 Announce Type: new Abstract: Decades of engineering drawings and technical records remain locked in legacy archives with inconsistent or missing metadata, making retrieval difficult and often manual. We present Blueprint, a layout-aware multimodal retrieval system designed for la...

#ArXiv#Machine Learning#Academic

Tool• Feb 17, 2026

Directional Concentration Uncertainty: A representational approach to uncertainty quantification for generative models

arXiv:2602.13264v1 Announce Type: new Abstract: In the critical task of making generative models trustworthy and robust, methods for Uncertainty Quantification (UQ) have begun to show encouraging potential. However, many of these methods rely on rigid heuristics that fail to generalize across tasks...

#ArXiv#Machine Learning#Academic

Tool• Feb 17, 2026

Accelerated Discovery of Cryoprotectant Cocktails via Multi-Objective Bayesian Optimization

arXiv:2602.13398v1 Announce Type: new Abstract: Designing cryoprotectant agent (CPA) cocktails for vitrification is challenging because formulations must be concentrated enough to suppress ice formation yet non-toxic enough to preserve cell viability. This tradeoff creates a large, multi-objective ...

#ArXiv#Machine Learning#Academic

Tool• Feb 16, 2026

Evolving Beyond Snapshots: Harmonizing Structure and Sequence via Entity State Tuning for Temporal Knowledge Graph Forecasting

arXiv:2602.12389v1 Announce Type: new Abstract: Temporal knowledge graph (TKG) forecasting requires predicting future facts by jointly modeling structural dependencies within each snapshot and temporal evolution across snapshots. However, most existing methods are stateless: they recompute entity r...

#ArXiv#Machine Learning#Academic

Tool• Feb 16, 2026

GT-HarmBench: Benchmarking AI Safety Risks Through the Lens of Game Theory

arXiv:2602.12316v1 Announce Type: new Abstract: Frontier AI systems are increasingly capable and deployed in high-stakes multi-agent environments. However, existing AI safety benchmarks largely evaluate single agents, leaving multi-agent risks such as coordination failure and conflict poorly unders...

#ArXiv#Machine Learning#Academic

Tool• Feb 16, 2026

A Theoretical Framework for Adaptive Utility-Weighted Benchmarking

arXiv:2602.12356v1 Announce Type: new Abstract: Benchmarking has long served as a foundational practice in machine learning and, increasingly, in modern AI systems such as large language models, where shared tasks, metrics, and leaderboards offer a common basis for measuring progress and comparing ...

#ArXiv#Machine Learning#Academic

Tool• Feb 16, 2026

Intent-Driven Smart Manufacturing Integrating Knowledge Graphs and Large Language Models

arXiv:2602.12419v1 Announce Type: new Abstract: The increasing complexity of smart manufacturing environments demands interfaces that can translate high-level human intents into machine-executable actions. This paper presents a unified framework that integrates instruction-tuned Large Language Mode...

#ArXiv#Machine Learning#Academic

Tool• Feb 16, 2026

Scaling Web Agent Training through Automatic Data Generation and Fine-grained Evaluation

arXiv:2602.12544v1 Announce Type: new Abstract: We present a scalable pipeline for automatically generating high-quality training data for web agents. In particular, a major challenge in identifying high-quality training instances is trajectory evaluation - quantifying how much progress was made to...

#ArXiv#Machine Learning#Academic

Tool• Feb 16, 2026

The Appeal and Reality of Recycling LoRAs with Adaptive Merging

arXiv:2602.12323v1 Announce Type: new Abstract: The widespread availability of fine-tuned LoRA modules for open pre-trained models has led to an interest in methods that can adaptively merge LoRAs to improve performance. These methods typically include some way of selecting LoRAs from a pool and tu...

#ArXiv#Machine Learning#Academic

Tool• Feb 16, 2026

Intrinsic Credit Assignment for Long Horizon Interaction

arXiv:2602.12342v1 Announce Type: new Abstract: How can we train agents to navigate uncertainty over long horizons? In this work, we propose {\Delta}Belief-RL, which leverages a language model's own intrinsic beliefs to reward intermediate progress. Our method utilizes the change in the probability...

#ArXiv#Machine Learning#Academic

Tool• Feb 16, 2026

OptiML: An End-to-End Framework for Program Synthesis and CUDA Kernel Optimization

arXiv:2602.12305v1 Announce Type: new Abstract: Generating high-performance CUDA kernels remains challenging due to the need to navigate a combinatorial space of low-level transformations under noisy and expensive hardware feedback. Although large language models can synthesize functionally correct...

#ArXiv#Machine Learning#Academic

Tool• Feb 16, 2026

Abstractive Red-Teaming of Language Model Character

arXiv:2602.12318v1 Announce Type: new Abstract: We want language model assistants to conform to a character specification, which asserts how the model should act across diverse user interactions. While models typically follow these character specifications, they can occasionally violate them in lar...

#ArXiv#Machine Learning#Academic

Tool• Feb 13, 2026

The PBSAI Governance Ecosystem: A Multi-Agent AI Reference Architecture for Securing Enterprise AI Estates

arXiv:2602.11301v1 Announce Type: new Abstract: Enterprises are rapidly deploying large language models, retrieval augmented generation pipelines, and tool using agents into production, often on shared high performance computing clusters and cloud accelerator platforms that also support defensive a...

#ArXiv#Machine Learning#Academic

Tool• Feb 13, 2026

GAC-KAN: An Ultra-Lightweight GNSS Interference Classifier for GenAI-Powered Consumer Edge Devices

arXiv:2602.11186v1 Announce Type: new Abstract: The integration of Generative AI (GenAI) into Consumer Electronics (CE)--from AI-powered assistants in wearables to generative planning in autonomous Uncrewed Aerial Vehicles (UAVs)--has revolutionized user experiences. However, these GenAI applicatio...

#ArXiv#Machine Learning#Academic

Tool• Feb 13, 2026

Spectra: Rethinking Optimizers for LLMs Under Spectral Anisotropy

arXiv:2602.11185v1 Announce Type: new Abstract: Gradient signals in LLM training are highly anisotropic: recurrent linguistic structure concentrates energy into a small set of dominant spectral directions, while context specific information resides in a long tail. We show that this spike tail separ...

#ArXiv#Machine Learning#Academic

Tool• Feb 13, 2026

Automated Optimization Modeling via a Localizable Error-Driven Perspective

arXiv:2602.11164v1 Announce Type: new Abstract: Automated optimization modeling via Large Language Models (LLMs) has emerged as a promising approach to assist complex human decision-making. While post-training has become a pivotal technique to enhance LLMs' capabilities in this domain, its effectiv...

#ArXiv#Machine Learning#Academic

Tool• Feb 13, 2026

Voxtral Realtime

arXiv:2602.11298v1 Announce Type: new Abstract: We introduce Voxtral Realtime, a natively streaming automatic speech recognition model that matches offline transcription quality at sub-second latency. Unlike approaches that adapt offline models through chunking or sliding windows, Voxtral Realtime ...

#ArXiv#Machine Learning#Academic

Tool• Feb 13, 2026

TDPNavigator-Placer: Thermal- and Wirelength-Aware Chiplet Placement in 2.5D Systems Through Multi-Agent Reinforcement Learning

arXiv:2602.11187v1 Announce Type: new Abstract: The rapid growth of electronics has accelerated the adoption of 2.5D integrated circuits, where effective automated chiplet placement is essential as systems scale to larger and more heterogeneous chiplet assemblies. Existing placement methods typical...

#ArXiv#Machine Learning#Academic

1...5 6 7 8 9...19