Stay ahead of the generative AI revolution!Join the M5B Newsletter →

AI Tools & Frameworks Directory

Discover essential tools, libraries, and frameworks to power your AI workflows.

All Engineering Hardware Jobs News Research Tools Tutorials

News AI TechCrunch Analytics Vidhya Data Science Towards Data Science Medium GenAI Textual OpenAI Google MIT Microsoft HuggingFace OpenSource Models NVIDIA GPU Enterprise ArXiv

Tool• Feb 20, 2026

Contextuality from Single-State Representations: An Information-Theoretic Principle for Adaptive Intelligence

arXiv:2602.16716v1 Announce Type: new Abstract: Adaptive systems often operate across multiple contexts while reusing a fixed internal state space due to constraints on memory, representation, or physical resources. Such single-state reuse is ubiquitous in natural and artificial intelligence, yet i...

#ArXiv#Machine Learning#Academic

Tool• Feb 20, 2026

Mobility-Aware Cache Framework for Scalable LLM-Based Human Mobility Simulation

arXiv:2602.16727v1 Announce Type: new Abstract: Large-scale human mobility simulation is critical for applications such as urban planning, epidemiology, and transportation analysis. Recent works treat large language models (LLMs) as human agents to simulate realistic mobility behaviors using struct...

#ArXiv#Machine Learning#Academic

Tool• Feb 20, 2026

DeepVision-103K: A Visually Diverse, Broad-Coverage, and Verifiable Mathematical Dataset for Multimodal Reasoning

arXiv:2602.16742v1 Announce Type: new Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has been shown effective in enhancing the visual reflection and reasoning capabilities of Large Multimodal Models (LMMs). However, existing datasets are predominantly derived from either small-scal...

#ArXiv#Machine Learning#Academic

Tool• Feb 20, 2026

Quantifying LLM Attention-Head Stability: Implications for Circuit Universality

arXiv:2602.16740v1 Announce Type: new Abstract: In mechanistic interpretability, recent work scrutinizes transformer "circuits" - sparse, mono or multi layer sub computations, that may reflect human understandable functions. Yet, these network circuits are rarely acid-tested for their stability acr...

#ArXiv#Machine Learning#Academic

Tool• Feb 20, 2026

A Few-Shot LLM Framework for Extreme Day Classification in Electricity Markets

arXiv:2602.16735v1 Announce Type: new Abstract: This paper proposes a few-shot classification framework based on Large Language Models (LLMs) to predict whether the next day will have spikes in real-time electricity prices. The approach aggregates system state information, including electricity dem...

#ArXiv#Machine Learning#Academic

Tool• Feb 19, 2026

Improving Interactive In-Context Learning from Natural Language Feedback

arXiv:2602.16066v1 Announce Type: new Abstract: Adapting one's thought process based on corrective feedback is an essential ability in human learning, particularly in collaborative settings. In contrast, the current large language model training paradigm relies heavily on modeling vast, static corp...

#ArXiv#Machine Learning#Academic

Tool• Feb 19, 2026

How Uncertain Is the Grade? A Benchmark of Uncertainty Metrics for LLM-Based Automatic Assessment

arXiv:2602.16039v1 Announce Type: new Abstract: The rapid rise of large language models (LLMs) is reshaping the landscape of automatic assessment in education. While these systems demonstrate substantial advantages in adaptability to diverse question types and flexibility in output formats, they al...

#ArXiv#Machine Learning#Academic

Tool• Feb 19, 2026

Evidence-Grounded Subspecialty Reasoning: Evaluating a Curated Clinical Intelligence Layer on the 2025 Endocrinology Board-Style Examination

arXiv:2602.16050v1 Announce Type: new Abstract: Background: Large language models have demonstrated strong performance on general medical examinations, but subspecialty clinical reasoning remains challenging due to rapidly evolving guidelines and nuanced evidence hierarchies. Methods: We evaluated ...

#ArXiv#Machine Learning#Academic

Tool• Feb 19, 2026

Optimization Instability in Autonomous Agentic Workflows for Clinical Symptom Detection

arXiv:2602.16037v1 Announce Type: new Abstract: Autonomous agentic workflows that iteratively refine their own behavior hold considerable promise, yet their failure modes remain poorly characterized. We investigate optimization instability, a phenomenon in which continued autonomous improvement par...

#ArXiv#Machine Learning#Academic

Tool• Feb 19, 2026

A Koopman-Bayesian Framework for High-Fidelity, Perceptually Optimized Haptic Surgical Simulation

arXiv:2602.15834v1 Announce Type: new Abstract: We introduce a unified framework that combines nonlinear dynamics, perceptual psychophysics and high frequency haptic rendering to enhance realism in surgical simulation. The interaction of the surgical device with soft tissue is elevated to an augmen...

#ArXiv#Machine Learning#Academic

Tool• Feb 19, 2026

Genetic Generalized Additive Models

arXiv:2602.15877v1 Announce Type: new Abstract: Generalized Additive Models (GAMs) balance predictive accuracy and interpretability, but manually configuring their structure is challenging. We propose using the multi-objective genetic algorithm NSGA-II to automatically optimize GAMs, jointly minimi...

#ArXiv#Machine Learning#Academic

Tool• Feb 19, 2026

Memes-as-Replies: Can Models Select Humorous Manga Panel Responses?

arXiv:2602.15842v1 Announce Type: new Abstract: Memes are a popular element of modern web communication, used not only as static artifacts but also as interactive replies within conversations. While computational research has focused on analyzing the intrinsic properties of memes, the dynamic and c...

#ArXiv#Machine Learning#Academic

Tool• Feb 19, 2026

IT-OSE: Exploring Optimal Sample Size for Industrial Data Augmentation

arXiv:2602.15878v1 Announce Type: new Abstract: In industrial scenarios, data augmentation is an effective approach to improve model performance. However, its benefits are not unidirectionally beneficial. There is no theoretical research or established estimation for the optimal sample size (OSS) i...

#ArXiv#Machine Learning#Academic

Tool• Feb 19, 2026

Kalman-Inspired Runtime Stability and Recovery in Hybrid Reasoning Systems

arXiv:2602.15855v1 Announce Type: new Abstract: Hybrid reasoning systems that combine learned components with model-based inference are increasingly deployed in tool-augmented decision loops, yet their runtime behavior under partial observability and sustained evidence mismatch remains poorly under...

#ArXiv#Machine Learning#Academic

Tool• Feb 18, 2026

Attention-gated U-Net model for semantic segmentation of brain tumors and feature extraction for survival prognosis

arXiv:2602.15067v1 Announce Type: new Abstract: Gliomas, among the most common primary brain tumors, vary widely in aggressiveness, prognosis, and histology, making treatment challenging due to complex and time-intensive surgical interventions. This study presents an Attention-Gated Recurrent Resid...

#ArXiv#Machine Learning#Academic

Tool• Feb 18, 2026

Refine Now, Query Fast: A Decoupled Refinement Paradigm for Implicit Neural Fields

arXiv:2602.15155v1 Announce Type: new Abstract: Implicit Neural Representations (INRs) have emerged as promising surrogates for large 3D scientific simulations due to their ability to continuously model spatial and conditional fields, yet they face a critical fidelity-speed dilemma: deep MLPs suffe...

#ArXiv#Machine Learning#Academic

Tool• Feb 18, 2026

PolyNODE: Variable-dimension Neural ODEs on M-polyfolds

arXiv:2602.15128v1 Announce Type: new Abstract: Neural ordinary differential equations (NODEs) are geometric deep learning models based on dynamical systems and flows generated by vector fields on manifolds. Despite numerous successful applications, particularly within the flow matching paradigm, a...

#ArXiv#Machine Learning#Academic

Tool• Feb 18, 2026

Near-Optimal Sample Complexity for Online Constrained MDPs

arXiv:2602.15076v1 Announce Type: new Abstract: Safety is a fundamental challenge in reinforcement learning (RL), particularly in real-world applications such as autonomous driving, robotics, and healthcare. To address this, Constrained Markov Decision Processes (CMDPs) are commonly used to enforce...

#ArXiv#Machine Learning#Academic

Tool• Feb 18, 2026

Hybrid Feature Learning with Time Series Embeddings for Equipment Anomaly Prediction

arXiv:2602.15089v1 Announce Type: new Abstract: In predictive maintenance of equipment, deep learning-based time series anomaly detection has garnered significant attention; however, pure deep learning approaches often fail to achieve sufficient accuracy on real-world data. This study proposes a hy...

#ArXiv#Machine Learning#Academic

Tool• Feb 18, 2026

ResearchGym: Evaluating Language Model Agents on Real-World AI Research

arXiv:2602.15112v1 Announce Type: new Abstract: We introduce ResearchGym, a benchmark and execution environment for evaluating AI agents on end-to-end research. To instantiate this, we repurpose five oral and spotlight papers from ICML, ICLR, and ACL. From each paper's repository, we preserve the d...

#ArXiv#Machine Learning#Academic

Tool• Feb 18, 2026

Protecting Language Models Against Unauthorized Distillation through Trace Rewriting

arXiv:2602.15143v1 Announce Type: new Abstract: Knowledge distillation is a widely adopted technique for transferring capabilities from LLMs to smaller, more efficient student models. However, unauthorized use of knowledge distillation takes unfair advantage of the considerable effort and cost put ...

#ArXiv#Machine Learning#Academic

Tool• Feb 18, 2026

Panini: Continual Learning in Token Space via Structured Memory

arXiv:2602.15156v1 Announce Type: new Abstract: Language models are increasingly used to reason over content they were not trained on, such as new documents, evolving knowledge, and user-specific data. A common approach is retrieval-augmented generation (RAG), which stores verbatim documents extern...

#ArXiv#Machine Learning#Academic

Tool• Feb 18, 2026

da Costa and Tarski meet Goguen and Carnap: a novel approach for ontological heterogeneity based on consequence systems

arXiv:2602.15158v1 Announce Type: new Abstract: This paper presents a novel approach for ontological heterogeneity that draws heavily from Carnapian-Goguenism, as presented by Kutz, Mossakowski and L\"ucke (2010). The approach is provisionally designated da Costian-Tarskianism, named after da Costa...

#ArXiv#Machine Learning#Academic

Tool• Feb 17, 2026

The Speed-up Factor: A Quantitative Multi-Iteration Active Learning Performance Metric

arXiv:2602.13359v1 Announce Type: new Abstract: Machine learning models excel with abundant annotated data, but annotation is often costly and time-intensive. Active learning (AL) aims to improve the performance-to-annotation ratio by using query methods (QMs) to iteratively select the most informa...

#ArXiv#Machine Learning#Academic

1...4 5 6 7 8...19