Stay ahead of the generative AI revolution!Join the M5B Newsletter →

AI Tools & Frameworks Directory

Discover essential tools, libraries, and frameworks to power your AI workflows.

All Engineering Hardware Jobs News Research Tools Tutorials

News AI TechCrunch Analytics Vidhya Data Science Towards Data Science Medium GenAI Textual OpenAI Google MIT Microsoft HuggingFace OpenSource Models NVIDIA GPU Enterprise ArXiv

Tool• May 29, 2026

Self-Play Reinforcement Learning under Imperfect Information in Big 2

arXiv:2605.28863v1 Announce Type: new Abstract: Imperfect-information multiplayer games test whether agents can act under hidden information, sparse rewards, and non-stationary opponents. We study these challenges in Big 2, a four-player imperfect-information card game. We develop a self-play RL fr...

#ArXiv#Machine Learning#Academic

Tool• May 29, 2026

Molecular Lead Optimization via Agentic Tool Planning

arXiv:2605.28862v1 Announce Type: new Abstract: Drug discovery is a lengthy and resource-intensive process composed of multiple stages. Among these stages, lead optimization plays a critical role in transforming early hit compounds into viable drug candidates. This stage requires improving ADMET-re...

#ArXiv#Machine Learning#Academic

Tool• May 29, 2026

Representation Signatures and Risk-Feedback Alignment in LLM Trading Agents

arXiv:2605.28850v1 Announce Type: new Abstract: We study behavioral alignment and representation dynamics of large language model (LLM) agents in financial decision environments. Using TradeArena, an auditable trading-agent testbed with risk reports, execution simulation, memory, and replayable tra...

#ArXiv#Machine Learning#Academic

Tool• May 29, 2026

The Cognitive Categorical Transformer: Category-Theoretic Inductive Biases for Language Modeling

arXiv:2605.28864v1 Announce Type: new Abstract: The Cognitive Categorical Transformer (CCT) is a 306M-parameter architecture that augments a pretrained GPT-2 Small backbone with cognitively grounded components derived from category theory and several inspirations from cognitive science. Under a mat...

#ArXiv#Machine Learning#Academic

Tool• May 29, 2026

Ultra-Reduced-Impact-Encased-Logging (URIEL): propose a new method for selective sustainable logging and post-harvest silvicultural treatment in tropical forest using airborne robotics systems

arXiv:2605.28883v1 Announce Type: new Abstract: Tropical forests worldwide are under intense deforestation pressure driven by economic and political interests, and scientific evidence suggests this deforestation contributes to climate change. This paper proposes a novel logging method for tropical ...

#ArXiv#Machine Learning#Academic

Tool• May 28, 2026

Soro: A Lightweight Foundation Model and Chatbot for Tajik

arXiv:2605.27379v1 Announce Type: new Abstract: We present Soro, a family of Tajik-specialized conversational large language models (LLMs) designed for real-world deployment under tight compute and connectivity constraints in Tajikistan. Starting from open-weight Gemma 3 checkpoints, we perform Taj...

#ArXiv#Machine Learning#Academic

Tool• May 28, 2026

Identifying and Understanding Human Values in Text: A Tailorable LLM-based Architecture

arXiv:2605.27373v1 Announce Type: new Abstract: As intelligent systems become more autonomous, the scientific community focuses on creating decision-making mechanisms that include ethical and moral considerations, unlike traditional utility-maximisation models. To achieve this, a key aspect is asse...

#ArXiv#Machine Learning#Academic

Tool• May 28, 2026

DynaSchedBench: Calibrated Dynamic Scheduling Benchmarks and Observability Paradox in LLM-based Scheduling Agents

arXiv:2605.27566v1 Announce Type: new Abstract: Progress in neural combinatorial optimization for Dynamic Flexible Job Shop Scheduling Problem (DFJSP) is currently hindered by a methodological tension: static benchmarks encourage benchmark overfitting, while uncalibrated generators obscure algorith...

#ArXiv#Machine Learning#Academic

Tool• May 28, 2026

Personalized Observation Normalization for Federated Reinforcement Learning in Simulation Environments with Heterogeneity

arXiv:2605.27385v1 Announce Type: new Abstract: Federated reinforcement learning (FedRL) enables multiple agents to collaboratively train a global policy without sharing raw data, making it ideal for privacy-sensitive applications. However, FedRL faces challenges in heterogeneous environments where...

#ArXiv#Machine Learning#Academic

Tool• May 28, 2026

A Simple State Space Model Excels at Multivariate Time Series Classification

arXiv:2605.27406v1 Announce Type: new Abstract: Structured state space models (SSMs) have recently emerged as a promising foundation for sequence modeling, with Mamba-based architectures demonstrating strong performance through input-dependent state transitions, albeit at considerable complexity. H...

#ArXiv#Machine Learning#Academic

Tool• May 28, 2026

$E^3$-Agent: An Executable and Evolving Agent for Resource Management of Edge Generative Inference

arXiv:2605.27428v1 Announce Type: new Abstract: Edge deployments of generative inference increasingly face two practical realities: per-device per-model performance is often unknown at deployment time, and it is non-stationary due to user-driven semantic events, background load, and device churn. C...

#ArXiv#Machine Learning#Academic

Tool• May 28, 2026

IGADA-IoT: IoT Sensor Energy Optimization in Wireless Sensor Networks Driven by Automatic Data Augmentation

arXiv:2605.27397v1 Announce Type: new Abstract: In wireless sensor networks (WSNs), data augmentation is a novel method to improve sampling-frequency decision performance, thereby enabling energy optimization for IoT (Internet of Things) sensors. However, existing methods rely on a single generator...

#ArXiv#Machine Learning#Academic

Tool• May 28, 2026

Tackling Multimodal Learning Challenges with Mixture-of-Expert: A Survey

arXiv:2605.27431v1 Announce Type: new Abstract: Mixture-of-Experts (MoE) presents a naturally compatible and scalable framework for multimodal learning, demonstrating strong adaptability across diverse modalities and tasks. Despite its growing success, a comprehensive and systematic review on the M...

#ArXiv#Machine Learning#Academic

Tool• May 27, 2026

Neural Bayesian Sequential Routing

arXiv:2605.26147v1 Announce Type: new Abstract: Human decision-making is sequential and uncertainty-aware, yet standard neural networks often rely on static, dense forward computation with limited visibility into evidence acquisition, uncertainty evolution, or when computation should stop. We intro...

#ArXiv#Machine Learning#Academic

Tool• May 27, 2026

GEM: Geometric Entropy Mixing for Optimal LLM Data Curation

arXiv:2605.26121v1 Announce Type: new Abstract: LLM pre-training efficacy increasingly depends on data composition rather than sheer volume. Yet, optimal mixing is hindered by categorization flaws: human taxonomies suffer from ontological misalignment, and Euclidean clustering fails to address embe...

#ArXiv#Machine Learning#Academic

Tool• May 27, 2026

The Constraint Tax: Measuring Validity-Correctness Tradeoffs in Structured Outputs for Small Language Models

arXiv:2605.26128v1 Announce Type: new Abstract: Production LLM systems increasingly require machine-readable outputs: JSON objects, typed traces, regex-constrained fields, and tool-call schemas. This paper targets on-device and low-cost small language model (SLM) deployments, where sub-3B models ar...

#ArXiv#Machine Learning#Academic

Tool• May 27, 2026

AirCast-SR: A Foundation Model for Kilometer-Scale Atmospheric Super-Resolution via Latent Consistency Diffusion

arXiv:2605.26130v1 Announce Type: new Abstract: Operational weather prediction at kilometer scales remains computationally prohibitive for traditional numerical weather prediction (NWP) models, limiting forecast access for applications in energy, agriculture, and disaster management that require fi...

#ArXiv#Machine Learning#Academic

Tool• May 27, 2026

Is Agent Memory a Database? Rethinking Data Foundations for Long-Term AI Agent Memory

arXiv:2605.26252v1 Announce Type: new Abstract: Long-running AI agents need persistent memory. Memory supports learning across sessions, reduces repeated context injection, and enables auditing of past decisions. Current agent memory systems and database paradigms treat memory as storage. They loca...

#ArXiv#Machine Learning#Academic

Tool• May 27, 2026

BrickAnything: Geometry-Conditioned Buildable Brick Generation with Structure-Aware Tokenization

arXiv:2605.26182v1 Announce Type: new Abstract: Generating physically buildable brick structures from 3D shapes requires more than geometric reconstruction: the output must also satisfy discrete part constraints and structural stability. Existing brick generation methods either rely on heuristic op...

#ArXiv#Machine Learning#Academic

Tool• May 27, 2026

Personalizing Embodied Multimodal Large Language Model Agents over Long-term User Interactions

arXiv:2605.26256v1 Announce Type: new Abstract: Multimodal large language model (MLLM)-based embodied agents have shown strong potential for solving complex tasks in physical environments. However, personalized assistance requires more than following generic instruction or recognizing object catego...

#ArXiv#Machine Learning#Academic

Tool• May 27, 2026

Can LLMs Introspect? A Reality Check

arXiv:2605.26242v1 Announce Type: new Abstract: Can large language models detect and report their own internal states? A number of studies have argued that the answer to this question is yes. We argue, based on lessons from human metacognition research, that this conclusion may be premature: to be ...

#ArXiv#Machine Learning#Academic

Tool• May 27, 2026

Constraint acquisition needs better benchmarks

arXiv:2605.26279v1 Announce Type: new Abstract: Constraint Acquisition (CA) and related research on the validation and enhancement of Mathematical Programming (MP) models from domain knowledge artifacts are currently limited by inadequate benchmarks. This deficiency impedes reproducibility and cros...

#ArXiv#Machine Learning#Academic

Tool• May 26, 2026

Algometrics: Forecasting Under Algorithmic Feedback

arXiv:2605.23978v1 Announce Type: new Abstract: In algorithmic markets, predictive models become part of the data-generating process they aim to forecast. Once their outputs are converted into trades, allocations, execution schedules, or risk controls, they change the future data on which they are ...

#ArXiv#Machine Learning#Academic

Tool• May 26, 2026

Towards Verifiable Transformers: Solver-Checkable Circuit Explanations

arXiv:2605.24033v1 Announce Type: new Abstract: Mechanistic interpretability often identifies circuits inside Transformer models, but explanations of those circuits are usually validated through examples, ablations, and manual reasoning. This leaves a gap between finding a plausible circuit and pro...

#ArXiv#Machine Learning#Academic

1...11 12 13 14 15...48