AI Tools & Frameworks Directory

Discover essential tools, libraries, and frameworks to power your AI workflows.

Tool• May 7, 2026

Pro$^2$Assist: Continuous Step-Aware Proactive Assistance with Multimodal Egocentric Perception for Long-Horizon Procedural Tasks

arXiv:2605.04227v1 Announce Type: new Abstract: Procedural tasks with multiple ordered steps are ubiquitous in daily life. Recent advances in multimodal large language models (MLLMs) have enabled personal assistants that support daily activities. However, existing systems primarily provide reactive...

#ArXiv#Machine Learning#Academic

Tool• May 7, 2026

Actionable Real-Time Modeling of Surgical Team Dynamics via Time-Expanded Interaction Graphs

arXiv:2605.04169v1 Announce Type: new Abstract: Surgical team performance arises from complex interactions between technical execution and non-technical skills, including communication and coordination dynamics. However, current surgical AI systems predominantly model visual workflow signals, lacki...

#ArXiv#Machine Learning#Academic

Tool• May 7, 2026

LCM: Lossless Context Management

arXiv:2605.04050v1 Announce Type: new Abstract: We introduce Lossless Context Management (LCM), a deterministic architecture for LLM memory that outperforms Claude Code on long-context tasks. When benchmarked using Opus 4.6, our LCM-augmented coding agent, Volt, achieves higher scores than Claude C...

#ArXiv#Machine Learning#Academic

Tool• May 7, 2026

MP-ISMoE: Mixed-Precision Interactive Side Mixture-of-Experts for Efficient Transfer Learning

arXiv:2605.04058v1 Announce Type: new Abstract: Parameter-efficient transfer learning (PETL) has emerged as a pivotal paradigm for adapting pre-trained foundation models to downstream tasks, significantly reducing trainable parameters yet suffering from substantial memory overhead caused by gradien...

#ArXiv#Machine Learning#Academic

Tool• May 7, 2026

Structured Progressive Knowledge Activation for LLM-Driven Neural Architecture Search

arXiv:2605.04057v1 Announce Type: new Abstract: This paper focuses on a key challenge in Neural Architecture Search (NAS): integrating established architectural knowledge while exploring new designs under expensive evaluations. Large language models (LLMs) are a promising assistant for NAS because ...

#ArXiv#Machine Learning#Academic

Tool• May 7, 2026

Transformation Categorization Based on Group Decomposition Theory Using Parameter Division

arXiv:2605.04056v1 Announce Type: new Abstract: Representation learning seeks meaningful sensory representations without supervision and can model aspects of human development. Although many neural networks empirically learn useful features, a principled account of what makes a representation "good...

#ArXiv#Machine Learning#Academic

Tool• May 7, 2026

Endogenous Regime Switching Driven by Scalar-Irreducible Learning Dynamics

arXiv:2605.04054v1 Announce Type: new Abstract: Achieving endogenous regime switching is crucial for the emergence of autonomous intelligence, yet remains a central challenge for existing machine learning frameworks, where such transitions are typically externally imposed. In this work, we introduc...

#ArXiv#Machine Learning#Academic

Tool• May 7, 2026

May 7, 2026AlignmentDonating our open-source alignment tool

#Anthropic#Claude#LLM

Tool• May 6, 2026

5 ways to prepare for physical AI, today

Jensen Huang called it "the ChatGPT moment for robotics." Deloitte says 80% of businesses plan to use physical AI within two years. Here is what you actually need to know, and do, to prepare…

#AI Accelerator Institute#AI#Research

Tool• May 6, 2026

AlphaEvolve: How our Gemini-powered coding agent is scaling impact across fields

Explore how AlphaEvolve's Gemini-powered algorithms are driving impact across business, infrastructure, and science.

#DeepMind#Google#AGI

Tool• May 6, 2026

Programmatic Context Augmentation for LLM-based Symbolic Regression

arXiv:2605.03101v1 Announce Type: new Abstract: Symbolic regression (SR), the task of discovering mathematical expressions that best describe a given dataset, remains a fundamental challenge in scientific discovery. Traditional approaches, primarily based on genetic algorithms and related evolution...

#ArXiv#Machine Learning#Academic

Tool• May 6, 2026

Stable Agentic Control: Tool-Mediated LLM Architecture for Autonomous Cyber Defense

arXiv:2605.03034v1 Announce Type: new Abstract: Agentic systems involved in high-stake decision-making under adversarial pressure need formal guarantees not offered by existing approaches. Motivated by the operational needs of security operations centers (SOCs) that must configure endpoint detectio...

#ArXiv#Machine Learning#Academic

Tool• May 6, 2026

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

arXiv:2605.02910v2 Announce Type: new Abstract: Recent advances in large language models have led to strong performance on reasoning and environment-interaction tasks, yet their ability for creative problem-solving remains underexplored. We study this capability through the lens of creative tool us...

#ArXiv#Machine Learning#Academic

Tool• May 6, 2026

Delay, Plateau, or Collapse: Evaluating the Impact of Systematic Verification Error on RLVR

arXiv:2605.02909v1 Announce Type: new Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has become a powerful approach for improving the reasoning capabilities of large language models (LLMs). While RLVR is designed for tasks with verifiable ground-truth answers, real-world verifiers ...

#ArXiv#Machine Learning#Academic

Tool• May 6, 2026

An End-to-End Framework for Building Large Language Models for Software Operations

arXiv:2605.02906v1 Announce Type: new Abstract: In the field of software operations, Large Language Models (LLMs) have attracted increasing attention. However, existing research has not yet achieved efficient and effective end-to-end intelligent operations due to low-quality data, fragmented knowle...

#ArXiv#Machine Learning#Academic

Tool• May 6, 2026

eOptShrinkQ: Near-Lossless KV Cache Compression Through Optimal Spectral Denoising and Quantization

arXiv:2605.02905v1 Announce Type: new Abstract: We show that the key-value (KV) cache in transformer attention heads admits a natural decomposition into a low-rank \emph{shared context} component and a full-rank \emph{per-token} residual, well described by the spiked random matrix model. This obser...

#ArXiv#Machine Learning#Academic

Tool• May 6, 2026

StateSMix: Online Lossless Compression via Mamba State Space Models and Sparse N-gram Context Mixing

arXiv:2605.02904v1 Announce Type: new Abstract: We present StateSMix, a fully self-contained lossless compressor that couples an online-trained Mamba-style State Space Model (SSM) with sparse n-gram context mixing and arithmetic coding. The model is initialised from scratch and trained token-by-tok...

#ArXiv#Machine Learning#Academic

Tool• May 6, 2026

AI lets chemists design molecules by simply describing them

Creating complex molecules usually requires years of experience and countless decisions, but a new AI system is changing that. Synthegy lets chemists guide synthesis and reaction planning using simple language, while powerful algorithms generate and evaluate possible solutions. The AI doesn’t just c...

#Science Daily#AI#Research

Tool• May 6, 2026

Normalizing Flows with Iterative Denoising

Normalizing Flows (NFs) are a classical family of likelihood-based methods that have received revived attention. Recent efforts such as TARFlow have shown that NFs are capable of achieving promising performance on image modeling tasks, making them viable alternatives to other methods such as diffusi...

#Apple#On-device AI

Tool• May 6, 2026

From Where Things Are to What They’re For: Benchmarking Spatial–Functional Intelligence for Multimodal LLMs

True spatial intelligence for multimodal agents transcends low-level geometric perception, evolving from knowing where things are to understanding what they are for. While existing benchmarks, such as VSI-Bench, effectively evaluate this foundational geometric stage, they fall short of probing the h...

#Apple#On-device AI

Tool• May 5, 2026

Microsoft at NSDI 2026: Advances in large-scale networked systems

Microsoft researchers share advances in building and operating large-scale distributed systems, spanning datacenters, networking, and the growing intersection with AI during NSDI ’26. The post Microsoft at NSDI 2026: Advances in large-scale networked systems appeared first on Microsoft Research.

#Microsoft#Research

Tool• May 5, 2026

Understanding Emergent Misalignment via Feature Superposition Geometry

arXiv:2605.00842v1 Announce Type: new Abstract: Emergent misalignment, where fine-tuning on narrow, non-harmful tasks induces harmful behaviors, poses a key challenge for AI safety in LLMs. Despite growing empirical evidence, its underlying mechanism remains unclear. To uncover the reason behind th...

#ArXiv#Machine Learning#Academic

Tool• May 5, 2026

AI Agents for Sustainable SMEs: A Green ESG Assessment Framework

arXiv:2605.00841v1 Announce Type: new Abstract: This study presents a novel, AI-driven framework for assessing Environmental, Social, and Governance (ESG) performance in European small and medium-sized enterprises (SMEs). An initial phase established expert-validated ESG baseline scores from a subs...

#ArXiv#Machine Learning#Academic

Tool• May 5, 2026

2026 Roadmap on Artificial Intelligence and Machine Learning for Smart Manufacturing

arXiv:2605.00839v1 Announce Type: new Abstract: The evolution of artificial intelligence (AI) and machine learning (ML) is reshaping smart manufacturing by providing new capabilities for efficiency, adaptability, and autonomy across industrial value chains. However, the deployment of AI and ML in i...

#ArXiv#Machine Learning#Academic

← Prev

1...9 10 11 12 13...50