ActiveState Survey Flags Container Security Gaps and Breach Risk
The company’s 2026 State of Vulnerability Management & Remediation Report reveals the tension between the strategic intent and operational reality of open source in enterprise software development ActiveState, a global leader in open source language solutions and secure software supply chain managem...
Pentera Closes Record-Setting Year, Becomes First in AEV to Surpass $100M ARR
Pentera has set the standard for the Adversarial Exposure Validation (AEV) category, uniting AI-powered security testing and automated remediation workflows in one platform Pentera is celebrating its breakthrough as the first company in the adversarial exposure validation (AEV) market to reach Centa...
10Bridge Unveils AI Agents for Healthcare Data Interoperability Workflows
New AI Agent Automation eliminates manual reporting, automates complex interoperability tasks, and enables system integration when APIs or technical expertise are not available. 10Bridge (https://10bridge.io), a leader in healthcare data interoperability solutions, today announced the launch of its ...
Learning Python at the beginning feels deceptively simple. You write a few lines, the code runs, and it’s tempting to think you’ve got it. Then you try to build something on your own and… nothing works!? Turns out all the information you had learnt, didn’t find an outlet. That’s where challenging p...
arXiv:2601.02433v1 Announce Type: new
Abstract: Digital AI systems spanning large language models, vision models, and generative architectures that operate primarily in symbolic, linguistic, or pixel domains. They have achieved striking progress, but almost all of this progress lives in virtual spa...
WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks
arXiv:2601.02439v1 Announce Type: new
Abstract: We present WebGym, the largest-to-date open-source environment for training realistic visual web agents. Real websites are non-stationary and diverse, making artificial or small-scale task sets insufficient for robust policy learning. WebGym contains ...
Polynomial Convergence of Riemannian Diffusion Models
arXiv:2601.02499v1 Announce Type: new
Abstract: Diffusion models have demonstrated remarkable empirical success in the recent years and are considered one of the state-of-the-art generative models in modern AI. These models consist of a forward process, which gradually diffuses the data distributio...
GEM-Style Constraints for PEFT with Dual Gradient Projection in LoRA
arXiv:2601.02500v1 Announce Type: new
Abstract: Full fine-tuning of Large Language Models (LLMs) is computationally costly, motivating Continual Learning (CL) approaches that utilize parameter-efficient adapters. We revisit Gradient Episodic Memory (GEM) within the Low-Rank Adapter (LoRA) subspace ...
Textual Explanations and Their Evaluations for Reinforcement Learning Policy
arXiv:2601.02514v1 Announce Type: new
Abstract: Understanding a Reinforcement Learning (RL) policy is crucial for ensuring that autonomous agents behave according to human expectations. This goal can be achieved using Explainable Reinforcement Learning (XRL) techniques. Although textual explanation...
SimpleMem: Efficient Lifelong Memory for LLM Agents
arXiv:2601.02553v1 Announce Type: new
Abstract: To support reliable long-term interaction in complex environments, LLM agents require memory systems that efficiently manage historical experiences. Existing approaches either retain full interaction histories via passive context extension, leading to...
Orchestral AI: A Framework for Agent Orchestration
arXiv:2601.02577v1 Announce Type: new
Abstract: The rapid proliferation of LLM agent frameworks has forced developers to choose between vendor lock-in through provider-specific SDKs and complex multi-package ecosystems that obscure control flow and hinder reproducibility. Integrating tool calling a...
AWARE-US: Benchmark for Preference-Aware Resolution in Tool-Calling Agents
arXiv:2601.02643v1 Announce Type: new
Abstract: Tool-calling conversational agents querying structured databases often face two linked failures: underspecification (missing constraints needed to run a precise query) and infeasibility (the fully specified query returns an empty set because no item s...
Implementing Softmax From Scratch: Avoiding the Numerical Stability Trap
In deep learning, classification models don’t just need to make predictions—they need to express confidence. That’s where the Softmax activation function comes in. Softmax takes the raw, unbounded scores produced by a neural network and transforms them into a well-defined probability distribution, m...
NVIDIA AI Released Nemotron Speech ASR: A New Open Source Transcription Model Designed from the Ground Up for Low-Latency Use Cases like Voice Agents
NVIDIA has just released its new streaming English transcription model (Nemotron Speech ASR) built specifically for low latency voice agents and live captioning. The checkpoint nvidia/nemotron-speech-streaming-en-0.6b on Hugging Face combines a cache aware FastConformer encoder with an RNNT decoder,...
A Gentle Introduction to Language Model Fine-tuning
This article is divided into four parts; they are: • The Reason for Fine-tuning a Model • Dataset for Fine-tuning • Fine-tuning Procedure • Other Fine-Tuning Techniques Once you train your decoder-only transformer model, you have a text generator.
What To Look For In A Cloud Services Provider (Sponsored)
Choosing a cloud services provider can feel a lot like dating: every vendor promises reliability, security, and support, but only a few truly live up to it. The wrong choice can lead to costly downtime, security headaches, or performance bottlenecks that ripple across your business.
California lawmaker proposes a four-year ban on AI chatbots in kid’s toys
“Our children cannot be used as lab rats for Big Tech to experiment on,” Senator Steve Padilla said. He just introduced a bill to ban AI chatbots in toys until safety regulations are developed.
Nvidia to accelerate Siemens chip-design tools using its GPUs
Nvidia announced today at CES 2026 that it would help Siemens’ electronic design automation (EDA) software run on its GPUs in an attempt to speed up the chip-design process. Nearly all computer chips today are designed using EDA tools, and as chip features get smaller and transistors more numerous, ...
Liquid AI Releases LFM2.5: A Compact AI Model Family For Real On Device Agents
Liquid AI has introduced LFM2.5, a new generation of small foundation models built on the LFM2 architecture and focused at on device and edge deployments. The model family includes LFM2.5-1.2B-Base and LFM2.5-1.2B-Instruct and extends to Japanese, vision language, and audio language variants. It is ...
A practical guide to observability, evaluations, and model comparisons
The post Measuring What Matters with NeMo Agent Toolkit appeared first on Towards Data Science.