LangSmith Fleet introduces two types of agent authorization: Assistants, which use the end user's own credentials, and Claws, which use a fixed set of credentials.
Polly is generally available everywhere you work in LangSmith
Debugging agents is different from debugging anything else you've built. Traces run hundreds of steps deep, prompts span thousands of lines, and when something goes wrong, the context that caused it is buried somewhere in the middle.We built Polly to be the AI assistant that can read
LangChain Announces Enterprise Agentic AI Platform Built with NVIDIA
Comprehensive agent engineering platform combined with NVIDIA AI enables enterprises to build, deploy, and monitor production-grade AI agents at scalePress ReleaseSAN FRANCISCO, March 16, 2026 /PRNewswire/ — LangChain, the agent engineering company behind LangSmith and open-source frameworks that ha...
Clarifai Reasoning Engine Achieves 414 Tokens Per Second on Kimi K2.5
Clarifai achieves 414 tokens per second on Kimi K2.5, one of the first providers to reach 400+ TPS on a trillion-parameter reasoning model running on Nvidia B200 GPUs.
We’re excited to introduce the deploy cli, a new set of commands within the langgraph-cli package that makes it simple to deploy and manage agents directly from the command line.The first command in this new set, langgraph deploy, lets you deploy an agent to LangSmith Deployment in
Clarifai 12.2: Three-Command CLI Workflow for Model Deployment
Clarifai 12.2 introduces a three-command CLI workflow for model deployment. Initialize, test locally, and deploy to production with automatic GPU selection and infrastructure provisioning.
TL;DR: We've added a tool to the Deep Agents SDK (Python) and CLI that allows models to compress their own context windows at opportune times.MotivationContext compression is an action that reduces the information in an agent’s working memory. Older messages are replaced by
By Vivek TrivedyTLDR: Agent = Model + Harness. Harness engineering is how we build systems around models to turn them into work engines. The model contains the intelligence and the harness makes that intelligence useful. We define what a harness is and derive the core components today's and tomor...
How Coding Agents Are Reshaping Engineering, Product and Design
EPD (Engineering, Product, and Design) at software company is about creating good software. Separate roles exist, but the end goal is functional software that solves a business problem that users can use. At the end of the day, this is just code. It is important to recognize that the output
By Robert XuRecently at LangChain we’ve been building skills to help coding agents like Codex, Claude Code, and Deep Agents CLI work with our ecosystem: namely, LangChain and LangSmith. This is not an effort unique to us - most (if not all) companies are exploring how to
We’re releasing a CLI along with our first set of skills to give AI coding agents expertise in the LangSmith ecosystem. This includes adding tracing to agents, understanding their execution, building test sets, and evaluating performance. On our eval set, this bumps Claude Code’s performance on
We’re releasing our first set of skills to give AI coding agents expertise in the open source LangChain ecosystem. This includes building agents with LangChain, LangGraph, and Deep Agents. On our eval set, this bumps Claude Code’s performance on these tasks from 29% to 95%.What
How Clay uses LangSmith to debug, evaluate, and monitor 300 million agents runs per month
Clay is the creative tool for growth — a platform where go-to-market teams build, enrich, and activate lists of companies and people. Sales teams use Clay to source target accounts, qualify leads with AI-powered research, draft personalized outreach, and route opportunities through their CRM. Clay's...
You don’t know what your agent will do until it’s in production
You can't monitor agents like traditional software. Inputs are infinite, behavior is non-deterministic, and quality lives in the conversations themselves. This article explains what to monitor, how to scale evaluation, and how production traces become the foundation for continuous improvement.