agent-debugging

Here are 32 public repositories matching this topic...

liaohch3 / claude-tap

Intercept and inspect Coding Agent API traffic from Claude Code, Codex CLI, Gemini CLI, Cursor CLI, OpenCode, Kimi, Pi, and Hermes in a local trace viewer.

Updated May 26, 2026
Python

najeed / ai-agent-eval-harness

Star

The open-source MultiAgentOps evaluation and verification harness for any industry business workflow.

Updated May 23, 2026
Python

OthmanAdi / langsmith-fetch-skill

Sponsor

Star

🔍 AI observability skill for Claude Code. Debug LangChain/LangGraph agents by fetching execution traces from LangSmith Studio directly in your terminal.

developer-tools observability ai-agents langchain langsmith llm-ops langsmith-tracing developer-tools-ai-agent claude-skills claude-skills-creator claude-skills-hub claude-skills-libary agent-debugging

Updated Apr 6, 2026

cylestio / agent-inspector

Star

Local open-source dev tool to debug, secure, and evaluate LLM agents. Provides static analysis, dynamic security checks, and runtime monitoring - integrates with Cursor and Claude Code.

behavior-analysis agent-trace ai-security-tool agent-security cursor-integration claude-code-plugin agent-debugging

Updated Jan 15, 2026
Python

converra / agent-triage

Star

Diagnose your AI agents in production. Extract policies from prompts, evaluate traces, generate diagnostic reports.

Updated Mar 10, 2026
TypeScript

Ylsssq926 / clawclip

Star

Cut your OpenClaw / ZeroClaw token bill. Find which model earns its cost. Prove whether optimizations actually work. Local, no upload.

hermes ai-agent ai-observability cost-reduction local-ai agent-tools llm-cost token-optimization agent-debugging openclaw zeroclaw hermes-agent agent-analytics prompt-efficiency

Updated May 26, 2026
TypeScript

aaronlab / browsertrace

Star

Local replay debugger for Browser Use failures with screenshots, model I/O, failed-step timelines, and public-safe HTML exports.

Updated May 14, 2026
Python

amitmishrg / agenticlens

Star

Visual debugging, tracing, and replay for agent workflows.

nodejs ai reactjs devtools tracing developer-tools visualizations observability debugging-tools ai-agents log-visualization jsonl ai-observability llm agentic-ai agent-workflows workflow-visualization agent-debugging execution-tracing

Updated Mar 27, 2026
JavaScript

kangjinghang / agent-chatlens

Star

🔍 A beautiful web viewer for AI agent session files. Browse Claude Code & OpenClaw conversations with chat-style UI, timeline visualization, and zero setup.

react visualization typescript developer-tools dark-mode chat-ui claude conversation-analysis jsonl vite ai-agent session-viewer claude-code agent-debugging openclaw jsonl-viewer tool-call-visualization

Updated May 19, 2026
TypeScript

ChainWatch is a flight data recorder for multi-step AI systems. It's a CLI-based tool that records every step in an AI decision chain, links them together in order, prevents tampering, and allows you to verify the chain's integrity and replay the full decision flow.

ai artificial-intelligence audit-log autonomous-agents ai-agents ai-engineering ai-observability llm llmops ai-tracing agent-observability ai-audit agent-debugging tool-using-agents decision-tracing

Updated Jan 22, 2026
Python

jigjoy-ai / kaleidoskop

Star

Kaleidoskop — replay your baro/Mozaik agent runs visually. Audit log → hexagonal neural firing in your browser.

visualization typescript multi-agent replay mozaik observability ai-agents baro llm agent-orchestration agent-debugging jigjoy

Updated May 23, 2026
TypeScript

Exploreunive / agentlens

Star

Explain why your agent failed — root-cause debugging, memory attribution, and run divergence for LLM agents.

python memory tracing developer-tools observability ai-agents llm agent-debugging

Updated Mar 31, 2026
Python

xiaoshuo1988130 / deepseek-compat-kit

Star

Compatibility and diagnostics for DeepSeek V4 tool-calling agents

json-schema llm-proxy deepseek tool-calling openai-compatible agent-debugging deepseek-v4 reasoning-content

Updated May 27, 2026
JavaScript

joshualamerton / AgentLens

Star

A real-time observability and debugging layer for AI agents.

python machine-learning ai machine-learning-algorithms devtools agents ai-agents machine-learning-projects llms ai-devtools agent-debugging

Updated Mar 11, 2026
Python

valani9 / vstack

Star

AI agents fail like junior teammates—looping on bad ideas, ignoring feedback, escalating commitment. vstack ports 34 of the most-cited organizational-behavior frameworks so you can diagnose your agents the same way you'd diagnose your team.