See AI First โ The Opinionated AI Stack Guide
Curated directory of 66 AI developer tools across 13 categories.
Protocols โ Standardized Communication Layer
- MCP โ Model Context Protocol โ Anthropic's connection standard โ lets AI talk to any tool, database, or API through one unified protocol. Often called 'USB-C for AI'.
- A2A โ Agent2Agent Protocol โ Google's protocol โ lets AI agents communicate and collaborate with each other, regardless of their underlying framework.
Agent Frameworks โ Building AI Agents
- LangGraph โ Stateful agent orchestration framework from the LangChain ecosystem. The most widely adopted today.
- CrewAI โ Build role-based AI agent teams with minimal code. Each agent has a role, they coordinate automatically.
- AutoGen โ Microsoft's framework for multi-agent conversations. Flexible architecture, easy to extend.
- OpenAI Agents SDK โ OpenAI's lightweight Python SDK โ build agents fast with built-in safety guardrails.
- Google ADK โ Google's agent building toolkit โ with built-in Gemini, Vertex AI, and A2A protocol integration.
- LlamaIndex โ The RAG specialist โ connects AI to your private data for accurate answers from internal documents.
- Rasa โ Chatbot and voice AI platform โ runs on your own server with full data control.
- Browser-Use โ Browser automation framework for AI agents โ control the web with Python using any LLM
- Pydantic AI โ Type-safe agent framework in the style of FastAPI โ built by the Pydantic team, supports 40+ model providers
- Mastra โ TypeScript-first agent framework from the Gatsby team โ workflows, RAG, evals, and MCP built in
Platforms & Config โ AI Platforms & System Files
- Claude.ai (Web) โ Anthropic's chat workspace โ with memory, web search, Claude Cowork, and team plugins/connectors for enterprise workflows.
- ChatGPT (Web) โ OpenAI's most popular AI chat โ with GPTs, search, tools, and a built-in memory system.
Memory Systems โ Cross-Session Memory
- Claude-Mem โ Plugin that gives Claude Code persistent long-term memory โ automatically records experience, compresses intelligently, and remembers across sessions.
- Other Memory Systems โ Beyond Claude-Mem, several other memory systems exist: MemGPT/Letta โ self-manages context windows like an OS manages RAM, with a paging mechanism.
- MemOS โ Unified memory operating system for LLMs โ manages memory via MemCube with short-term and long-term support
Skills Ecosystem โ "Bigger than MCP"
- Universal Agent Skills โ Skills are .md files with instructions for AI agents โ simple yet effective, runnable on any AI coding tool.
- Notable Skill Categories โ Key categories: Engineering & DevOps (API testing, Docker, CI/CD), Web Development (React, Tailwind, animations), AI/ML Research (77 skills for paper reading, fine-tuning), Cross-Agent Orchestration (handoff between Claude Code โ Codex โ Cursor), Claude Code Ecosystem (Task Master, Claude Swarm, session management).
- Superpowers โ Agent-style skills framework for Claude Code โ automatically activates the right skills based on context.
- Agents (wshobson) โ Collection of 112 agents + 146 skills + 73 specialized plugins for Claude Code โ the largest to date.
- Planning with Files โ Markdown-file-based planning skill inspired by Manus โ maintains context across multiple work sessions.
Orchestration โ Multi-Agent Coordination
- Claude-Flow โ Agent orchestration platform with a Queen + Workers architecture โ manages Claude, GPT, and Gemini simultaneously via MCP.
- OpenClaw โ Evolution: Clawdbot โ Moltbot โ OpenClaw
- Claude Squad โ Terminal app for running multiple Claude Code and Codex sessions simultaneously, each in its own workspace.
AI Coding Agents โ AI-Powered Coding Tools
- Claude Code โ The leading CLI agent โ SWE-bench, outperforming all rivals.
- Cursor โ Smart IDE based on VS Code โ features Composer and Agent mode, best for daily coding.
- Codex CLI โ OpenAI's CLI agent โ runs in terminal or cloud, handles tasks sequentially and precisely.
- Windsurf โ AI IDE with Cascade โ a multi-step autonomous agent: reads code โ analyzes โ edits โ tests โ commits.
- GitHub Copilot โ Microsoft's code assistant built into VS Code โ features Agent mode, strong for enterprise.
- Gemini CLI โ Google's CLI agent โ free for individuals, context (the largest).
- Cline / RooCode โ Open-source VS Code extension โ supports multiple AI models and MCP, community-driven.
- Kiro (AWS) โ AWS's spec-first AI IDE โ from requirements โ design โ plan โ automated code generation.
- Replit Agent โ Cloud-based agent โ builds full-stack apps fast, no setup needed. Best for beginners.
- OpenCode โ Open-source coding agent running in terminal โ supports all LLM providers, MIT license
- Goose โ Open-source coding agent from Block (Square) โ doesn't just suggest but also installs, runs, and debugs with any LLM
AI Trends 2026 โ Trends Shaping the Future
- Multi-Agent Systems โ Gartner reports queries about multi-agent systems surged from Q1/2024 to Q2/2025.
- Mechanistic Interpretability โ Research into how AI 'thinks' internally โ enabling better understanding and control of AI behavior. 2026 breakthrough per MIT.
- World Models โ AI simulating the real world โ AMI Labs, Google, and startups are racing to develop it. Gaming is a major driver.
- Generative Coding
- Chinese Open-Source AI โ DeepSeek R1 shook the AI world โ open-source reasoning model from China. Technology gap shrinking from months to weeks.
- Post-Training > Pre-Training โ Trend toward post-training refinement instead of building larger models โ RL and fine-tuning become the key.
- AI Drug Discovery โ AI-discovered drugs entering mid/late-stage clinical trials. Focus: cancer and rare diseases.
- AI Infrastructure โ Massive data centers, even using nuclear power for AI. A core battleground in the AI race.
- AI Governance โ AI agents monitoring other AI agents โ setting safety guardrails and keeping humans in the loop.
Observability & Evaluation โ AI Monitoring & Assessment
- Langfuse โ Open-source observability platform for LLMs โ tracing, evals, prompt management
- DeepEval โ LLM evaluation framework like Pytest โ 50+ metrics, CI/CD integration, red-teaming
- Promptfoo โ CLI tool for prompt testing, model comparison, and LLM red-teaming/security scanning
- LangSmith โ LangChain's observability platform โ tracing, evaluation, prompt hub for LangChain/LangGraph
- Arize Phoenix โ Open-source AI observability platform โ tracing, evaluation, RAG debugging, embeddings analysis
AI Infrastructure โ Model Serving & Deployment
- Ollama โ The easiest way to run LLMs locally โ supports 100+ models, 3 commands to get started
- vLLM โ High-performance production inference engine โ PagedAttention, 2-4ร throughput over baseline
- LocalAI โ Self-hosted OpenAI alternative โ run LLMs, image gen, audio on consumer hardware, no GPU required
- LiteLLM โ Unified AI Gateway for 100+ LLM providers โ cost tracking, load balancing, unified API
RAG Systems โ Retrieval-Augmented Generation
- Dify โ Open-source platform for building AI apps with visual workflow builder and built-in RAG pipeline
- RAGFlow โ Open-source RAG engine specializing in deep document understanding โ smart chunking, grounded citations
- Flowise โ No-code tool for building RAG pipelines and LLM apps with drag-and-drop UI โ built on LangChain
- LightRAG โ Lightweight RAG framework based on knowledge graphs โ GraphRAG approach from HKU research (EMNLP 2025)
Vector Databases โ Vector Storage & Search
- Milvus โ Largest open-source vector database โ cloud-native, scales to billions of vectors, backed by LF AI Foundation
- Qdrant โ Vector database written in Rust โ fast, memory-efficient, best-in-class payload filtering
- Chroma โ Simplest vector database for AI apps โ embedded mode, zero-config, ideal for getting started with RAG
- pgvector โ Extension that turns PostgreSQL into a vector database โ no separate DB needed, uses familiar SQL
Security & Guardrails โ Protecting AI Applications
- garak โ LLM vulnerability scanner โ probes security flaws in AI models
- NeMo Guardrails โ Programmable guardrails for LLMs with Colang DSL โ dialog control and safety
- Presidio โ PII detection & anonymization for text and images โ protecting personal data
- Guardrails AI โ I/O validation framework for LLMs โ structured output + 100+ validators