Building Reliable Agents - Evaluation Challenges
Exploring the challenges of evaluating agent reliability and LLM performance.
Exploring the challenges of evaluating agent reliability and LLM performance.
Briefing document for LangChain Interrupt 2025.
BlackRock's approach to building Aladdin Copilot, a sophisticated assistant for investment management.
JPMorgan Chase's multi-agent system for investment research that integrates structured data, RAG, and analytics.
Quick digest table summarizing the LangChain Interrupt 2025 conference presentations.