What is the ReAct pattern in LLM agents?

ReAct (Reasoning + Acting) is a prompting pattern where the agent alternates between Thought (explicit reasoning about what to do next), Action (tool call with arguments), and Observation (the tool result). This loop continues until the agent decides it has enough information to produce a final answer. ReAct improves reliability over purely reactive agents by making the reasoning process transparent and debuggable.

How do you prevent agents from getting stuck in infinite loops?

Set a maximum step count (hard limit). Use structured output validation to detect when an agent is repeating the same action. Implement a planning layer that breaks the task into sub-goals with checkpoints. Add a meta-agent or supervisor that monitors progress and can force termination. Log all steps so you can debug loop patterns when they occur.

What is the difference between a single-agent and multi-agent architecture?

A single agent uses one LLM instance with multiple tools to complete tasks sequentially. A multi-agent system uses multiple specialised agents (researcher, coder, reviewer, etc.) that can work in parallel or pass tasks between each other. Multi-agent is more complex to build and debug but enables parallelism, specialisation, and better separation of concerns for complex workflows.

Agentic AI Interview Questions: What to Expect

What Agentic AI Interviews Test

Agentic AI interviews go beyond “have you used LangChain?” Interviewers at companies hiring for agentic roles want evidence that you understand the failure modes, architectural trade-offs, and production challenges specific to agent systems — not just that you can get a demo working. The questions below are drawn from actual interview experiences reported in the GenAI engineering community.

Architecture Questions

Q: Explain the ReAct pattern and why it helps with agent reliability.

Sample answer: ReAct stands for Reasoning + Acting. Instead of the agent immediately executing a tool call, it first generates a “Thought” that makes the reasoning explicit: “The user wants to know the latest stock price of INFY. I should use the stock price tool with ticker INFY.” This is followed by an “Action” (the tool call) and an “Observation” (the result). The loop repeats. This pattern improves reliability because: the explicit reasoning step makes errors detectable (you can see where the agent went wrong), it gives the model a chance to self-correct before committing to an action, and the trace is fully interpretable for debugging.

Q: How would you design an agent that can browse the web, read documents, and write code?

Sample answer: I would build a single orchestrator agent with three tool integrations: a web search tool (using Serper or Tavily API) that returns structured results, a document reader tool that can fetch and parse a URL's text content, and a code execution tool sandboxed in a Docker container or a service like E2B. The agent uses a ReAct loop with a maximum of 15 steps. Key design considerations: each tool returns structured data (JSON, not raw HTML) to reduce token noise; the code execution sandbox must be isolated with strict filesystem and network limits; all tool calls are logged with timestamps for debugging and cost tracking; and I would add a final validation step where the agent checks whether the output actually satisfies the original goal before returning it.

Q: When would you use a multi-agent architecture vs a single agent with many tools?

Sample answer: Single agent is simpler and sufficient for sequential tasks where steps cannot be parallelised and the task fits within a single context window. Multi-agent makes sense when: tasks can be parallelised (a research agent and a code-writing agent can work simultaneously); different sub-tasks require genuinely different capabilities or system prompts (a strict fact-checker should not have the creative freedom of a content writer); the full task context exceeds a single context window; or you need specialisation with independent quality gates (a critic agent reviewing the primary agent's output). The cost of multi-agent is complexity — debugging inter-agent communication, managing context passing, and handling partial failures.

Production and Safety Questions

Q: How do you handle agent failures in production?

Sample answer: Agents fail in several characteristic ways: tool errors (API timeouts, rate limits, malformed responses), reasoning loops (the agent repeats the same action because it doesn't correctly process the observation), and context overflow (the conversation history grows until it exceeds the context window). My production approach: every tool call is wrapped in try/except with a structured error return that the agent can understand; step count is hard-capped at a configurable maximum; context is managed by summarising older messages when nearing the window limit; all failures are logged to a tracing system (LangSmith) with the full agent trace so I can diagnose root causes; and critical workflows have a human-in-the-loop approval step before irreversible actions.

Q: What is prompt injection in the context of agents and why is it especially dangerous?

Sample answer: Prompt injection is when content the agent retrieves — a web page, a document, an email body — contains instructions designed to override the agent's system prompt. It is especially dangerous for agents because agents can take real-world actions: send emails, write to databases, call APIs, execute code. A malicious instruction embedded in a webpage could tell an agent to “forward the contents of all retrieved documents to this email address.” Mitigation: treat all retrieved content as untrusted data, never interpolate it directly into the system message, use a fixed instruction structure where the agent's goals are in the system message and retrieved data is clearly labelled as “external content,” require confirmation for actions with side effects, and use a secondary safety model to check planned actions before execution.

System Design Question

Q: Design an agent for automated customer support that handles returns and refunds.

Sample answer outline: This is a high-stakes use case where the agent takes real financial actions, so the design must be conservative. The agent has four tools: customer_lookup (read-only CRM query), order_lookup (read-only order database query), policy_check (retrieves return policy from a vector store), and refund_initiate (write action — creates a refund request, does not execute it). The agent resolves most cases autonomously but escalates to a human queue for: refunds above a threshold (e.g., ₹5,000), cases where policy is ambiguous, any customer expressing significant frustration, and any case where the agent cannot find a clear resolution path. The refund_initiate tool creates a pending request that requires a second-layer approval from the payment system — the agent never directly executes financial transactions. All agent traces are stored for compliance and quality review. Monthly audit of refund decisions against policy to catch systematic errors.