feat(live): Support live mode of sequential agent

Add a `task_completed` function to the agent so when a model finished the task, it can send a signal and the program knows it can go to next agent.

This cl include:
* Implements the `_run_live_impl` in `sequential_agent` so it can handle live case.
* Add an example for sequential agent.
* Improve error message for unimplemented _run_live_impl in other agents.

Note:
1. Compared to non-live case, live agents process a continuous streams of audio
or video, so it doesn't have a native way to tell if it's finished and should pass
to next agent or not. So we introduce a task_compelted() function so the
model can call this function to signal that it's finished the task and we
can move on to next agent.

2. live agents doesn't seems to be very useful or natural in parallel or loop agents so we don't implement it for now. If there is user demand, we can implement it easily using similar approach.

PiperOrigin-RevId: 758315430
This commit is contained in:
Hangfei Lin
2025-05-13 11:55:50 -07:00
committed by Copybara-Service
parent 39f78dc28f
commit 4188673b0f
7 changed files with 180 additions and 19 deletions
+1 -1
View File
@@ -58,5 +58,5 @@ class LoopAgent(BaseAgent):
async def _run_live_impl(
self, ctx: InvocationContext
) -> AsyncGenerator[Event, None]:
raise NotImplementedError('The behavior for run_live is not defined yet.')
raise NotImplementedError('This is not supported yet for LoopAgent.')
yield # AsyncGenerator requires having at least one yield statement
+7
View File
@@ -94,3 +94,10 @@ class ParallelAgent(BaseAgent):
agent_runs = [agent.run_async(ctx) for agent in self.sub_agents]
async for event in _merge_agent_run(agent_runs):
yield event
@override
async def _run_live_impl(
self, ctx: InvocationContext
) -> AsyncGenerator[Event, None]:
raise NotImplementedError("This is not supported yet for ParallelAgent.")
yield # AsyncGenerator requires having at least one yield statement
+31
View File
@@ -23,6 +23,7 @@ from typing_extensions import override
from ..agents.invocation_context import InvocationContext
from ..events.event import Event
from .base_agent import BaseAgent
from .llm_agent import LlmAgent
class SequentialAgent(BaseAgent):
@@ -40,6 +41,36 @@ class SequentialAgent(BaseAgent):
async def _run_live_impl(
self, ctx: InvocationContext
) -> AsyncGenerator[Event, None]:
"""Implementation for live SequentialAgent.
Compared to non-live case, live agents process a continous streams of audio
or video, so it doesn't have a way to tell if it's finished and should pass
to next agent or not. So we introduce a task_compelted() function so the
model can call this function to signal that it's finished the task and we
can move on to next agent.
Args:
ctx: The invocation context of the agent.
"""
# There is no way to know if it's using live during init phase so we have to init it here
for sub_agent in self.sub_agents:
# add tool
def task_completed():
"""
Signals that the model has successfully completed the user's question
or task.
"""
return "Task completion signaled."
if isinstance(sub_agent, LlmAgent):
# Use function name to dedupe.
if task_completed.__name__ not in sub_agent.tools:
sub_agent.tools.append(task_completed)
sub_agent.instruction += f"""If you finished the user' request
according to its description, call {task_completed.__name__} function
to exit so the next agents can take over. When calling this function,
do not generate any text other than the function call.'"""
for sub_agent in self.sub_agents:
async for event in sub_agent.run_live(ctx):
yield event