Harnessing the Power of Multi-Agent Systems for Generative AI (Part 2)

Moritz Georg Glauner ,

Lion Schulz

Last modified: 25.03.2024 7 minutes read

In this second part of our series on multi-agent systems in generative AI, we explore LangGraph, a component of the LangChain framework, and its role in implementing complex information flows. We delve into how LangGraph builds upon Autogen's foundation, offering more precise control over agent communication through directed graphs. By comparing the features, usability, and maturity of both frameworks, you'll gain insights into choosing the right tool for your specific use case and leveraging the power of multi-agent systems in generative AI.

Implementing Complex Information Flows with LangGraph in Multi-Agent LLM Systems

This section focuses on implementing complex information flows using LangGraph in multi-agent LLM systems. In Part 1, we discussed the usefulness of multi-agent systems and how to implement them with AutoGen.

LangChain is a widely used framework for developing LLM applications. It serves as basis for LangGraph and provides:

A vast selection of pre-defined text extraction tools
Language models
Other resources
A hierarchy of classes

Getting started with LangChain

Typically, LangChain and LangGraph are based on the concept of chains: A PromptTemplate is usually combined with an LLM and, optionally, a validator. The simplest way to achieve this is by using the LCEL (LangChain Expression Language). Although it may take some time to get used to, it allows for concise and standardised code.

from langchain.output_parsers.boolean import BooleanOutputParser
from langchain.prompts import PromptTemplate
from langchain_openai import AzureChatOpenAI

prompt = PromptTemplate.from_template(
    """Decide if the user question got sufficiently answered within the chat history. Answer only with YES or NO!
Sentences like "I don't know" or "There are no information" are no sufficient answers.

chat history: {messages}

user question: {user_question}
    """
)
llm = AzureChatOpenAI(
	openai_api_version="2023-12-01-preview", 
	azure_deployment="gpt-35-turbo", 
	streaming=True
)
parser = BooleanOutputParser()
chain = prompt | llm | parser

# Zum Aufruf der Chain müssen alle Prompt-Variablen übergeben werden
chain.invoke({    
    "user_question": "Was sind die aktuellen Trends in Italien?",
    "messages": ["die aktuellen Trends Italiens sind heute Formel 1 und ChatGPT"],
}) # Returns True

Empowering Agents through Function Calling

LangChain supports the execution of functions or tools. To execute a function, it must first be converted into a LangChain tool, which can be done:

Explicitly
Via function annotation

The latter holds the advantage, that converting the function's docstrings into language model information avoids redundancies and is very straight-forward, as shown in the following example:

from pytrends.request import TrendReq

def get_google_trends(country_name='germany', num_trends=5):
    """
    Fetches the current top trending searches for a given country from Google Trends.

    Parameters:
    - country_name (str): The english name of the country written in lower letters
    - num_trends (int): Number of top trends to fetch. Defaults to 5.

    Returns:
    - Prints the top trending searches.
    """
    pytrends = TrendReq(hl='en-US', tz=360)
    try:
        trending_searches_df = pytrends.trending_searches(pn=country_name)
        top_trends = trending_searches_df.head(num_trends)[0].to_list()
        return top_trends
    except Exception as e:
        print(f"An error occurred: {e}")

from langchain.tools import StructuredTool
google_trends_tool = StructuredTool.from_function(get_google_trends)

Once created, the tool should be passed to the model. If ChatGPT is used, the model supports native function calling. Therefore, the tool only needs to be activated using the bind_functions call. Now, the model can trigger a corresponding function call if necessary.

However, to execute the function automatically and return the results to the model, the chain must be transferred to an agent. For this purpose, there is a separate class that requires:

A name
The necessary tools to execute the chain

from langchain_core.prompts import ChatPromptTemplate, MessagesPlaceholder
from langchain.agents import create_openai_functions_agent
from langchain_core.utils.function_calling import convert_to_openai_function
from langchain_core.runnables import RunnablePassthrough
from langchain.agents.output_parsers.openai_functions import OpenAIFunctionsAgentOutputParser
from langchain.agents import AgentExecutor
from langchain_core.messages import HumanMessage
from langchain.agents.format_scratchpad.openai_functions import (
    format_to_openai_function_messages,
)

from langchain_community.tools.tavily_search import TavilySearchResults
tavily_tool = TavilySearchResults(max_results=5)

tools = [google_trends_tool, tavily_tool]
system_prompt = "\\nYou task is to get information on the current trends by using your tools."
prompt = ChatPromptTemplate.from_messages([
        ("system", system_prompt),
        MessagesPlaceholder(variable_name="messages"),
        MessagesPlaceholder(variable_name="agent_scratchpad"),
])
agent = create_openai_functions_agent(llm, tools, prompt)
llm_with_tools = llm.bind(functions=[convert_to_openai_function(t) for t in tools])
agent = (
    RunnablePassthrough.assign(
        agent_scratchpad=lambda x: format_to_openai_function_messages(
            x["intermediate_steps"]
        )
    )
    | prompt
    | llm_with_tools
    | OpenAIFunctionsAgentOutputParser()
)

executor = AgentExecutor(name="Analyzer", agent=agent, tools=tools)

Extending Single LangChain Agents to Multi Agents in LangGraph

LangChain version 0.1 introduced LangGraph; a concept for implementing multi-agent systems. LangGraph organizes communication through a graph instead of a free exchange of messages and shared chat history.

The interface is based on the NetworkX Python library, allowing for flexible composition of directed graphs, which may also be cyclical.

First, a graph with a defined status is created.
Then, nodes and edges are added to the graph, and a starting point is selected.
The graph's dynamics are determined by either static or conditional edges.
- Both nodes and conditional edges can be simple Python functions or determined using an LLM call.
- These functions receive the current state and return a new state for the next node.
Finally, all nodes and edges are compiled into a 'Pregel' object.

The Pregel graph implements the LangChain Runnable Interface. It can be executed:

Synchronously or asynchronously
As a stream or batch operation

Monitoring of Agents through LangSmith

LangChain aims to hide complexity from the user, which can make debugging more difficult. LangSmith, the monitoring solution developed by LangChain, is helpful in this regard. The integration requires an API key only.

After obtaining the key, all events are streamed to the cloud and displayed in a user-friendly WebUI. This provides quick insight into all executed operations, such as:

LLM API calls
Executed tools
Errors that have occurred

The system tracks:

Execution times
Generated tokens and their costs
Numerous system meta-information

Additionally, custom key-value pairs can be logged.

To have more control or to avoid using LangSmith, you can include your own callback in the workflow to handle events that occur within it. Callbacks are also useful for integrating your own user interface.

The implementation can be complex, as there are 14 different event types that are relevant for:

Start of various actions
End of various actions
Errors of various actions

Unfortunately, we couldn't extract all the desired information in every event. In some cases, we had to resort to linked parent events.

To address this issue, we developed a graph callback that can also help visualize the complexity of the hidden calls within a LangGraph call.

Comparison between LangGraph and AutoGen

	AutoGen	LangGraph
Projectstatus	AutoGen is a popular framework for multi-agent systems. Microsoft is driving the project, which is based on a scientific paper.	In January 2024, the multi-agent solution for the popular LLM framework was released. It combines initial experience from AutoGen with approaches from established open source projects such as NetworkX and Pregel. LangGraph will continue to be a component of the LangChain ecosystem.
Function Calling	Autogen uses an user proxy agent to execute all functions, either natively or within a Docker container for added security.	LangChain allows for the transformation of functions and agents into executables, simplifying the structure. However, this feature does not currently provide any additional virtualization.
Messageflow	Autogen uses an user proxy agent to execute all functions, either natively or within a Docker container for added security.	The communication is represented by a graph. This makes it easy and intuitive to map specific communication paths. Conditional edges can also be used to map open group conversations between agents.
Usability	AutoGen makes it easy to use multiple agents with its examples and AutoGen Studio. However, modifying more than just prompts and tools requires extending the actual agent classes, which can make upgrades and maintenance more challenging.	LangChain is a powerful framework that tries to hide complexity from the user, but requires the user to learn many framework peculiarities. The high level of abstraction is often a barrier, especially at the beginning. However, once a user has understood the specifics of LangChain, using LangGraph is easy and intuitive.
Maturity	Autogen is a good starting point for multi-agent projects. However, it may be challenging to implement productive use cases due to unreliable group conversations and a lack of monitoring support. However, it may be challenging to implement productive use cases due to unreliable group conversations and a lack of monitoring support. However, it may be challenging to implement productive use cases due to unreliable group conversations and a lack of monitoring support. The shared chat history of the agents can cause executed prompts to become lengthy, resulting in slow and costly processes.	LangGraph is a young software with a solid foundation. The LangChain ecosystem provides various output parsers and error management options. LangGraph allows for precise control over the availability of information for each node and can flexibly map business requirements for communication flow. Additionally, it is supported by its serving and monitoring infrastructure.

Conclusion

Multi-agent systems are useful for building complex autonomous or semi-autonomous systems. They allow for the definition of a specific agent for each subtask, including:

Prompting
Model selection
Configuration

AutoGen has made a valuable contribution to multi-agent systems and is well-suited for initial experiments. However, if you require more precise control over agent communication or need to build a productive application, LangGraph is a better choice.

We recently converted an application from AutoGen to LangGraph:

The implementation of the agents and tools was relatively simple.
However, the biggest effort was migrating the UI connection to map all necessary information regarding tool and LLM usage via LangGraph callback.
Contrary, the deployment through LangServe is very straight forward.

Additionally, neither framework natively supports parallel execution of agents if the results are to be merged afterwards.

LangGraph and LangSmith can be used to create and operate complex workflows involving LLMs. They offer a range of possibilities for managing and processing language data.

The opinions and information stated in this article are personal to the individual author and do not necessarily represent Bertelsmann.

Moritz Georg GlaunerHead of Data ScienceCorporate Center

Lion SchulzData Scientist | Future Leaders TraineeCorporate Center

#artificial-intelligence#machine-learning#data-science#online-learning#chatgpt#generative-ai