Migrating agents from Llama to GPT-OSS-120B

Aspect	Llama 3.2 90B	GPT-OSS-120B
System prompt	Uses standard watsonx Orchestrate system prompt	Does not use standard system prompt
Knowledge preference	Balanced between internal and external knowledge	Prefers internal knowledge unless instructed otherwise
Parameter collection	Turn-by-turn questioning	Single-turn collection of multiple parameters
Error communication	Generic error messages	Context-aware, detailed explanations
Instruction following	Flexible interpretation	Literal, precise following of examples
Reasoning approach	Concise execution	More exploratory with additional tool calls

1

Review your agent configuration

Export your existing Llama-based agent to review its configuration:

orchestrate agents export -n <agent-name> -k native -o agent-backup.zip

Document the following elements:

Agent instructions and tone
Tool usage patterns
Knowledge base dependencies
Expected user interaction flows

2

Update the LLM configuration

Modify your agent configuration to use GPT-OSS-120B:

llm: groq/openai/gpt-oss-120b

3

Optimize agent instructions

GPT-OSS-120B requires explicit, model-specific instructions. Add the following blocks to your agent’s instructions based on your needs:

Essential instruction template

Use this comprehensive template as a starting point:

Behavior and sources:
- Identify as an agent operating within watsonx Orchestrate.
- Always use available tools to retrieve information before relying on your internal knowledge.
- If tools don't contain the answer, state "The available tools don't contain the answer" and ask one clarifying question.

Reasoning and brevity controls:
- Use concise reasoning with at most 3 reasoning steps before responding.
- Keep chit-chat to one sentence, then proceed.
- Limit final answers to ≤150 words unless the user requests detail.

Error handling:
- If required inputs are missing, ask for the minimum missing fields in a single question.
- When errors occur, explain what went wrong and suggest next steps.

Formatting:
- Use proper Markdown syntax for hyperlinks: [link text](url)
- Format responses clearly with appropriate structure.

Prioritize knowledge bases (if applicable)

If your agent uses knowledge bases, add this instruction:

Always check the connected knowledge base(s) first.
Prefer information retrieved from knowledge over your
own internal knowledge. If relevant content is found,
summarize it faithfully and cite the source title or document
name. If the knowledge base does not contain the answer, say
"I don't know based on the provided knowledge" and ask
a clarifying question.

Optimize tool usage

For agents with multiple tools, provide clear guidance:

Tool usage rules:
- Use the [tool_name] tool for [specific purpose].
- Call tools with all available information; don't ask for parameters you can infer.
- If a tool returns an error, analyze the error message and retry with corrections.
- If a tool fails after 2 attempts, inform the user and suggest alternatives.

Control agent routing (for supervisor agents)

If your agent delegates to other agents, use explicit action verbs:

Agent delegation:
- Call the [agent_name] agent when [specific condition].
- Execute the agent call immediately with available information.
- Do NOT ask for additional parameters before calling the agent unless absolutely required.

4

Remove problematic patterns

GPT-OSS-120B can be constrained by overly specific examples. Review and update:❌ Avoid:

When updating employee data, ask for:
1. Employee name
2. Department
3. Location
Then call the update_employee tool.

✅ Prefer:

When updating employee data, collect all required parameters
(name, department, location) in a single question, then call
the update_employee tool.

❌ Avoid:

If the user asks about weather, say "I cannot help with that."
If they ask about sports, say "That's outside my scope."
If they ask about news, say "I don't have access to that."

✅ Prefer:

If the user asks about topics outside your capabilities,
politely acknowledge the request and redirect to your
primary function.

5

Test and validate

After updating your agent configuration:

Test basic interactions:

orchestrate chat ask --agent-name <agent-name> "Hello"

Test tool calling: Verify that tools are called correctly with appropriate parameters.
Test error scenarios: Ensure error messages are clear and recovery is intelligent.
Test multi-turn conversations: Confirm context retention across multiple exchanges.
Test edge cases: Validate behavior with incomplete information, out-of-scope requests, and ambiguous queries.

6

Deploy and monitor

Import your updated agent:

orchestrate agents import -f updated-agent.yaml

Monitor initial usage for:

Response quality and accuracy
Tool call precision
User satisfaction
Error rates and recovery success

Release Notes

Get Started

Build

Deploy

Analyze

watsonx Orchestrate Developer Edition

watsonx Orchestrate ADK MCP Server

Reference

Legal notices

Why migrate to GPT-OSS-120B

Before you begin

Prerequisites

Understanding the differences

Migration process

Essential instruction template

Prioritize knowledge bases (if applicable)

Optimize tool usage

Control agent routing (for supervisor agents)

Common migration challenges

Challenge 1: Over-reliance on internal knowledge

Challenge 2: Excessive parameter collection

Challenge 3: Literal example following

Challenge 4: Tool call precision issues

Challenge 5: Agent routing confusion

Prompt engineering best practices

DO:

DON’T:

Performance optimization tips

Reduce response latency

Improve conversational flow

Enhance error recovery

Validation checklist

Troubleshooting

Next steps

Managing agents

Agent descriptions and instructions

Managing custom LLMs

Model policies

Additional resources

Release Notes

Get Started

Build

Deploy

Analyze

watsonx Orchestrate Developer Edition

watsonx Orchestrate ADK MCP Server

Reference

Legal notices

​Why migrate to GPT-OSS-120B

​Before you begin

​Prerequisites

​Understanding the differences

​Migration process

​Essential instruction template

​Prioritize knowledge bases (if applicable)

​Optimize tool usage

​Control agent routing (for supervisor agents)

​Common migration challenges

​Challenge 1: Over-reliance on internal knowledge

​Challenge 2: Excessive parameter collection

​Challenge 3: Literal example following

​Challenge 4: Tool call precision issues

​Challenge 5: Agent routing confusion

​Prompt engineering best practices

​DO:

​DON’T:

​Performance optimization tips

​Reduce response latency

​Improve conversational flow

​Enhance error recovery

​Validation checklist

​Troubleshooting

​Next steps

Managing agents

Agent descriptions and instructions

Managing custom LLMs

Model policies

​Additional resources

Why migrate to GPT-OSS-120B

Before you begin

Prerequisites

Understanding the differences

Migration process

Essential instruction template

Prioritize knowledge bases (if applicable)

Optimize tool usage

Control agent routing (for supervisor agents)

Common migration challenges

Challenge 1: Over-reliance on internal knowledge

Challenge 2: Excessive parameter collection

Challenge 3: Literal example following

Challenge 4: Tool call precision issues

Challenge 5: Agent routing confusion

Prompt engineering best practices

DO:

DON’T:

Performance optimization tips

Reduce response latency

Improve conversational flow

Enhance error recovery

Validation checklist

Troubleshooting

Next steps

Additional resources