pocketflow/docs/guide.md

---
layout: default
title: "Design Guidance"
parent: "Apps"
nav_order: 1
---

# LLM System Design Guidance


## System Design Steps

1. **Project Requirements**: Understand what the project is for and what are required.

2. **Utility Functions**: LLM Systems are like the brain


   - Determine the utility functions on which this project depends (e.g., for LLM calls, web searches, file handling).
   - Implement these functions and write basic tests to confirm they work correctly.

> After this step, don't jump straight into building an LLM system.
>
> First, make sure you clearly understand the problem by manually solving it using some example inputs.
>
> It's always easier to first build a solid intuition about the problem and its solution, then focus on automating the process.
{: .warning }

3. **Flow Design**
   - Build a high-level design of the flow of nodes (for example, using a Mermaid diagram) to automate the solution.
   - For each node in your flow, specify:
     - **prep**: How data is accessed or retrieved.
     - **exec**: The specific utility function to use (ideally one function per node).
     - **post**: How data is updated or persisted.
   - Identify potential design patterns, such as Batch, Agent, or RAG.

4. **Data Structure**
   - Decide how you will store and update state (in memory for smaller applications or in a database for larger, persistent needs).
   - If it isn’t straightforward, define data schemas or models detailing how information is stored, accessed, and updated.
   - As you finalize your data structure, you may need to refine your flow design.

5. **Implementation**
   - For each node, implement the **prep**, **exec**, and **post** functions based on the flow design.
   - Start coding with a simple, direct approach (avoid over-engineering at first).
   - Add logging throughout the code to facilitate debugging.

6. **Optimization**
   - **Prompt Engineering**: Use clear, specific instructions with illustrative examples to reduce ambiguity.
   - **Task Decomposition**: Break large or complex tasks into manageable, logical steps.

7. **Reliability**
   - **Structured Output**: Ensure outputs conform to the required format. Consider increasing `max_retries` if needed.
   - **Test Cases**: Develop clear, reproducible tests for each part of the flow.
   - **Self-Evaluation**: Introduce an additional node (powered by LLMs) to review outputs when results are uncertain.

## Example LLM Project File Structure

```
my_project/
├── main.py
├── flow.py
├── utils/
│   ├── __init__.py
│   ├── call_llm.py
│   └── search_web.py
├── requirements.txt
└── docs/
    └── design.md
```

### `docs/`

Holds all project documentation. Include a `design.md` file covering:
- Project requirements
- Utility functions
- High-level flow (with a Mermaid diagram)
- Shared memory data structure
- Node designs:
  - Purpose and design (e.g., batch or async)
  - Data read (prep) and write (post)
  - Data processing (exec)

### `utils/`

Houses functions for external API calls (e.g., LLMs, web searches, etc.). It’s recommended to dedicate one Python file per API call, with names like `call_llm.py` or `search_web.py`. Each file should include:

- The function to call the API
- A main function to run that API call for testing

For instance, here’s a simplified `call_llm.py` example:

```python
from openai import OpenAI

def call_llm(prompt):
    client = OpenAI(api_key="YOUR_API_KEY_HERE")
    response = client.chat.completions.create(
        model="gpt-4o",
        messages=[{"role": "user", "content": prompt}]
    )
    return response.choices[0].message.content

if __name__ == "__main__":
    prompt = "Hello, how are you?"
    print(call_llm(prompt))
```

### `main.py`

Serves as the project’s entry point.

### `flow.py`

Implements the application’s flow, starting with node followed by the flow structure.