History

zachary62 3544dd5440 update para		2025-03-20 17:33:10 -04:00
..
README.md	add streaming	2025-03-20 16:28:19 -04:00
main.py	update para	2025-03-20 17:33:10 -04:00
utils.py	add streaming	2025-03-20 16:28:19 -04:00

README.md

LLM Streaming and Interruption

Demonstrates real-time LLM response streaming with user interrupt capability.

Features

Real-time display of LLM responses as they're generated
User interrupt with ENTER key at any time

Run It

pip install -r requirements.txt
python main.py

How It Works

StreamNode:

Creates interrupt listener thread
Fetches content chunks from LLM
Displays chunks in real-time
Handles user interruption

API Key

By default, demo uses fake streaming responses. To use real OpenAI streaming:

Edit main.py to replace the fake_stream_llm with stream_llm:

# Change this line:
chunks = fake_stream_llm(prompt)
# To this:
chunks = stream_llm(prompt)

Make sure your OpenAI API key is set:

export OPENAI_API_KEY="your-api-key-here"

Files

main.py: StreamNode implementation
utils.py: Real and fake LLM streaming functions