Streaming

If streaming is set when making prediction, tokens will be sent as data-only server-sent events as they become available.

Using Python/TS Library

Flowise provides 2 libraries:

from flowise import Flowise, PredictionData
 
def test_streaming():
    client = Flowise()
 
    # Test streaming prediction
    completion = client.create_prediction(
        PredictionData(
            chatflowId="<flow-id>",
            question="Tell me a joke!",
            streaming=True
        )
    )
 
    # Process and print each streamed chunk
    print("Streaming response:")
    for chunk in completion:
        # {event: "token", data: "hello"}
        print(chunk)
 
 
if __name__ == "__main__":
    test_streaming()
event: token
data: Once upon a time...

A prediction’s event stream consists of the following event types:

EventDescription
startThe start of streaming
tokenEmitted when the prediction is streaming new token output
errorEmitted when the prediction returns an error
endEmitted when the prediction finishes
metadataAll metadata such as chatId, messageId, of the related flow. Emitted after all tokens have finished streaming, and before end event
sourceDocumentsEmitted when the flow returns sources from vector store
usedToolsEmitted when the flow used tools

Streamlit App

https://github.com/HenryHengZJ/flowise-streamlit