Documentation Index
Fetch the complete documentation index at: https://na-36-handover-docs-v2-into-docs-v2-dev-20260518.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
ComfyStream is a Livepeer-maintained extension of ComfyUI that swaps batch image generation for a real-time video loop. A ComfyUI workflow that takes an image and returns an image becomes a live-video-to-video pipeline when run through ComfyStream: video frames flow in over WebRTC, the workflow processes each frame, transformed frames flow back out at sub-second latency. Phase 4 (January 2026) hardened ComfyStream for production. The runtime added audio processing, data-channel output, dynamic workflow warm-up, and PyTrickle-based BYOC packaging. Daydream and Embody both run on ComfyStream infrastructure. The canonical install reference is
docs.comfystream.org. The repository is livepeer/comfystream. The current Docker image is livepeer/comfystream, with livepeer/comfyui-base:stable as the BYOC base.
Pipeline Modes
ComfyStream workflows produce one of four output types. Every workflow declares its output mode through the nodes it composes.| Mode | Input | Output | Representative node |
|---|---|---|---|
| Image-to-image (live) | Live video frames | Transformed video frames | StreamDiffusionSampler |
| Video-to-video | Video segment | Processed video | StreamDiffusion V2 |
| Audio processing | Audio track from stream | Audio (pass-through or transformed) | LoadAudioTensor |
| Data-channel output | Audio or video frames | Structured text alongside video | AudioTranscription + data output node |
Node Ecosystem
ComfyStream uses standard ComfyUI custom nodes. Any node that executes per-frame without maintaining incompatible state runs in a real-time workflow.Core I/O Nodes
Required for every ComfyStream workflow. They handle the real-time tensor handoff between the stream and the ComfyUI graph.| Node | Source | Purpose |
|---|---|---|
LoadTensor | livepeer/comfystream | Load a video frame tensor from the live stream |
LoadAudioTensor | livepeer/comfystream | Load an audio frame tensor for audio-aware processing |
Real-Time Control Nodes
These nodes update their output on every workflow execution, which makes them suitable for animating parameters across a continuous stream.| Node | Source | Purpose |
|---|---|---|
FloatControl | ComfyUI_RealtimeNodes | Outputs a float that changes over time (sine, bounce, random) |
IntControl | ComfyUI_RealtimeNodes | Same as FloatControl for integer values |
StringControl | ComfyUI_RealtimeNodes | Cycles through a list of strings per frame |
FloatSequence | ComfyUI_RealtimeNodes | Cycles through comma-separated float values |
IntSequence | ComfyUI_RealtimeNodes | Cycles through comma-separated integer values |
| Motion detection nodes | ComfyUI_RealtimeNodes | Detect motion between frames; can trigger parameter changes |
StreamDiffusion Nodes (Phase 4)
The primary generative video nodes. Ported from Daydream’s StreamDiffusion pipeline.| Node | Purpose |
|---|---|
StreamDiffusionCheckpoint | Loads a StreamDiffusion checkpoint model. Use with SD1.5 or SDXL |
StreamDiffusionConfig | Configures CFG, t-index, acceleration mode |
StreamDiffusionSampler | Runs StreamDiffusion inference per frame |
StreamDiffusionLPCheckpointLoader | Alternative checkpoint loader for Livepeer-hosted models |
StreamDiffusionTensorRTEngineLoader | Loads a pre-compiled TensorRT engine. Not compatible with all ControlNets |
Phase 4 Additions
- SuperResolution. Real-time video upscaling. Input: standard-resolution frame. Output: upscaled frame.
- AudioTranscription. Whisper-based real-time speech transcription. Two output modes: SRT subtitles burned into video, or text delivered to the application via WebRTC data channel.
Workflow Format
ComfyStream requires workflows in ComfyUI API format, not the default save format. The default ComfyUI export includes layout metadata that ComfyStream does not parse. To export a workflow in API format:- Enable Developer Mode in ComfyUI settings.
- Use Save (API Format) to produce the JSON file.
workflows/ directory. For Docker deployments, mount this directory as a volume. The canonical workspace layout is in docs.comfystream.org.
When the workflow loads in the ComfyStream UI, the server compiles TensorRT engines for the relevant nodes. First run takes between two and ten minutes depending on the model and the GPU. Subsequent loads skip compilation.
Data-Channel Output
Phase 4 added a structured-text output path that runs alongside video. The ComfyStream WebRTC connection extends with a data channel; workflows containing a data output node emit text to the browser or application that connects to the server. Use cases:- Real-time audio transcription delivered as text to a downstream application
- Frame-level metadata (object labels, confidence scores) delivered to an overlay UI
- Any workflow where the output is data, not video
Performance Characteristics
ComfyStream compiles TensorRT engines and runstorch.compile on model components at first run. This is a one-time cost per workflow on each machine.
| Operation | Duration | Frequency |
|---|---|---|
| TensorRT compilation | 2-10 minutes | First run per machine, per workflow |
torch.compile (ControlNet, VAE) | On first frame | First frame per session |
| Subsequent workflow loads | Immediate | All later runs |
- SD1.5 + DMD one-step + DepthControlNet workflow: 14-15 fps at 640x360 input
- StreamDiffusion with TensorRT: higher throughput at the same resolution
Hardware Requirements
ComfyStream requires an NVIDIA GPU. The server component runs on Linux only; Windows and macOS are not supported for the server, though the browser client runs anywhere.| Workload | Minimum VRAM | Recommended |
|---|---|---|
| Real-time AI (ComfyStream) | 12 GB | 16 GB+ |
Relationship to BYOC
ComfyStream is itself BYOC-compatible. Phase 4 integrated ComfyStream with PyTrickle, which means thelivepeer/comfystream Docker image can register directly as a BYOC capability on an orchestrator without rewriting the workflow as a custom container.
| If you want to | Use |
|---|---|
| Run a ComfyUI workflow as a real-time pipeline | ComfyStream directly |
| Run a custom Python model that isn’t a ComfyUI workflow | |
| Run multiple ComfyStream workflows from one orchestrator | ComfyStream’s multi-pipeline mode (Phase 4) |
| Earn fees from your ComfyStream instance | Register as a BYOC capability on go-livepeer |
Next Steps
ComfyStream Quickstart
Docker, RunPod, or local install. First real-time AI effect on a webcam in fifteen minutes.
Workflow Authoring
Build a custom workflow, configure StreamDiffusion, tune for latency.
ComfyStream as BYOC
Register a ComfyStream instance as a BYOC capability and earn fees.
docs.comfystream.org
Canonical install reference, hardware deep-dive, troubleshooting.