Gemini Model
Overview
Gemini is a state‑of‑the‑art multimodal model. In StoryFlow, Gemini 3 (Pro/Flash) acts as the “brain” for planning and prompt refinement.
Capabilities
- Reasoning & Story Creation: Turn ideas into outlines, shot lists, and character settings.
- Multi‑modal Understanding: Analyze images; produce detailed textual descriptions (reverse prompting).
- Low Latency (Flash): Flash is optimized for fast responses.
Inputs
- Text (Prompt): Natural language from simple notes to complex scripts.
- Reference Image / Video: Upload media for analysis (e.g., “Describe this image’s style”).
Parameters
Gemini is primarily text‑based; no complex settings are typically required.
Tips
- Use Gemini to draft prompts, then pass results into Banana or Sora.
- Combine multiple Text nodes upstream for richer context.