EnglishModels OverviewGemini 3 (Brain)

Gemini Model

Overview

Gemini is a state‑of‑the‑art multimodal model. In StoryFlow, Gemini 3 (Pro/Flash) acts as the “brain” for planning and prompt refinement.

Capabilities

  • Reasoning & Story Creation: Turn ideas into outlines, shot lists, and character settings.
  • Multi‑modal Understanding: Analyze images; produce detailed textual descriptions (reverse prompting).
  • Low Latency (Flash): Flash is optimized for fast responses.

Inputs

  • Text (Prompt): Natural language from simple notes to complex scripts.
  • Reference Image / Video: Upload media for analysis (e.g., “Describe this image’s style”).

Parameters

Gemini is primarily text‑based; no complex settings are typically required.

Tips

  • Use Gemini to draft prompts, then pass results into Banana or Sora.
  • Combine multiple Text nodes upstream for richer context.