AI Scenario

How it works

The scenario generation uses Claude (via OpenRouter) to:

Analyze document structure — identify the logical flow of steps, sections, and key information

Interpret screenshots — understand what each screenshot shows and how it relates to the surrounding text

Write narration — create natural-sounding narration text for each slide, matching the tutorial tone

Set timing — assign duration and pacing for each slide based on content complexity

Field	Description
Narration text	What the TTS voice will say for this slide
Screenshot reference	Which screenshot to display
Duration	How long the slide is shown (driven by audio length)
Transition	How the slide enters/exits

Field

Description

Narration text

What the TTS voice will say for this slide

Screenshot reference

Which screenshot to display

Duration

How long the slide is shown (driven by audio length)

Transition

How the slide enters/exits

Tips for better scenarios

Write clear headings

Headings help AI understand the tutorial structure. Use them to mark distinct steps or sections.

Add descriptive text

The more context you provide around screenshots, the better the AI narration. Brief captions or explanations significantly improve output.

Use logical ordering

Arrange blocks in the order you want them narrated. AI follows the document flow top to bottom.

Keep screenshots focused

Crop screenshots to show only the relevant area. AI narration is more precise when screenshots are focused.

The AI uses up to 8,192 tokens for scenario generation. Very long documents with many screenshots are supported, but keeping documents focused produces better narration quality.

Language handling

The narration language is set when you start generation, not determined by the document text. The AI will:

Generate narration in the selected language regardless of the document language

Handle mixed-language documents (e.g., English UI with Polish explanations)

Use proper SSML markup for natural prosody in the target language

Polish text with special characters is handled automatically. The AI pipeline includes quote repair and Unicode sanitization to prevent JSON parsing errors.

Can I edit the AI scenario before rendering?

Currently, the scenario is generated and rendered in a single pipeline. To change the narration, edit your document content and regenerate.

Why does the narration not match my text exactly?

AI rewrites your content into spoken narration. It paraphrases, simplifies, and adds transitions to create a natural video flow. The meaning is preserved.

What AI model is used?

The scenario is generated using Claude via OpenRouter. The model selection is optimized for quality and cost.

What if the AI generates incorrect narration?

If the narration is inaccurate, try adding more descriptive text to your document. The AI relies on context — more detail produces better results.

What is an AI Scenario?

How it works

Scenario structure

Tips for better scenarios

Language handling

​What is an AI Scenario?

​How it works

​Scenario structure

​Tips for better scenarios

​Language handling

What is an AI Scenario?

How it works

Scenario structure

Tips for better scenarios

Language handling