What is AI Video?
AI Video lets you turn any clarife document into a fully narrated video — automatically. The AI reads your screenshots and text, writes a narration script, synthesizes speech in your chosen language, and renders the final video with subtitles. The entire pipeline runs in the cloud. You don’t need any video editing software.How it works
Document analysis
AI reads your document content — headings, paragraphs, code blocks, tables, and screenshots — to understand the tutorial flow.
Scenario generation
Using Claude (via OpenRouter), AI generates a narration script with timing cues for each step. The script matches your document structure.
Text-to-speech
Google Cloud TTS converts the script into natural-sounding audio using SSML markup. 15 languages are supported.
Video rendering
Remotion on AWS Lambda composites your screenshots, audio, and subtitles into a final MP4 video. Branding (watermark, intro/outro) is applied based on your plan.
Supported languages
AI Video supports 15 languages for text-to-speech narration:| Language | Code | Language | Code |
|---|---|---|---|
| English | en | Polish | pl |
| German | de | French | fr |
| Spanish | es | Italian | it |
| Portuguese | pt | Dutch | nl |
| Swedish | sv | Norwegian | nb |
| Danish | da | Finnish | fi |
| Czech | cs | Romanian | ro |
| Turkish | tr |
Requirements
- A clarife document with at least one screenshot block
- AI credits (3 free during trial, then purchased in packs)
- An internet connection (rendering happens in the cloud)
Output format
| Property | Value |
|---|---|
| Video format | MP4 (H.264) |
| Subtitles | SRT file |
| Resolution | Matches your screenshot dimensions |
| Audio | Synthesized TTS narration |
| Duration | Varies by document length (typically 1-5 minutes) |
How long does generation take?
How long does generation take?
Typically 2-5 minutes depending on document length and the number of screenshots. You can close the editor and come back — the generation continues in the background.
Can I edit the generated video?
Can I edit the generated video?
The generated video is a final MP4 file. If you want changes, edit the source document and regenerate. You can also download the SRT subtitle file separately.
What happens if I have no screenshots?
What happens if I have no screenshots?
AI Video requires at least one screenshot block in your document. Text-only documents cannot be converted to video.
Is there a limit on document length?
Is there a limit on document length?
There is no hard limit, but very long documents (50+ blocks) may produce longer videos and use more rendering resources. For best results, keep documents focused.