Veo 3 is an AI model for creating cinematic 8-second videos directly on Telegram
- By text description — simply write what you want to see
- Based on an image — upload a photo and describe what should happen in it
How to Create a Video?
1️⃣ Go to @GPT4Telegrambot
2️⃣ Enter the command /video → Veo 3
3️⃣ Fill in the parameters in the menu that opens:
1. Prompt
Describe what should happen in the video. Use English. The more precise and visually detailed your description is, the better the result.
2. Image
Upload a photo that you want to:
3. Duration
The video always lasts exactly 8 seconds. If you set a scene or dialogue that’s too long, it may get cut off. Use every second wisely.
4. Veo 3 Versions
The bot offers two generation options:
- Veo 3 Fast — faster and cheaper. Generally follows the prompt accurately, with realistic physics and sound effects. Perfect for tests and drafts.
- Veo 3 — produces highly realistic videos with sound effects and speech. Best for physics, camera work, and atmosphere. Uses two generations.
Aspect Ratio
Veo 3 generates videos in 16:9. If you use an image with a different aspect ratio, the final video will still be 16:9 — with black bars on the sides to preserve proportions.
After you set the parameters, click “Start Generation.”
⏳ Generation takes about 5 minutes.
Mastering Veo 3
Want the character to speak?
Don’t include actions in the same sentence. Write it briefly and clearly: “Speaks in clear English.” If you add something like “jumps and speaks,” the model may focus on the action and ignore the speech.
You can set an accent for characters. Simply specify the desired accent in the prompt, and the AI model will voice the character accordingly.
Want to keep the same character across different videos?
Write a detailed description or use the same image. For example, take a screenshot of the last frame from a previous video and use it as the base for the next one — this helps preserve the character and overall atmosphere.
Specify everything that matters
You can specify everything down to the smallest details: music, background sounds, accents, emotions. The prompt can be long — the key is making sure everything fits within 8 seconds.
How to Write an Effective Prompt?
It’s important to understand: you only have 8 seconds to fit in the storyline, movement, atmosphere, dialogue, and effects. Your prompt should be precise, logical, and detailed.
1. Start with the main point: Who is doing what?
- Specify the character: man, woman, child, cat, robot, etc.
- What are they doing? Use one clear verb: walking, speaking, looking, waving, sitting, drawing.
❌ Bad: “A lady is walking on the beach, singing a song, smiling, dancing, and talking.”
✅ Good: “A lady is slowly walking on the beach, speaking in clear Spanish.”
2. Describe the setting (if it matters)
Where is the action: in a desert, in a big city, by the sea, in space. What does the scene look like: lighting, colors, style.
Example: “A dark room with purple neon lights, the camera slowly zooms in.”
Example prompt: The video begins with a person walking through snow; the camera is on their boots. Tense music and the crunch of footsteps in a blizzard. The camera slowly pulls back as the person walks into the mountains, then they stop and say in clear English: “Try Veo 3 now.”
3. Sound effects
- Atmosphere: “tense music playing,” “sound of footsteps in the snow.”
- Effects: “sound of wind,” “object falling,” “city background noise.”
4. Define the style and mood
Do you want the video to look cinematic, futuristic, or like an animated film? Use words such as: cinematic, realistic, surreal, 3D render, three-dimensional visualization.
Now it’s your turn to write a prompt and see what Veo 3 can do. Good luck!