Hasta la semana pasada todo el mundo hablaba de OpenAI. Sora 2. Mientras todos esperamos el llegada en la India, Google acaba de realizar un shock profesor. De la sombra a la mañana, no sólo anunciaron el potente Veo 3.1, sino que igualmente lo lanzaron de forma gratuita en Flow y a través de su suite Gemini. ¿Cómo no arrobar a Google por esto? En lado de una demostración a puerta cerrada, nos entregaron las llaves de un estudio de vídeo de última gestación y es hora de explorar. En este artículo te contaré todo sobre Veo 3.1.
¿Qué hay de nuevo en Veo 3.1 y Flow?
Anunciada el 15 de octubre de 2025, esta no es solo una puesta al día beocio. Veo 3.1 ofrece un conjunto de mejoras centradas en cumplimentar a los creadores más control y resultados de maduro calidad. Aquí está el desglose:
- Audio más rico en todos los ámbitos: Funciones como “Ingredientes para video”, “Marcos para video” y “Extender” ahora vienen con audio enriquecido generado por IA. Tus videos ya no estarán en silencio; Tendrán sonidos ambientales y partituras que darán vida a las escenas.
- Realismo mejorado y añadidura rápida: Veo 3.1 se pedestal en su predecesor y se centra en capturar texturas reales y comprender mejor exactamente lo que pides en tus indicaciones.
- Herramientas de estampado de precisión: Flow se está convirtiendo en una suite de estampado completa. La nueva útil «Insertar» te permite sumar cualquier cosa, desde un pícaro callejero hasta un automóvil volátil, a una panorama existente. Próximamente habrá una útil «Eliminar» para eliminar sin problemas objetos no deseados.
- Más control narrativo: Utilice múltiples «Ingredientes» para aconsejar el estilo, proporcione una imagen auténtico y final para una «transición épica» con «Cuadros a video» o cree clips fluidos de un minuto «Extendiendo» sus tomas.
¿Cómo aceptar a Veo 3.1?
¿La mejor parte? No es necesario que esté en una cinta de aplazamiento.
- Para todos: Dirígete a Laboratorios.Google/Flow/Gemini/AI Studio para despuntar a crear injusto.
- En tu teléfono: Acceda a Veo 3.1 directamente desde la aplicación Gemini.
- Para desarrolladores y empresas: El maniquí está arreglado a través de Gemini API y Vertex AI para crear aplicaciones y uso empresarial.
Nota: Sugiero usar Flow sobre la aplicación Gemini/AI Studio o Vertex AI. Flow proporciona una experiencia creativa superior con diversas herramientas de estampado, lo que le permite ampliar historias fácilmente, establecer fotogramas de inicio/final personalizados y controlar mejor los medios de la panorama. (Estas funciones no están disponibles actualmente en otras plataformas de Google).

6 sugerencias interesantes para probar con Veo 3.1
Posteriormente de acontecer utilizado ampliamente todas las funciones de Veo 3.1, desde vlogging hasta mundos futuristas, he recopilado los mejores videos que creé. En la próximo sección, lo guiaré a través del alucinación para crearlos.
Tarea 1: crear un anuncio de marketing
Para esta tarea, le pedí a Veo 3.1 que creara un video de marketing para un impermeable adhesivo siguiendo el próximo mensaje:
(Cinematography) A dynamic, chaotic wide shot with a shaky handheld feel, quickly panning and zooming slightly to follow the action, ending on a close-up of the Fix-It packaging. (Subject) The same young Gen Z male blogger, in modern attire, looking slightly out of place, stands on the sloping deck of the Titanic. Passengers in historical 1912 clothing are running in panic and disarray around him. The Captain (in full naval uniform, looking frantic and desperate) spots the blogger. (Action) The blogger, holding up a small, red "Fix-It" tube/canister, is vlogging with a concerned but casual expression, saying: "Seriously, guys, why don't you just use Fix-It to save the ship?" The Captain, with wide, desperate eyes, lunges forward and snatches the Fix-It tube from the blogger's hand, yelling: "Give it to us!"
(Cut/Transition - Implied by rapid action)
(Cinematography) A smooth, triumphant wide shot showing the Titanic now fully afloat and stable, sailing calmly on the ocean under a clear sky. (Cut) A clean, well-lit close-up shot of "Fix-It" packaging (a sleek, modern red tube/canister with clear branding) rotates slowly into the frame, with a subtle sparkle effect.
(Context) The upper deck of the Titanic during its sinking, with lifeboats being lowered in the background, steam rising, and a sense of impending doom. Then, the Titanic peacefully sailing on the ocean.
(Style & Ambiance) Hyper-realistic, cinematic drama transitioning to a clean, commercial aesthetic. Initially, panicked, chaotic, and cold blue lighting of a disaster, then bright, clear, and hopeful lighting for the saved ship and product shot. Audio: Initial chaos, screams, creaking of the ship, the blogger's voice, the Captain's frantic shout, then sudden calm with gentle ocean waves and a triumphant, subtle jingle as the Fix-It packaging appears. 1080p, 16:9, 8s.
Producción:
El resultado fue impresionante. Si acertadamente el video fue excelente, noté un problema popular: la distorsión del texto en el empaque del producto al final. La esencia para solucionar este problema y crear anuncios impecables es proporcionar una imagen del producto como relato en su mensaje.
Tarea 2: Esencia de anime
Inmediato:
0-4s: Medium wide shot, slightly low angle, on two anime schoolgirls (early teens, wearing cute, modern Japanese school uniforms). They are sitting on swings next to each other in a sunny park. Both are gently swinging, with cherry blossoms falling softly around them. The first girl (with a slightly worried expression) is looking down. First Girl (softly, with a sigh): "Ugh, I still have to prepare for tomorrow's exam."
4-7s: The camera smoothly tracks/dolly-in to a medium close-up of both girls. The second girl (with a confident, cheerful smile) turns to face the first girl. Second Girl (brightly): "Don't worry! Use NotebookLM to learn, revise, and test your knowledge. That's what I do all the time to top the class!"
7-8s: The camera briefly holds on both girls, who are now smiling confidently, then slowly fades to a clean white background.
(Context & Ambiance) A beautiful, well-maintained park with lush green trees, bright flowers, and a clear blue sky. The overall mood is light, hopeful, and friendly.
Audio: Gentle creaking of the swings, soft, cheerful anime background music (light piano and strings), both girls' clear, expressive voices. Subtle ambient park sounds (distant birdsong).
Tarea 3: retroceder en el tiempo
Si usa Veo 3.1 a través de Flow, puede crear videos más largos. Eso es exactamente lo que hice para el video a continuación. Creé un mensaje en 3 partes, que representa escenas de los 18 días del Mahabharata.
Verás cómo estos videos se combinan perfectamente y se ven naturales.
Aquí están las indicaciones para las tres escenas de video y el video combinado.
Primera panorama:
(Cinematography) A poignant medium close-up, slightly high angle, with a slow, empathetic dolly-out, emphasizing the tragic scene. (Subject) The same young Gen Z male blogger, in modern attire, smartphone in hand, looking down at Duryodhana (lying wounded and dying on the ground, his armor dented and scuffed). (Action) The blogger has a somber, almost "I told you so" expression, while Duryodhana is weak and barely conscious, struggling to breathe. Blogger (with a mix of sadness and resignation): "Bro, I told you to call it off. Look what you have done!" (Context) The aftermath of battle on Kurukshetra, with a few scattered dead warriors around, a broken chariot wheel nearby, and the setting sun casting long, orange shadows over the desolate landscape. (Style & Ambiance) Hyper-realistic, cinematic, melancholic mood, dramatic golden hour lighting, conveying the tragedy of war with the blogger's casual commentary creating a darkly comedic moment. Audio: Blogger's mournful voice, faint groans from Duryodhana, the eerie silence of a battlefield after the fighting, distant wind. 1080p, 16:9, 8s.
Segunda panorama:
(Cinematography) A poignant medium close-up, slightly high angle, with a slow, empathetic dolly-out, emphasizing the tragic scene. (Subject) The same young Gen Z male blogger, in modern attire, smartphone in hand, looking down at Duryodhana (lying wounded and dying on the ground, his armor dented and scuffed). (Action) The blogger has a somber, almost "I told you so" expression, while Duryodhana is weak and barely conscious, struggling to breathe. Blogger (with a mix of sadness and resignation): "Bro, I told you to call it off. Look what you have done!" (Context) The aftermath of battle on Kurukshetra, with a few scattered dead warriors around, a broken chariot wheel nearby, and the setting sun casting long, orange shadows over the desolate landscape. (Style & Ambiance) Hyper-realistic, cinematic, melancholic mood, dramatic golden hour lighting, conveying the tragedy of war with the blogger's casual commentary creating a darkly comedic moment. Audio: Blogger's mournful voice, faint groans from Duryodhana, the eerie silence of a battlefield after the fighting, distant wind. 1080p, 16:9, 8s.
Tercera panorama:
(Cinematography) A medium two-shot, slightly low angle, with a smooth, subtle push-in on the subjects. (Subject) A young Gen Z male blogger, early 20s, wearing a modern graphic t-shirt, skinny jeans, and stylish sneakers, holding a modern smartphone in selfie mode, facing Lord Krishna (traditional blue skin, ornate gold jewelry, and royal attire) and Arjuna (in traditional warrior armor, bow and quiver on his back). (Action) The blogger is speaking directly to the camera/phone (as if vlogging), then turns slightly to Krishna and Arjuna, who are looking amused. Blogger (looking at camera/phone): "Yo, Krinsha and Arjuna... quick question, are you guys using ChatGPT for your battle strategies?" Krishna and Arjuna (in unison, with a knowing smile): "Of course, bro." (Context) Set on the ancient, dusty battlefield of Kurukshetra, with distant chariots and warriors visible in the background (slightly out of focus due to shallow depth of field). (Style & Ambiance) Hyper-realistic, cinematic, warm natural daylight, epic scale, yet with a humorous undertone. Audio: The blogger's clear voice, Krishna and Arjuna's synchronized reply, distant sounds of battle (clashing swords, faint shouts). 1080p, 16:9, 8s.
Producción:
¡La salida me dejó alucinado! La sincronización labial con diálogo, la precisión del fondo y la coherencia de los rasgos faciales del vlogger contribuyen a crear un vídeo utópico. Esto convierte a Veo en una útil poderosa que los influencers pueden utilizar para crear contenido divertido y muy atractivo.
Tarea 4: Vlogs promocionales
Para esta tarea, le pediré a Veo 3.1 que cree un vlog y promueva el curso de balde de Analytics Vidhya sobre IA agente.
Inmediato:
(Cinematography) A medium shot, slightly wide angle, with a slow, eerie push-in towards the sarcophagus. As the mummy emerges, the camera smoothly pans up and slightly backs away to include both characters, then cuts to a static shot of a QR code. (Subject) The same young Gen Z male blogger, in modern adventure attire (e.g., cargo shorts, t-shirt, small backpack), holding his smartphone in selfie mode, standing reverently in an ancient Egyptian pyramid burial chamber. Behind him is an ornate stone sarcophagus. (Action) The blogger is speaking directly to the camera/phone with an enthusiastic whisper: "Hey guys! Do you know Analytics Vidhya is teaching how to build AI Agents for FREE?" Suddenly, the lid of the sarcophagus begins to slide open, and a mummy (wrapped in decaying linen, with glowing eyes) slowly sits up and emerges, looking directly at the blogger. Mummy (with a raspy, ancient voice): "I also want to enroll."
(Cut)
(Cinematography) A static, well-lit shot of a clear, scannable QR code centered on a dark, stone-like background. (Text Overlay) Below the QR code, in clear white text: "Scan the Code to Enroll for Free."
(Context) The dimly lit, dusty interior of an ancient Egyptian pyramid chamber, with hieroglyphs on the walls and golden sarcophagus details.
(Style & Ambiance) Hyper-realistic with a touch of mysterious fantasy. Dark, atmospheric lighting with shafts of light highlighting dust motes, conveying an ancient, undiscovered feel. The mummy's emergence should be spooky yet slightly humorous. Subtle, ambient Egyptian music (e.g., flutes, distant percussion, string instruments) plays throughout, intensifying slightly as the mummy speaks, then becoming clean and clear for the QR code screen. 1080p, 16:9, 8s.
Tarea 5: Escenas tipo película
Esta tarea es una conversación entre presidente y empleado sobre la abaratamiento de licencias. El empleado quiere una semana atrevido para asistir a la Cumbre DataHack y el presidente no aprueba las licencias oportuno a la carga de trabajo extrema por parte del cliente. Aquí está el mensaje para lo mismo:
0-2.5s: Medium two-shot, over-the-shoulder perspective, focused on the boss. Setting is a modern corporate office, clean and minimalist, with a large glass window in the background showing a city view. The employee (young professional, formal shirt and tie) stands opposite the boss (older, more serious, wearing a suit) across a desk. Employee (casual, confident tone): "I need one week off in August. Letting you know already."
2.5-5s: The camera quickly cuts to a close-up of the Boss's face (looking stern and annoyed). Boss (firm, stressed tone): "You can't go anywhere in August! We have major client deliverables."
5-8s: The camera cuts back to the medium two-shot, with a subtle teleobjetivo/dolly-in on the Employee (who has a determined, unwavering expression). Employee (unfazed, passionate tone): "I don't care. India's most futuristic conference, the DataHack Summit in Bangalore, is happening. I can't miss it."
(Context & Ambiance) Tense, professional atmosphere contrasting with the employee's casual defiance. Sharp, cool office lighting.
Audio: Employee's voice is clear, Boss's voice is slightly frustrated and louder. Subtle, ambient office noise (keyboard clicks, distant phone rings). A light, triumphant, energetic synth track swells slightly as the employee delivers the final line, then cuts out.
Este vídeo es muy bueno. Los personajes y la panorama parecen muy reales. Él definitivamente no se siente como fue IA generada en total”.
Tarea 6: Vídeos animados
Inmediato:
Total Duration: 8 seconds. 1080p, 16:9. Style: Hyper-realistic, scientifically accurate yet visually awe-inspiring, cosmic.
0-1s: Extreme close-up on an infinitesimally small, impossibly bright, pure white singularity at the center of the frame, pulsating with immense energy. The background is absolute, inky blackness.
1-4s: The singularity explodes outwards with incredible force and speed. The camera rapidly pulls back and expands with the explosion, showing a dramatic, fast-paced expansion of superheated plasma and fundamental particles. Vibrant blues, yellows, and whites dominate, rapidly cooling and spreading into a vast, glowing cosmic fog.
4-7s: The expansion continues, slowing slightly. The cosmic fog begins to swirl and coalesce, with pockets of slightly denser energy forming. Early, rudimentary structures of the nascent universe start to become visible within the glowing gas.
7-8s: The camera reveals the first hints of distinct cosmic structures, like proto-galaxies and swirling nebulae, still glowing intensely but showing clear patterns. The expansion continues into the deep, dark vastness of space.
(Context & Ambiance) A depiction of the birth of the universe. The mood is one of immense power, mystery, and awe.
Audio:
0-1s: A deep, intense, low-frequency hum/drone that rapidly builds in pitch and intensity, with sharp, high-frequency crackles hinting at immense energy.
1-4s: A massive, deafening "BOOM" sound, immediately followed by a whoosh of rapidly expanding, fiery plasma, accompanied by a high-pitched, ethereal "shimmering" sound as particles form.
4-7s: The sound mellows into a vast, echoing cosmic hum, with subtle, deep rumbling tones signifying the formation of gravity wells and early structures.
7-8s: The cosmic hum continues, now with faint, distant twinkles and ethereal chimes, suggesting the nascent formation of stars and galaxies, fading slightly at the very end.
Funciones adicionales que vienen con el uso de Veo 3.1 en Flow
El equipo de Veo está implementando nuevas y potentes funciones de estampado en Flow, su útil cinematográfica de IA.
Sumar o quitar objetos
Los usuarios ahora pueden sumar cualquier sujeto que puedan imaginar a una panorama, desde objetos realistas hasta criaturas fantásticas, con Veo manejando inteligentemente las sombras y la iluminación para una apariencia natural.
Fotogramas a vídeo
En una potente puesta al día, Veo ofrece a los creadores un control narrativo total sobre sus clips con la nueva función «Frames to Video» en Flow. Ahora puede explicar la imagen auténtico y final de una panorama, y Veo generará un video de transición fluido que une perfectamente las dos, creando un movimiento ingenioso y épico.
Ingredientes para el vídeo
En una potente puesta al día, Veo ofrece a los creadores un control narrativo total sobre sus clips con la nueva función «Frames to Video» en Flow. Ahora puede explicar la imagen auténtico y final de una panorama, y Veo generará un video de transición fluido que une perfectamente las dos, creando un movimiento ingenioso y épico.
Lea igualmente:
Conclusión
Veo 3.1 marca un importante brinco delante, transformando Flow de Google en un estudio de vídeo verdaderamente de última gestación. La combinación de realismo mejorado, gestación de audio nativo y poderosas herramientas de estampado como Frames to Video y la capacidad de sumar y eliminar objetos lo convierten en un herramienta creativo formidable. Al ofrecer este maniquí reformista de llegada de balde, Google ha democratizado la gestación de vídeos de suscripción serie. Nuestras pruebas confirman que la calidad y la consistencia están listas para uso profesional o personal.
Ahora que tienes el conocimiento y las indicaciones, ¡es hora de probarlo!
Inicie sesión para continuar leyendo y disfrutar de contenido seleccionado por expertos.