Loading alternatives...

Discover 620 tools with similar features and pricing
We're showing tools from all 4 categories that Captions belongs to. This gives you a wider range of alternatives across different use cases.
Want alternatives from a specific category only? Use the Filters sidebar to uncheck categories you don't need. You can also filter by pricing, ratings, and features for more precise results.

AdCreative AI is an advertising-focused generative platform that converts static product images into short, UGC-style social videos optimized for formats like Instagram, TikTok, and Facebook.

Pdfgear is a PDF utility that lets users edit, merge, split, convert, annotate, and manage PDF documents online or offline across Windows, macOS, iOS, and Android.

Hypergro is an AI video platform that creates video and image ads from text, product descriptions, and URLs, and generates scripts for those ads.

Mirage by Captions is a web-based AI tool for generating, editing, and stylizing images and videos from text prompts and user-provided media.

Neuralspace AI is a platform that enables AI-powered dubbing, subtitling, and data-driven ideation to help users create and localize multimedia content efficiently.

Pyramid Flow is a framework for building and executing multimodal, agentic AI workflows using hierarchical flow diagrams with integrated tools, memory, and language models.

Hipclip is an AI-assisted editing tool that converts long-form video and podcast content into short clips, blog posts, and advertising assets.

Boldvoice is an accent training app that uses video lessons and AI-powered speech feedback to help non-native English speakers improve clarity and pronunciation.

WellSaid Labs is an AI-powered text-to-speech platform focused on producing natural, production-read

ElevenLabs is an AI platform for generating, editing, and managing natural-sounding multilingual speech and custom voice clones via web tools and developer APIs.

Personal Avatars Synthesia provides a platform for creating, customizing, and deploying AI video avatars that replicate a personβs likeness and voice for scalable video content.

Saydi provides instant, bidirectional voice translation for live meetings, events, and conferences, enabling participants to speak and listen in their preferred languages in real time.
Showing 1 - 24 of 620 alternatives