Resources ⟶ AI ⟶ Image, Video & Audio

Multimodal Media

Adobe Firefly

Website

Adobe Firefly is a generative AI solution for creating images, video, audio, and vector graphics, along with Firefly Boards for collaboration. It uses commercially-safe AI models, making it ideal for professional use. Key features include text-to-image and text-to-video generation, integration with Adobe Creative Cloud, and tools like Photoshop's Generative Fill. Users can create videos from text prompts or images, add motion, and translate audio and video clips into various languages, making it a versatile tool for designers, marketers, and content creators.

Artflow

Website

ArtFlow is an AI creative platform that lets users generate, animate, and personalize visual content using text prompts or reference images. It allows creators to design consistent avatars, craft illustrated scenes, and produce short videos from a browser-based studio. Users can upload their faces to create lifelike characters, control poses and styles, and animate avatars for storytelling or social media. ArtFlow offers a free plan with limited credits and paid tiers for higher-quality outputs and advanced features, making it perfect for creators and marketers.

Comfy

ComfyUI is a free, open-source node-based interface for creating AI-generated content like video, images, 3D models, and audio. It enables users to build workflows visually by connecting nodes, facilitating real-time adjustments, and live previews. The platform emphasizes reusability and sharing, allowing exported files for easy workflow rebuilding. It runs locally on users' machines, ensuring faster iteration, lower costs, and complete data control, while also supporting custom nodes without subscriptions or hidden fees.

DigitalMagicWand

Website

DigitalMagicWand is an AI-powered platform that enables users to create visuals, sound, and videos with no prior experience. It transforms static images into dynamic videos using text prompts and caters to various fields like education and marketing. The platform also includes an AI Humanizer tool, which converts robotic content into fluent language for essays and blogs. Positioned as a democratizing force in creative technology, DigitalMagicWand offers sophisticated multimedia creation tools on a credit-based system, making them accessible to all.

getimg.ai

Website

GetImg.ai is an AI platform offering tools for creating, editing, and transforming images and videos with advanced models like Qwen and Seedream 4.0. Users can generate original images, modify existing ones, and expand pictures easily, eliminating the need for layers or masks. Its real-time AI generator creates high-quality images in milliseconds, while the Model Trainer allows users to develop custom models from their photos. The requires no downloads, offers free usage for up to 100 images/month, and provides access to top-tier AI models.

Imagine.art

Website

Imagine.art is a leading AI art generation platform with over 30 million users and 100+ million downloads. It transforms creative workflows with its AI Tools Suite, allowing instant conversion of text descriptions into stunning artwork and HD videos. Features include Text-to-Image generation, Real-Time Creation, an AI Video Generator for 4K videos, Ideate for description-based painting, a Creative Upscaler for image enhancement, and Character Consistency for visual storytelling. Its vibrant Discord community of over 63,000 supports creatives in generating graphics, designs, and more through an intuitive interface.

Kaiber

Website

Kaiber is an AI video creation platform that transforms text prompts, images, and audio into dynamic videos. It features creative modes like Flipbook for animation, Motion for transitions, and Transform for artistic styles. The platform offers audio-reactive visuals to sync with music or voiceovers and a Superstudio for designing narratives and camera movements. Users can upscale videos to 4K resolution and customize aspect ratios for different platforms. With a user-friendly interface available on web and mobile, Kaiber is ideal for musicians, artists, marketers, and content creators looking to produce high-quality videos.

KingAI

Website

Kling AI is an innovative AI creative studio by Kuaishou that enables users to create imaginative images and videos using advanced generative AI. With over 10 million videos produced since its launch, the platform offers text-to-video capabilities that generate high-quality visuals from text prompts. Key features include motion brush, video extension, face modeling, and camera movement. Kling AI provides an SD version for faster video generation and a Pro version for higher-quality outputs, utilizing dynamic-resolution training for versatile content creation. Accessible via modern browsers, it offers a free plan with daily credits and various subscription options, catering to content creators, marketers, educators, and professionals needing ultra-realistic animations and cinema-quality videos.

NightCafe Studio

Website

NightCafe Studio is a leading free AI image generator that allows users to create stunning art from simple text prompts, aiming to democratize creativity for all. With user-friendly tools and features, it accommodates both beginners and experienced artists, requiring no coding skills. NightCafe emphasizes the joy of creativity as a form of therapy and expression, eliminating the need for years of practice. The platform fosters a vibrant community where creators share their work, explore diverse applications, and engage in creative challenges, making it both a powerful tool and a social hub for AI art enthusiasts.

Pollo AI

Website

Pollo AI is a comprehensive AI video and image creation platform that utilizes its flagship Pollo 1.6 to transform text prompts, images, and chat conversations into vivid visuals. It features advanced models like GPT-4o, over 2,000 specialized LoRA models, and 100+ image generators. Users can create dynamic videos from images with audio, employ features like image-to-video and text-to-video generation, and use over 40 unique video effects. Designed for creators and businesses, Pollo AI streamlines video production, allowing for rapid creation of engaging content without the need for pretraining - upload a photo, add a prompt, and let it generate stunning AI videos.

PromeAI

Website

PromeAI is a leading AI-powered design tool catering to diverse industries like architecture, interior design, product development, and game design. Trusted by millions of designers, it allows users to upload drafts or photos to create realistic renders, generate stunning images from text, and use Consistency Rendering to train custom AI models with a single image for cohesive designs. Features like Creative Fusion enable blending artistic styles while maintaining control over ideas, and AI-powered image conversion transforms photos into drawings. This user-friendly, web-based platform streamlines design with tools like sketch rendering and erase/replace functionality, helping creative professionals automate tasks and focus on high-quality graphics, videos, and animations.

Stability AI

Website

Stability AI is an enterprise-focused company that offers advanced AI tools for creative professionals in marketing, entertainment, and gaming. Their solutions include image generation, video production, and 3D/4D media tools, designed to enhance creative workflows. With their Dream Studio application and flexible deployment options - such as self-hosted models and API integration - Stability AI provides customizable, enterprise-grade tools. These tools include features like brand safety and compliance support, serving major clients like AWS, Microsoft Azure, Nvidia, and Lenovo.

starryai

Website

Starry AI is a user-friendly, free AI art generator that converts text prompts into unique visuals in seconds, allowing users to create up to 25 images daily without charge. Users can opt for unlimited generation with a Pro Unlimited subscription. It features four AI methods - Art, Photos, Illustrations, and Custom Styles - and offers preset themes from CyberPunk to Portraits, along with a Prompt Builder tool. Utilizing advanced neural style transfer and machine learning, Starry AI lets users customize image resolutions and canvas sizes, storing all creations in "My Creations" for easy future access. With its high-quality output and intuitive mobile design, it's ideal for artists and content creators looking to produce professional-grade artwork for various platforms.

Tangra

Website

Tangra is a web-based immersive platform that transforms virtual training, events, and collaboration into engaging AI-powered experiences. It offers solutions for employee training and onboarding, enhancing engagement through personalized, gamified experiences. Users can access both 2D and 3D AI chatbots for various functions like training and tours. With Tangra Immersive Learning, students are engaged, and learning is enjoyable, tackling the issues of ineffective virtual sessions. Additionally, Tangra AI provides high-quality visuals and integrates with platforms like Canva, making it a versatile tool for immersive events, team collaboration, and product showcases in an accessible online environment.

Images, Graphics & Art

Artbreeder

Website

Artbreeder is an AI platform that allows users to create and manipulate images by blending existing ones and adjusting features, making it popular among artists, game developers, and writers seeking quick visual inspiration. Its user-friendly web interface lets users modify "genes" like color and texture, mix images, and create new compositions. Artbreeder specializes in photorealistic character portraits and enables alterations to features such as age and skin tone through sliders. Recent updates include Stability AI's SD-XL, ControlNet functionality for specific poses, AI pattern generation, and Outpainting to extend images. With a tiered subscription model that includes a free tier, the platform supports collaborative image creation for all skill levels.

Artistly

Website

Artistly AI is an all-in-one AI image generation and editing suite that provides unlimited creative freedom through a one-time payment. Unlike subscription-based platforms, it offers a range of powerful features, including text-to-image generation, background replacement, image expansion, AI clothing try-on, and object replacement. The platform is ideal for tasks like creating children's storybooks, logos, t-shirt graphics, YouTube thumbnails, book covers, pet portraits, and bulk clipart - all with a fair-use policy of 400 image generations per day and full commercial rights. It also comes with a 30-day money-back guarantee and ongoing free updates, making it perfect for marketers, content creators, and entrepreneurs seeking professional visuals without the recurring fees.

ArtSmart AI

Website

ArtSmart.ai is an AI art generator that allows users to turn text into images for just $0.00542 each - 10,000 times cheaper than hiring a graphic designer. It specializes in photorealistic art, including faces and landscapes. It features an AI avatar system called "Tunes" for personalized artistic creations. With built-in social sharing and a comprehensive gallery system, users can easily navigate and manage their images. The central Playground offers a blank canvas for exploring AI-generated art, while its developer-friendly API provides high-resolution images with customizable settings, making it accessible for both casual creators and professional marketers.

character.ai

website

character.ai is an AI chatbot platform that lets users create and interact with personalized AI characters. Built on advanced language models, it enables conversations with AI personalities ranging from historical figures and fictional characters to custom-created companions with unique traits and backstories. Users can design their own characters by defining their personality, speaking style, and knowledge base, or chat with thousands of community-created characters. The platform supports creative writing, language learning, entertainment, and brainstorming through engaging, human-like dialogue. character.ai offers both free access and a subscription tier with enhanced features like priority access and faster response times, making AI interaction accessible and entertaining.

Clipdrop

Website

Clipdrop is an AI-enabled visual toolkit that makes advanced image editing and generation simple for creators and professionals. It offers a powerful combination of tools such as background removal, object cleanup, image upscaling, generative fill, and uncropping. Users can erase unwanted elements, extend scenes beyond original borders, change lighting, or generate new visuals from prompts - all in one browser or via API. Clipdrop supports seamless integration with design workflows (like plugins and APIs), and comes in free and subscription tiers, unlocking higher-resolution outputs, priority access, and advanced capabilities.

Craiyon

Website

Craiyon is a free AI image generator that transforms text prompts into unique visuals within seconds. Originally launched as DALL·E Mini, it allows users to describe any concept and instantly see multiple AI-generated interpretations. The platform offers an optional Expert Mode for refining results with advanced prompt controls, including negative prompts to exclude unwanted elements. Craiyon runs entirely in the browser and includes both free and paid plans, with premium tiers offering faster generation speeds, ad-free usage, and priority access. Designed for artists, creators, and casual users alike, Craiyon makes AI-driven image generation simple, quick, and accessible to everyone.

Deep Dream Generator

Website

Deep Dream Generator is an AI art platform that turns ideas and images into dreamlike visuals. Users can input text or images and choose from styles like "Deep," "Text 2 Dream," or "DreamFusion" to create surreal artworks. It also offers enhancements like upscaling and video creation, making it easy for artists and enthusiasts to explore AI art directly in their browser.

Dzine

Website

Dzine.ai (formerly Stylar AI) is an AI-powered design platform that integrates image generation and editing into a single workflow, eliminating the need for multiple design tools. It specializes in transforming sketches into polished artwork. It offers features like AI photo filters, generative image merging, AI object and watermark removal, and precise background removal. Users can also create logos, convert 2D images to 3D, and make virtual outfit changes. With a user-friendly interface, no coding required, and built-in prompt generation, Dzine.ai streamlines the design process for designers, game developers, and e-commerce professionals, offering 100 free credits upon registration and premium subscriptions for high-volume users.

Fooocus

Website

Fooocus is a free, offline, open-source image generation software based on Stable Diffusion XL. It simplifies the image generation process by allowing users to focus on prompts and images without manual parameter adjustments, similar to Midjourney. The installation is quick, needing fewer than three clicks, and it runs on a minimal GPU requirement of just 4GB VRAM. Key features include an offline GPT-2 prompt processor, inpainting algorithms, and support for various input methods like image prompts and upscaling. Currently in long-term support mode, Fooocus offers preset configurations for different styles and supports Windows, Linux, Mac, and Colab, making AI image generation accessible to users with basic hardware and technical knowledge.

Fotor

Website

Fotor's AI Art Generator is a free, user-friendly online platform that allows anyone to create stunning artwork from text prompts or photos without needing to sign up or deal with watermarks. It offers extensive customization options, such as a wide range of artistic styles, adjustable aspect ratios, the ability to generate up to six images at once, and negative prompts to exclude unwanted content. Users can earn free credits through daily check-ins. Additionally, Fotor includes a suite of AI-powered editing tools, including photo upscaling and background removal, making it a comprehensive creative solution for both professionals and beginners.

Gemini Nano Banana

Website

Gemini 2.5 Flash Image (codenamed "Nano Banana) is Google's advanced image generation and editing model, featuring lower latency and a native multimodal architecture. It processes text and images in a unified step, allowing users to merge multiple photos, restyle scenes, and fuse visuals from a single prompt. Designed for fast, conversational creative workflows, it excels with descriptive narratives. It is now available via the Gemini API on Google AI Studio and Vertex AI for enterprise use. The model supports 10 aspect ratios, making it suitable for a variety of formats, while delivering high-quality results for both image generation and editing. It's ideal for creative professionals and businesses seeking efficient, high-quality AI-driven image solutions.

ImageColorizer

Website

ImageColorizer is an automatic, AI-driven, cloud-based tool that restores and enhances photos effortlessly. It uses advanced AI trained on millions of images to add realistic colors to black and white photos, fix faded colors, and repair damage like scratches and stains. The platform also sharpens details, improves brightness and clarity, removes unwanted objects, and enhances portraits by smoothing skin and brightening eyes. With a user-friendly interface, users can easily upload photos with just one click. This allows the AI to transform old images into vibrant, high-quality pictures quickly and affordably, preserving cherished memories for future generations.

Imagifly

Website

Imagifly is a prompt management tool that simplifies AI image generation. It allows users to create, organize, and save customizable prompts for use in generative AI tools like Midjourney, DALL-E, or Photoshop. This helps maintain consistency across projects, builds a reusable prompt collection, and streamlines the creative workflow for creators using multiple AI platforms.

ImgLarger

Website

ImgLarger is an AI-powered image enhancement platform that allows users to upscale, restore, and optimize photos instantly. It offers a suite of tools to enlarge images up to 8x without losing quality, reduce noise, sharpen details, and colorize black-and-white or old photos. Users can also uncrop images by extending backgrounds, convert between popular formats (JPEG, PNG, WebP), and batch-process multiple files efficiently. Designed for photographers, designers, and everyday users, ImgLarger delivers professional-grade image enhancement directly in the browser - with secure processing and automatic image deletion after 24 hours.

Jasper Art

website

Jasper is an AI-powered platform that combines content creation with visual art generation. Its art tools transform text prompts into unique, high-quality images in seconds, making it ideal for marketers, content creators, and businesses looking to enhance their visual content. Users can generate custom artwork by describing their vision, choosing from various artistic styles, and adjusting parameters like mood and medium. Jasper Art produces royalty-free images suitable for blogs, social media, advertisements, and presentations. The platform integrates seamlessly with Jasper's broader AI writing tools, enabling users to create cohesive content campaigns with both compelling copy and eye-catching visuals.

Midjourney

Website

Midjourney is a research lab that explores new ways of thinking and enhances human creativity. It is a small, self-funded team focused on design and AI. The lab has developed an AI image generator that turns text descriptions into high-quality artwork, allowing users to create stunning images quickly. Users can access the platform on midjourney.com, enter their prompt in an "Imagine bar," and receive four generated images. This makes it a popular tool for both creative professionals and casual users looking to visualize their ideas.

OpenArt

Website

OpenArt is an AI-powered art platform for users of all skill levels to create art without prompts. It offers tools for character generation, image-to-video conversion, advanced inpainting, object removal, and extensive style customization via a diverse Style Palette. With educational resources like the Prompt Book and YouTube tutorials, OpenArt supports AI art creation. The free plan allows generating images up to 512x512 pixels with Stable Diffusion XL, provides new users with 20 bonus credits, and rewards community participation. It features high-resolution upscaling and sketch-to-image conversion for artistic collaboration.

PixAI

Website

PixAI is a free AI-powered anime art generator that allows users to create customizable anime-style artwork easily. With features like text-to-image generation, image enhancement tools, a Model Market for exclusive LoRA models, and editing options like inpainting and outpainting, it caters to both beginners and experienced artists. The platform also includes an Artists' Marketplace and Gallery for sharing and exploring art within a vibrant community.

Playform

Website

Playform is a privacy-focused AI art generation platform that offers unlimited free image generation with tools for face remixing, sketch-to-image conversion, and custom AI model training. It ensures all creations remain private, with no hidden sharing or paywalls. Key features include FaceCraft for consistent AI characters, a real-time drawing canvas, Freeform Diffusion for visual asset generation, and style transfer capabilities. Tailored for artists and designers, it offers 100 free daily generations (with watermark), while users can pay to download their favorites and remove watermarks. Premium tiers allow unlimited downloads and bonuses, empowering creators to explore and produce copyright-free art.

ShutterStock AI Image Generator

Website

Shutterstock's AI Image Generator uses advanced models like Google's Gemini 2.5 Flash and OpenAI's GPT Image to create high-quality images from simple prompts. Users can access the "Generate" tool via the Launchpad menu or the search bar, receiving unique pictures in seconds. What sets Shutterstock apart is its commitment to compensating artists, making it both a creative tool and an ethically-minded platform that integrates with its extensive royalty-free content library.

Video

D-ID

Website

D-ID is a generative AI platform that transforms static images, text, and voice into lifelike talking avatars and video agents. It lets users upload a photo or video to create AI-powered digital people, add natural speech in over 120 languages, animate realistic facial expressions and lip-sync, and repurpose those for marketing, training, customer service, or storytelling. Accessible via a web-based studio or API, D-ID enables businesses and creators to scale engaging, personalized video content without traditional production costs.

Descript

Website

Descript is an AI-powered video and podcast editing tool that simplifies audio and video editing. You can upload or record media, which is automatically transcribed, allowing you to make edits through text changes. Key features include filler-word removal, noise cleanup, voice cloning (Overdub), auto captions, background removal, and "eye contact" correction. With both browser and desktop interfaces, it streamlines production workflows for creators, helping them quickly turn ideas into polished content.

Google Veo 3

Website

Veo 3 is Google's advanced AI video generation model that creates high-fidelity, 8-second videos at 720p or 1080p resolution from text prompts. Accessible through Google AI Studio, it offers best-in-class realism and natively generated audio, including sound effects and dialogue. The model excels in understanding nuanced prompts and cinematic language, producing realistic motion and sound across various visual styles. Available to Google AI Pro and Ultra subscribers, Veo 3 enables creators and businesses to turn text descriptions into professional-quality videos with synchronized audio.

InVideo

Website

InVideo is an AI-powered video creation platform that enables users to turn ideas into professional videos using text prompts. It uses advanced AI tools for script generation, visual selection, voiceovers, and video production tailored for platforms like YouTube and TikTok. Users can customize over 7,000 templates, adjust formats, and translate videos into various languages. With a web-based interface and AI-driven automation, InVideo simplifies video production for marketers, educators, and content creators of all skill levels.

Lumen5

Website

Lumen5 is an AI-driven video creation platform that simplifies the process of turning text into engaging videos. It automatically generates storyboards, selects relevant visuals and music, and offers an intuitive drag-and-drop interface suitable for users without video editing experience. With a vast library of stock images, video clips, and music, Lumen5 allows for easy customization. It also includes features like voiceovers, captioning, and social media optimization for efficient, professional-quality video production.

OpenAI Sora

Website

Sora 2 is OpenAI's advanced video and audio generation model, offering more realistic and controllable output than previous systems. Users can create videos from text prompts with synchronized dialogue and sound effects, using a diffusion model that transforms static noise into complete videos. Sora supports various durations, aspect ratios, and resolutions, offering up to a minute of high-definition video, along with a storyboard feature for selecting specific frames. Available through the Sora app with C2PA metadata, it marks a significant advancement in text-to-video technology, with potential for higher resolutions and longer durations in the future.

Pictory

Website

Pictory is an AI-powered video creation platform that transforms text, blog posts, and scripts into professional-quality videos within minutes. It uses advanced AI to match content with relevant visuals, music, and voiceovers, making video production easy for anyone. Key features include script-to-video conversion, blog-to-video transformation, automatic captioning, and video summarization. Users can edit videos with text, create highlights from long-form content, and turn PowerPoint presentations into engaging videos. With a user-friendly interface and cloud accessibility, Pictory streamlines the video creation process for creators, marketers, and educators.

Runway

Website

Runway is an AI research and technology company known for its Gen-4 video generation model. This model allows users to create videos using simple prompts, generating consistent characters, locations, and objects while maintaining a coherent style and mood across scenes. Users can easily adjust lighting, restyle shots, and alter elements by directly communicating their needs. In addition, Runway operates Runway Studios for producing films and music videos and Runway Academy, which offers tutorials and resources for creators to integrate AI into their projects, providing a comprehensive solution for professional video production.

Synthesia

Website

Synthesia is an AI-powered video creation platform that allows users to generate professional-quality videos from text in over 140 languages without the need for cameras or studios. It features lifelike AI avatars, voiceovers, customizable templates, and integration with Learning Management Systems (LMS). The platform also offers one-click translation, AI dubbing, and multilingual content support, making it suitable for training, marketing, and communications. With real-time collaboration tools and analytics, teams can efficiently manage video content while adhering to ethical AI practices.

Synthesys

Website

Synthesys is an AI video creation platform that allows users to produce high-quality videos from text in over 140 languages. It features hyper-realistic avatars, voice cloning, text-to-speech, and customizable options, enabling content creation for marketing, training, and more without traditional filming equipment. With 600+ voices and multilingual support, Synthesys makes video production accessible for both beginners and professionals. Its user-friendly interface ensures quick video rendering.

Audio, Music & Speech

15.dev

Website

Originally called 15.ai, 15.dev is a text-to-speech system designed to create realistic voices with minimal training data. It started in 2016 as a deep learning research project by a developer during their first year at MIT. The platform aims to democratize voice synthesis, allowing users to generate high-quality synthetic speech easily. This is especially beneficial for content creators, game developers, and storytellers looking to enhance their projects with voice-over capabilities.

AIVA

Website

AIVA (Artificial Intelligence Virtual Artist) is an AI music generation tool that allows users to create songs in over 250 styles within seconds, suitable for both beginners and professionals. The platform offers customizability, letting users upload audio or MIDI references, edit tracks, and download compositions in various formats. AIVA has a tiered pricing structure: a free plan for non-commercial use (3 downloads/month), a Standard plan for limited monetization (15 downloads/month), and a Pro plan with full copyright ownership (300 downloads/month). With over 150 showcase tracks, AIVA provides royalty-free music for content creators, filmmakers, and advertisers, along with discounts for students and educational institutions.

LiveKit

Website

LiveKit is an open-source Voice AI platform that allows developers to create and scale real-time voice agents using powerful APIs. It integrates with AI services like Deepgram, OpenAI's GPT-4, Cartesia, and Silero, enabling the development of sophisticated voice agents quickly. With ultra-low 100ms global latency, strong compliance with GDPR and HIPAA, and 99.99% uptime, LiveKit supports over 100,000 developers and handles 3 billion calls annually. It offers web and mobile integration, telephony connectivity, and a testing playground, all with 50GB of free monthly usage without a credit card requirement.

Mubert

Website

Mubert is an AI-driven music platform that allows creators to generate royalty-free soundtracks tailored to their content. With tools like Mubert Render, users can create music for videos and social media by selecting genre and tempo. Mubert Studio enables musicians to upload samples and collaborate with AI on new tracks while earning revenue. The platform also offers an API for developers to integrate custom AI music into apps and games. Additionally, Mubert Play provides personalized, infinite music streams that adapt to user preferences for various activities.

ABOV

Image, Video & Audio