sdxl-lightning-4step - AI model preview image

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

1,029,607,539runs

black-forest-labs/flux-schnell

The fastest image generation model tailored for local development and personal use

554,240,688runs

meta/meta-llama-3-8b-instruct

An 8 billion parameter language model from Meta, fine tuned for chat completions

390,822,271runs

salesforce/blip

Generate image captions

168,964,227runs

meta/meta-llama-3-70b-instruct

A 70 billion parameter language model from Meta, fine tuned for chat completions

163,658,716runs

openai/whisper

Convert speech in audio to text

142,475,764runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

122,708,238runs

stable-diffusion - AI model preview image

stability-ai/stable-diffusion

A latent text-to-image diffusion model capable of generating photo-realistic images given any text input

110,816,850runs

tencentarc/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

101,800,346runs

text-extract-ocr - AI model preview image

abiruyt/text-extract-ocr

A simple OCR Model that can easily extract text from an image.

89,982,796runs

stability-ai/sdxl

A text-to-image generative AI model that creates beautiful images

82,843,860runs

nightmareai/real-esrgan

Real-ESRGAN with optional face correction and adjustable upscale

78,779,547runs

nsfw_image_detection - AI model preview image

falcons-ai/nsfw_image_detection

Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification

66,684,896runs

black-forest-labs/flux-1.1-pro

Faster, better FLUX Pro. Text-to-image model with excellent image quality, prompt adherence, and output diversity.

64,420,298runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

61,794,750runs

meta/meta-llama-3-8b

Base version of Llama 3, an 8 billion parameter language model from Meta.

51,126,602runs

clip-embeddings - AI model preview image

krthr/clip-embeddings

Generate CLIP (clip-vit-large-patch14) text & image embeddings

47,961,072runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

46,892,315runs

sczhou/codeformer

Robust face restoration algorithm for old photos / AI-generated faces

46,533,726runs

google/nano-banana

Google's latest image editing model in Gemini 2.5

45,223,351runs

pengdaqian2020/image-tagger

image tagger

41,837,718runs

controlnet-scribble - AI model preview image

jagilley/controlnet-scribble

Generate detailed images from scribbled drawings

38,305,593runs

flux-kontext-pro - AI model preview image

black-forest-labs/flux-kontext-pro

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language

36,796,609runs

multilingual-e5-large - AI model preview image

beautyyuyanli/multilingual-e5-large

multilingual-e5-large: A multi-language text embedding model

35,083,700runs

prunaai/flux-fast

This is the fastest Flux endpoint in the world.

34,716,370runs

black-forest-labs/flux-dev

A 12 billion parameter rectified flow transformer capable of generating images from text descriptions

32,739,440runs

yorickvp/llava-13b

Visual instruction tuning towards large language and vision models with GPT-4 level capabilities

32,630,236runs

andreasjansson/blip-2

Answers questions about images

31,054,971runs

clarity-upscaler - AI model preview image

philz1337x/clarity-upscaler

High resolution image Upscaler and Enhancer. Use at ClarityAI.co. A free Magnific alternative. Twitter/X: @philz1337x

23,615,838runs

adirik/grounding-dino

Detect everything with language!

21,502,469runs

stable-diffusion-inpainting - AI model preview image

stability-ai/stable-diffusion-inpainting

Fill in masked parts of images with Stable Diffusion

20,803,814runs

incredibly-fast-whisper - AI model preview image

vaibhavs10/incredibly-fast-whisper

whisper-large-v3, incredibly fast, powered by Hugging Face Transformers! 🤗

20,373,416runs

flux-1.1-pro-ultra - AI model preview image

black-forest-labs/flux-1.1-pro-ultra

FLUX1.1 [pro] in ultra and raw modes. Images are up to 4 megapixels. Use raw mode for realism.

19,138,829runs

hyper-flux-8step - AI model preview image

bytedance/hyper-flux-8step

Hyper FLUX 8-step by ByteDance

18,771,153runs

meta/llama-2-7b-chat

A 7 billion parameter language model from Meta, fine tuned for chat completions

18,398,070runs

real-esrgan-a100 - AI model preview image

daanelson/real-esrgan-a100

Real-ESRGAN for image upscaling on an A100

14,995,659runs

fofr/face-to-many

Turn a face into 3D, emoji, pixel art, video game, claymation or toy

14,745,566runs

clip-vit-large-patch14 - AI model preview image

cjwbw/clip-vit-large-patch14

openai/clip-vit-large-patch14 with Transformers

14,393,628runs

bytedance/seedream-4

Unified text-to-image generation and precise single-sentence editing at up to 4K resolution

14,287,253runs

allenhooo/lama

🦙 LaMa: Resolution-robust Large Mask Inpainting with Fourier Convolutions

13,995,874runs

black-forest-labs/flux-pro

State-of-the-art image generation with top of the line prompt following, visual quality, image detail and output diversity.

13,671,424runs

lucataco/remove-bg

Remove background from an image

12,290,860runs

prompthero/openjourney

Stable Diffusion fine tuned on Midjourney v4 images.

12,081,097runs

flux-kontext-fast - AI model preview image

prunaai/flux-kontext-fast

Ultra fast flux kontext endpoint

11,375,300runs

fofr/sdxl-emoji

An SDXL fine-tune based on Apple Emojis

11,332,031runs

datacte/proteus-v0.2

Proteus v0.2 shows subtle yet significant improvements over Version 0.1. It demonstrates enhanced prompt understanding that surpasses MJ6, while also approaching its stylistic capabilities.

11,199,633runs

background-remover - AI model preview image

851-labs/background-remover

Remove backgrounds from images.

10,902,827runs

m1guelpf/nsfw-filter

Run any image through the Stable Diffusion content filter

10,446,360runs

cjwbw/rembg

Remove images background

10,159,934runs

ai-forever/kandinsky-2.2

multilingual text2image latent diffusion model

10,027,581runs

llama-2-70b-chat - AI model preview image

meta/llama-2-70b-chat

A 70 billion parameter language model from Meta, fine tuned for chat completions

9,970,550runs

controlnet-hough - AI model preview image

jagilley/controlnet-hough

Modify images using M-LSD line detection

9,899,673runs

daanelson/imagebind

A model for text, audio, and image embeddings in one space

9,410,960runs

animagine-xl-3.1 - AI model preview image

cjwbw/animagine-xl-3.1

Anime-themed text-to-image stable diffusion model

9,310,897runs

okaris/roop

chameleonn: one-click face swap (formerly roop)

8,960,406runs

flux-kontext-max - AI model preview image

black-forest-labs/flux-kontext-max

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts

8,833,148runs

insanely-fast-whisper-with-video - AI model preview image

turian/insanely-fast-whisper-with-video

whisper-large-v3, incredibly fast, with video transcription

8,641,798runs

tencentarc/photomaker

Create photos, paintings and avatars for anyone in any style within seconds.

8,487,475runs

lucataco/codeformer

Robust face restoration algorithm for old photos/AI-generated faces

8,051,063runs

text-to-pokemon - AI model preview image

lambdal/text-to-pokemon

Generate Pokémon from a text description

7,885,943runs

hyper-flux-16step - AI model preview image

bytedance/hyper-flux-16step

Hyper FLUX 16-step by ByteDance

7,863,811runs

xinntao/realesrgan

Practical Image Restoration Algorithms for General/Anime Images

7,632,323runs

alexgenovese/upscaler

GFPGAN aims at developing Practical Algorithms for Real-world Face and Object Restoration

7,631,533runs

rossjillian/controlnet

Control diffusion models

7,616,445runs

recraft-ai/recraft-v3

Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis

7,072,696runs

asiryan/meina-mix-v11

Meina Mix V11 Model (Text2Img, Img2Img and Inpainting)

6,985,536runs

any-comfyui-workflow - AI model preview image

comfyui/any-comfyui-workflow

Run any ComfyUI workflow. Guide: https://github.com/replicate/cog-comfyui

6,945,284runs

meta/meta-llama-3.1-405b-instruct

Meta's flagship 405 billion parameter language model, fine-tuned for chat completions

6,809,217runs

nicolascoutureau/video-utils

A powerful AI model.

6,643,771runs

shefa/turbo-enigma

SDXL based text-to-image model applying Distribution Matching Distillation, supporting zero-shot identity generation in 2-5s. https://ai-visionboard.com

6,209,473runs

jingyunliang/swinir

Image Restoration Using Swin Transformer

6,184,336runs

ai-forever/kandinsky-2

text2img model trained on LAION HighRes and fine-tuned on internal datasets

6,180,300runs

alphanumericuser/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

6,107,509runs

google/imagen-4

Google's Imagen 4 flagship model

5,986,346runs

speech-02-turbo - AI model preview image

minimax/speech-02-turbo

Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Designed for real-time applications with low latency

5,902,308runs

controlnet-1.1-x-realistic-vision-v2.0 - AI model preview image

usamaehsan/controlnet-1.1-x-realistic-vision-v2.0

controlnet 1.1 lineart x realistic-vision-v2.0 (updated to v5)

5,756,385runs

hidream-l1-fast - AI model preview image

prunaai/hidream-l1-fast

This is an optimised version of the hidream-l1 model using the pruna ai optimisation toolkit!

5,517,005runs

lucataco/moondream2

moondream2 is a small vision language model designed to run efficiently on edge devices

5,478,781runs

smoretalk/rembg-enhance

A background removal model enhanced with better matting

5,323,452runs

black-forest-labs/flux-dev-lora

A version of flux-dev, a text to image model, that supports fast fine-tuned lora inference

5,134,354runs

victor-upmeet/whisperx

Accelerated transcription, word-level timestamps and diarization with whisperX large-v3

5,080,470runs

llava-v1.6-mistral-7b - AI model preview image

yorickvp/llava-v1.6-mistral-7b

LLaVA v1.6: Large Language and Vision Assistant (Mistral-7B)

4,923,874runs

llama-2-13b-chat - AI model preview image

meta/llama-2-13b-chat

A 13 billion parameter language model from Meta, fine tuned for chat completions

4,876,459runs

datacte/proteus-v0.3

ProteusV0.3: The Anime Update

4,842,514runs

openai/gpt-4o-mini

Low latency, low cost version of OpenAI's GPT-4o model

4,834,454runs

clip-interrogator - AI model preview image

pharmapsychotic/clip-interrogator

The CLIP Interrogator is a prompt engineering tool that combines OpenAI's CLIP and Salesforce's BLIP to optimize text prompts to match a given image. Use the resulting prompts with text-to-image models like Stable Diffusion to create cool art!

4,719,624runs

cjwbw/zoedepth

ZoeDepth: Combining relative and metric depth

4,622,310runs

ideogram-v3-turbo - AI model preview image

ideogram-ai/ideogram-v3-turbo

Turbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles

4,606,683runs

lucataco/xtts-v2

Coqui XTTS-v2: Multilingual Text To Speech Voice Cloning

4,541,093runs

realistic-vision-v5.1 - AI model preview image

lucataco/realistic-vision-v5.1

Implementation of Realistic Vision v5.1 with VAE

4,285,776runs

lucataco/flux-dev-lora

FLUX.1-Dev LoRA Explorer (DEPRECATED Please use: black-forest-labs/flux-dev-lora)

4,120,058runs

deepseek-ai/deepseek-v3

DeepSeek-V3-0324 is the leading non-reasoning model, a milestone for open source

4,115,275runs

zsxkib/mmaudio

Add sound to video using the MMAudio V2 model. An advanced AI model that synthesizes high-quality audio from video content, enabling seamless video-to-audio transformation.

4,092,846runs

qwen-image-edit-plus - AI model preview image

qwen/qwen-image-edit-plus

The latest Qwen-Image’s iteration with improved multi-image editing, single-image consistency, and native support for ControlNet

4,082,460runs

smoosh-sh/baby-mystic

Implementation of Realistic Vision v5.1 to conjure up images of the potential baby using a single photo from each parent

4,004,179runs

men1scus/birefnet

Bilateral Reference for High-Resolution Dichotomous Image Segmentation (CAAI AIR 2024)

3,927,547runs

llava-v1.6-vicuna-13b - AI model preview image

yorickvp/llava-v1.6-vicuna-13b

LLaVA v1.6: Large Language and Vision Assistant (Vicuna-13B)

3,744,464runs

flux-kontext-dev - AI model preview image

black-forest-labs/flux-kontext-dev

Open-weight version of FLUX.1 Kontext

3,706,955runs

bytedance/pulid

📖 PuLID: Pure and Lightning ID Customization via Contrastive Alignment

3,575,135runs

mv-lab/swin2sr

3 Million Runs! AI Photorealistic Image Super-Resolution and Restoration

3,573,997runs

whisper-diarization - AI model preview image

thomasmol/whisper-diarization

⚡️ Blazing fast audio transcription with speaker diarization | Whisper Large V3 Turbo | word & sentence level timestamps | prompt

3,501,863runs

anything-v3-better-vae - AI model preview image

cjwbw/anything-v3-better-vae

high-quality, highly detailed anime style stable-diffusion with better VAE

3,478,031runs

wan-video/wan-2.2-i2v-fast

A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B image-to-video

3,468,235runs

flux-schnell-lora - AI model preview image

black-forest-labs/flux-schnell-lora

The fastest image generation model tailored for fine-tuned use

3,448,034runs

sdxl-controlnet - AI model preview image

lucataco/sdxl-controlnet

SDXL ControlNet - Canny

3,377,760runs

black-forest-labs/flux-fill-pro

Professional inpainting and outpainting model with state-of-the-art performance. Edit or extend images with natural, seamless results.

3,343,246runs

meta/llama-4-scout-instruct

A 17 billion parameter model with 16 experts

3,332,176runs

cjwbw/anything-v4.0

high-quality, highly detailed anime-style Stable Diffusion models

3,270,943runs

anthropic/claude-3.7-sonnet

The most intelligent Claude model and the first hybrid reasoning model on the market (claude-3-7-sonnet-20250219)

3,249,436runs

clip-interrogator-turbo - AI model preview image

smoretalk/clip-interrogator-turbo

@pharmapsychotic 's CLIP-Interrogator, but 3x faster and more accurate. Specialized on SDXL.

3,181,164runs

meta/musicgen

Generate music from a prompt or melody

3,148,441runs

luma/photon

High-quality image generation model optimized for creative professional workflows and ultra-high fidelity outputs

3,013,369runs

sdxl-inpainting - AI model preview image

lucataco/sdxl-inpainting

SDXL Inpainting by the HF Diffusers team

3,002,370runs

bytedance/seedream-3

A text-to-image model with support for native high-resolution (2K) image generation

2,928,112runs

claude-3.5-haiku - AI model preview image

anthropic/claude-3.5-haiku

Anthropic's fastest, most cost-effective model, with a 200K token context window (claude-3-5-haiku-20241022)

2,785,219runs

ideogram-v2-turbo - AI model preview image

ideogram-ai/ideogram-v2-turbo

A fast image model with state of the art inpainting, prompt comprehension and text rendering.

2,759,473runs

playground-v2.5-1024px-aesthetic - AI model preview image

playgroundai/playground-v2.5-1024px-aesthetic

Playground v2.5 is the state-of-the-art open-source model in aesthetic quality

2,757,520runs

kwaivgi/kling-v2.1

Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)

2,702,327runs

methexis-inc/img2prompt

Get an approximate text prompt, with style, matching an image. (Optimized for stable-diffusion (clip ViT-L/14))

2,658,796runs

cjwbw/real-esrgan

Real-ESRGAN: Real-World Blind Super-Resolution

2,618,881runs

flux-black-light - AI model preview image

fofr/flux-black-light

A flux lora fine-tuned on black light images

2,561,921runs

animagine-xl-v4-opt - AI model preview image

aisha-ai-official/animagine-xl-v4-opt

A powerful AI model.

2,557,821runs

ideogram-ai/ideogram-v2

An excellent image model with state of the art inpainting, prompt comprehension and text rendering

2,557,382runs

bytedance/flux-pulid

⚡️FLUX PuLID: FLUX-dev based Pure and Lightning ID Customization via Contrastive Alignment🎭

2,413,437runs

asiryan/reliberate-v3

Reliberate v3 Model (Text2Img, Img2Img and Inpainting)

2,381,031runs

all-mpnet-base-v2 - AI model preview image

replicate/all-mpnet-base-v2

This is a language model that can be used to obtain document embeddings suitable for downstream tasks like semantic search and clustering.

2,368,044runs

prunaai/flux-schnell

This is a 3x faster FLUX.1 [schnell] model from Black Forest Labs, optimised with pruna with minimal quality loss. Contact us for more at pruna.ai

2,360,710runs

material-diffusion - AI model preview image

tstramer/material-diffusion

Stable diffusion fork for generating tileable outputs using v1.5 model

2,349,855runs

openai/gpt-5-nano

Fastest, most cost-effective GPT-5 model from OpenAI

2,224,591runs

google/imagen-4-fast

Use this fast version of Imagen 4 when speed and cost are more important than quality

2,204,271runs

deepseek-ai/deepseek-r1

A reasoning model trained with reinforcement learning, on par with OpenAI o1

2,131,946runs

minimax/image-01

Minimax's first image model, with character reference support

2,090,645runs

ideogram-v3-quality - AI model preview image

ideogram-ai/ideogram-v3-quality

The highest quality Ideogram v3 model. v3 creates images with stunning realism, creative designs, and consistent styles

2,017,960runs

andreasjansson/deepfloyd-if

The DeepFloyd IF model has been initially released as a non-commercial research-only model. Please make sure you read and abide to the license before using it.

2,000,237runs

lucataco/flux-schnell-lora

FLUX.1-Schnell LoRA Explorer

1,997,426runs

snowflake-arctic-instruct - AI model preview image

snowflake/snowflake-arctic-instruct

An efficient, intelligent, and truly open-source language model

1,996,272runs

ideogram-ai/ideogram-v2a

Like Ideogram v2, but faster and cheaper

1,995,870runs

realvisxl-v3-multi-controlnet-lora - AI model preview image

sdxl-based/realvisxl-v3-multi-controlnet-lora

RealVisXl V3 with multi-controlnet, lora loading, img2img, inpainting

1,959,995runs

lang-segment-anything - AI model preview image

tmappdev/lang-segment-anything

Segment Anything with prompts

1,914,441runs

mistral-7b-v0.1 - AI model preview image

mistralai/mistral-7b-v0.1

A 7 billion parameter language model from Mistral.

1,900,818runs

prompt-classifier - AI model preview image

fofr/prompt-classifier

Determines the toxicity of text to image prompts, llama-13b fine-tune. [SAFETY_RANKING] between 0 (safe) and 10 (toxic)

1,894,162runs

mejiabrayan/logoai

A powerful AI model.

1,891,529runs

daanelson/minigpt-4

A model which generates text in response to an input image and prompt.

1,842,184runs

seedance-1-lite - AI model preview image

bytedance/seedance-1-lite

A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution

1,840,775runs

google/imagen-3

Google's highest quality text-to-image model, capable of generating images with detail, rich lighting and beauty

1,829,708runs

openai/clip

Official CLIP models, generate CLIP (clip-vit-large-patch14) text & image embeddings

1,799,054runs

zylim0702/remove-object

The LaMa (Large Mask Inpainting) model is an advanced image inpainting system designed to address the challenges of handling large missing areas, complex geometric structures, and high-resolution images.

1,791,160runs

stable-diffusion-3 - AI model preview image

stability-ai/stable-diffusion-3

A text-to-image model with greatly improved performance in image quality, typography, complex prompt understanding, and resource-efficiency

1,778,820runs

interior-design - AI model preview image

adirik/interior-design

Realistic interior design with text and image inputs

1,767,242runs

fofr/sticker-maker

Make stickers with AI. Generates graphics with transparent backgrounds.

1,756,495runs

flux-dev-multi-lora - AI model preview image

lucataco/flux-dev-multi-lora

FLUX.1-Dev Multi LoRA Explorer

1,754,427runs

stable-diffusion-3.5-large - AI model preview image

stability-ai/stable-diffusion-3.5-large

A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, thanks to Query-Key Normalization.

1,746,468runs

clip_prefix_caption - AI model preview image

rmokady/clip_prefix_caption

Simple image captioning model using CLIP and GPT-2

1,736,317runs

multi-image-kontext-pro - AI model preview image

flux-kontext-apps/multi-image-kontext-pro

An experimental model with FLUX Kontext Pro that can combine two input images

1,720,851runs

face-to-sticker - AI model preview image

fofr/face-to-sticker

Turn a face into a sticker

1,615,417runs

meta/llama-4-maverick-instruct

A 17 billion parameter model with 128 experts

1,568,907runs

midjourney-diffusion - AI model preview image

tstramer/midjourney-diffusion

A powerful AI model.

1,562,980runs

granite-3.3-8b-instruct - AI model preview image

ibm-granite/granite-3.3-8b-instruct

Granite-3.3-8B-Instruct is a 8-billion parameter 128K context length language model fine-tuned for improved reasoning and instruction-following capabilities.

1,562,885runs

andreasjansson/stable-diffusion-inpainting

Inpainting using RunwayML's stable-diffusion-inpainting checkpoint

1,549,350runs

photomaker-style - AI model preview image

tencentarc/photomaker-style

Create photos, paintings and avatars for anyone in any style within seconds. (Stylization version)

1,538,099runs

ram-grounded-sam - AI model preview image

idea-research/ram-grounded-sam

A Strong Image Tagging Model with Segment Anything

1,537,574runs

latent-consistency-model - AI model preview image

fofr/latent-consistency-model

Super-fast, 0.6s per image. LCM with img2img, large batching and canny controlnet

1,534,468runs

P

humbleworth/price-predict-v1

Predicts the value of a domain name.

1,516,040runs

W

hnesk/whisper-wordtimestamps

openai/whisper with exposed settings for word_timestamps

1,511,031runs

segformer-b5-finetuned-ade-640-640 - AI model preview image

simbrams/segformer-b5-finetuned-ade-640-640

Semantic Segmentation

1,506,569runs

cjwbw/cogvlm

powerful open-source visual language model

1,497,721runs

mrhan1993/fooocus-api

A powerful AI model.

1,485,803runs

konieshadow/fooocus-api

Third party Fooocus replicate model

1,467,989runs

pixray/text2image

Uses pixray to generate an image from text prompt

1,446,391runs

megvii-research/nafnet

Nonlinear Activation Free Network for Image Restoration

1,432,136runs

pollinations/modnet

A deep learning approach to remove background & adding new background image

1,407,225runs

kling-v1.6-standard - AI model preview image

kwaivgi/kling-v1.6-standard

Generate 5s and 10s videos in 720p resolution at 30fps

1,400,820runs

babes-v2.0-img2img - AI model preview image

mcai/babes-v2.0-img2img

Generate a new image from an input image with Babes 2.0

1,383,317runs

zsxkib/ic-light

✍️✨Prompts to auto-magically relights your images

1,375,521runs

consistent-character - AI model preview image

sdxl-based/consistent-character

Create images of a given character in different poses

1,359,760runs

cdingram/face-swap

Image to image face swapping

1,352,688runs

zsxkib/blip-3

Blip 3 / XGen-MM, Answers questions about images ({blip3,xgen-mm}-phi3-mini-base-r-v1)

1,331,362runs

yuval-alaluf/sam

Only a Matter of Style: Age Transformation Using a Style-Based Regression Model

1,325,022runs

cjwbw/dreamshaper

Dream Shaper stable diffusion

1,323,220runs

black-forest-labs/flux-krea-dev

An opinionated text-to-image model from Black Forest Labs in collaboration with Krea that excels in photorealism. Creates images that avoid the oversaturated "AI look".

1,314,759runs

website-screenshot - AI model preview image

shreejalmaharjan-27/website-screenshot

Capture a website screenshot

1,313,432runs

lucataco/ms-img2vid

Turn any image into a video

1,308,545runs

kwaivgi/kling-v2.5-turbo-pro

Kling 2.5 Turbo Pro: Unlock pro-level text-to-video and image-to-video creation with smooth motion, cinematic depth, and remarkable prompt adherence.

1,308,414runs

openai/gpt-4.1-mini

Fast, affordable version of GPT-4.1

1,303,640runs

orpatashnik/styleclip

Text-Driven Manipulation of StyleGAN Imagery

1,284,215runs

piddnad/ddcolor

Towards Photo-Realistic Image Colorization via Dual Decoders

1,256,548runs

realistic-voice-cloning - AI model preview image

zsxkib/realistic-voice-cloning

Create song covers with any RVC v2 trained AI voice from audio files.

1,248,998runs

franz-biz/yolo-world-xl

Real-Time Open-Vocabulary Object Detection using the xl weights

1,243,511runs

pseudoram/rvc-v2

Speech to speech with any RVC v2 trained AI voice

1,241,078runs

realistic-vision-v5 - AI model preview image

heedster/realistic-vision-v5

Deployment of Realistic vision v5.0 with xformers for fast inference

1,224,764runs

fofr/style-transfer

Transfer the style of one image to another

1,224,316runs

charlesmccarthy/addwatermark

Add a watermark to your videos using the power of Replicate brought to you from your friends at FullJourney.AI

1,217,602runs

cuuupid/idm-vton

Best-in-class clothing virtual try on in the wild (non-commercial use only)

1,210,819runs

M

ardianfe/music-gen-fn-200e

Create music for your content

1,173,363runs

microsoft/bringing-old-photos-back-to-life

Bringing Old Photos Back to Life

1,165,170runs

black-forest-labs/flux-fill-dev

Open-weight inpainting model for editing and extending images. Guidance-distilled from FLUX.1 Fill [pro].

1,157,690runs

luosiallen/latent-consistency-model

Synthesizing High-Resolution Images with Few-Step Inference

1,149,227runs

embeddings-gte-base - AI model preview image

mark3labs/embeddings-gte-base

General Text Embeddings (GTE) model.

1,147,686runs

bytedance/seedance-1-pro

A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution

1,139,068runs

google/imagen-4-ultra

Use this ultra version of Imagen 4 when quality matters more than speed and cost

1,136,740runs

waifu-diffusion - AI model preview image

cjwbw/waifu-diffusion

Stable Diffusion on Danbooru images

1,115,716runs

claude-4-sonnet - AI model preview image

anthropic/claude-4-sonnet

Claude Sonnet 4 is a significant upgrade to 3.7, delivering superior coding and reasoning while responding more precisely to your instructions

1,106,887runs

gte-qwen2-7b-instruct - AI model preview image

cuuupid/gte-qwen2-7b-instruct

Embed text with Qwen2-7b-Instruct

1,105,023runs

controlnet-deliberate - AI model preview image

philz1337x/controlnet-deliberate

Modify images with canny edge detection and Deliberate model twitter: @philz1337x

1,095,261runs

minimax/speech-02-hd

Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Optimized for high-fidelity applications like voiceovers and audiobooks.

1,090,606runs

recraft-crisp-upscale - AI model preview image

recraft-ai/recraft-crisp-upscale

Designed to make images sharper and cleaner, Crisp Upscale increases overall quality, making visuals suitable for web use or print-ready materials.

1,072,960runs

segformer-b0-finetuned-ade-512-512 - AI model preview image

bfirsh/segformer-b0-finetuned-ade-512-512

A powerful AI model.

1,071,351runs

riffusion/riffusion

Stable diffusion for real-time music generation

1,059,545runs

qwen-image-edit - AI model preview image

qwen/qwen-image-edit

Edit images using a prompt. This model extends Qwen-Image’s unique text rendering capabilities to image editing tasks, enabling precise text editing

1,048,415runs

qwen/qwen-image

An image generation foundation model in the Qwen series that achieves significant advances in complex text rendering.

1,043,150runs

stable-diffusion-img2img - AI model preview image

stability-ai/stable-diffusion-img2img

Generate a new image from an input image with Stable Diffusion

1,039,667runs

lucataco/ssd-1b

Segmind Stable Diffusion Model (SSD-1B) is a distilled 50% smaller version of SDXL, offering a 60% speedup while maintaining high-quality text-to-image generation capabilities

1,038,740runs

asiryan/anything-v4.5

Anything V4.5 Model (Text2Img, Img2Img and Inpainting)

1,025,762runs

dreambooth-batch - AI model preview image

anotherjesse/dreambooth-batch

batch inference for dreambooth trainings

1,025,176runs

flux-dev-realism - AI model preview image

xlabs-ai/flux-dev-realism

FLUX.1-dev with XLabs-AI’s realism lora

1,015,850runs

topazlabs/image-upscale

Professional-grade image upscaling, from Topaz Labs

1,004,787runs

zsxkib/instant-id

Make realistic images of real people instantly

997,390runs

sdxl-controlnet-lora - AI model preview image

fermatresearch/sdxl-controlnet-lora

'''Last update: Now supports img2img.''' SDXL Canny controlnet with LoRA support.

977,838runs

fooocus-api-realistic - AI model preview image

konieshadow/fooocus-api-realistic

Third party Fooocus replicate model with preset 'realistic'

957,841runs

openai/gpt-image-1

A multimodal image generation model that creates high-quality images. You need to bring your own verified OpenAI key to use this model. Your OpenAI account will be charged for usage.

954,695runs

magic-image-refiner - AI model preview image

fermatresearch/magic-image-refiner

A better alternative to SDXL refiners, providing a lot of quality and detail. Can also be used for inpainting or upscaling.

947,737runs

instruct-pix2pix - AI model preview image

timothybrooks/instruct-pix2pix

Edit images with human instructions

921,683runs

codeplugtech/face-swap

A powerful AI model.

899,065runs

sana-sprint-1.6b - AI model preview image

nvidia/sana-sprint-1.6b

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation

889,974runs

zsxkib/molmo-7b

allenai/Molmo-7B-D-0924, Answers questions and caption about images

889,462runs

qwen3-embedding-8b - AI model preview image

lucataco/qwen3-embedding-8b

The Qwen3 Embedding model series is specifically designed for text embedding and ranking tasks

888,181runs

black-forest-labs/flux-depth-dev

Open-weight depth-aware image generation. Edit images while preserving spatial relationships.

881,697runs

whisperx-a40-large - AI model preview image

victor-upmeet/whisperx-a40-large

Accelerated transcription, word-level timestamps and diarization with whisperX large-v3 for large audio files

876,325runs

chenxwh/sdxl-flash

Fast sdxl with higher quality

873,241runs

schananas/grounded_sam

Mask prompting based on Grounding DINO & Segment Anything | Integral cog of doiwear.it

870,438runs

lucataco/hotshot-xl

😊 Hotshot-XL is an AI text-to-GIF model trained to work alongside Stable Diffusion XL

855,549runs

sdxl-clip-interrogator - AI model preview image

lucataco/sdxl-clip-interrogator

CLIP Interrogator for SDXL optimizes text prompts to match a given image

848,652runs

speaker-diarization - AI model preview image

meronym/speaker-diarization

Segments an audio recording based on who is speaking

847,235runs

meta/meta-llama-3-70b

Base version of Llama 3, a 70 billion parameter language model from Meta.

846,657runs

flux-dev-lora-trainer - AI model preview image

ostris/flux-dev-lora-trainer

Fine-tune FLUX.1-dev using ai-toolkit

842,947runs

ryan5453/demucs

Demucs is an audio source separator created by Facebook Research.

834,184runs

controlnet-canny - AI model preview image

jagilley/controlnet-canny

Modify images using canny edge detection

832,565runs

stable-diffusion-3.5-large-turbo - AI model preview image

stability-ai/stable-diffusion-3.5-large-turbo

A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, with a focus on fewer inference steps

830,234runs

lucataco/qwen-vl-chat

A multimodal LLM-based AI assistant, which is trained with alignment techniques. Qwen-VL-Chat supports more flexible interaction, such as multi-round question answering, and creative capabilities.

825,568runs

delta-lock/ponynai3

Models fine-tuned from Pony-XL series.

820,566runs

nano-banana-pro - AI model preview image

google/nano-banana-pro

Google's state of the art image generation and editing model 🍌🍌

817,525runs

S

soykertje/spleeter

Spleeter is Deezer source separation library with pretrained models written in Python and uses Tensorflow.

812,076runs

prunaai/wan-2.2-image

This model generates beautiful cinematic 2 megapixel images in 3-4 seconds and is derived from the Wan 2.2 model through optimisation techniques from the pruna package

806,824runs

sdxl-based/realvisxl-v3

Amazing photorealism with RealVisXL_V3.0, based on SDXL, trainable

787,693runs

xrunda/hello

Take a video and replace the face in it with a face of your choice. You only need one image of the desired face. No dataset, no training.

786,959runs

anillustrious-v4 - AI model preview image

aisha-ai-official/anillustrious-v4

A powerful AI model.

783,892runs

kwaivgi/kling-v1.6-pro

Generate 5s and 10s videos in 1080p resolution

778,816runs

topazlabs/video-upscale

Video Upscaling from Topaz Labs

776,828runs

qr_code_controlnet - AI model preview image

zylim0702/qr_code_controlnet

ControlNet QR Code Generator: Simplify QR code creation for various needs using ControlNet's user-friendly neural interface, making integration a breeze. Just key in the url !

772,030runs

granite-3.1-8b-instruct - AI model preview image

ibm-granite/granite-3.1-8b-instruct

Granite-3.1-8B-Instruct is a lightweight and open-source 8B parameter model is designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.

771,840runs

zsxkib/jina-clip-v2

Jina-CLIP v2: 0.9B multimodal embedding model with 89-language multilingual support, 512x512 image resolution, and Matryoshka representations

764,976runs

catacolabs/cartoonify

Turn your image into a cartoon

752,858runs

cjwbw/midas

Robust Monocular Depth Estimation

741,403runs

fofr/become-image

Adapt any picture of a face into another image

736,173runs

meta-llama-guard-2-8b - AI model preview image

meta/meta-llama-guard-2-8b

A llama-3 based moderation and safeguarding language model

734,921runs

frame-extractor - AI model preview image

lucataco/frame-extractor

Extract the first or last frame from any video file as a high-quality image

706,549runs

fpsorg/emoji

Make Emoji with AI.

704,136runs

swook/inspyrenet

Segment foreground objects with high resolution and matting, using InSPyReNet

695,194runs

juggernaut-xl-v7 - AI model preview image

asiryan/juggernaut-xl-v7

Juggernaut XL v7 Model (Text2Img, Img2Img and Inpainting)

683,755runs

simbrams/ri

Realistic Inpainting with ControlNET (M-LSD + SEG)

682,068runs

controlnet-depth2img - AI model preview image

jagilley/controlnet-depth2img

Modify images using depth maps

679,579runs

mcai/deliberate-v2

Generate a new image given any input text with Deliberate v2

671,573runs

chenxwh/cogvlm2-video

CogVLM2: Visual Language Models for Image and Video Understanding

669,919runs

C

codehappynice/codemusic

The current model is used for graphics replacement processing

669,903runs

controlnet-x-ip-adapter-realistic-vision-v5 - AI model preview image

usamaehsan/controlnet-x-ip-adapter-realistic-vision-v5

Inpainting || multi-controlnet || single-controlnet || ip-adapter || ip adapter face || ip adapter plus || No ip adapter

667,914runs

meta/llama-2-7b

Base version of Llama 2 7B, a 7 billion parameter language model

659,483runs

t2i-adapter-sdxl-depth-midas - AI model preview image

adirik/t2i-adapter-sdxl-depth-midas

Modify images using depth maps

656,893runs

swartype/sdxl-pixar

Create Pixar poster easily with SDXL Pixar.

656,789runs

pixverse/pixverse-v5

Create 5s-8s videos with enhanced character movement, visual effects, and exclusive 1080p-8s support. Optimized for anime characters and complex actions

652,753runs

tstramer/waifu-diffusion

A powerful AI model.

650,165runs

high-resolution-controlnet-tile - AI model preview image

fermatresearch/high-resolution-controlnet-tile

UPDATE: new upscaling algorithm for a much improved image quality. Fermat.app open-source implementation of an efficient ControlNet 1.1 tile for high-quality upscales. Increase the creativity to encourage hallucination.

647,471runs

bytedance/seededit-3.0

Text-guided image editing model that preserves original details while making targeted modifications like lighting changes, object removal, and style conversion

645,321runs

flux-kontext-apps/restore-image

Use FLUX Kontext to restore, fix scratches and damage, and colorize old photos

634,989runs

cjwbw/bigcolor

Colorization using a Generative Color Prior for Natural Images

626,315runs

logerzhu/ad-inpaint

Product advertising image generator

622,593runs

minimax/video-01

Generate 6s videos with prompts or images. (Also known as Hailuo). Use a subject reference to make a video with a character and the S2V-01 model.

619,809runs

openai/gpt-5

OpenAI's new model excelling at coding, writing, and reasoning.

619,641runs

aaronaftab/mirage-ghibli

Ghiblify any image, 10x cheaper/faster than GPT 4o

595,912runs

jagilley/controlnet-hed

Modify images using HED maps

595,689runs

claude-3.5-sonnet - AI model preview image

anthropic/claude-3.5-sonnet

Anthropic's most intelligent language model to date, with a 200K token context window and image understanding (claude-3-5-sonnet-20241022)

593,131runs

realisitic-vision-v3-inpainting - AI model preview image

mixinmax1990/realisitic-vision-v3-inpainting

Realistic Vision V3.0 Inpainting

592,412runs

yoyo-nb/thin-plate-spline-motion-model

Thin-Plate Spline Motion Model for Image Animation

591,355runs

cjwbw/demucs

Demucs Music Source Separation

584,764runs

edge-of-realism-v2.0-img2img - AI model preview image

mcai/edge-of-realism-v2.0-img2img

Generate a new image from an input image with Edge Of Realism - EOR v2.0

581,958runs

juergengunz/real-esrgan-v2

Real-ESRGAN Upscale with AI Face Correction

574,044runs

florence-2-large - AI model preview image

lucataco/florence-2-large

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

569,820runs

openai/gpt-4.1-nano

Fastest, most cost-effective GPT-4.1 model from OpenAI

564,367runs

realistic_vision_v1.3 - AI model preview image

cloneofsimo/realistic_vision_v1.3

A powerful AI model.

563,725runs

google-research/maxim

Multi-Axis MLP for Image Processing

554,963runs

ibm-granite/granite-8b-code-instruct-128k

Join the Granite community where you can find numerous recipe workbooks to help you get started with a wide variety of use cases using this model. https://github.com/ibm-granite-community

551,413runs

sd-controlnet-lora - AI model preview image

pnyompen/sd-controlnet-lora

SD1.5 Canny controlnet with LoRA support.

548,825runs

realvisxl-v3.0-turbo - AI model preview image

adirik/realvisxl-v3.0-turbo

Photorealism with RealVisXL V3.0 Turbo based on SDXL

543,071runs

realistic-vision-v2.0 - AI model preview image

mcai/realistic-vision-v2.0

Generate a new image given any input text with Realistic Vision V2.0

532,968runs

andreasjansson/tile-morph

Create tileable animations with seamless transitions

529,345runs

stabledesign_interiordesign - AI model preview image

melgor/stabledesign_interiordesign

Transfer empty room into fabulous interior design

515,291runs

kuprel/min-dalle

Fast, minimal port of DALL·E Mini to PyTorch

506,482runs

firtoz/trellis

A powerful 3D asset generation model

506,481runs

playground-v2-1024px-aesthetic - AI model preview image

playgroundai/playground-v2-1024px-aesthetic

Playground v2 is a diffusion-based text-to-image generative model trained from scratch by the research team at Playground

506,384runs

google/imagen-3-fast

A faster and cheaper Imagen 3 model, for when price or speed are more important than final image quality

505,757runs

mchong6/jojogan

JoJoGAN: One Shot Face Stylization

496,069runs

lucataco/sdxl

SDXL v1.0 - A text-to-image generative AI model that creates beautiful images

485,361runs

cjwbw/rudalle-sr

Real-ESRGAN super-resolution model from ruDALL-E

485,217runs

tencentarc/vqfr

Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

478,409runs

ideogram-character - AI model preview image

ideogram-ai/ideogram-character

Generate consistent characters from a single reference image. Outputs can be in many styles. You can also use inpainting to add your character to an existing image.

465,154runs

huage001/adaattn

Arbitrary Neural Style Transfer

457,972runs

sdxl-controlnet-lora-small - AI model preview image

pnyompen/sdxl-controlnet-lora-small

SDXL Canny controlnet with LoRA support.

457,198runs

colinmcdonnell22/ghiblify-3

A powerful AI model.

456,554runs

hexiaochun/pp-ocr-v4

图文识别

456,149runs

granite-3.2-8b-instruct - AI model preview image

ibm-granite/granite-3.2-8b-instruct

Granite-3.2-8B-Instruct is a 8-billion parameter 128K context length language model fine-tuned for reasoning and instruction-following capabilities.

453,964runs

arielreplicate/deoldify_image

Add colours to old images

448,114runs

flux.1-dev-lora - AI model preview image

prunaai/flux.1-dev-lora

This is a 3x faster FLUX.1 [dev] model from Black Forest Labs, optimised with pruna with minimal quality loss.

437,038runs

minimax/music-01

Quickly generate up to 1 minute of music with lyrics and vocals in the style of a reference track

436,670runs

wan-2.1-i2v-480p - AI model preview image

wavespeedai/wan-2.1-i2v-480p

Accelerated inference for Wan 2.1 14B image to video, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.

427,626runs

inpaint-and-guess-prompt - AI model preview image

zf-kbot/inpaint-and-guess-prompt

Use a mask to inpaint the image or generate a prompt based on the mask.

424,546runs

mcai/dreamshaper-v6

Generate a new image given any input text with DreamShaper V6

423,376runs

openai/gpt-5-mini

Faster version of OpenAI's flagship GPT-5 model

422,505runs

runwayml/gen4-image

Runway's Gen-4 Image model with references. Use up to 3 reference images to create the exact image you need. Capture every angle.

420,535runs

granite-3.0-2b-instruct - AI model preview image

ibm-granite/granite-3.0-2b-instruct

Granite-3.0-2B-Instruct is a lightweight and open-source 2B parameter model designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.

420,296runs

flatdolphinmaid-8x7b-gguf - AI model preview image

spuuntries/flatdolphinmaid-8x7b-gguf

Undi95's FlatDolphinMaid 8x7B Mixtral Merge, GGUF Q5_K_M quantized by TheBloke.

415,235runs

andreasjansson/illusion

Monster Labs' control_v1p_sd15_qrcode_monster ControlNet on top of SD 1.5

407,802runs

illusion-diffusion-hq - AI model preview image

lucataco/illusion-diffusion-hq

Monster Labs QrCode ControlNet on top of SD Realistic Vision v5.1

406,755runs

sdxl-ad-inpaint - AI model preview image

catacolabs/sdxl-ad-inpaint

Product advertising image generator using SDXL

401,249runs

snowflake-arctic-embed-l - AI model preview image

lucataco/snowflake-arctic-embed-l

snowflake-arctic-embed is a suite of text embedding models that focuses on creating high-quality retrieval models optimized for performance

398,505runs

lucataco/sdxl-lcm

Latent Consistency Model (LCM): SDXL, distills the original model into a version that requires fewer steps (4 to 8 instead of the original 25 to 50)

397,707runs

asiryan/realism-xl

Realism XL Model (Text2Img, Img2Img and Inpainting)

396,523runs

vetkastar/fooocus

Image generation, Added: inpaint_strength loras_custom_urls

393,092runs

flux-dev-inpainting - AI model preview image

zsxkib/flux-dev-inpainting

🎨 Fill in masked parts of images with FLUX.1-dev 🖌️

390,221runs

material_stable_diffusion - AI model preview image

tommoore515/material_stable_diffusion

Stable diffusion fork for generating tileable outputs

390,072runs

codeplugtech/background_remover

Remove background from image

385,133runs

audio-to-waveform - AI model preview image

fofr/audio-to-waveform

Create a waveform video from audio

383,834runs

black-forest-labs/flux-canny-pro

Professional edge-guided image generation. Control structure and composition using Canny edge detection

382,888runs

train-rvc-model - AI model preview image

replicate/train-rvc-model

Train your own custom RVC model

377,283runs

lucataco/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces* (for larger images)

371,550runs

stable-diffusion-image-variation - AI model preview image

lambdal/stable-diffusion-image-variation

Image Variations with Stable Diffusion

370,128runs

ideogram-v2a-turbo - AI model preview image

ideogram-ai/ideogram-v2a-turbo

Like Ideogram v2 turbo, but now faster and cheaper

367,359runs

expression-editor - AI model preview image

fofr/expression-editor

Quickly edit the expression of a face

361,724runs

meta/llama-2-70b

Base version of Llama 2, a 70 billion parameter language model from Meta.

359,603runs

openai/o4-mini

OpenAI's fast, lightweight reasoning model

357,977runs

llama-guard-3-8b - AI model preview image

meta/llama-guard-3-8b

A Llama-3.1-8B pretrained model, fine-tuned for content safety classification

357,413runs

cjwbw/anything-v3.0

high-quality, highly detailed anime style stable-diffusion

354,015runs

laion-ai/erlich

Generate a logo using text.

349,646runs

phi-3-mini-128k-instruct - AI model preview image

microsoft/phi-3-mini-128k-instruct

Phi-3-Mini-128K-Instruct is a 3.8 billion-parameter, lightweight, state-of-the-art open model trained using the Phi-3 datasets

349,375runs

nandycc/sdxl-app-icons

Fine tuned to generate awesome app icons, by aistartupkit.com

348,331runs

absolutebeauty-v1.0-img2img - AI model preview image

mcai/absolutebeauty-v1.0-img2img

Generate a new image from an input image with AbsoluteReality v1.0

348,226runs

fooocus-api-anime - AI model preview image

konieshadow/fooocus-api-anime

Third party Fooocus replicate model with preset 'anime'

344,110runs

pvitoria/chromagan

An Adversarial Approach for Picture Colorization

337,816runs

controlnet-normal - AI model preview image

jagilley/controlnet-normal

Modify images using normal maps

330,704runs

ideogram-v3-balanced - AI model preview image

ideogram-ai/ideogram-v3-balanced

Balance speed, quality and cost. Ideogram v3 creates images with stunning realism, creative designs, and consistent styles

329,966runs

zust-ai/supir

A powerful AI model.

329,488runs

depth-anything-v2 - AI model preview image

chenxwh/depth-anything-v2

Depth estimation with faster inference speed, fewer parameters, and higher depth accuracy.

328,101runs

prompthero/dreamshaper

Generate a new image given any input text with Dreamshaper v7

326,082runs

mtg/effnet-discogs

An EfficientNet for music style classification by 400 styles from the Discogs taxonomy

322,844runs

lucataco/animate-diff

Animate Your Personalized Text-to-Image Diffusion Models

320,927runs

fofr/pulid-base

Use a face to make images. Uses SDXL fine-tuned checkpoints.

320,858runs

recraft-ai/recraft-v3-svg

Recraft V3 SVG (code-named red_panda) is a text-to-image model with the ability to generate high quality SVG images including logotypes, and icons. The model supports a wide list of styles.

320,737runs

01-ai/yi-34b-chat

The Yi series models are large language models trained from scratch by developers at 01.AI.

320,318runs

illust3relustion - AI model preview image

aisha-ai-official/illust3relustion

A powerful AI model.

315,529runs

instant-id-multicontrolnet - AI model preview image

tgohblio/instant-id-multicontrolnet

InstantID. ControlNets. More base SDXL models. And the latest ByteDance's ⚡️SDXL-Lightning !⚡️

313,174runs

wan-video/wan-2.2-5b-fast

The fastest Wan 2.2 text-to-image and image-to-video model

307,471runs

suno-ai/bark

🔊 Text-Prompted Generative Audio Model

302,616runs

blue-pencil-xl-v2 - AI model preview image

asiryan/blue-pencil-xl-v2

Blue Pencil XL v2 Model (Text2Img, Img2Img and Inpainting)

302,174runs

zeroscope-v2-xl - AI model preview image

anotherjesse/zeroscope-v2-xl

Zeroscope V2 XL & 576w

299,516runs

wty-ustc/hairclip

Design Your Hair by Text and Reference Image

298,681runs

bge-large-en-v1.5 - AI model preview image

nateraw/bge-large-en-v1.5

BAAI's bge-en-large-v1.5 for embedding text sequences

297,556runs

replicate/dreambooth

Train your own custom Stable Diffusion model using a small set of images

296,211runs

clip-caption-reward - AI model preview image

j-min/clip-caption-reward

Fine-grained Image Captioning with CLIP Reward

296,124runs

lucataco/realvisxl2-lcm

RealvisXL-v2.0 with LCM LoRA - requires fewer steps (4 to 8 instead of the original 40 to 50)

293,158runs

recraft-ai/recraft-20b

Affordable and fast images

292,389runs

real-esrgan-video - AI model preview image

lucataco/real-esrgan-video

Real-ESRGAN Video Upscaler

291,694runs

absolutebeauty-v1.0 - AI model preview image

mcai/absolutebeauty-v1.0

Generate a new image given any input text with AbsoluteReality v1.0

290,376runs

lucataco/realvisxl-v2.0

Implementation of SDXL RealVisXL_V2.0

288,521runs

stable-diffusion-v2 - AI model preview image

cjwbw/stable-diffusion-v2

sd-v2 with diffusers, test version!

280,570runs

openai/gpt-4o

OpenAI's high-intelligence chat model

280,483runs

meepo-pro-player/winter-wyvern

A powerful AI model.

279,496runs

frame-interpolation - AI model preview image

google-research/frame-interpolation

Frame Interpolation for Large Scene Motion

278,926runs

C

wglodell/cog-whisperx-withprompt

WhisperX transcription with inital_prompt

271,006runs

black-forest-labs/flux-depth-pro

Professional depth-aware image generation. Edit images while preserving spatial relationships.

270,790runs

qwen2-vl-7b-instruct - AI model preview image

lucataco/qwen2-vl-7b-instruct

Latest model in the Qwen family for chatting with video and image models

269,515runs

aryamansital/instant_mesh

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view LRMs

269,187runs

black-forest-labs/flux-redux-dev

Open-weight image variation model. Create new versions while preserving key elements of your original.

268,965runs

gpt-5-structured - AI model preview image

openai/gpt-5-structured

GPT-5 with support for structured outputs, web search and custom tools

267,441runs

deforum/deforum_stable_diffusion

Animating prompts with stable diffusion

266,011runs

flux-dev-controlnet - AI model preview image

xlabs-ai/flux-dev-controlnet

XLabs v3 canny, depth and soft edge controlnets for Flux.1 Dev

264,768runs

gemini-2.5-flash-image - AI model preview image

google/gemini-2.5-flash-image

Google's latest image generation model in Gemini 2.5

264,168runs

wai-nsfw-illustrious-v11 - AI model preview image

aisha-ai-official/wai-nsfw-illustrious-v11

A powerful AI model.

264,024runs

flux-multi-controlnet - AI model preview image

usamaehsan/flux-multi-controlnet

Fast FLUX DEV -> Flux Controlnet Canny, Controlnet Depth , Controlnet Line Art, Controlnet Upscaler - You can use just one controlnet or All - LORAs: HyperFlex LoRA , Add Details LoRA , Realism LoRA

257,700runs

img2aestheticscore - AI model preview image

methexis-inc/img2aestheticscore

A powerful AI model.

255,099runs

sdxl-v-transparent - AI model preview image

vectradmin/sdxl-v-transparent

A powerful AI model.

253,462runs

kyrick/prompt-parrot

Prompt Parrot generates text2image prompts from finetuned distilgpt2

252,742runs

prompthero/openjourney-v4

SD 1.5 trained with +124k MJv4 images by PromptHero

250,401runs

vintedois-diffusion - AI model preview image

22-hours/vintedois-diffusion

Generate beautiful images with simple prompts

247,313runs

claude-4.5-sonnet - AI model preview image

anthropic/claude-4.5-sonnet

Claude Sonnet 4.5 is the best coding model to date, with significant improvements across the entire development lifecycle

246,418runs

llama-2-13b-embeddings - AI model preview image

andreasjansson/llama-2-13b-embeddings

Llama2 13B with embedding output

243,110runs

openai/gpt-4.1

OpenAI's Flagship GPT model for complex tasks.

242,763runs

okaris/omni-zero

Omni-Zero: A diffusion pipeline for zero-shot stylized portrait creation.

239,927runs

nateraw/goliath-120b

An auto-regressive causal LM created by combining 2x finetuned Llama-2 70B into one.

237,495runs

analog-diffusion - AI model preview image

cjwbw/analog-diffusion

a dreambooth model trained on a diverse set of analog photographs

234,573runs

minimax/hailuo-02

Hailuo 2 is a text-to-video and image-to-video model that can make 6s or 10s videos at 768p (standard) or 1080p (pro). It excels at real world physics.

234,529runs

bytedance/bagel

🥯ByteDance Seed's Bagel Unified multimodal AI that generates images, edits images, and understands images in one 7B parameter model🥯

228,493runs

multilingual-e5-large-instruct - AI model preview image

saattrupdan/multilingual-e5-large-instruct

multilingual-e5-large-instruct: A multi-language text embedding model with custom query instructions.

227,072runs

dreamshaper-xl-turbo - AI model preview image

lucataco/dreamshaper-xl-turbo

DreamShaper is a general purpose SD model that aims at doing everything well, photos, art, anime, manga. It's designed to match Midjourney and DALL-E.

226,623runs

resemble-enhance - AI model preview image

resemble-ai/resemble-enhance

AI-driven audio enhancement for your audio files, powered by Resemble AI

226,087runs

prompthero/lookbook

Fashion Diffusion by Dreamshot

225,647runs

pixray-text2image - AI model preview image

dribnet/pixray-text2image

Uses pixray to generate an image from text prompt

223,777runs

jingyunliang/hcflow-sr

Image Super-Resolution

222,870runs

lightweight-ai/model1

flux_schnell model img2img inference

221,169runs

clip-age-predictor - AI model preview image

zsxkib/clip-age-predictor

Age prediction using CLIP - Patched version of `https://replicate.com/andreasjansson/clip-age-predictor` that works with the new version of cog!

220,797runs

app-icon-generator - AI model preview image

geeklab-ltd/app-icon-generator

Generates game icons, For full use: appiconlab.com

218,525runs

real-esrgan-a40 - AI model preview image

anotherjesse/real-esrgan-a40

A powerful AI model.

218,273runs

nvidia/sana

A fast image model with wide artistic range and resolutions up to 4096x4096

217,696runs

ghibli-easycontrol - AI model preview image

danila013/ghibli-easycontrol

Ghiblify your image – ChatGPT-level quality, 10× faster and cheaper.

216,672runs

sdxl-multi-controlnet-lora - AI model preview image

fofr/sdxl-multi-controlnet-lora

Multi-controlnet, lora loading, img2img, inpainting

215,057runs

B

sesamo-srl/bge-reranker-v2-m3

Newest reranker model from BAAI (https://huggingface.co/BAAI/bge-reranker-v2-m3). FP16 inference enabled. Normalize param available

213,452runs

aisha-ai-official/nsfw-flux-dev

A powerful AI model.

212,972runs

ifan-defocus-deblur - AI model preview image

codeslake/ifan-defocus-deblur

Removes defocus blur in an image

210,585runs

meta/llama-2-13b

Base version of Llama 2 13B, a 13 billion parameter language model

209,144runs

pixverse/pixverse-v4.5

Quickly make 5s or 8s videos at 540p, 720p or 1080p. It has enhanced motion, prompt coherence and handles complex actions well.

209,126runs

multi-image-kontext-max - AI model preview image

flux-kontext-apps/multi-image-kontext-max

An experimental FLUX Kontext model that can combine two input images

206,281runs

microsoft/omniparser-v2

OmniParser is a screen parsing tool to convert general GUI screen to structured elements.

205,729runs

whisper-diarization-advanced - AI model preview image

rafaelgalle/whisper-diarization-advanced

Ultra-fast, customizable speech-to-text and speaker diarization for noisy, multi-speaker audio. Includes advanced noise reduction, stereo channel support, and flexible audio preprocessing—ideal for call centers, meetings, and podcasts.

205,083runs

cjwbw/stable-diffusion-v2-inpainting

stable-diffusion-v2-inpainting

204,084runs

shreejalmaharjan-27/tiktok-short-captions

Generate Tiktok-Style Captions powered by Whisper (GPU)

203,012runs

crystal-upscaler - AI model preview image

philz1337x/crystal-upscaler

High-precision image upscaler optimized for portraits, faces and products. One of the upscale modes powered by Clarity AI. X:https://x.com/philz1337x

201,052runs

google/veo-3

Sound on: Google’s flagship Veo 3 text to video model, with audio

200,280runs

O

antoinelyset/openhermes-2-mistral-7b-awq

A powerful AI model.

199,570runs

qwen2-1.5b-instruct - AI model preview image

zsxkib/qwen2-1.5b-instruct

Qwen 2: A 1.5 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

192,972runs

pikachupichu25/image-faceswap

A powerful AI model.

192,091runs

cjwbw/supir

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This version uses LLaVA-13b for captioning.

189,821runs

luma/photon-flash

Accelerated variant of Photon prioritizing speed while maintaining quality

188,558runs

realistic-vision-v5-img2img - AI model preview image

lucataco/realistic-vision-v5-img2img

Realistic Vision v5.0 Image 2 Image

185,607runs

clipdraw-interactive - AI model preview image

evilstreak/clipdraw-interactive

Morphs vector paths towards a text prompt

183,944runs

magifactory-t-shirt-diffusion - AI model preview image

cjwbw/magifactory-t-shirt-diffusion

Generate t-shirt logos with stable-dfffusion

182,465runs

miaomiao-harem-illustrious-v1 - AI model preview image

aisha-ai-official/miaomiao-harem-illustrious-v1

A powerful AI model.

181,909runs

granite-3.0-8b-instruct - AI model preview image

ibm-granite/granite-3.0-8b-instruct

Granite-3.0-8B-Instruct is a lightweight and open-source 8B parameter model is designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.

181,372runs

fofr/sdxl-fresh-ink

SDXL fine-tuned on photos of freshly inked tattoos

180,922runs

wan-2.1-t2v-480p - AI model preview image

wavespeedai/wan-2.1-t2v-480p

Accelerated inference for Wan 2.1 14B text to video, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.

180,556runs

xinntao/esrgan

Image 4x super-resolution

180,329runs

instant-id-albedobase-xl - AI model preview image

tgohblio/instant-id-albedobase-xl

InstantID : Zero-shot Identity-Preserving Generation in Seconds with ⚡️LCM-LoRA⚡️. Using AlbedoBase-XL v2.0 as base model.

180,143runs

controlnet-pose - AI model preview image

jagilley/controlnet-pose

Modify images with humans using pose detection

175,414runs

remove-background - AI model preview image

bria/remove-background

Bria AI's remove background model

175,284runs

vetkastar/comfy-flux

comfy with flux model,

173,225runs

T

afiaka87/tortoise-tts

Generate speech from text, clone voices from mp3 files. From James Betker AKA "neonbjb".

172,824runs

epicrealismxl-lightning-hades - AI model preview image

sdxl-based/epicrealismxl-lightning-hades

Fast and high quality lightning model, epiCRealismXL-Lightning Hades

171,189runs

minimax/video-01-live

An image-to-video (I2V) model specifically trained for Live2D and general animation use cases

170,095runs

okaris/live-portrait

A powerful AI model.

170,074runs

resemble-ai/chatterbox

Generate expressive, natural speech. Features unique emotion control, instant voice cloning from short audio, and built-in watermarking.

169,170runs

dreamshaper-v6-img2img - AI model preview image

mcai/dreamshaper-v6-img2img

Generate a new image from an input image with DreamShaper V6

169,038runs

black-forest-labs/flux-canny-dev

Open-weight edge-guided image generation. Control structure and composition using Canny edge detection.

168,998runs

awerks/neon-tts

NeonAI Coqui AI TTS Plugin.

167,517runs

jagilley/controlnet-seg

Modify images using semantic segmentation

166,725runs

ultimate-sd-upscale - AI model preview image

fewjative/ultimate-sd-upscale

Ultimate SD Upscale with ControlNet Tile

165,042runs

yangxy/gpen

Blind Face Restoration in the Wild

164,822runs

internlm-xcomposer - AI model preview image

cjwbw/internlm-xcomposer

Advanced text-image comprehension and composition based on InternLM

164,425runs

lightricks/ltx-video

LTX-Video is the first DiT-based video generation model capable of generating high-quality videos in real-time. It produces 24 FPS videos at a 768x512 resolution faster than they can be watched.

161,589runs

01-ai/yi-6b

The Yi series models are large language models trained from scratch by developers at 01.AI.

161,057runs

dribnet/pixray-genesis

A powerful AI model.

160,668runs

leonardoai/lucid-origin

Artistic and high-quality visuals with improved prompt adherence, diversity, and definition

157,908runs

codellama-34b-instruct - AI model preview image

meta/codellama-34b-instruct

A 34 billion parameter Llama tuned for coding and conversation

157,877runs

multi-image-list - AI model preview image

flux-kontext-apps/multi-image-list

FLUX Kontext max with list input for multiple images

155,102runs

live-portrait-image - AI model preview image

pikachupichu25/live-portrait-image

Match facial expression using a driving image using LivePortrait as a base

154,993runs

bria/eraser

SOTA Object removal, enables precise removal of unwanted objects from images while maintaining high-quality outputs. Trained exclusively on licensed data for safe and risk-free commercial use

154,372runs

cjwbw/damo-text-to-video

Multi-stage text-to-video generation

153,851runs

bytedance/seedance-1-pro-fast

A faster and cheaper version of Seedance 1 Pro

152,984runs

arielreplicate/tres_iqa

Assess the quality of an image

152,921runs

replicate/flan-t5-xl

A language model by Google for tasks like classification, summarization, and more

151,202runs

flux-content-filter - AI model preview image

lucataco/flux-content-filter

Flux Content Filter - Check for public figures and copyright concerns

149,576runs

pixray-text2pixel-0x42 - AI model preview image

dribnet/pixray-text2pixel-0x42

Uses pixray to generate an image from text prompt

148,447runs

cjwbw/sadtalker

Stylized Audio-Driven Single Image Talking Face Animation

148,233runs

bytedance/omni-human

Turns your audio/video/images into professional-quality animated videos

146,074runs

declare-lab/tango

Tango 2: Use text prompts to make sound effects

143,982runs

embedding-gemma-300m - AI model preview image

zsxkib/embedding-gemma-300m

Turn any text into 768-dimensional vectors for search, classification, and AI apps 🧠✨

142,197runs

t2i-adapter-sdxl-canny - AI model preview image

adirik/t2i-adapter-sdxl-canny

Modify images using canny edges

142,010runs

pulid-lightning - AI model preview image

fofr/pulid-lightning

Use a face to instantly make images. Uses SDXL Lightning checkpoints.

140,806runs

stablelm-tuned-alpha-7b - AI model preview image

stability-ai/stablelm-tuned-alpha-7b

7 billion parameter version of Stability AI's language model

140,594runs

cjwbw/vqfr

Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

140,483runs

dreamshaper-xl-lightning - AI model preview image

lucataco/dreamshaper-xl-lightning

dreamshaper-xl-lightning is a Stable Diffusion model that has been fine-tuned on SDXL

140,122runs

website-scrapper - AI model preview image

myaiteam2/website-scrapper

Just some good ole beautifulsoup scrapping URL magic. (some sites don't work as they block scrapping, but still useful)

138,685runs

flux-kontext-dev-lora - AI model preview image

black-forest-labs/flux-kontext-dev-lora

FLUX.1 Kontext[dev] image editing model for running lora finetunes

138,356runs

zsxkib/diffbir

✨DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

137,261runs

shanginn/supir

A powerful AI model.

136,708runs

hate-speech-detector - AI model preview image

atrifat/hate-speech-detector

Detect hate speech or toxic comments in tweets/texts

135,693runs

open-dalle-1.1-lora - AI model preview image

fermatresearch/open-dalle-1.1-lora

Better than SDXL at both prompt adherence and image quality, by dataautogpt3

135,311runs

google-deepmind/gemma-2b-it

2B instruct version of Google’s Gemma model

134,038runs

fofr/color-matcher

Color match and white balance fixes for images

133,972runs

flux-kontext-apps/change-haircut

Quickly change someone's hair style and hair color, powered by FLUX.1 Kontext [pro]

133,770runs

laion-ai/ongo

Generate a painting using text.

133,591runs

edge-of-realism-v2.0 - AI model preview image

mcai/edge-of-realism-v2.0

Generate a new image given any input text with Edge Of Realism - EOR v2.0

133,541runs

open-dalle-v1.1 - AI model preview image

lucataco/open-dalle-v1.1

A unique fusion that showcases exceptional prompt adherence and semantic understanding, it seems to be a step above base SDXL and a step closer to DALLE-3 in terms of prompt comprehension

132,471runs

adirik/styletts2

Generates speech from text

132,004runs

cjwbw/videocrafter

VideoCrafter2: Text-to-Video and Image-to-Video Generation and Editing

131,906runs

proteus-v0.4-lightning - AI model preview image

datacte/proteus-v0.4-lightning

ProteusV0.4: The Style Update - enhances stylistic capabilities, similar to Midjourney's approach, rather than advancing prompt comprehension

131,860runs

cloneofsimo/lora

LoRA Inference model with Stable Diffusion

131,344runs

google/veo-3-fast

A faster and cheaper version of Google’s Veo 3 video model, with audio

129,284runs

wan-video/wan-2.2-t2v-fast

A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B text-to-video

128,645runs

qwen3-235b-a22b-instruct-2507 - AI model preview image

qwen/qwen3-235b-a22b-instruct-2507

Updated Qwen3 model for instruction following

128,259runs

ali-vilab/i2vgen-xl

RESEARCH/NON-COMMERCIAL USE ONLY: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models

128,062runs

florence-2-base - AI model preview image

lucataco/florence-2-base

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

127,622runs

openai/gpt-oss-120b

120b open-weight language model from OpenAI

124,308runs

lucataco/clip-interrogator

CLIP Interrogator (for faster inference)

123,183runs

lucataco/apollo-7b

Apollo 7B - An Exploration of Video Understanding in Large Multimodal Models

122,529runs

Advanced AI Models

bytedance/sdxl-lightning-4step

black-forest-labs/flux-schnell

meta/meta-llama-3-8b-instruct

salesforce/blip

meta/meta-llama-3-70b-instruct

openai/whisper

andreasjansson/clip-features

stability-ai/stable-diffusion

tencentarc/gfpgan

abiruyt/text-extract-ocr

stability-ai/sdxl

nightmareai/real-esrgan

falcons-ai/nsfw_image_detection

black-forest-labs/flux-1.1-pro

jaaari/kokoro-82m

meta/meta-llama-3-8b

krthr/clip-embeddings

xinntao/gfpgan

sczhou/codeformer

google/nano-banana

pengdaqian2020/image-tagger

jagilley/controlnet-scribble

black-forest-labs/flux-kontext-pro

beautyyuyanli/multilingual-e5-large

prunaai/flux-fast

black-forest-labs/flux-dev

yorickvp/llava-13b

andreasjansson/blip-2

philz1337x/clarity-upscaler

adirik/grounding-dino

stability-ai/stable-diffusion-inpainting

vaibhavs10/incredibly-fast-whisper

black-forest-labs/flux-1.1-pro-ultra

bytedance/hyper-flux-8step

meta/llama-2-7b-chat

daanelson/real-esrgan-a100

fofr/face-to-many

cjwbw/clip-vit-large-patch14

bytedance/seedream-4

allenhooo/lama

black-forest-labs/flux-pro

lucataco/remove-bg

prompthero/openjourney

prunaai/flux-kontext-fast

fofr/sdxl-emoji

datacte/proteus-v0.2

851-labs/background-remover

m1guelpf/nsfw-filter

cjwbw/rembg

ai-forever/kandinsky-2.2

meta/llama-2-70b-chat

jagilley/controlnet-hough

daanelson/imagebind

cjwbw/animagine-xl-3.1

okaris/roop

black-forest-labs/flux-kontext-max

turian/insanely-fast-whisper-with-video

tencentarc/photomaker

lucataco/codeformer

lambdal/text-to-pokemon

bytedance/hyper-flux-16step

xinntao/realesrgan

alexgenovese/upscaler

rossjillian/controlnet

recraft-ai/recraft-v3

asiryan/meina-mix-v11

comfyui/any-comfyui-workflow

meta/meta-llama-3.1-405b-instruct

nicolascoutureau/video-utils

shefa/turbo-enigma

jingyunliang/swinir

ai-forever/kandinsky-2

alphanumericuser/kokoro-82m

google/imagen-4

minimax/speech-02-turbo

usamaehsan/controlnet-1.1-x-realistic-vision-v2.0

prunaai/hidream-l1-fast

lucataco/moondream2

smoretalk/rembg-enhance