jasperai
/

Flux.1-dev-Controlnet-Depth

Model card Files Files and versions Community

Flux.1-dev-Controlnet-Depth / README.md

clementchadebec's picture

clementchadebec

Update README.md

46745ac verified about 2 months ago

|

2.4 kB

	---
	base_model:
	- black-forest-labs/FLUX.1-dev
	library_name: diffusers
	license: other
	license_name: flux-1-dev-non-commercial-license
	license_link: https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICENSE.md
	pipeline_tag: image-to-image
	tags:
	- ControlNet
	---
	# ⚡ Flux.1-dev: Depth ControlNet ⚡

	This is [Flux.1-dev](https://huggingface.co/black-forest-labs/FLUX.1-dev) ControlNet for Depth map developed by Jasper research team.

	<p align="center">
	<img style="width:700px;" src="examples/showcase.jpg">
	</p>

	# How to use
	This model can be used directly with the `diffusers` library

	```python
	import torch
	from diffusers.utils import load_image
	from diffusers import FluxControlNetModel
	from diffusers.pipelines import FluxControlNetPipeline

	# Load pipeline
	controlnet = FluxControlNetModel.from_pretrained(
	"jasperai/Flux.1-dev-Controlnet-Depth",
	torch_dtype=torch.bfloat16
	)
	pipe = FluxControlNetPipeline.from_pretrained(
	"black-forest-labs/FLUX.1-dev",
	controlnet=controlnet,
	torch_dtype=torch.bfloat16
	)
	pipe.to("cuda")

	# Load a control image
	control_image = load_image(
	"https://huggingface.co/jasperai/Flux.1-dev-Controlnet-Depth/resolve/main/examples/depth.jpg"
	)

	prompt = "a statue of a gnome in a field of purple tulips"

	image = pipe(
	prompt,
	control_image=control_image,
	controlnet_conditioning_scale=0.6,
	num_inference_steps=28,
	guidance_scale=3.5,
	height=control_image.size[1],
	width=control_image.size[0]
	).images[0]
	image
	```

	<p align="center">
	<img style="width:500px;" src="examples/output.jpg">
	</p>

	💡 Note: You can compute the conditioning map using for instance the `MidasDetector` from the `controlnet_aux` library

	```python
	from controlnet_aux import MidasDetector
	from diffusers.utils import load_image

	midas = MidasDetector.from_pretrained("lllyasviel/Annotators")

	midas.to("cuda")

	# Load an image
	im = load_image(
	"https://huggingface.co/jasperai/Flux.1-dev-Controlnet-Depth/resolve/main/examples/output.jpg"
	)

	depth = midas(im)
	```

	# Training
	This model was trained with depth maps computed with [Clipdrop's depth estimator model](https://clipdrop.co/apis/docs/portrait-depth-estimation) as well as open-souce depth estimation models such as Midas or Leres.

	# Licence
	This model falls under the [Flux.1-dev licence](https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICENSE.md).