[Inference.Core] MiDaS Depth Map¶

Documentation¶

Class name: Inference_Core_MiDaS-DepthMapPreprocessor
Category: ControlNet Preprocessors/Normal and Depth Estimators
Output node: False

The MiDaS Depth Map Preprocessor node is designed to transform input images into depth maps using the MiDaS model. This process enhances the perception of depth in images, facilitating various applications in 3D modeling, augmented reality, and beyond by providing a detailed depth estimation.

Input types¶

Required¶

image
- The 'image' parameter is the input image for depth map generation, serving as the primary data for depth estimation.
- Comfy dtype: IMAGE
- Python dtype: torch.Tensor

Optional¶

a
- The 'a' parameter influences the calculation of normals in the depth map, affecting the perception of depth and texture in the output.
- Comfy dtype: FLOAT
- Python dtype: float
bg_threshold
- The 'bg_threshold' parameter sets the threshold for background separation in the depth map, enhancing the focus on foreground elements by filtering out background noise.
- Comfy dtype: FLOAT
- Python dtype: float
resolution
- The 'resolution' parameter specifies the resolution for the output depth map, affecting the level of detail and size of the generated depth map.
- Comfy dtype: INT
- Python dtype: int

Output types¶

image
- Comfy dtype: IMAGE
- The output is a depth map image, providing a pixel-wise depth estimation of the input image.
- Python dtype: torch.Tensor

Usage tips¶

Infra type: GPU
Common nodes: unknown

Source code¶

class MIDAS_Depth_Map_Preprocessor:
    @classmethod
    def INPUT_TYPES(s):
        return create_node_input_types(
            a =  ("FLOAT", {"default": np.pi * 2.0, "min": 0.0, "max": np.pi * 5.0, "step": 0.05}),
            bg_threshold = ("FLOAT", {"default": 0.1, "min": 0, "max": 1, "step": 0.05})
        )

    RETURN_TYPES = ("IMAGE",)
    FUNCTION = "execute"

    CATEGORY = "ControlNet Preprocessors/Normal and Depth Estimators"

    def execute(self, image, a, bg_threshold, resolution=512, **kwargs):
        from controlnet_aux.midas import MidasDetector

        # Ref: https://github.com/lllyasviel/ControlNet/blob/main/gradio_depth2image.py
        model = MidasDetector.from_pretrained().to(model_management.get_torch_device())
        out = common_annotator_call(model, image, resolution=resolution, a=a, bg_th=bg_threshold)
        del model
        return (out, )