Shortcuts

mmagic.models.editors.stylegan2.stylegan2_modules

Module Contents

Classes

_FusedBiasLeakyReLU

Wrap FusedBiasLeakyReLU to support FP16 training.

UpsampleUpFIRDn

UpFIRDn for Upsampling.

DownsampleUpFIRDn

UpFIRDn for Downsampling.

ModulatedConv2d

Modulated Conv2d in StyleGANv2.

ModulatedStyleConv

Modulated Style Convolution.

ModulatedToRGB

To RGB layer.

ConvDownLayer

Convolution and Downsampling layer.

ResBlock

Residual block used in the discriminator of StyleGAN2.

ModMBStddevLayer

Modified MiniBatch Stddev Layer.

Attributes

conv2d

mmagic.models.editors.stylegan2.stylegan2_modules.conv2d[source]
class mmagic.models.editors.stylegan2.stylegan2_modules._FusedBiasLeakyReLU(num_channels: int, negative_slope: float = 0.2, scale: float = 2 ** 0.5)[source]

Bases: mmcv.ops.fused_bias_leakyrelu.FusedBiasLeakyReLU

Wrap FusedBiasLeakyReLU to support FP16 training.

forward(x)[source]

Forward function.

Parameters

x (Tensor) – Input feature map with shape of (N, C, …).

Returns

Output feature map.

Return type

Tensor

class mmagic.models.editors.stylegan2.stylegan2_modules.UpsampleUpFIRDn(kernel, factor=2)[source]

Bases: mmengine.model.BaseModule

UpFIRDn for Upsampling.

This module is used in the to_rgb layers in StyleGAN2 for upsampling the images.

Parameters
  • kernel (Array) – Blur kernel/filter used in UpFIRDn.

  • factor (int, optional) – Upsampling factor. Defaults to 2.

forward(x)[source]

Forward function.

Parameters

x (Tensor) – Input feature map with shape of (N, C, H, W).

Returns

Output feature map.

Return type

Tensor

class mmagic.models.editors.stylegan2.stylegan2_modules.DownsampleUpFIRDn(kernel, factor=2)[source]

Bases: mmengine.model.BaseModule

UpFIRDn for Downsampling.

This module is mentioned in StyleGAN2 for dowampling the feature maps.

Parameters
  • kernel (Array) – Blur kernel/filter used in UpFIRDn.

  • factor (int, optional) – Downsampling factor. Defaults to 2.

forward(input)[source]

Forward function.

Parameters

input (Tensor) – Input feature map with shape of (N, C, H, W).

Returns

Output feature map.

Return type

Tensor

class mmagic.models.editors.stylegan2.stylegan2_modules.ModulatedConv2d(in_channels, out_channels, kernel_size, style_channels, demodulate=True, upsample=False, downsample=False, blur_kernel=[1, 3, 3, 1], equalized_lr_cfg=dict(mode='fan_in', lr_mul=1.0, gain=1.0), style_mod_cfg=dict(bias_init=1.0), style_bias=0.0, padding=None, eps=1e-08, fp16_enabled=False)[source]

Bases: mmengine.model.BaseModule

Modulated Conv2d in StyleGANv2.

This module implements the modulated convolution layers proposed in StyleGAN2. Details can be found in Analyzing and Improving the Image Quality of StyleGAN, CVPR2020.

Parameters
  • in_channels (int) – Input channels.

  • out_channels (int) – Output channels.

  • kernel_size (int) – Kernel size, same as nn.Con2d.

  • style_channels (int) – Channels for the style codes.

  • demodulate (bool, optional) – Whether to adopt demodulation. Defaults to True.

  • upsample (bool, optional) – Whether to adopt upsampling in features. Defaults to False.

  • downsample (bool, optional) – Whether to adopt downsampling in features. Defaults to False.

  • blur_kernel (list[int], optional) – Blurry kernel. Defaults to [1, 3, 3, 1].

  • equalized_lr_cfg (dict | None, optional) – Configs for equalized lr. Defaults to dict(mode=’fan_in’, lr_mul=1., gain=1.).

  • style_mod_cfg (dict, optional) – Configs for style modulation module. Defaults to dict(bias_init=1.).

  • style_bias (float, optional) – Bias value for style code. Defaults to 0..

  • eps (float, optional) – Epsilon value to avoid computation error. Defaults to 1e-8.

forward(x, style, input_gain=None)[source]
class mmagic.models.editors.stylegan2.stylegan2_modules.ModulatedStyleConv(in_channels, out_channels, kernel_size, style_channels, upsample=False, blur_kernel=[1, 3, 3, 1], demodulate=True, style_mod_cfg=dict(bias_init=1.0), style_bias=0.0, fp16_enabled=False, conv_clamp=256, fixed_noise=False)[source]

Bases: mmengine.model.BaseModule

Modulated Style Convolution.

In this module, we integrate the modulated conv2d, noise injector and activation layers into together.

Parameters
  • in_channels (int) – Input channels.

  • out_channels (int) – Output channels.

  • kernel_size (int) – Kernel size, same as nn.Con2d.

  • style_channels (int) – Channels for the style codes.

  • demodulate (bool, optional) – Whether to adopt demodulation. Defaults to True.

  • upsample (bool, optional) – Whether to adopt upsampling in features. Defaults to False.

  • downsample (bool, optional) – Whether to adopt downsampling in features. Defaults to False.

  • blur_kernel (list[int], optional) – Blurry kernel. Defaults to [1, 3, 3, 1].

  • equalized_lr_cfg (dict | None, optional) – Configs for equalized lr. Defaults to dict(mode=’fan_in’, lr_mul=1., gain=1.).

  • style_mod_cfg (dict, optional) – Configs for style modulation module. Defaults to dict(bias_init=1.).

  • style_bias (float, optional) – Bias value for style code. Defaults to 0..

  • fp16_enabled (bool, optional) – Whether to use fp16 training in this module. Defaults to False.

  • conv_clamp (float, optional) – Clamp the convolutional layer results to avoid gradient overflow. Defaults to 256.0.

forward(x, style, noise=None, add_noise=True, return_noise=False)[source]

Forward Function.

Parameters
  • x ([Tensor) – Input features with shape of (N, C, H, W).

  • style (Tensor) – Style latent with shape of (N, C).

  • noise (Tensor, optional) – Noise for injection. Defaults to None.

  • add_noise (bool, optional) – Whether apply noise injection to feature. Defaults to True.

  • return_noise (bool, optional) – Whether to return noise tensors. Defaults to False.

Returns

Output features with shape of (N, C, H, W)

Return type

Tensor

class mmagic.models.editors.stylegan2.stylegan2_modules.ModulatedToRGB(in_channels, style_channels, out_channels=3, upsample=True, blur_kernel=[1, 3, 3, 1], style_mod_cfg=dict(bias_init=1.0), style_bias=0.0, fp16_enabled=False, conv_clamp=256, out_fp32=True)[source]

Bases: mmengine.model.BaseModule

To RGB layer.

This module is designed to output image tensor in StyleGAN2.

Parameters
  • in_channels (int) – Input channels.

  • style_channels (int) – Channels for the style codes.

  • out_channels (int, optional) – Output channels. Defaults to 3.

  • upsample (bool, optional) – Whether to adopt upsampling in features. Defaults to False.

  • blur_kernel (list[int], optional) – Blurry kernel. Defaults to [1, 3, 3, 1].

  • style_mod_cfg (dict, optional) – Configs for style modulation module. Defaults to dict(bias_init=1.).

  • style_bias (float, optional) – Bias value for style code. Defaults to 0..

  • fp16_enabled (bool, optional) – Whether to use fp16 training in this module. Defaults to False.

  • conv_clamp (float, optional) – Clamp the convolutional layer results to avoid gradient overflow. Defaults to 256.0.

  • out_fp32 (bool, optional) – Whether to convert the output feature map to torch.float32. Defaults to True.

forward(x, style, skip=None)[source]

Forward Function.

Parameters
  • x ([Tensor) – Input features with shape of (N, C, H, W).

  • style (Tensor) – Style latent with shape of (N, C).

  • skip (Tensor, optional) – Tensor for skip link. Defaults to None.

Returns

Output features with shape of (N, C, H, W)

Return type

Tensor

class mmagic.models.editors.stylegan2.stylegan2_modules.ConvDownLayer(in_channels, out_channels, kernel_size, downsample=False, blur_kernel=[1, 3, 3, 1], bias=True, act_cfg=dict(type='fused_bias'), fp16_enabled=False, conv_clamp=256.0)[source]

Bases: torch.nn.Sequential

Convolution and Downsampling layer.

Parameters
  • in_channels (int) – Input channels.

  • out_channels (int) – Output channels.

  • kernel_size (int) – Kernel size, same as nn.Con2d.

  • downsample (bool, optional) – Whether to adopt downsampling in features. Defaults to False.

  • blur_kernel (list[int], optional) – Blurry kernel. Defaults to [1, 3, 3, 1].

  • bias (bool, optional) – Whether to use bias parameter. Defaults to True.

  • act_cfg (dict, optional) – Activation configs. Defaults to dict(type=’fused_bias’).

  • fp16_enabled (bool, optional) – Whether to use fp16 training in this module. Defaults to False.

  • conv_clamp (float, optional) – Clamp the convolutional layer results to avoid gradient overflow. Defaults to 256.0.

forward(x)[source]
class mmagic.models.editors.stylegan2.stylegan2_modules.ResBlock(in_channels, out_channels, blur_kernel=[1, 3, 3, 1], fp16_enabled=False, convert_input_fp32=True)[source]

Bases: mmengine.model.BaseModule

Residual block used in the discriminator of StyleGAN2.

Parameters
  • in_channels (int) – Input channels.

  • out_channels (int) – Output channels.

  • kernel_size (int) – Kernel size, same as nn.Con2d.

  • fp16_enabled (bool, optional) – Whether to use fp16 training in this module. Defaults to False.

  • convert_input_fp32 (bool, optional) – Whether to convert input type to fp32 if not fp16_enabled. This argument is designed to deal with the cases where some modules are run in FP16 and others in FP32. Defaults to True.

forward(input)[source]

Forward function.

Parameters

input (Tensor) – Input feature map with shape of (N, C, H, W).

Returns

Output feature map.

Return type

Tensor

class mmagic.models.editors.stylegan2.stylegan2_modules.ModMBStddevLayer(group_size=4, channel_groups=1, sync_std=False, sync_groups=None, eps=1e-08)[source]

Bases: mmengine.model.BaseModule

Modified MiniBatch Stddev Layer.

This layer is modified from MiniBatchStddevLayer used in PGGAN. In StyleGAN2, the authors add a new feature, channel_groups, into this layer.

Note that to accelerate the training procedure, we also add a new feature of sync_std to achieve multi-nodes/machine training. This feature is still in beta version and we have tested it on 256 scales.

Parameters
  • group_size (int, optional) – The size of groups in batch dimension. Defaults to 4.

  • channel_groups (int, optional) – The size of groups in channel dimension. Defaults to 1.

  • sync_std (bool, optional) – Whether to use synchronized std feature. Defaults to False.

  • sync_groups (int | None, optional) – The size of groups in node dimension. Defaults to None.

  • eps (float, optional) – Epsilon value to avoid computation error. Defaults to 1e-8.

forward(x)[source]

Forward function.

Parameters

x (Tensor) – Input feature map with shape of (N, C, H, W).

Returns

Output feature map with shape of (N, C+1, H, W).

Return type

Tensor

Read the Docs v: latest
Versions
latest
stable
0.x
Downloads
pdf
epub
On Read the Docs
Project Home
Builds

Free document hosting provided by Read the Docs.