`mmagic.models.editors.stylegan1`¶

Package Contents¶

Classes¶

`StyleGAN1`	Implementation of `A Style-Based Generator Architecture for Generative
`StyleGAN1Discriminator`	StyleGAN1 Discriminator.
`StyleGAN1Generator`	StyleGAN1 Generator.
`Blur`	Blur module.
`ConstantInput`	Constant Input.
`EqualLinearActModule`	Equalized LR Linear Module with Activation Layer.
`NoiseInjection`	Noise Injection Module.

Functions¶

`make_kernel`(k)
`get_mean_latent`(generator[, num_samples, bs_per_repeat])	Get mean latent of W space in Style-based GANs.
`style_mixing`(generator, n_source, n_target[, ...])

class mmagic.models.editors.stylegan1.StyleGAN1(generator: ModelType, discriminator: Optional[ModelType] = None, data_preprocessor: Optional[Union[dict, mmengine.Config]] = None, style_channels: int = 512, nkimgs_per_scale: dict = {}, interp_real: Optional[dict] = None, transition_kimgs: int = 600, prev_stage: int = 0, ema_config: Optional[Dict] = None)[源代码]¶

Bases: mmagic.models.editors.pggan.ProgressiveGrowingGAN

Implementation of A Style-Based Generator Architecture for Generative Adversarial Networks.

<https://openaccess.thecvf.com/content_CVPR_2019/html/Karras_A_Style-Based_Generator_Architecture_for_Generative_Adversarial_Networks_CVPR_2019_paper.html>`_ # noqa (StyleGANv1). This class is inherited from ProgressiveGrowingGAN to support progressive training.

Detailed architecture can be found in StyleGAN1Generator and StyleGAN1Discriminator

参数

generator (ModelType) – The config or model of the generator.
discriminator (Optional[ModelType]) – The config or model of the discriminator. Defaults to None.
data_preprocessor (Optional[Union[dict, Config]]) – The pre-process config or DataPreprocessor.
style_channels (int) – The number of channels for style code. Defaults to 128.
nkimgs_per_scale (dict) – The number of images need for each resolution’s training. Defaults to {}.
intep_real (dict, optional) – The config of interpolation method for real images. If not passed, bilinear interpolation with align_corners will be used. Defaults to None.
transition_kimgs (int, optional) – The number of images during used to transit from the previous torgb layer to newer torgb layer. Defaults to 600.
prev_stage (int, optional) – The resolution of previous stage. Used for resume training. Defaults to 0.
ema_config (Optional[Dict]) – The config for generator’s exponential moving average setting. Defaults to None.

disc_loss(disc_pred_fake: torch.Tensor, disc_pred_real: torch.Tensor, fake_data: torch.Tensor, real_data: torch.Tensor) → Tuple[torch.Tensor, dict]¶

Get disc loss. StyleGANv1 use non-saturating gan loss and R1 gradient penalty. loss to train the discriminator.

参数

disc_pred_fake (Tensor) – Discriminator’s prediction of the fake images.
disc_pred_real (Tensor) – Discriminator’s prediction of the real images.
fake_data (Tensor) – Generated images, used to calculate gradient penalty.
real_data (Tensor) – Real images, used to calculate gradient penalty.

返回

Loss value and a dict of log variables.

返回类型

Tuple[Tensor, dict]

gen_loss(disc_pred_fake: torch.Tensor) → Tuple[torch.Tensor, dict]¶

Generator loss for PGGAN. PGGAN use WGAN’s loss to train the generator.

参数: disc_pred_fake (Tensor) – Discriminator’s prediction of the fake images.
返回: Loss value and a dict of log variables.
返回类型: Tuple[Tensor, dict]

class mmagic.models.editors.stylegan1.StyleGAN1Discriminator(in_size, blur_kernel=[1, 2, 1], mbstd_cfg=dict(group_size=4))[源代码]¶

Bases: mmengine.model.BaseModule

StyleGAN1 Discriminator.

The architecture of this discriminator is proposed in StyleGAN1. More details can be found in: A Style-Based Generator Architecture for Generative Adversarial Networks CVPR2019.

参数

in_size (int) – The input size of images.
blur_kernel (list, optional) – The blurry kernel. Defaults to [1, 2, 1].
mbstd_cfg (dict, optional) – Configs for minibatch-stddev layer. Defaults to dict(group_size=4).

forward(input, transition_weight=1.0, curr_scale=- 1)¶

Forward function.

参数

input (torch.Tensor) – Input image tensor.
transition_weight (float, optional) – The weight used in resolution transition. Defaults to 1..
curr_scale (int, optional) – The resolution scale of image tensor. -1 means the max resolution scale of the StyleGAN1. Defaults to -1.

返回

Predict score for the input image.

返回类型

torch.Tensor

class mmagic.models.editors.stylegan1.StyleGAN1Generator(out_size, style_channels, num_mlps=8, blur_kernel=[1, 2, 1], lr_mlp=0.01, default_style_mode='mix', eval_style_mode='single', mix_prob=0.9)[源代码]¶

Bases: mmengine.model.BaseModule

StyleGAN1 Generator.

In StyleGAN1, we use a progressive growing architecture composing of a style mapping module and number of convolutional style blocks. More details can be found in: A Style-Based Generator Architecture for Generative Adversarial Networks CVPR2019.

参数

out_size (int) – The output size of the StyleGAN1 generator.
style_channels (int) – The number of channels for style code.
num_mlps (int, optional) – The number of MLP layers. Defaults to 8.
blur_kernel (list, optional) – The blurry kernel. Defaults to [1, 2, 1].
lr_mlp (float, optional) – The learning rate for the style mapping layer. Defaults to 0.01.
default_style_mode (str, optional) – The default mode of style mixing. In training, we adopt mixing style mode in default. However, in the evaluation, we use ‘single’ style mode. [‘mix’, ‘single’] are currently supported. Defaults to ‘mix’.
eval_style_mode (str, optional) – The evaluation mode of style mixing. Defaults to ‘single’.
mix_prob (float, optional) – Mixing probability. The value should be in range of [0, 1]. Defaults to 0.9.

train(mode=True)¶

Sets the module in training mode.

This has any effect only on certain modules. See documentations of particular modules for details of their behaviors in training/evaluation mode, if they are affected, e.g. Dropout, BatchNorm, etc.

参数: mode (bool) – whether to set training mode (True) or evaluation mode (False). Default: True.
返回: self
返回类型: Module

make_injected_noise()¶

make noises that will be injected into feature maps.

返回: List of layer-wise noise tensor.
返回类型: list[Tensor]

get_mean_latent(num_samples=4096, **kwargs)¶

Get mean latent of W space in this generator.

参数: num_samples (int, optional) – Number of sample times. Defaults to 4096.
返回: Mean latent of this generator.
返回类型: Tensor

style_mixing(n_source, n_target, inject_index=1, truncation_latent=None, truncation=0.7, curr_scale=- 1, transition_weight=1)¶

forward(styles, num_batches=- 1, return_noise=False, return_latents=False, inject_index=None, truncation=1, truncation_latent=None, input_is_latent=False, injected_noise=None, randomize_noise=True, transition_weight=1.0, curr_scale=- 1)¶

Forward function.

This function has been integrated with the truncation trick. Please refer to the usage of truncation and truncation_latent.

参数

styles (torch.Tensor | list[torch.Tensor] | callable | None) – In StyleGAN1, you can provide noise tensor or latent tensor. Given a list containing more than one noise or latent tensors, style mixing trick will be used in training. Of course, You can directly give a batch of noise through a torch.Tensor or offer a callable function to sample a batch of noise data. Otherwise, the None indicates to use the default noise sampler.
num_batches (int, optional) – The number of batch size. Defaults to 0.
return_noise (bool, optional) – If True, noise_batch will be returned in a dict with fake_img. Defaults to False.
return_latents (bool, optional) – If True, latent will be returned in a dict with fake_img. Defaults to False.
inject_index (int | None, optional) – The index number for mixing style codes. Defaults to None.
truncation (float, optional) – Truncation factor. Give value less than 1., the truncation trick will be adopted. Defaults to 1.
truncation_latent (torch.Tensor, optional) – Mean truncation latent. Defaults to None.
input_is_latent (bool, optional) – If True, the input tensor is the latent tensor. Defaults to False.
injected_noise (torch.Tensor | None, optional) – Given a tensor, the random noise will be fixed as this input injected noise. Defaults to None.
randomize_noise (bool, optional) – If False, images are sampled with the buffered noise tensor injected to the style conv block. Defaults to True.
transition_weight (float, optional) – The weight used in resolution transition. Defaults to 1..
curr_scale (int, optional) – The resolution scale of generated image tensor. -1 means the max resolution scale of the StyleGAN1. Defaults to -1.

返回

Generated image tensor or dictionary containing more data.

返回类型

torch.Tensor | dict

class mmagic.models.editors.stylegan1.Blur(kernel, pad, upsample_factor=1)[源代码]¶

Bases: mmengine.model.BaseModule

Blur module.

This module is adopted rightly after upsampling operation in StyleGAN2.

参数

kernel (Array) – Blur kernel/filter used in UpFIRDn.
pad (list[int]) – Padding for features.
upsample_factor (int, optional) – Upsampling factor. Defaults to 1.

forward(x)¶

Forward function.

参数: x (Tensor) – Input feature map with shape of (N, C, H, W).
返回: Output feature map.
返回类型: Tensor

class mmagic.models.editors.stylegan1.ConstantInput(channel, size=4)[源代码]¶

Bases: mmengine.model.BaseModule

Constant Input.

In StyleGAN2, they substitute the original head noise input with such a constant input module.

参数

channel (int) – Channels for the constant input tensor.
size (int, optional) – Spatial size for the constant input. Defaults to 4.

forward(x)¶

Forward function.

参数: x (Tensor) – Input feature map with shape of (N, C, …).
返回: Output feature map.
返回类型: Tensor

class mmagic.models.editors.stylegan1.EqualLinearActModule(*args, equalized_lr_cfg=dict(gain=1.0, lr_mul=1.0), bias=True, bias_init=0.0, act_cfg=None, **kwargs)[源代码]¶

Bases: mmengine.model.BaseModule

Equalized LR Linear Module with Activation Layer.

This module is modified from EqualizedLRLinearModule defined in PGGAN. The major features updated in this module is adding support for activation layers used in StyleGAN2.

参数

equalized_lr_cfg (dict | None, optional) – Config for equalized lr. Defaults to dict(gain=1., lr_mul=1.).
bias (bool, optional) – Whether to use bias item. Defaults to True.
bias_init (float, optional) – The value for bias initialization. Defaults to 0..
act_cfg (dict | None, optional) – Config for activation layer. Defaults to None.

forward(x)¶

Forward function.

参数: x (Tensor) – Input feature map with shape of (N, C, …).
返回: Output feature map.
返回类型: Tensor

class mmagic.models.editors.stylegan1.NoiseInjection(noise_weight_init=0.0, fixed_noise=False)[源代码]¶

Bases: mmengine.model.BaseModule

Noise Injection Module.

In StyleGAN2, they adopt this module to inject spatial random noise map in the generators.

参数

noise_weight_init (float, optional) – Initialization weight for noise injection. Defaults to 0..
fixed_noise (bool, optional) – Whether to inject a fixed noise. Defaults
False. (to) –

forward(image, noise=None, return_noise=False)¶

Forward Function.

参数

image (Tensor) – Spatial features with a shape of (N, C, H, W).
noise (Tensor, optional) – Noises from the outside. Defaults to None.
return_noise (bool, optional) – Whether to return noise tensor. Defaults to False.

返回

Output features.

返回类型

Tensor

mmagic.models.editors.stylegan1.make_kernel(k)[源代码]¶

mmagic.models.editors.stylegan1.get_mean_latent(generator, num_samples=4096, bs_per_repeat=1024)[源代码]¶

Get mean latent of W space in Style-based GANs.

参数

generator (BaseModule) – Generator of a Style-based GAN.
num_samples (int, optional) – Number of sample times. Defaults to 4096.
bs_per_repeat (int, optional) – Batch size of noises per sample. Defaults to 1024.

返回

Mean latent of this generator.

返回类型

Tensor

mmagic.models.editors.stylegan1.style_mixing(generator, n_source, n_target, inject_index=1, truncation_latent=None, truncation=0.7, style_channels=512, **kwargs)[源代码]¶

mmagic.models.editors.stylegan1¶

Package Contents¶

Classes¶

Functions¶

`mmagic.models.editors.stylegan1`¶