► Keras 3 API 文档 / 层 API / 卷积层 / DepthwiseConv2D 层

DepthwiseConv2D 层

`DepthwiseConv2D` 类

keras.layers.DepthwiseConv2D(
    kernel_size,
    strides=(1, 1),
    padding="valid",
    depth_multiplier=1,
    data_format=None,
    dilation_rate=(1, 1),
    activation=None,
    use_bias=True,
    depthwise_initializer="glorot_uniform",
    bias_initializer="zeros",
    depthwise_regularizer=None,
    bias_regularizer=None,
    activity_regularizer=None,
    depthwise_constraint=None,
    bias_constraint=None,
    **kwargs
)

2D 深度可分离卷积层。

深度卷积是一种卷积类型，其中每个输入通道都与不同的核（称为深度核）进行卷积。你可以将深度卷积理解为深度可分离卷积的第一步。

它通过以下步骤实现：

将输入分割成单独的通道。
将每个通道与具有 depth_multiplier 输出通道的单独深度核进行卷积。
沿通道轴连接卷积输出。

与常规 2D 卷积不同，深度可分离卷积不会混合不同输入通道的信息。

depth_multiplier 参数决定了一个输入通道应用多少个过滤器。因此，它控制了在深度可分离步骤中每个输入通道生成的输出通道的数量。

参数

kernel_size: int 或 2 个整数的元组/列表，指定深度可分离卷积窗口的大小。
strides: int 或 2 个整数的元组/列表，指定深度可分离卷积的步长。strides > 1 与 dilation_rate > 1 不兼容。
padding: 字符串，"valid" 或 "same"（不区分大小写）。"valid" 表示不进行填充。"same" 会在输入的左/右或上/下均匀填充。当 padding="same" 且 strides=1 时，输出与输入具有相同的大小。
depth_multiplier：每个输入通道的深度卷积输出通道数。深度卷积输出通道的总数将等于 input_channel * depth_multiplier。
data_format: 字符串，"channels_last" 或 "channels_first"。输入中维度的顺序。"channels_last" 对应于形状为 (batch, height, width, channels) 的输入，而 "channels_first" 对应于形状为 (batch, channels, height, width) 的输入。它默认为你的 Keras 配置文件 ~/.keras/keras.json 中 image_data_format 的值。如果你从未设置过，则为 "channels_last"。
dilation_rate：整数或包含 2 个整数的元组/列表，指定用于扩张卷积的扩张率。
activation：激活函数。如果为 None，则不应用激活。
use_bias：bool，如果为 True，则会将偏置添加到输出中。
depthwise_initializer: 卷积核的初始化器。如果为 None，将使用默认初始化器 ("glorot_uniform")。
bias_initializer：偏置向量的初始化器。如果为 None，将使用默认初始化器（"zeros"）。
depthwise_regularizer: 卷积核的可选正则化器。
bias_regularizer：偏置向量的可选正则化器。
activity_regularizer：输出的可选正则化函数。
depthwise_constraint: 应用于经 Optimizer 更新后核的可选投影函数（例如，用于实现层权重的范数约束或值约束）。该函数必须将未投影的变量作为输入，并且必须返回投影的变量（其必须具有相同的形状）。在进行异步分布式训练时，使用约束是不安全的。
bias_constraint：可选的投影函数，用于在 Optimizer 更新偏置后应用于偏置。

输入形状

如果 data_format="channels_last"：形状为 (batch_size, height, width, channels) 的 4D 张量
如果 data_format="channels_first"：形状为 (batch_size, channels, height, width) 的 4D 张量

输出形状

如果 data_format="channels_last"：一个 4D 张量，形状为：(batch_size, new_height, new_width, channels * depth_multiplier)
如果 data_format="channels_first"：一个 4D 张量，形状为：(batch_size, channels * depth_multiplier, new_height, new_width)

一个 4D 张量，表示 activation(depthwise_conv2d(inputs, kernel) + bias)。

引发

ValueError：当 strides > 1 和 dilation_rate > 1 都存在时。

示例

>>> x = np.random.rand(4, 10, 10, 12)
>>> y = keras.layers.DepthwiseConv2D(kernel_size=3, activation='relu')(x)
>>> print(y.shape)
(4, 8, 8, 12)

DepthwiseConv2D 层

DepthwiseConv2D 类

DepthwiseConv2D 层

DepthwiseConv2D 类

`DepthwiseConv2D` 类