► Keras 2 API 文档 / 层 API / 池化层 / MaxPooling2D 层

MaxPooling2D 层

`MaxPooling2D` 类

tf_keras.layers.MaxPooling2D(
    pool_size=(2, 2), strides=None, padding="valid", data_format=None, **kwargs
)

用于 2D 空间数据的最大池化操作。

通过对输入在每个通道上取输入窗口（由 pool_size 定义大小）的最大值，来下采样输入的空间维度（高度和宽度）。窗口沿着每个维度以 strides 的步长移动。

当使用 "valid" 填充选项时，输出的空间形状（行数或列数）为：output_shape = math.floor((input_shape - pool_size) / strides) + 1（当 input_shape >= pool_size 时）

当使用 "same" 填充选项时，结果输出形状为：output_shape = math.floor((input_shape - 1) / strides) + 1

例如，对于 strides=(1, 1) 和 padding="valid"

>>> x = tf.constant([[1., 2., 3.],
...                  [4., 5., 6.],
...                  [7., 8., 9.]])
>>> x = tf.reshape(x, [1, 3, 3, 1])
>>> max_pool_2d = tf.keras.layers.MaxPooling2D(pool_size=(2, 2),
...    strides=(1, 1), padding='valid')
>>> max_pool_2d(x)
<tf.Tensor: shape=(1, 2, 2, 1), dtype=float32, numpy=
  array([[[[5.],
           [6.]],
          [[8.],
           [9.]]]], dtype=float32)>

例如，对于 strides=(2, 2) 和 padding="valid"

>>> x = tf.constant([[1., 2., 3., 4.],
...                  [5., 6., 7., 8.],
...                  [9., 10., 11., 12.]])
>>> x = tf.reshape(x, [1, 3, 4, 1])
>>> max_pool_2d = tf.keras.layers.MaxPooling2D(pool_size=(2, 2),
...    strides=(2, 2), padding='valid')
>>> max_pool_2d(x)
<tf.Tensor: shape=(1, 1, 2, 1), dtype=float32, numpy=
  array([[[[6.],
           [8.]]]], dtype=float32)>

用法 # 示例

>>> input_image = tf.constant([[[[1.], [1.], [2.], [4.]],
...                            [[2.], [2.], [3.], [2.]],
...                            [[4.], [1.], [1.], [1.]],
...                            [[2.], [2.], [1.], [4.]]]])
>>> output = tf.constant([[[[1], [0]],
...                       [[0], [1]]]])
>>> model = tf.keras.models.Sequential()
>>> model.add(tf.keras.layers.MaxPooling2D(pool_size=(2, 2),
...    input_shape=(4, 4, 1)))
>>> model.compile('adam', 'mean_squared_error')
>>> model.predict(input_image, steps=1)
array([[[[2.],
         [4.]],
        [[4.],
         [4.]]]], dtype=float32)

例如，对于 stride=(1, 1) 和 padding="same"

>>> x = tf.constant([[1., 2., 3.],
...                  [4., 5., 6.],
...                  [7., 8., 9.]])
>>> x = tf.reshape(x, [1, 3, 3, 1])
>>> max_pool_2d = tf.keras.layers.MaxPooling2D(pool_size=(2, 2),
...    strides=(1, 1), padding='same')
>>> max_pool_2d(x)
<tf.Tensor: shape=(1, 3, 3, 1), dtype=float32, numpy=
  array([[[[5.],
           [6.],
           [6.]],
          [[8.],
           [9.],
           [9.]],
          [[8.],
           [9.],
           [9.]]]], dtype=float32)>

参数

pool_size：整数或 2 个整数的元组，表示取最大值的窗口大小。(2, 2) 将在 2x2 的池化窗口内取最大值。如果只指定了一个整数，则两个维度将使用相同的窗口长度。
strides：整数、2 个整数的元组或 None。步长值。指定了每个池化步骤中池化窗口的移动距离。如果为 None，则默认为 pool_size。
padding："valid" 或 "same"（不区分大小写）之一。"valid" 表示无填充。"same" 会在输入的左/右或上/下进行均匀填充，使得输出与输入具有相同的高度/宽度维度。
data_format：字符串，channels_last（默认）或 channels_first 之一。输入中维度的顺序。channels_last 对应于形状为 (batch, height, width, channels) 的输入，而 channels_first 对应于形状为 (batch, channels, height, width) 的输入。当未指定时，使用您 TF-Keras 配置文件 ~/.keras/keras.json 中的 image_data_format 值（如果存在），否则默认为 'channels_last'。默认为 'channels_last'。

输入形状

如果 data_format='channels_last'：形状为 (batch_size, rows, cols, channels) 的 4D 张量。
如果 data_format='channels_first'：形状为 (batch_size, channels, rows, cols) 的 4D 张量。

输出形状

如果 data_format='channels_last'：形状为 (batch_size, pooled_rows, pooled_cols, channels) 的 4D 张量。
如果 data_format='channels_first'：形状为 (batch_size, channels, pooled_rows, pooled_cols) 的 4D 张量。

一个秩为 4 的张量，表示最大池化值。有关输出形状，请参见上文。

MaxPooling2D 层

MaxPooling2D 类

MaxPooling2D 层

MaxPooling2D 类

`MaxPooling2D` 类