GlobalAveragePooling1D data_format Question

My rig
- Ubuntu 24.04 VM , RTX3060Ti with driver nvidia 535
- tensorflow-2.14-gpu/tensorflow-2.18 , both pull from docker
- Nvidia Container Toolkit if running in gpu version

About[ this example](https://keras.io/examples/timeseries/timeseries_classification_transformer/)

The transformer blocks of this example contain 2 Conv1D layer, and therefore we have to reshape the input matrix to add the channel dimension at the end.
There is a GlobalAveragePooling1D layer after the transformer blocks:
x = layers.GlobalAveragePooling1D(data_format="channels_last")(x)

which should be correct since our channel is added at the last.

However, if running these example, the summary at the last third line will not have 64,128 Params
dense (Dense)       │ (None, 128)       │  64,128 │ global_average_pool… 

Instead it will just have 256 parameters and making the total params way less, the model will also have an accuracy of ~50% only
![Screenshot from 2024-12-11 13-32-38](https://github.com/user-attachments/assets/9ced2781-0edd-47aa-8fab-03aba3185fbf)

this happen no matter i am running tensorflow-2.14-gpu, or just using the CPU version tensorflow-2.18

However, if changing the data_format="channels_first" everything become fine. The number of params in the GlobalAveragePooling1D layer become 64,128. The total params also match. The training accuracy also more than 90%.

I discover that as i find a very similar model [here](https://github.com/mxochicale/intro-to-transformers/blob/main/tutorials/time-series-classification/timeseries_transformer_classification.ipynb).
The only difference is the data_format

But isn't data_format="channels_last" is the right choice ?

So whats wrong ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GlobalAveragePooling1D data_format Question #20627

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

GlobalAveragePooling1D data_format Question #20627

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions