MNIST - the world’s most famous computer vision benchmark dataset. From a human perspective, the only thing we should detect on MNIST is shape. However, it seems like neural networks are much more sensitive to texture than shape. I debugged many neural networks that ran on videos, and texture sensitivity was the number one failure case. Therefore, I introduce CMNIST. CMNIST combines existing MNIST-like datasets to generate arbitrary challenging datasets using 784 pixels only. I hope this helps to develop better neural network architectures instead of overfitting to existing datasets.

Version 2

Release planned for Q2/2023

WIP

Version 1

How to cite CMNIST

If you use any of the CMNIST subsets in your research, please use the following bibtex entry:

@Misc{wenkel2019cmnist,
  author = {Simon Wenkel},
  title  = {Concatenated MNIST (CMNIST). Making 784 pixels challenging again.},
  year   = {2019},
  url    = {https://www.simonwenkel.com/publications/articles/pdf/20190924_CMNIST.pdf},
}