Fig. 5From: A survey of Transformer applications for histopathological image analysis: New developments and future directionsA schematic diagram of a standard ViT model. Sequential image patches are used as the input, which is then processed with a transformer encoder and uses an MLP head module to generate a class predictionBack to article page