Fine-tune ViT model on our dataset