I remember I once gone through vision transformer paper (An image is worth 16x16…) explained by aman arora. Now I can’t find that lecture. Can anyone here let me know about that lecture, or recommend this explanation by some other who explained this paper in easy way.