發送短信 : Adopting multiple vision transformer layers for fine-grained image representation