Now showing items 1-2 of 2

    • Improving local features with relevant spatial information by vision transformer for crowd counting 

      Nguyen, H. Tran; Ta, Duc Huy; Duong, T. M. Soan; Nguyen, Phan; Dao, Huu Hung; Nguyen, D. Tr. Chanh; Bui, Trung; Truong, Q. H. Steven (2022)
      Vision Transformer (ViT) variants have demonstrated state-of-the-art performances in plenty of computer vision benchmarks, including crowd counting. Although Transformer-based models have shown breakthroughs in crowd ...
    • LOGOVIT: Local-global vision transformer for object re-identification 

      Phan, Nguyen; Tran, Sam; Nguyen, Tran Hoang; Ta, Duc Huy; Duong, T. M. Soan; Nguyen, D. Tr. Chanh; Dao, Huu Hung; Bui, Trung; Truong, Q. H. Steven (2023-06)
      Object re-identification (ReID) is prone to errors under variations in scale, illumination, complex background, and object occlusion scenarios. To overcome these challenges, attention mechanisms are employed to concentrate ...

      Vin University Library
      Da Ton, Gia Lam
      Vinhomes Oceanpark, Ha Noi, Viet Nam
      Phone: +84-2471-089-779 | 1800-8189
      Contact: library@vinuni.edu.vn