A versatile pipeline for generating and editing 3D head avatars with textual prompts.
A versatile and efficient multi-task model for fashion-focused V+L tasks.
The second place solution for 2nd eBay eProduct Visual Search Challenge (FGVC9-CVPR2022).
A versatile and flexible framework for fashion-focused V+L representation learning.
A unified framework and benchmark for two interactive garment retrieval tasks.
Solve the data scarcity problem in TBPS through transfer learning and contrastive learning.
Interdisciplinary AI application in metasurface designing.