Go with Your Gut: Scaling Confidence for Autoregressive Image Generation
Confidence scaling for better autoregressive image generation.
Wenjie Shu is a Research Assistant at The Hong Kong University of Science and Technology (HKUST), supervised by Prof. Harry Yang and Prof. Qifeng Chen. He obtained his B.E. degree from the University of Electronic Science and Technology of China (UESTC) in 2025, where he worked closely with Prof. Liangjian Deng. He is also fortunate to collaborate with Dr. Xiaogang Xu and Prof. Ser-Nam Lim.
He is always open to research collaborations and is currently applying to PhD programs, actively seeking opportunities for the next intake. Feel free to get in touch if you are interested in working with him! His research interests include Video Generation & Understanding, Reinforcement Learning and Computer Vision.
B.E. in Information Engineering
University of Electronic Science and Technology of China (UESTC)
Visiting Student, Generative AI
The Hong Kong University of Science and Technology (HKUST)
Visiting Student, Video Generation
The Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen)
My research lies at the intersection of generative modeling and reinforcement learning, aiming for controllable, reliable, and efficient image/video generation.
I actively collaborate across academia and industry. If you’re interested in collaboration, feel free to reach out.
Confidence scaling for better autoregressive image generation.
Region-focused RL for human-centric diffusion generation.
Benchmarking visual reasoning in video generation.
Cross Modulation Transformer with frequency-domain hybrid loss for pansharpening.
We explore frequency behavior in super-resolution models and its impact on reconstruction performance.