Xuan Shen
I am a Ph.D. student in ECE Department of Northeastern University at Boston, advised by Yanzhi Wang. Previously, I received my M.S. degree at Northeastern University in 2020 and my B.S. degree at Nanjing University of Science and Technology in 2018.
My research interest is Efficient AI including pruning, quantization, NAS, and distillation with Software Hardware Co-Design on Mobile, FGPA, and ASIC.
I work closely with Jiuxiang Gu, Prof. Pu Zhao and Prof. Wei Niu. I was fortunate to work with Ming Lin.
As a final-year Ph.D. candidate, I am actively pursuing post-doctoral and full-time research opportunities. I would welcome the chance to connect and discuss potential collaborations if our research interests align.
News
Dec 09, 2024 | Got Adobe Reward: 2024 Key Innovations (Tech Transfer Small LLM on Acrobat). |
---|---|
Dec 09, 2024 | Got three papers accepted in AAAI 2025. |
Nov 19, 2024 | Multimodal Opioid Benchmark released on HuggingFace: opioidarchive/oida-qa. |
Oct 30, 2024 | Our paper about PTQ of LLMs on Mobile and FPGA has been accepted to TCAD. |
Sep 25, 2024 | Got two papers accepted in NeurIPS 2024. |
Selected Publications
- AAAINumerical Pruning for Efficient Autoregressive ModelsThe Association for the Advancement of Artificial Intelligence, 2025
- AAAILazyDiT: Lazy Learning for the Acceleration of Diffusion TransformersThe Association for the Advancement of Artificial Intelligence, 2025
- TCADHotaQ: Hardware Oriented Token Adaptive Quantization for Large Language ModelsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2024