Publications
* Equal Contribution, † Corresponding Author(s)
2025
- AAAINumerical Pruning for Efficient Autoregressive ModelsThe Association for the Advancement of Artificial Intelligence, 2025
- AAAILazyDiT: Lazy Learning for the Acceleration of Diffusion TransformersThe Association for the Advancement of Artificial Intelligence, 2025
2024
- TCADHotaQ: Hardware Oriented Token Adaptive Quantization for Large Language ModelsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2024