I currently work at AGI startup Moonshot AI (ζδΉζι’).
My research interests are LLM & MLLM Scaling, Efficient Deep Learning Methods, etc.
We are hiring!
If you are passionate about developing next generation of AI,
or if you're keen on crafting the next super app,
or if you see yourself fitting into a role at Moonshot AI,
don't hesitate to get in touch with me!
Dual-oriented Disentangled Network with Counterfactual Intervention for Multimodal Intent Detection
[EMNLP'24] Z. Chen, Z. Zhu, X. Zhuang, Z. Huang, Y. Zou
Divide and Conquer: Scenario-aware Label Graph Interaction for Multi-intent Spoken Language Understanding
[CIKM'24] Z. Zhu, X. Cheng, Z. Chen, Z. Wang, Z. Huang, Y. Zou
FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model
[ACMMM'24] Z. Yao, X. Cheng, Z. Huang
Mixture of Bidirectional Adapter for Multi-modal Sarcasm Detection
[ACMMM'24] Y. Xie, Z. Zhu, X. Chen, Z. Chen, Z. Huang
Multivariate Cooperative Game for Image-Report Pairs: Hierarchical Semantic Alignment for Medical Report Generation
[MICCAI'24] Z. Zhu, X. Cheng, Y. Zhang, Z. Chen, Q. Long, H. Li, Z. Huang, X. Wu, Y. Zheng
MoE-SLU: Towards ASR-Robust Spoken Language Understanding via Mixture-of-Experts
[ACL'24] X. Cheng, Z. Zhu, X. Zhuang, Z. Chen, Z. Huang, Y. Zou
Code-Switching Can be Better Aligners: Advancing Cross-Lingual SLU through Representation-Level and Prediction-Leve Alignment
[ACL'24] Z. Zhu, X. Cheng, Z. Chen, X. Zhuang, Z. Huang, Y. Zou
KC-Prompt: End-to-end Knowledge-Complementary Prompting for Rehearsal-free Continual Learning
[ICASSP'24] Y. Li, Y. Liu, X. Cheng, Z. Zhu, H. Li, B. Yang, Z. Huang
Alignment before Awareness: Towards Visual Question Localized-Answering in Robotic Surgery via Optimal Transport and Answer Semantics
[COLING'24] Z. Zhu, Y. Zhang, X. Cheng, Z. Huang, D, Xu, X. Wu, Y. Zheng
Towards Unified SLU Decoding via Label-aware Compact Linguistics Representations
[ACL'23] Z. Zhu, X. Cheng, Z. Huang, D. Chen, Y. Zou
A Multi-grained Contrastive Learning Framework for ASR-robust Language Understanding
[EMNLP'23] Z. Huang, D. Chen, Z. Zhu, X. Cheng
Audio-Oriented Multimodal Machine Comprehension via Dynamic Inter- & Intra-modality Attention
[AAAI'21] Z. Huang, F. Liu, X. Wu, S. Ge, H. Wang, W. Fan, Y. Zou
GhostBERT: Generate More Features with Cheap Operations for BERT
[ACL'21] Z. Huang, L. Hou, L. Shang, X. Chen, X. Jiang, Q. Liu
DynaBERT: Dynamic BERT with Adaptive Width and Depth
[NeurIPS'20] L. Hou, Z. Huang, L. Shang, X. Jiang, X. Chen, Q. Liu