publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. ICLR
    MoLEx: Mixture of Layer Experts for Fine-tuning with Sparse Upcycling
    Rachel S.Y. Teo, and Tan M. Nguyen
    International Conference on Learning Representations (ICLR), 2025, Feb 2025
  2. ICLR
    Tight Clusters Make Specialized Experts
    Stefan Nielsen*, Rachel S.Y. Teo*, Laziz Abdullaev, and Tan M. Nguyen
    International Conference on Learning Representations (ICLR), 2025, Feb 2025
  3. ICLR
    CAMEx: Curvature-aware Merging of Experts
    Viet Dung Nguyen, Minh Nguyen Hoang, Rachel S.Y. Teo, Luc Nguyen, and 2 more authors
    International Conference on Learning Representations (ICLR), 2025, Feb 2025

2024

  1. NeurIPS
    MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts
    Rachel S.Y. Teo, and Tan M. Nguyen
    Conference on Neural Information Processing Systems (NeurIPS), 2024, Oct 2024
  2. NeurIPS
    Unveiling the Hidden Structure of Self-Attention via Kernel Principal Component Analysis
    Rachel S.Y. Teo, and Tan M. Nguyen
    Conference on Neural Information Processing Systems (NeurIPS), 2024, Oct 2024
  3. NeurIPS
    Elliptical Attention
    Stefan Nielsen*, Laziz Abdullaev*, Rachel S.Y. Teo, and Tan M. Nguyen
    Conference on Neural Information Processing Systems (NeurIPS), 2024, Oct 2024