Loading paper
MAR: Efficient Large Language Models via Module-aware Architecture Refinement | Tomesphere