MIDUS: Memory-Infused Depth Up-Scaling

Published in Under Review, 2025

We propose MIDUS, a depth up-scaling method for large language models that infuses memory mechanisms into newly inserted layers. MIDUS enables efficient model depth expansion while preserving pretrained knowledge and improving parameter utilization.