MIDUS: Memory-Infused Depth Up-Scaling
Published in Under Review, 2025
We propose MIDUS, a depth up-scaling method for large language models that infuses memory mechanisms into newly inserted layers. MIDUS enables efficient model depth expansion while preserving pretrained knowledge and improving parameter utilization.
