摘要:
大规模预训练模型已经在自然语言处理等领域展现出强大的能力. 为更好地适配下游任务, 微调预训练模型是一个常用的方法. 然而, 大模型的全参数微调面临计算成本高昂、存储需求巨大等严峻挑战. 参数高效微调(PEFT)作为解决这些问题的关键技术范式, 仅引入或选择极少量可训练参数, 在显著降低计算和存储开销的同时, 有效保持模型的能力. 该综述系统梳理PEFT领域的主流方法体系、关键技术进展与发展趋势. 首先, 将现有方法归纳为四大范式: 添加式、局部式、重参数化式以及融合式, 并深入剖析各类方法的核心机理、性能特征、应用场景及策略优势. 进而, 重点探讨PEFT的技术演进, 从技术变化中分析出现该发展的内在本质规律, 总结出PEFT 方法从单一方法创新向存储、计算、性能三元权衡, 以及自动化、智能化、软硬件协同等统一框架发展的技术趋势. 更进一步, 该综述对各类PEFT中的代表性方法进行系统性的定量比较, 在统一的模型与数据集上评估其性能与参数效率. 此外, 本综述还涵盖PEFT 技术在视觉、语音及跨模态模型等领域的拓展应用, 展现其广泛的适用性. 最后, 总结并探讨未来研究方向, 以推动更高效、更适应多样化任务的大型模型微调技术的发展.
Abstract:
Large-scale pre-trained models have demonstrated remarkable capabilities in fields such as natural language processing. Fine-tuning these models is a common approach to adapt them to downstream tasks. However, full fine-tuning of large models faces severe challenges, including high computational costs and substantial storage requirements. Parameter-efficient fine-tuning (PEFT) has emerged as a key technical paradigm to address these issues by introducing or selecting a minimal number of trainable parameters, significantly reducing computation and storage overhead while effectively preserving the core capabilities of models. This paper provides a systematic review of the mainstream methodologies, key technological advances, and development trends within the PEFT field. First, we categorize existing methods into four major paradigms: Additive, selective, reparameterized, and hybrid fine-tuning, offering an in-depth analysis of their core mechanisms, performance characteristics, application scenarios, and strategic advantages. Furthermore, we focus on the technical evolution of PEFT, analyzing the intrinsic principles behind its development. We summarize a clear technical trajectory: The field is moving from isolated method innovations toward a sophisticated tri-balance of storage, computation, and performance, and further advancing into unified frameworks emphasizing automation, intelligence, and hardware-software co-design. Furthermore, we conduct a systematic quantitative comparison of representative methods from various PEFT categories, evaluating their performance and parameter efficiency on uniform models and datasets. In addition, this survey also covers the extended applications of PEFT technology in fields such as vision, speech, and cross-modal models, demonstrating its broad applicability. Finally, we discuss promising future research directions to facilitate the development of more efficient and adaptable fine-tuning techniques for large-scale models across diverse tasks.