(2024)
Introduces evolutionary optimisation of model merging recipes to evolve pre-trained finetuned models at low computational cost
Unlock new capabilities by simple merging of abundantly available fine-tuned models without gradient-based optimisation
Can classical model merging techniques be automated via evolutionary algorithms?
Defines a search space of model merging recipes in parameter space (PS) and data flow space (DFS) and applies evolutionary algorithms to search for optimal recipes
Merged general Japanese and math model surpasses parent models on Japenese math benchmarks
© Mika Senghaas 2024