Knowledge distillation can be used to distill safer, more robust behaviors from larger teacher models.
Knowledge Distillation: Architecting Safer and More Robust AI Models Introduction The race to build increasingly large Large Language Models (LLMs)…
