Results for "instruction tuning"
Fine-tuning on (prompt, response) pairs to align a model with instruction-following behaviors.
PEFT method injecting trainable low-rank matrices into layers, enabling efficient fine-tuning.
A high-priority instruction layer setting overarching behavior constraints for a chat model.
Automated detection/prevention of disallowed outputs (toxicity, self-harm, illegal instruction, etc.).
Task instruction without examples.
Updating a pretrained model’s weights on task-specific data to improve performance or adapt style/behavior.
Techniques that fine-tune small additional components rather than all weights to reduce compute and storage.
Controlling robots via language.