Meta-Learning — Learning to Learn
Meta-learning trains models to learn new tasks quickly from few examples.
The Problem
Standard ML: Learn ONE task from MANY examples
Meta-Learning: Learn MANY tasks from FEW examples each
Example:
Standard: 10,000 images of cats → recognize cats
Meta-Learning: 5 images of new animal → recognize new animal
Approaches
MAML (Model-Agnostic Meta-Learning):
├─ Find initialization that adapts quickly
├─ Inner loop: adapt to new task
├─ Outer loop: optimize initialization
└─ Works with any gradient-based model
Prototypical Networks:
├─ Compute class prototypes (mean embeddings)
├─ Classify by nearest prototype
├─ Simple and effective
└─ Works for few-shot classification
Matching Networks:
├─ Attention-based similarity
├─ Weighted nearest neighbor
└─ Episodic training
Key Takeaways
- Meta-learning enables few-shot learning
- MAML finds good initialization for fast adaptation
- Prototypical networks use class prototypes
- Episodic training simulates few-shot scenarios
- Applications: robotics, personalization, drug discovery
- Transfer learning is a simpler alternative
- Meta-learning is data-efficient for new tasks
- Neural architecture search is related (learning to design models)