You Only Look Once (YOLO)
Redmon et al. (2015)
Why It Matters
Object detection as single regression at 45 FPS. Made real-time detection practical for robotics, surveillance, autonomous systems.
Key Ideas
- Treat object detection as one direct regression problem from image pixels to bounding boxes and class probabilities instead of a proposal stage followed by classification.
- Predict all detections in a single forward pass, making real-time object detection practical.
- Accept weaker localization and small-object recall in exchange for simplicity and speed.
- Establish the one-stage detector pattern that later real-time detectors refined rather than replaced.
Notes
- YOLO changed the culture of detection by making speed a primary design goal.
- Later YOLO variants mostly improve the same basic idea with better training, scaling, and box prediction.