Build an image captioning system. Learn how sequence-to-sequence models, CNNs, and attention mechanisms generate natural language descriptions.