Explore how multi-head attention allows LLMs to focus on different parts of a sentence simultaneously for complex linguistic patterns.