Multimodal large language models (MLLMs) hold promise for a range of medical applications. Here, the authors use MLLMs for 3D brain CT radiology report generation, demonstrating that combining anatomy-aware model fine-tuning with robust evaluation metrics establishes a comprehensive and effective framework.