Artificial intelligence (AI) models trained on large datasets are increasingly seen as the key to unlocking personalized treatments for brain disorders. An important bottleneck for scaling AI is the cost of data collection. This raises a fundamental dilemma: is it more cost-effective to scan more people for a short time, or fewer people for longer?