Designing Better Datasets for Diagnostic Analytics