❓ FAQΒΆ

  • Q: I have a clinical dataset with some patient data. Can I use this data with CliMB?

    • A: This depends on the nature and requirements of your data. Reading the πŸ” Data Privacy section of this documentation should allow you to make an informed decision on this.

  • Q: My dataset has so-and-so many variables and so-and-so many records (e.g. patients). Will a dataset of this size work with CliMB?

    • A: This depends on your hardware configuration and the details of your dataset(note: for minimal hardware requirements, please refer to the πŸ“¦ Installation).

      • If your dataset has more than a few 10s of features, the data exploration step may produce too much output, and use up the LLM context window. We recommend using datasets with no more than ~50 features.

      • A larger number of records (rows of data) will lead to a longer execution time of various tools, especially the predictive modelling steps and the feature importance steps. For a modern workstation with a GPU, please use the following very rough guide for time estimate: 5 minutes per 1,000 records for the predictive modelling step, and up to 5 times slower for the feature importance step.

  • Q: I have problems when installing or running CliMB. Where can I get help?

    • A: Please submit a GitHub issue here, or ask us on Slack, #climb channel.