Skip to content

Basic Data Preprocessing with GitHub Copilot

Prompt 1: Simple Data Preprocessing Steps

Use case: Help data scientists and ML engineers create preprocessing code for machine learning models.

Prompt: Preprocess a dataset for a machine learning model. The dataset details are:

dataset_type: {dataset_type}
target_variable: {target_variable}
basic_issues: {basic_issues}

Generate a preprocessing script that includes:
- Loading data.
- Basic cleaning.
- Feature transformations.
- Standard preprocessing steps.
`

Key points:
1. Use standard libraries like pandas, numpy, and scikit-learn.
2. Focus on common preprocessing steps.
3. Include basic data exploration.
4. Use standard approaches for handling missing values and outliers.