Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement Automated Data Cleaning and Preprocessing for Student Metrics #112

Open
Anushka-Pote opened this issue Nov 9, 2024 · 1 comment

Comments

@Anushka-Pote
Copy link
Contributor

Currently, the student metrics data (study hours, participation rate, assignment completion) is directly analyzed without automated cleaning or preprocessing. To improve data reliability and maintain analysis accuracy, let's introduce automated data cleaning steps to handle inconsistent or extreme values.

What will be automated-
Create functions to preprocess and clean student metrics data before analysis, ensuring:
Normalization: All participation rates are standardized on a 0-1 scale.
Capping and Flooring: Set reasonable upper and lower bounds for each metric (e.g., study hours should not exceed 100).
Handling Missing Values: Fill or interpolate any missing data points to avoid calculation errors.

@Sankalp13353
Copy link

I would be grateful if you could assign this issue to me. I am eager to contribute to this project and look forward to working on it. Please let me know if you need any additional information from my side.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants