🍎 Predictive Classification of Apples 🍎

The purpose of this project is to create predictive classifiers to determine the quality of apples. These predictions are based on variables related to the quality of apples using features such as ripeness, crunchiness, weight, acidity, etc. The models can be utilized by farmers or produce sellers to determine the financial value of their product or to predict the quality of future anticipated harvests.

Data

The data was retrieved in the form of a CSV file from Kaggle.com and uploaded by Nidula Elgiriyewithana who states that the set was provided by a nameless American agriculture company. https://www.kaggle.com/datasets/nelgiriyewithana/apple-quality

Methodology

Python: EDA, modification, cleaning, and modeling

EDA

The uploader kindly took the liberty of scaling and cleaning the data. There were some remaining NA values which I cleaned prior to modeling.

Modification of Data

Column: A_id | Represents unique identifiers for each fruit. This feature was not required for modeling and was dropped from the dataframe.
Column: Quality | Initially the quality of the apples was listed as either "good" or "bad". For the sake of convenience these values were re-mapped to numerical binary values. Good apples are transformed to "1" and bad apples are "0"

Visualization

The data can be visualized utilizing a kernel density estimate (KDE) plot which covers all of the variables in the set.

Discussion of Results

Observing the results the K-Nearest Neighbor(KNN) model is the most accurate at 90.08% while our least accurate model is Naive Bayes at 72.25%. Ideally we would like to see values of >= 85% and 6 out of 10 models were able to meet this success rate. Implementation of the KNN model will lead to accurate predictions in 9 out of 10 evaluations.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Apple KDE.png		Apple KDE.png
Apples.ipynb		Apples.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🍎 Predictive Classification of Apples 🍎

Table of Contents

Project Overview

Data

Methodology

EDA

Modification of Data

Visualization

Discussion of Results

About

Releases

Packages

Languages

DanielWrightGIT/predicting-apple-quality

Folders and files

Latest commit

History

Repository files navigation

🍎 Predictive Classification of Apples 🍎

Table of Contents

Project Overview

Data

Methodology

EDA

Modification of Data

Visualization

Discussion of Results

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages