This is a Data Science and Machine Learning project that tries to predict early diabetes type two using simple classification algorithms, such as Decision Tree, KNN and SVC. This project was done for study purposes and for an extension activity - HackoonSpace - of my university.
The implementation uses Python language with Google Colaboratory and libraries as NumPy, Matplotlib, Pandas and Scikit Learn. Also, the models are trained from the following data set:
- "Early stage diabetes risk prediction dataset", downloadable at UCI Repository
- Data collection and exploratory analysis
- Data pre-processing, coding and regularization
- Models training
- Evaluation and validation
- Download the repository
- Open and run the .ipynb files on Google Colaboratory or Jupyter Notebook
*you can do tests and change some hyperparameters of the models to try better results
- Matheus Vargas Volpon Berto (MathVolps)