Data Scientist - Portfolio
Python
Pandas
NumPy
Plotly
Scikit-Learn
XGBoost
Supervised Learning
Feature Engineering
Hyperparameter Tuning
Statistics
Probability
Data Modeling
JSON
Jupyter
Google Colab
GitHub
Markdown
Data Cleaning
Data Visualization
Teen mental health is a growing concern in an increasingly digital and high-pressure world. The “Inside Teen Minds” dataset offers a detailed simulation of daily mood, stress levels, habits, and digital behavior among high school students across more than 40 countries in 2025. This project uses clustering techniques to explore patterns in the mental health and lifestyle behaviors of teens. The goal is to identify distinct behavioral profiles and gain insights into how daily habits and technology use relate to emotional well-being.
Python
Pandas
NumPy
Plotly
Scikit-Learn
XGBoost
Supervised Learning
Feature Engineering
Hyperparameter Tuning
Statistics
Probability
Data Modeling
JSON
Jupyter
Google Colab
GitHub
Markdown
Data Cleaning
Data Visualization
Teen mental health is a growing concern in an increasingly digital and high-pressure world.
UMAP is performed with n_neighbours=20
and n_components=35
.
Then, Gaussian Mixture, Spectral Clustering, Agglomerative Clustering and DBSCAN are performed for a cluster range of 2 to 12. For each one, Silhouette Score is computed.