Victor Alejandro Leiva Espinoza

Data Scientist - Portfolio


Project maintained by Vpy7 Hosted on GitHub Pages — Theme by mattgraham

Go back to Main Page

Go back to Projects

Clustering


Teen Psychology Clustering

Python Pandas NumPy Plotly Scikit-Learn XGBoost Supervised Learning Feature Engineering Hyperparameter Tuning Statistics Probability Data Modeling JSON Jupyter Google Colab GitHub Markdown Data Cleaning Data Visualization

Teen mental health is a growing concern in an increasingly digital and high-pressure world. The “Inside Teen Minds” dataset offers a detailed simulation of daily mood, stress levels, habits, and digital behavior among high school students across more than 40 countries in 2025. This project uses clustering techniques to explore patterns in the mental health and lifestyle behaviors of teens. The goal is to identify distinct behavioral profiles and gain insights into how daily habits and technology use relate to emotional well-being.

Project

Teen Psychology Clustering: Second Approach

Python Pandas NumPy Plotly Scikit-Learn XGBoost Supervised Learning Feature Engineering Hyperparameter Tuning Statistics Probability Data Modeling JSON Jupyter Google Colab GitHub Markdown Data Cleaning Data Visualization

Teen mental health is a growing concern in an increasingly digital and high-pressure world.

UMAP is performed with n_neighbours=20 and n_components=35.

Then, Gaussian Mixture, Spectral Clustering, Agglomerative Clustering and DBSCAN are performed for a cluster range of 2 to 12. For each one, Silhouette Score is computed.

Project