Eric Villanueva | Data Analyst & Data Scientist

Hello! I'm

ERIC VILLANUEVA

Aspiring Data Analyst & Data Scientist

CS grad · Chapman University San Francisco, CA

About me

I'm a data-focused CS grad, born and raised in San Francisco, half Mexican and half Filipino, turning messy datasets into clear and decision-ready insight.

I work with Python, SQL, Tableau, Power BI, and APIs, building pipelines and analyses that people actually use. I treat data like a product: clean structure, clear logic, and outputs that make sense to non-technical stakeholders.

Outside of work, I'm building the Resale Market Price Tracker, a project at the intersection of fashion, data, and market intelligence. I care deeply about expanding access and representation in tech for Black and Latinx communities.

🥾

Outdoors and HikingBorn and raised in the Bay Area, I love being outside on trails, in parks, and anywhere that gets me off a screen.

🎵

Music ObsessedMy taste runs across all genres. Current favorites are Blood Orange, Kelela, and FKA Twigs.

📖

Currently ReadingMy favorite book is Giovanni's Room by James Baldwin.

♠️

Poker StudentWorking on sharpening my game. I love the strategy, probability, and psychology behind every hand.

Python

Pandas

NumPy

Scikit-learn

SQL / MySQL

Tableau

Power BI

Excel

Matplotlib

Seaborn

Git

Jupyter

Feature Engineering

EDA

Experience

Research & Data Operations

Jan 2025 to Present

Stealth Robotics Startup · San Francisco, CA

Collected and structured operational data from robotic systems performing household task automation to support ML model development and training pipelines.
Worked with engineering teams to apply standardized data collection protocols, keeping datasets consistent and reliable across evaluation cycles.
Contributed to iterative model evaluation by surfacing behavioral patterns and performance gaps through systematic data review.

Mail & Logistics Coordinator

Aug 2022 to Sep 2024

Chapman University · Orange, CA

Processed and sorted 350+ packages daily across Amazon and major carriers with 99% accuracy, maintaining throughput under time constraints.
Maintained detailed records of all incoming mail and packages to support inventory tracking and data integrity across 4+ campus buildings.

Front-End Web Development Intern

Jun 2020 to Sep 2020

Code Tenderloin · San Francisco, CA

Built and deployed a website using HTML, CSS, and JavaScript to highlight housing insecurity and social injustices facing San Francisco's unhoused population.
Documented and analyzed tech employee work culture to surface insights on industry accessibility and barriers for underrepresented communities.

Education

B.S. Computer Science, Minor in Business Administration

Chapman University · Orange, CA

Aug 2021 to May 2025

My Work

SQL / Analytics / Visualization

COVID-19 Global Data Analysis

Modeled and joined two tables across 200+ countries to analyze infection rates, death counts, and vaccination rollout over time.
Wrote queries using window functions, CTEs, and temp tables to track rolling vaccination totals and infection rates by country.
Built SQL views to store key metrics by continent and country for downstream visualization.

Tools SQL Excel Tableau

200+ countries analyzed across the full pandemic timeline

Machine Learning / EDA

Spotify Streams Analysis

Analyzed 4,600+ Spotify songs and found Playlist Count (R² = 0.74) was a significantly stronger predictor of streams than Playlist Reach (R² = 0.32).
Compared polynomial regression models across degrees 2, 3, and 4 on an 80/20 split; degree-2 was optimal as higher-degree models severely overfit the training data.
Applied GMM clustering with AIC selection and compared DBSCAN vs. K-Means on 2,600+ songs, selecting K-Means (k=2, silhouette = 0.49) for cleaner cluster results.

Tools Python Scikit-learn Pandas

4,600+ songs analyzed across regression and clustering models

SQL / Power BI / HR Analytics

HR Analytics Dashboard

Cleaned and structured 22,000+ employee records spanning 2000 to 2020 in MySQL Workbench, standardizing date formats, normalizing categorical fields, and creating calculated columns for age, tenure, and termination metrics.
Wrote SQL queries to aggregate workforce data by department, job title, race, gender, and state, then exported cleaned data into Power BI for visualization.
Built an interactive Power BI dashboard covering gender and race distribution, age group breakdowns, HQ vs. remote splits, employee count trends over time, and turnover rates by department.

Tools SQL MySQL Workbench Power BI

22,000+ employee records analyzed across a 20-year window

Data Engineering / Analytics

Resale Market Price Tracker

Built an end-to-end pipeline pulling live eBay sold listings for luxury fashion brands like Bottega Veneta and Acne Studios.
Parsed JSON API responses and cleaned and structured pricing data with Pandas to surface market trends for resale intelligence.
Actively expanding brand coverage and visualization layer for deeper price trend analysis.

Tools Python Pandas eBay API JSON

Live market data pipeline, actively in development

Get in touch

Let's build something together

Open to data analyst, data scientist, and associate PM roles, especially in fashion tech, robotics, or social impact. Born and raised in San Francisco.

Email LinkedIn GitHub Resume